No menu items!

    OpenAI president shares first picture generated by GPT-4o

    Date:

    Share post:

    Be part of us in returning to NYC on June fifth to collaborate with govt leaders in exploring complete strategies for auditing AI fashions concerning bias, efficiency, and moral compliance throughout numerous organizations. Discover out how one can attend right here.


    OpenAI’s president Greg Brockman has posted from his X account what seems to be the primary public picture generated utilizing the corporate’s model new GPT-4o mannequin.

    As you’ll see within the picture under, it’s fairly convincingly photorealistic, exhibiting an individual sporting a black T-shirt with an OpenAI brand writing chalk textual content on a blackboard that reads “Transfer between Modalities. Suppose we directly model P (text, pixels, sound) with one big autoregressive transformer. What are the pros and cons?”

    The brand new GPT-4o mannequin, which debuted on Monday, improves upon the prior GPT-4 household of fashions (GPT-4, GPT-4 Imaginative and prescient, and GPT-4 Turbo) by being quicker, cheaper, and retaining extra info from inputs akin to audio and imaginative and prescient.

    It’s in a position to take action as a result of OpenAI took a distinct method from its prior GPT-4 class LLMs. Whereas these chained a number of totally different fashions collectively and transformed different media akin to audio and visuals to textual content and again, the brand new GPT-4o was skilled on multimedia tokens from the get-go, permitting it to immediately analyze and interpret imaginative and prescient and audio with out first changing it into textual content.

    VB Occasion

    The AI Affect Tour: The AI Audit

    Be part of us as we return to NYC on June fifth to interact with high govt leaders, delving into methods for auditing AI fashions to make sure equity, optimum efficiency, and moral compliance throughout numerous organizations. Safe your attendance for this unique invite-only occasion.


    Request an invitation

    Based mostly on the above picture, the brand new method is a noticeable enchancment over OpenAI’s final picture era mannequin DALL-E 3 which debuted in September 2023. I ran an analogous immediate via DALL-E 3 in ChatGPT and right here is the end result.

    As you’ll be able to see, the picture shared by Brockman created with GPT-4o improves considerably in high quality, photorealism, and accuracy of textual content era.

    Nevertheless, GPT-4o’s native picture era capabilities are usually not but publicly obtainable. As Brockman alluded to in his X put up by saying “Team is working hard to bring those to the world.”

    Related articles

    Hugging Face brings ‘Pi-Zero’ to LeRobot, making AI-powered robots simpler to construct and deploy

    Be a part of our every day and weekly newsletters for the most recent updates and unique content...

    Pour one out for Cruise and why autonomous car check miles dropped 50%

    Welcome again to TechCrunch Mobility — your central hub for information and insights on the way forward for...

    Anker’s newest charger and energy financial institution are again on sale for record-low costs

    Anker made a variety of bulletins at CES 2025, together with new chargers and energy banks. We noticed...

    GitHub Copilot previews agent mode as marketplace for agentic AI coding instruments accelerates

    Be a part of our every day and weekly newsletters for the newest updates and unique content material...