• TIMER= 25.07.2024
  • bitcoinBitcoin (BTC) $ 42,977.00 0.18%
  • ethereumEthereum (ETH) $ 2,365.53 1.12%
  • tetherTether (USDT) $ 1.00 0.2%
  • bnbBNB (BNB) $ 302.66 0.19%
  • solanaSolana (SOL) $ 95.44 1.28%
  • xrpXRP (XRP) $ 0.501444 0.1%
  • usd-coinUSDC (USDC) $ 0.996294 0.34%
  • staked-etherLido Staked Ether (STETH) $ 2,367.26 1.4%
  • cardanoCardano (ADA) $ 0.481226 2.68%
  • avalanche-2Avalanche (AVAX) $ 34.37 1.19%
  • bitcoinBitcoin (BTC) $ 42,977.00 0.18%
    ethereumEthereum (ETH) $ 2,365.53 1.12%
    tetherTether (USDT) $ 1.00 0.2%
    bnbBNB (BNB) $ 302.66 0.19%
    solanaSolana (SOL) $ 95.44 1.28%
    xrpXRP (XRP) $ 0.501444 0.1%
    usd-coinUSDC (USDC) $ 0.996294 0.34%
    staked-etherLido Staked Ether (STETH) $ 2,367.26 1.4%
    cardanoCardano (ADA) $ 0.481226 2.68%
    avalanche-2Avalanche (AVAX) $ 34.37 1.19%
image-alt-13BTC Dominance: 51.25%
image-alt-14 ETH Dominance: 16.27%
image-alt-15 BTC/ETH Ratio: 13%
image-alt-16 Total Market Cap 24h: $1.65T
image-alt-17Volume 24h: $42.89B
image-alt-18 ETH Gas Price: 26 Gwei

Ideogram Is A New AI Image Generator That Obliterates the Competition, Outperforming MidJourney and Dall-E 3

Ideogram AI—a startup founded by former Google engineers alongside members from prestigious institutions like UC Berkeley, Carnegie Mellon University, and...
Firefly An Ancient African Statue; Science Fiction Art; Futuristic Feel; Cinematic Shot In A Jungle

“We’re excited to release Ideogram 1.0, our most advanced text-to-image model to date,” Ideogram AI said in an official blog post. “Trained from scratch like all Ideogram models, Ideogram 1.0 offers state-of-the-art text rendering, unprecedented photorealism, and prompt adherence—and a new feature called Magic Prompt that helps you write detailed prompts for beautiful, creative images.”

The release comes alongside news of a $80 million Series A fundraise led by Andreessen Horowitz, along with Redpoint Ventures, Pear VC, and SV Angel.

Decrypt was able to test the model and Ideogram AI’s claims are not wildly overstated—a side by side comparison can be found below. Version one of Ideogram is a clear improvement over its v0.1 and v0.2 predecessors: it excels in prompt adherence, image quality, and text generation capabilities.

The model is not open-source, so there is limited visibility into its plumbing and no research paper to evaluate. But the results obtained with the model spoke for themselves, potentially making it the best model currently available—at least until Stable Diffusion 3 is publicly released.

The new model is arguably the most capable image generator in terms of text capabilities, generating longer text strings with fewer errors than Dall-E 3 or MidJourney. The current free tier also gives it an edge over competitors like Dall-E 3 and MidJourney, the latter of which has no free tier. Microsoft Copilot also uses Dall-E 3, but it only generates square 1:1 images, whereas Ideogram supports a wider set of aspect ratios.

Ideogram also offers two paid plans of $7 and $15 per month, which give access to over 400 generations per day along with other perks like an image editor, better quality downloads, img2img—which allows modifications or variations on an existing image—and private generations. All lower tiers display requested images publicly.

Ideogram is capable of understanding long prompts, going toe to toe with Stable Diffusion 3, and beating all other image generators in this field.

One of the standout features of Ideogram is “Prompt Magic,” which can be turned on and off. This feature analyzes the prompt and enhances it to create images of better quality, essentially giving the model the ability to understand natural language like Dall-E 3. However, Ideogram is more versatile because this feature is optional. It’s always turned on with ChatGPT Plus, which sometimes leads to inaccuracies.

Finally, Ideogram is less aggressively censored than MidJourney and Dall-E 3, and is so far capable of generating images of famous people, company logos, and art styles. It does not go fully NSFW, but it is more discrete when it comes to censoring prompts.

And early testers seem to prefer Ideogram over other models. “Using an evaluation protocol like that of DALL·E 3, we find that human raters prefer Ideogram 1.0 over DALL·E 3 and Midjourney V6 in prompt alignment, image coherence, overall preference, and text rendering quality,” the startup said.

Decrypt tested Ideogram’s capabilities and compared it against its top competitors, MidJourney and Dall-E 3. Stable Diffusion 3 and Google’s top-of-the-line ImageFX are not being evaluated here because SD3 is not released yet and ImageFX is not widely available.

Generating long strings of text

Prompt: A futuristic Android in Cyberpunk City with a sign that reads, “Don’t be late in the AI trend: Emerge by Decrypt”

Ideogram AI was able to portray both the requested aesthetics and the text. It had a typo, however, generating “thee” instead of “the.”

MidJourney was not able to generate any coherent text at all, and focused on generating a futuristic android with detail. It is the main subject of the whole composition. The city is not cyberpunk at all.

Dall-E 3 ranks in the middle. It was able to generate the futuristic robot, the city is cyberpunk, but the sign didn’t feature the word “Emerge.”

Interestingly enough, Ideogram understood that the robot was in the city and associated with the sign, whereas Dall-E assumed that the sign was part of the cityscape.

Long prompts and spatial capabilities

Prompt: A surreal and intriguing scene featuring a cat perched on top of a television next to a sign that reads “Emerge.” In the background, a futuristic android stands on one side and an astronaut on the other. The room’s walls are adorned with a striking image of a molecule and a DNA chain.

Free and widely available out of the gate, Ideogram may be the best image generator currently on the market. It is great at natural language understanding and has outstanding spatial capabilities and prompt adherence. It is also the best text generator currently available.

If aesthetics are the most important consideration—to the point where adherence and text is less important—then MidJourney might remain a solid competitor for specific use cases. While not especially strong and heavily censored, Dall-E 3 may still make sense as part of a ChatGPT Plus subscription.






Latest Iron Podcast episode