Hurry! 1M Free Tokens Waiting for You – Register Today!

  • Home
  • Models
    • Grok 4 API
    • Suno v4.5
    • GPT-image-1 API
    • GPT-4.1 API
    • Qwen 3 API
    • Llama 4 API
    • GPT-4o API
    • GPT-4.5 API
    • Claude Opus 4 API
    • Claude Sonnet 4 API
    • DeepSeek R1 API
    • Gemini2.5 pro
    • Runway Gen-3 Alpha API
    • FLUX 1.1 API
    • Kling 1.6 Pro API
    • All Models
  • Enterprise
  • Pricing
  • API Docs
  • Blog
  • Contact
Sign Up
Log in
Technology

Google Major Launch Imagen 4, Imagen 4 Ultra and Veo 3 models at Google I/O 2025

2025-05-19 anna No comments yet

Google is set to unveil its next-generation generative AI models—Imagen 4, Imagen 4 Ultra, and Veo 3—during its annual Google I/O developer conference on May 20, 2025. Early leaks of preview identifiers (e.g., imagen-4.0-generate-preview-05-20, imagen-4.0-ultra-generate-exp-05-20, veo-3.0-generate-preview) signal a staged rollout and multiple capability tiers across both image and video synthesis domains . Imagen 4 aims to deliver significant gains in photorealism, prompt fidelity, and stylistic consistency over Imagen 3, while the “Ultra” variant may offer even higher resolution or specialized performance modes . On the video side, Veo 3 promises more coherent clip-to-clip continuity and robust style adherence compared to Veo 2 . All three models are expected to integrate tightly with Google’s Gemini AI ecosystem, enabling seamless transitions from text prompts to images or videos within the same workflow .


Preview Identifiers and Rollout Strategy

Staged Previews: Internal references such as

  • imagen-4.0-generate-preview-05-20
  • imagen-4.0-ultra-generate-exp-05-20
  • veo-3.0-generate-preview

Have surfaced in code repositories and API previews, indicating Google’s intention to offer both standard and “Ultra” performance tiers for image generation, as well as an advanced video model preview for early testers.

Google I/O Launch:

These identifiers strongly suggest Google will showcase and potentially grant preview access to developers at I/O on May 20, 2025, mirroring previous rollouts for Imagen 3 and Veo 2.


What’s New in Imagen 4

Photorealism and Fidelity

  • Enhanced Rendering: Imagen 4 reportedly achieves greater photorealistic detail, reducing artifacts and improving color accuracy. Early rumors suggest improvements in understanding complex prompts, such as nuanced lighting or reflections .
  • Prompt Adherence: The model is expected to follow user instructions more precisely, delivering images that better match both content and style directives (e.g., “oil painting of sunset over mountains”) .

Style Consistency

  • Multi-Image Cohesion: Imagen 4 is designed to maintain a consistent visual style across multiple outputs, benefiting use cases like storyboarding or product catalog creation, where uniformity is critical .
  • Ultra Variant: The “Ultra” tier (imagen‑4.0‑ultra) likely offers higher-resolution outputs or specialized optimizations (e.g., ultra-high fidelity for print media) for enterprise and creative professionals .

What’s New in Veo 3

Improved Coherence

  • Clip-to-Clip Continuity: Veo 3 aims to generate video sequences where successive shots maintain consistent framing, lighting, and character appearance, addressing limitations in Veo 2 around visual drift over time .
  • Style Fidelity: The model focuses on replicating artistic or cinematic styles more faithfully, making it easier to produce videos in a desired aesthetic (e.g., noir, pastel animation).

Integration of SynthID Watermarking

  • Digital Watermarking: Leveraging DeepMind’s SynthID technology (introduced with Veo 2), Veo 3 will embed imperceptible watermarks to help identify AI-generated content and curb misuse.

Integration with Gemini AI

  • Seamless Access: Both Imagen 4 and Veo 3 are expected to be directly accessible through Google’s Gemini interfaces—enabling users to generate images or videos within chat-based prompts or through product interfaces like Google Photos and Google Slides.
  • Gemini Gems: Customized AI “Gems” may incorporate these models, allowing users to create specialized assistants (e.g., a travel-planning Gem that generates itinerary images and overview videos) and share them in a marketplace similar to ChatGPT’s GPT Store .

Availability and Next Steps

Public Preview: Developers and enterprise testers may receive invites to experiment with Imagen 4 (standard and Ultra) and Veo 3 beginning May 20, 2025 at Google I/O, with broader rollout to Labs and Vertex AI in the following weeks .

Feedback and Iteration: As with prior launches, Google will likely solicit user feedback to refine safety filters, watermarking robustness, and performance optimizations before general availability.

Watch This Space: interested developers should monitor the CometAPI.

The new model API will be listed on CometAPI, and it is promised to provide lower prices than Google to facilitate your integration. Please continue to pay attention API doc.

  • Google
  • Imagen 4
  • Veo 3
Start Today

One API
Access 500+ AI Models!

Free For A Limited Time! Register Now
Get 1M Free Token Instantly!

Get Free API Key
API Docs
anna

Anna, an AI research expert, focuses on cutting-edge exploration of large language models and generative AI, and is dedicated to analyzing technical principles and future trends with academic depth and unique insights.

Post navigation

Previous
Next

Search

Start Today

One API
Access 500+ AI Models!

Free For A Limited Time! Register Now
Get 1M Free Token Instantly!

Get Free API Key
API Docs

Categories

  • AI Company (2)
  • AI Comparisons (60)
  • AI Model (103)
  • Model API (29)
  • new (10)
  • Technology (437)

Tags

Alibaba Cloud Anthropic API Black Forest Labs ChatGPT Claude Claude 3.7 Sonnet Claude 4 claude code Claude Opus 4 Claude Opus 4.1 Claude Sonnet 4 cometapi deepseek DeepSeek R1 DeepSeek V3 FLUX Gemini Gemini 2.0 Gemini 2.0 Flash Gemini 2.5 Flash Gemini 2.5 Pro Google GPT-4.1 GPT-4o GPT -4o Image GPT-5 GPT-Image-1 GPT 4.5 gpt 4o grok 3 grok 4 Midjourney Midjourney V7 o3 o4 mini OpenAI Qwen Qwen 2.5 Qwen3 sora Stable Diffusion Suno Veo 3 xAI

Related posts

Veo 3
Technology

How Much does Veo 3 Cost? All You Need to Know

2025-08-13 anna No comments yet

Google’s Veo 3 — the company’s latest video-generation model that produces synchronized visuals and native audio from text or images — has been rolled out across several access channels (Gemini / Google AI consumer plans, the Gemini API, and Vertex AI for enterprise). That means “how much it costs” depends on how you plan to […]

Seedance 1.0 vs Google Veo 3
Technology, AI Comparisons

Seedance 1.0 VS Google Veo 3: Which one should You choose?

2025-07-31 anna No comments yet

Seedance 1.0 and Google Veo  3 represent two of the most advanced video generation models available today, each pushing the boundaries of what neural networks can achieve in transforming text or images into dynamic, cinematic experiences. Developed by ByteDance’s Volcano Engine (formerly known as Toutiao’s engine) and Google DeepMind respectively, these models cater to a rapidly […]

AI Model

Gemini 2.5 Pro API

2025-07-12 anna No comments yet

Gemini 2.5 Pro API, an advanced AI model designed to enhance reasoning, encoding and multimodal capabilities. Its multimodal design enables it to interpret and generate text, audio, images, videos and code, thereby expanding its applicability in various fields.

500+ AI Model API,All In One API. Just In CometAPI

Models API
  • GPT API
  • Suno API
  • Luma API
  • Sora API
Developer
  • Sign Up
  • API DashBoard
  • Documentation
  • Quick Start
Resources
  • Pricing
  • Enterprise
  • Blog
  • AI Model API Articles
  • Discord Community
Get in touch
  • [email protected]

© CometAPI. All Rights Reserved.  

  • Terms & Service
  • Privacy Policy