Hurry! 1M Free Tokens Waiting for You – Register Today!

  • Home
  • Models
    • Suno v4.5
    • GPT-image-1 API
    • GPT-4.1 API
    • Qwen 3 API
    • Grok-3-Mini
    • Llama 4 API
    • GPT-4o API
    • GPT-4.5 API
    • Claude 3.7-Sonnet API
    • Grok 3 API
    • DeepSeek R1 API
    • Gemini2.5 pro
    • Runway Gen-3 Alpha API
    • FLUX 1.1 API
    • Kling 1.6 Pro API
    • All Models
  • Enterprise
  • Pricing
  • API Docs
  • Blog
  • Contact
Sign Up
Log in
Technology

Google Major Launch Imagen 4, Imagen 4 Ultra and Veo 3 models at Google I/O 2025

2025-05-19 anna No comments yet

Google is set to unveil its next-generation generative AI models—Imagen 4, Imagen 4 Ultra, and Veo 3—during its annual Google I/O developer conference on May 20, 2025. Early leaks of preview identifiers (e.g., imagen-4.0-generate-preview-05-20, imagen-4.0-ultra-generate-exp-05-20, veo-3.0-generate-preview) signal a staged rollout and multiple capability tiers across both image and video synthesis domains . Imagen 4 aims to deliver significant gains in photorealism, prompt fidelity, and stylistic consistency over Imagen 3, while the “Ultra” variant may offer even higher resolution or specialized performance modes . On the video side, Veo 3 promises more coherent clip-to-clip continuity and robust style adherence compared to Veo 2 . All three models are expected to integrate tightly with Google’s Gemini AI ecosystem, enabling seamless transitions from text prompts to images or videos within the same workflow .


Preview Identifiers and Rollout Strategy

Staged Previews: Internal references such as

  • imagen-4.0-generate-preview-05-20
  • imagen-4.0-ultra-generate-exp-05-20
  • veo-3.0-generate-preview

Have surfaced in code repositories and API previews, indicating Google’s intention to offer both standard and “Ultra” performance tiers for image generation, as well as an advanced video model preview for early testers.

Google I/O Launch:

These identifiers strongly suggest Google will showcase and potentially grant preview access to developers at I/O on May 20, 2025, mirroring previous rollouts for Imagen 3 and Veo 2.


What’s New in Imagen 4

Photorealism and Fidelity

  • Enhanced Rendering: Imagen 4 reportedly achieves greater photorealistic detail, reducing artifacts and improving color accuracy. Early rumors suggest improvements in understanding complex prompts, such as nuanced lighting or reflections .
  • Prompt Adherence: The model is expected to follow user instructions more precisely, delivering images that better match both content and style directives (e.g., “oil painting of sunset over mountains”) .

Style Consistency

  • Multi-Image Cohesion: Imagen 4 is designed to maintain a consistent visual style across multiple outputs, benefiting use cases like storyboarding or product catalog creation, where uniformity is critical .
  • Ultra Variant: The “Ultra” tier (imagen‑4.0‑ultra) likely offers higher-resolution outputs or specialized optimizations (e.g., ultra-high fidelity for print media) for enterprise and creative professionals .

What’s New in Veo 3

Improved Coherence

  • Clip-to-Clip Continuity: Veo 3 aims to generate video sequences where successive shots maintain consistent framing, lighting, and character appearance, addressing limitations in Veo 2 around visual drift over time .
  • Style Fidelity: The model focuses on replicating artistic or cinematic styles more faithfully, making it easier to produce videos in a desired aesthetic (e.g., noir, pastel animation).

Integration of SynthID Watermarking

  • Digital Watermarking: Leveraging DeepMind’s SynthID technology (introduced with Veo 2), Veo 3 will embed imperceptible watermarks to help identify AI-generated content and curb misuse.

Integration with Gemini AI

  • Seamless Access: Both Imagen 4 and Veo 3 are expected to be directly accessible through Google’s Gemini interfaces—enabling users to generate images or videos within chat-based prompts or through product interfaces like Google Photos and Google Slides.
  • Gemini Gems: Customized AI “Gems” may incorporate these models, allowing users to create specialized assistants (e.g., a travel-planning Gem that generates itinerary images and overview videos) and share them in a marketplace similar to ChatGPT’s GPT Store .

Availability and Next Steps

Public Preview: Developers and enterprise testers may receive invites to experiment with Imagen 4 (standard and Ultra) and Veo 3 beginning May 20, 2025 at Google I/O, with broader rollout to Labs and Vertex AI in the following weeks .

Feedback and Iteration: As with prior launches, Google will likely solicit user feedback to refine safety filters, watermarking robustness, and performance optimizations before general availability.

Watch This Space: interested developers should monitor the CometAPI.

The new model API will be listed on CometAPI, and it is promised to provide lower prices than Google to facilitate your integration. Please continue to pay attention API doc.

  • Google
  • Imagen 4
  • Veo 3
anna

Post navigation

Previous
Next

Search

Categories

  • AI Company (2)
  • AI Comparisons (40)
  • AI Model (81)
  • Model API (29)
  • Technology (325)

Tags

Alibaba Cloud Anthropic Black Forest Labs ChatGPT Claude Claude 3.7 Sonnet Claude 4 Claude Opus 4 Claude Sonnet 4 Codex cometapi DALL-E 3 deepseek DeepSeek R1 DeepSeek V3 FLUX Gemini Gemini 2.0 Gemini 2.0 Flash Gemini 2.5 Flash Gemini 2.5 Pro Google GPT-4.1 GPT-4o GPT -4o Image GPT-Image-1 GPT 4.5 gpt 4o grok 3 Midjourney Midjourney V7 o3 o4 mini OpenAI Qwen Qwen 2.5 Qwen3 sora Stable AI Stable Diffusion Stable Diffusion 3.5 Large Suno Suno Music Veo 3 xAI

Related posts

Technology, AI Comparisons

Kling 2.1 vs Google veo 3: A Comparative Analysis

2025-06-11 anna No comments yet

You’ve probably come across two names making waves recently When you’re diving into AI video generation: Kling 2.1 and Veo 3, Google DeepMind’s most advanced text-to-video model. In this article, we’ll walk through their key features, performance, ease of use, and real-world applications—so you can decide which one fits your creative toolbox best. What can […]

Technology

3 Methods to Use Google Veo 3 in 2025

2025-06-07 anna No comments yet

Google Veo 3 is a video-generation model developed by Google using the latest AI technology. Announced at Google I/O 2025, it grabbed attention for its ability to automatically generate high-resolution, cinematic-quality videos from simple text or image inputs. With Veo 3, creators and businesses can produce high-quality video content more quickly and at lower cost […]

AI Model

Gemini 2.5 Pro Preview API

2025-06-06 anna No comments yet

Gemini 2.5 Pro API, an advanced AI model designed to enhance reasoning, encoding and multimodal capabilities. Its multimodal design enables it to interpret and generate text, audio, images, videos and code, thereby expanding its applicability in various fields.

500+ AI Model API,All In One API. Just In CometAPI

Models API
  • GPT API
  • Suno API
  • Luma API
  • Sora API
Developer
  • Sign Up
  • API DashBoard
  • Documentation
  • Quick Start
Resources
  • Pricing
  • Enterprise
  • Blog
  • AI Model API Articles
  • Discord Community
Get in touch
  • [email protected]

© CometAPI. All Rights Reserved.   EFoxTech LLC.

  • Terms & Service
  • Privacy Policy