📘 Grok Imagine Video 的技術規格

規格	詳細資訊
Model ID	grok-imagine-video
Provider	xAI
Type	影片生成與編輯 AI
Input Types	Text (prompt); optional image or video Text prompts (natural language); optional image input (image→video); optional video_url for editing existing clips. Editing input video max durations differ by endpoint — reported ~8.7s for some editing flows.
Output Types	透過臨時 URL 提供 .mp4 影片
Duration Range (generate)	1–15 秒
Resolution	480p、720p（可設定）
Aspect Ratios	1:1、16:9、9:16
Edit Support	是 — 可為影片添加動畫並修改，最長至 8.7s
Moderation	包含內容審核
Pricing	依秒計費，費率隨解析度而變

🚀 什麼是 Grok Imagine Video？

Grok Imagine Video 是 xAI 的進階影片生成與編輯 AI 模型，透過 CometAPI 對外提供。它讓開發者能從自然語言提示生成短、客製化影片，並可選擇將靜態圖片動畫化或編輯現有片段。該模型支援可設定的輸出長度、解析度與長寬比，且內建內容審核以確保遵循政策。

🧠 主要功能（Grok Imagine 的差異點）

原生音訊 + 對嘴同步：生成同步的環境音效、效果與短語音/旁白，並提供近似的對嘴同步。
圖像→影片 / 提示詞編輯：可將靜態圖片動畫化，或透過文字提示編輯現有影片（移除/替換物件、調整時間、改變風格）。
快速迭代與低延遲：為快速回饋循環設計，適用於創作流程與產品原型。
生產級 API：Imagine API 提供可程式化端點，支援批量生成、整合至剪輯流程與企業級控管。
多種「模式」/ 風格：面向使用者的模式（據報示例：Normal / Fun / Spicy 或類似預設）以偏向輸出風格或寬鬆程度（注意：「Spicy」模式歷史上曾啟用 NSFW）。

模型（公司）	最高解析度（公開）	最大片長（公開）	原生音訊？	優勢	注意事項
Grok Imagine (xAI)	720p	6–15s	是	快速迭代、成本/延遲表現佳、整合式編輯、原生音訊	僅支援至 720p；審核引發顧慮；真實世界擬真度表現不一
Sora (OpenAI)	720p–1080p（取決於等級）	短（6–15s）	是	高視覺擬真度；與 OpenAI 生態高度整合	成本較高；審核/控管更為嚴格
Veo (Google DeepMind)	最高至 1080p+	短（不定）	是	高擬真度、動作穩定	成本更高；公開試驗較少
Runway Gen-4.5	1080p+	短（不定）	是	業界採用度高、適合創作流程、擬真度高	成本較高；專注於創作工具
Vidu / Kling / Pika（各類專家）	最高至 1080p	短（不定）	不一	部分提供利基功能（Smart Cuts、多鏡頭串接）	音訊支援不一；API 成熟度差異

⚠️ 限制

最長影片長度上限為 15 秒。
編輯保留輸入影片長度（≤ 8.7s）。
產生的 URL 為臨時連結 — 請盡速下載。

如何存取與整合 Grok Imagine Video

Step 1: 申請 API Key

登入 cometapi.com。若您尚未成為使用者，請先註冊。登入您的 CometAPI console。取得介面存取憑證 API key。點擊個人中心的 API token 處「Add Token」，取得 token key：sk-xxxxx 並提交。

Step 2: 向 `Grok Imagine Video` API 發送請求

選擇 “grok-imagine-video” 端點以發送 API 請求並設定請求主體。請求方法與請求主體可從我們網站的 API 文件取得。我們的網站亦提供 Apifox 測試以利使用。請將 <YOUR_API_KEY> 替換為您帳戶中的實際 CometAPI key。呼叫位置：GROK 影片生成與影片編輯。

Step 3: 向 `Grok Imagine Video` API 發送請求

輸入文字或上傳圖片（您可以選擇提供來源圖片以進行動畫化）。Grok Imagine AI API 會分析您的輸入並準備可供存取的 URL。支援文字轉影片與圖片轉影片。

來源圖片可透過以下方式提供：

指向圖片的公開 URL
base64 編碼的 data URI（例如，data:image/jpeg;base64,<YOUR_BASE64_IMAGE>）

Step 4: 取得並驗證結果

處理 API 回應以取得生成結果。提交後，API 會回應任務狀態與輸出資料；會立即返回 request_id，請使用 GET 端點查詢狀態並取得生成的影片。影片編輯為非同步流程，您可能需要多次輪詢該端點直到任務完成。請儘速下載。

You send a POST request with model 'grok-imagine-video' including a text prompt and optional image/video source; it returns a task ID, then poll this ID until the video status is 'done'.

It accepts a natural language prompt and optional image URLs (or base64 images) for animation; for editing, a video URL is provided.

The model supports video generation up to 15 seconds and resolutions up to 720p with configurable aspect ratios like 16:9 or 1:1.

Yes, you can animate a still image into motion based on your prompt, using image URLs or encoded images in the request.

Yes — provide the source video URL and your edit instructions; the output keeps the original video’s duration and resolution.

Generated videos are subject to content moderation; flagged content may be filtered or blocked during generation.

Yes, the API returns a request ID which you poll to check when the video is ready for download.

Downloaded videos should be saved quickly; temporary URLs may expire and become inaccessible after generation.

Pricing Overview

Category	Item	Price
Input Pricing	Text	N/A (Free)
	Image	$0.0016
	Video per second	$0.008
Output Pricing	480p	$0.04
(Per second by resolution)	720p	$0.056

Note: When generating video via API, you are charged per second. You will also be charged when using video or images as input.