模型支援企業部落格
500+ AI 模型 API,全部整合在一個 API 中。就在 CometAPI
模型 API
開發者
快速入門說明文件API 儀表板
資源
AI模型部落格企業更新日誌關於
2025 CometAPI. 保留所有權利。隱私政策服務條款
Home/Models/Kling/Kling Image Recognize
K

Kling Image Recognize

每次請求:$0.013216
Keling 圖像元素辨識 API,可用於多張圖像參考的影片生成、多模態影片編輯功能 ● 可辨識主體、人臉、服裝等,每次請求可獲得 4 組結果(如可用)。
新
商業用途
概覽
功能
定價
API

Technical Specifications of kling-image-recognize

SpecificationDetails
Model IDkling-image-recognize
CategoryImage recognition / multimodal analysis
Primary CapabilityRecognizes image elements for downstream creative workflows, including multi-image reference video generation and multimodal video editing
Input TypeImage input
Output TypeStructured recognition results
Recognition ScopeSubjects, faces, clothing, and other visual elements
Result VolumeCan return up to 4 sets of results per request, if available
Use CasesVisual asset analysis, reference preparation for video generation, content understanding for editing pipelines, subject and apparel recognition

What is kling-image-recognize?

kling-image-recognize is a Keling image element recognition API designed to analyze visual content and identify important elements within an image. It is especially useful in workflows that require multi-image reference video generation or multimodal video editing, where understanding the contents of source images is an important preprocessing step.

The model can recognize a range of visual attributes such as subjects, faces, clothing, and related image components. Depending on the input, it can provide up to 4 sets of recognition results in a single request, helping developers capture multiple possible detections or interpretations when available.

Main features of kling-image-recognize

  • Image element recognition: Detects and identifies important visual elements contained in an input image.
  • Subject analysis: Recognizes primary subjects that can be used in downstream media generation or editing workflows.
  • Face recognition support: Extracts face-related recognition results when faces are present in the image.
  • Clothing identification: Detects apparel and clothing-related elements to support more detailed visual understanding.
  • Multi-image reference workflow support: Useful for preparing and analyzing image references used in video generation pipelines.
  • Multimodal video editing compatibility: Helps power editing scenarios where image content needs to be understood before transformation or composition.
  • Multiple result sets per request: Can obtain up to 4 sets of results per request, if available, enabling richer recognition output.
  • Integration-friendly API usage: Suitable for developers building automated media analysis and creative application pipelines.

How to access and integrate kling-image-recognize

Step 1: Sign Up for API Key

To get started, sign up on the CometAPI platform and generate your API key from the dashboard. After obtaining your key, store it securely and use it to authenticate every request to the kling-image-recognize API.

Step 2: Send Requests to kling-image-recognize API

Once you have your API key, send requests to the CometAPI endpoint using kling-image-recognize as the model ID. Include your authentication headers and provide the required image input payload based on your application workflow.

curl --request POST \
  --url https://api.cometapi.com/v1/responses \
  --header "Authorization: Bearer YOUR_COMETAPI_KEY" \
  --header "Content-Type: application/json" \
  --data '{
    "model": "kling-image-recognize",
    "input": [
      {
        "role": "user",
        "content": [
          {
            "type": "input_text",
            "text": "Recognize the main visual elements in this image."
          },
          {
            "type": "input_image",
            "image_url": "YOUR_IMAGE_URL"
          }
        ]
      }
    ]
  }'

Step 3: Retrieve and Verify Results

After submission, the API returns recognition results generated by kling-image-recognize. Parse the response in your application, verify the detected subjects or attributes, and store the returned data for use in video generation, editing, or other downstream automation tasks.

Kling Image Recognize 的功能

探索 Kling Image Recognize 的核心功能,專為提升效能和可用性而設計。了解這些功能如何為您的專案帶來效益並改善使用者體驗。

Kling Image Recognize 的定價

探索 Kling Image Recognize 的競爭性定價,專為滿足各種預算和使用需求而設計。我們靈活的方案確保您只需為實際使用量付費,讓您能夠隨著需求增長輕鬆擴展。了解 Kling Image Recognize 如何在保持成本可控的同時提升您的專案效果。
彗星價格 (USD / M Tokens)官方價格 (USD / M Tokens)折扣
每次請求:$0.013216
每次請求:$0.01652
-20%

Kling Image Recognize 的範例程式碼和 API

存取完整的範例程式碼和 API 資源,以簡化您的 Kling Image Recognize 整合流程。我們詳盡的文件提供逐步指引,協助您在專案中充分發揮 Kling Image Recognize 的潛力。

更多模型

O

Sora 2 Pro

每秒:$0.24
Sora 2 Pro 是我們最先進且最強大的媒體生成模型,能生成帶有同步音訊的影片。它可以從自然語言或圖像創建細節豐富、動態的影片片段。
O

Sora 2

每秒:$0.08
超強大的影片生成模型,具備音效,支援對話格式。
M

mj_fast_video

每次請求:$0.6
Midjourney video generation
X

Grok Imagine Video

每秒:$0.04
可根據文字提示生成影片,將靜態圖片動態化,或以自然語言編輯現有影片。API 支援為生成的影片自訂時長、寬高比與解析度 — SDK 會自動處理非同步輪詢。
G

Veo 3.1 Pro

每秒:$0.25
Veo 3.1-Pro 指的是 Google 的 Veo 3.1 系列的高階存取/設定 — 這是一代支援音訊的短影片模型,並加入更豐富的原生音訊、改進的敘事/剪輯控制與場景延伸工具。
G

Veo 3.1

每秒:$0.05
Veo 3.1 是 Google 對其 Veo 文本與圖像→影片系列的一次漸進但意義重大的更新,新增更豐富的原生音訊、更長且更可控的影片輸出,以及更精細的編輯與場景層級控制。