What exactly is Nano Banana 2 and what does it do?

Nano Banana 2 is Google’s latest AI image generation and editing model, built on Gemini Flash image technology to deliver fast, high-quality visual generation and precise instruction following across text and image inputs.

How does Nano Banana 2 relate to Gemini 3.1 Flash Image?

Nano Banana 2 is essentially the consumer-facing branding for Google’s Gemini 3.1 Flash Image model, combining advanced capabilities from previous Nano Banana versions with the speed of Flash models.

What improvements does Nano Banana 2 add over earlier Nano Banana models?

Nano Banana 2 brings faster generation speed, sharper detail, better instruction fidelity, enhanced text rendering/localized translation, and broader creative control while making many Pro-grade features available at base tier.

What kinds of images and resolutions can Nano Banana 2 generate?

The model supports flexible output with various aspect ratios and resolutions up to 4K, suitable for social media, ads, displays, and professional content.

Can Nano Banana 2 maintain consistency in complex compositions?

Yes — it preserves consistency across multiple subjects and objects (e.g., up to five characters and 14 objects in a single prompt workflow), helping with narrative scenes and storyboard-style tasks.

What image generation use cases is Gemini 3.1 Flash Image best suited for?

It’s well-suited for professional-grade image creation and editing, infographics, multi-image consistency, text rendering, and localized multilingual outputs, especially when workflows need precise control and repeated iterations.

Does Nano Banana 2 use real-time information or world knowledge?

Nano Banana 2 incorporates real-world knowledge and image search integration to help generate more accurate subjects, infographics, and location-aware visuals.

Can Gemini 3.1 Flash Image generate detailed text within images or diagrams?

Yes — it can generate and render clear text within images, but extremely small or dense multi-paragraph text sometimes remains challenging.

實惠的 Nano Banana 2 API | text-to-image

Gemini 3.1 Flash Image Preview 的技術規格

項目	Gemini 3.1 Flash Image Preview
提供者	Google
模型系列	Gemini 3.1 (Flash tier)
主要重點	快速多模態生成與影像預覽
輸入類型	文字、影像
輸出類型	文字、影像（預覽生成）
上下文視窗	最多 1M tokens（Gemini 3.x Flash 層級標準）
延遲層級	低延遲、高吞吐
串流支援	是
工具呼叫	是（Gemini API 工具框架）
版本	3.1

什麼是 Nano Banana 2

Nano Banana 2 是媒體與開發者社群對新近發布的 Gemini-3.1-Flash-Image 模型所使用的流行暱稱。Google 將其定位為「Flash」層級的影像引擎，在更低延遲與成本的層級上提供接近 Pro 的視覺保真度——適用於大規模生成、快速迭代編輯，以及橫跨 Google 服務的整合產品工作流程。它承襲 Gemini 3.1 的多模態推理，並新增以影像為中心的能力（影像中文字清晰可辨、多影像合成、超寬長寬比支援、原生 4K）。

主要功能

高速、多解析度生成： 以 Flash 層級速度提供 0.5K / 1K / 2K / 4K 輸出選項，並支援新的極端長寬比（1:4、4:1、1:8、8:1）。
即時網路對齊： 當啟用 “Thinking” 或搜尋對齊時，整合文字與影像搜尋結果，將生成內容錨定於當前網路資訊。適用於最新參考與事實型資訊圖表。
改進的文字渲染： 相較於較早的 Flash 模型，對短文字與圖形文字（字體、尺寸）的渲染更佳；對長段落／小字仍不完美。
多輸入編輯與多輪工作流程： 對將多張影像作為輸入並在多輪中反覆編輯提供強力支援。

📊 基準表現 — 影像生成與編輯（Elo 分數）

能力	Gemini 3.1 Flash Image (Nano Banana 2)	Gemini 2.5 Flash Image (Nano Banana)	Gemini 3 Pro Image (Nano Banana Pro)	GPT-Image 1.5	Seedream 5.0 Lite	Grok Imagine Image Pro
文字轉影像 — 整體偏好	1079.0 ± 7.0	1073.0 ± 5.0	942.0 ± 6.0	1021.0 ± 5.0	1047.0 ± 5.0	928.0 ± 8.0
文字轉影像 — 視覺品質	1140.0 ± 6.0	1129.0 ± 6.0	929.0 ± 6.0	1043.0 ± 5.0	975.0 ± 5.0	759.0 ± 10.0
文字轉影像 — 資訊圖表（事實性）	1114.0 ± 14.0	1074.0 ± 12.0	881.0 ± 13.0	1102.0 ± 13.0	985.0 ± 12.0	890.0 ± 22.0
編輯 — 一般	1065.0 ± 9.0	1047.0 ± 9.0	913.0 ± 9.0	1051.0 ± 10.0	995.0 ± 8.0	937.0 ± 9.0
編輯 — 角色	1056.0 ± 7.0	1049.0 ± 7.0	952.0 ± 7.0	1050.0 ± 8.0	1025.0 ± 7.0	894.0 ± 8.0
編輯 — 創意	1023.0 ± 7.0	1031.0 ± 7.0	976.0 ± 7.0	1004.0 ± 7.0	1017.0 ± 7.0	938.0 ± 7.0
編輯 — 物件／環境	1029.0 ± 8.0	1018.0 ± 8.0	945.0 ± 8.0	1042.0 ± 10.0	976.0 ± 8.0	946.0 ± 9.0
編輯 — 多輸入	1037.0 ± 8.0	1016.0 ± 8.0	919.0 ± 9.0	1056.0 ± 12.0	1014.0 ± 9.0	N/A
編輯 — 風格化	1045.0 ± 7.0	1031.0 ± 7.0	862.0 ± 8.0	1045.0 ± 9.0	996.0 ± 7.0	984.0 ± 7.0

本基準表的關鍵觀察：

橫跨 文字轉影像生成 與 影像編輯 類別，Gemini 3.1 Flash Image 在分數上持續領先或追平 Flash 層級與多款競爭影像模型中的最高表現。
該模型在 視覺品質 與 資訊圖表（事實性） 基準上表現尤為強勢——顯示其不僅在美學品質上優異，也能在結構準確性上出色呈現。
在 多輸入編輯 上，Nano Banana 2 亦展現穩健的泛化能力，分數高於前代 Flash 版本。

這些評估透過多樣化基準套件上的 人工並排 Elo 比較 進行，反映常見影像生成／編輯任務中的偏好與保真度。

Nano Banana 2 vs Nano Banana vs Nano Banana Pro

模型	定位	代表性基準／備註
Gemini 3.1 Flash Image (Nano Banana 2)	Flash 層級：速度 + 高視覺品質（2K–4K）	整體偏好 1079.0 ± 7.0；視覺品質 1140 ± 6.0（內部 GenAI-Bench）。
Gemini 2.5 Flash Image (Nano Banana)	較早的 Flash 發佈（保真度較低）	相較 3.1，偏好／視覺分數略低。
Gemini 3 Pro Image (Nano Banana Pro)	Pro 層級：在複雜任務上具有更高的感知保真度，成本／延遲更高	權衡不同；部分指標在專項任務中的相對排名有所差異。
GPT-Image 1.5 / other commercial models	競品（開源／閉源）	在 Google 的內部基準中，於所報告的評測裡，GPT-Image 等在視覺品質與整體偏好上低於 Gemini 3.1。第三方獨立比較結果可能有所差異。

何時選擇 Flash Image Preview：

應用程式中的即時影像預覽
對成本敏感的大規模影像生成
互動式設計助理

如何存取並整合 Nano Banana 2

步驟 1：申請 API 金鑰

登入 cometapi.com。若您尚未成為我們的使用者，請先註冊。登入您的 CometAPI 控制台。取得介面存取憑證 API 金鑰。在個人中心的 API token 處點擊 “Add Token”，取得 token key：sk-xxxxx 並提交。

步驟 2：向 `Nano Banana 2` API 發送請求

選擇 “gemini-3.1-flash-image-preview8” 端點發送 API 請求並設定請求體。請求方法與請求體請參見我們網站的 API 文件。我們的網站也提供 Apifox 測試以便使用。將 <YOUR_API_KEY> 替換為您帳戶中的實際 CometAPI 金鑰。呼叫入口：Gemini 生成影像

Nano Banana 2 支援影像編輯、影像生成與多影像工作流程。進行影像編輯時，需上傳影像 URL。更多參數請參閱文件。

步驟 3：擷取並驗證結果

處理 API 回應以取得生成結果。處理完成後，API 會回傳任務狀態與輸出資料。您可以在 playground 中直接下載影像到本機（通常為 PNG 格式）。API 過程中會產生影像 URL；請及時下載。

Nano Banana 2 的定價

探索 Nano Banana 2 的競爭性定價，專為滿足各種預算和使用需求而設計。我們靈活的方案確保您只需為實際使用量付費，讓您能夠隨著需求增長輕鬆擴展。了解 Nano Banana 2 如何在保持成本可控的同時提升您的專案效果。

nano-banana-2（image）

variant / alias	Price
gemini-3.1-flash-image (0.5K)	≈ $0.03600
gemini-3.1-flash-image (1K)	≈ $0.05360
gemini-3.1-flash-image (2K)	≈ $0.08080
gemini-3.1-flash-image (4K)	≈ $0.12080
gemini-3.1-flash-image-preview (0.5K)	≈ $0.03600
gemini-3.1-flash-image-preview (1K)	≈ $0.05360
gemini-3.1-flash-image-preview (2K)	≈ $0.08080
gemini-3.1-flash-image-preview (4K)	≈ $0.12080

Nano Banana 2 的範例程式碼和 API

存取完整的範例程式碼和 API 資源，以簡化您的 Nano Banana 2 整合流程。我們詳盡的文件提供逐步指引，協助您在專案中充分發揮 Nano Banana 2 的潛力。

Python
JavaScript
Curl

from google import genai
from google.genai import types
from PIL import Image
import os

# Get your CometAPI key from https://api.cometapi.com/console/token, and paste it here
COMETAPI_KEY = os.environ.get("COMETAPI_KEY") or "<YOUR_COMETAPI_KEY>"
BASE_URL = "https://api.cometapi.com"

client = genai.Client(
    http_options={"api_version": "v1beta", "base_url": BASE_URL},
    api_key=COMETAPI_KEY,
)

prompt = (
    "A woman leaning on a wooden railing of a traditional Chinese building. "
    "She is wearing a blue cheongsam with pink and red floral motifs and a headdress "
    "made of colorful flowers, including roses and lilacs. Realistic painting style, "
    "focusing on the textural details of the clothing patterns and wooden buildings."
)
aspect_ratio = "9:16"  # "1:1","2:3","3:2","3:4","4:3","4:5","5:4","9:16","16:9","21:9"

response = client.models.generate_content(
    model="gemini-3.1-flash-image-preview",
    contents=[prompt],
    config=types.GenerateContentConfig(
        response_modalities=["IMAGE"],
        image_config=types.ImageConfig(aspect_ratio=aspect_ratio),
    ),
)

os.makedirs("./output", exist_ok=True)

for part in response.parts:
    if part.text is not None:
        print(part.text)
    elif part.inline_data is not None:
        image = part.as_image()
        output_path = "./output/gemini-3.1-flash-image-preview.png"
        image.save(output_path)
        print(f"Image saved to {output_path}")

Python Code Example

from google import genai
from google.genai import types
from PIL import Image
import os

# Get your CometAPI key from https://api.cometapi.com/console/token, and paste it here
COMETAPI_KEY = os.environ.get("COMETAPI_KEY") or "<YOUR_COMETAPI_KEY>"
BASE_URL = "https://api.cometapi.com"

client = genai.Client(
    http_options={"api_version": "v1beta", "base_url": BASE_URL},
    api_key=COMETAPI_KEY,
)

prompt = (
    "A woman leaning on a wooden railing of a traditional Chinese building. "
    "She is wearing a blue cheongsam with pink and red floral motifs and a headdress "
    "made of colorful flowers, including roses and lilacs. Realistic painting style, "
    "focusing on the textural details of the clothing patterns and wooden buildings."
)
aspect_ratio = "9:16"  # "1:1","2:3","3:2","3:4","4:3","4:5","5:4","9:16","16:9","21:9"

response = client.models.generate_content(
    model="gemini-3.1-flash-image-preview",
    contents=[prompt],
    config=types.GenerateContentConfig(
        response_modalities=["IMAGE"],
        image_config=types.ImageConfig(aspect_ratio=aspect_ratio),
    ),
)

os.makedirs("./output", exist_ok=True)

for part in response.parts:
    if part.text is not None:
        print(part.text)
    elif part.inline_data is not None:
        image = part.as_image()
        output_path = "./output/gemini-3.1-flash-image-preview.png"
        image.save(output_path)
        print(f"Image saved to {output_path}")

JavaScript Code Example

import fs from "fs";
import path from "path";

// Get your CometAPI key from https://api.cometapi.com/console/token, and paste it here
const api_key = process.env.COMETAPI_KEY || "<YOUR_COMETAPI_KEY>";
const base_url = "https://api.cometapi.com/v1beta";
const model = "gemini-3.1-flash-image-preview";

const prompt =
  "A woman leaning on a wooden railing of a traditional Chinese building. " +
  "She is wearing a blue cheongsam with pink and red floral motifs and a headdress " +
  "made of colorful flowers, including roses and lilacs. Realistic painting style, " +
  "focusing on the textural details of the clothing patterns and wooden buildings.";

const response = await fetch(`${base_url}/models/${model}:generateContent`, {
  method: "POST",
  headers: {
    "Content-Type": "application/json",
    Authorization: api_key,
  },
  body: JSON.stringify({
    contents: [
      {
        role: "user",
        parts: [{ text: prompt }],
      },
    ],
    generationConfig: {
      responseModalities: ["IMAGE"],
      imageConfig: {
        aspectRatio: "9:16",
      },
    },
  }),
});

const data = await response.json();

const outputDir = "./output";
if (!fs.existsSync(outputDir)) {
  fs.mkdirSync(outputDir, { recursive: true });
}

for (const candidate of data.candidates) {
  for (const part of candidate.content.parts) {
    if (part.text) {
      console.log(part.text);
    } else if (part.inlineData) {
      const imageBuffer = Buffer.from(part.inlineData.data, "base64");
      const outputPath = path.join(outputDir, "gemini-3.1-flash-image-preview.png");
      fs.writeFileSync(outputPath, imageBuffer);
      console.log(`Image saved to ${outputPath}`);
    }
  }
}

Curl Code Example

# Get your CometAPI key from https://api.cometapi.com/console/token
# Export it as: export COMETAPI_KEY="your-key-here"

mkdir -p ./output

curl -s "https://api.cometapi.com/v1beta/models/gemini-3.1-flash-image-preview:generateContent" \
  -H "Authorization: $COMETAPI_KEY" \
  -H 'Content-Type: application/json' \
  -X POST \
  -d '{
    "contents": [
      {
        "role": "user",
        "parts": [
          {
            "text": "A woman leaning on a wooden railing of a traditional Chinese building. She is wearing a blue cheongsam with pink and red floral motifs and a headdress made of colorful flowers, including roses and lilacs. Realistic painting style, focusing on the textural details of the clothing patterns and wooden buildings."
          }
        ]
      }
    ],
    "generationConfig": {
      "responseModalities": ["IMAGE"],
      "imageConfig": {
        "aspectRatio": "9:16"
      }
    }
  }' | python3 -c "
import sys, json, base64
data = json.load(sys.stdin)
parts = data['candidates'][0]['content']['parts']
for part in parts:
    if 'text' in part:
        print(part['text'])
    elif 'inlineData' in part:
        img = base64.b64decode(part['inlineData']['data'])
        with open('./output/gemini-3.1-flash-image-preview.png', 'wb') as f:
            f.write(img)
        print('Image saved to ./output/gemini-3.1-flash-image-preview.png')
"

Nano Banana 2的版本

Nano Banana 2擁有多個快照的原因可能包括：更新後輸出結果存在差異需保留舊版快照以確保一致性、為開發者提供適應與遷移的過渡期，以及不同快照對應全球或區域端點以優化使用者體驗等潛在因素。各版本間的具體差異請參閱官方文件說明。

Model id	描述	可用性	請求
gemini-3.1-flash-image	推薦，指向最新模型	✅	Gemini 生成圖片
gemini-3.1-flash-image-preview	官方預覽版	✅	Gemini 生成圖片