Nano Banana 2 到底是什么，它能做什么？

Nano Banana 2 是 Google 最新的 AI 图像生成与编辑模型，基于 Gemini Flash 图像技术构建，可通过文本和图像输入实现快速、高质量的视觉生成以及精确的指令遵循。

Nano Banana 2 与 Gemini 3.1 Flash Image 有什么关系？

Nano Banana 2 本质上是 Google 的 Gemini 3.1 Flash Image 模型面向消费者的品牌名称，结合了早期 Nano Banana 版本的先进能力与 Flash 模型的速度。

与早期的 Nano Banana 模型相比，Nano Banana 2 有哪些改进？

Nano Banana 2 带来了更快的生成速度、更锐利的细节、更好的指令忠实度、增强的文本渲染/本地化翻译能力，以及更广泛的创意控制，同时让许多专业版级别的功能在基础层级即可使用。

Nano Banana 2 可以生成哪些类型的图像和分辨率？

该模型支持灵活输出，提供多种宽高比和最高达 4K 的分辨率，适用于社交媒体、广告、显示屏和专业内容。

Nano Banana 2 能在复杂构图中保持一致性吗？

可以——它能够在多个主体和物体之间保持一致性（例如，在单个提示工作流中最多支持五个角色和 14 个物体），有助于叙事场景和故事板风格的任务。

Gemini 3.1 Flash Image 最适合哪些图像生成用例？

它非常适合专业级图像创作与编辑、信息图、多图一致性、文本渲染以及本地化多语言输出，尤其适用于需要精确控制和反复迭代的工作流。

Nano Banana 2 会使用实时信息或世界知识吗？

Nano Banana 2 融合了现实世界知识和图像搜索集成，有助于生成更准确的主体、信息图以及具有位置感知能力的视觉内容。

Gemini 3.1 Flash Image 能在图像或图表中生成详细文本吗？

可以——它能够在图像中生成并呈现清晰文本，但对于极小字号或高密度的多段落文本，有时仍然具有挑战性。

实惠的 Nano Banana 2 API | text-to-image

Gemini 3.1 Flash Image Preview 技术规格

项目	Gemini 3.1 Flash Image Preview
提供方	Google
模型系列	Gemini 3.1（Flash 等级）
主要定位	快速多模态生成，支持图像预览
输入类型	文本、图像
输出类型	文本、图像（预览生成）
上下文窗口	最多 1M tokens（Gemini 3.x Flash 等级标准）
延迟等级	低延迟、高吞吐
流式传输支持	支持
工具调用	支持（Gemini API 工具框架）
版本	3.1

What is Nano Banana 2

Nano Banana 2 是媒体和开发者社区对新发布的 Gemini-3.1-Flash-Image 模型的常用昵称。Google 将其定位为“Flash”等级的图像引擎，在显著降低延迟与成本的同时，带来接近 Pro 级的视觉保真度——适用于高频生成、快速迭代编辑，以及在 Google 服务中的集成产品工作流。它继承了 Gemini 3.1 的多模态推理能力，并新增以图像为中心的能力（图像中文字可读、多图合成、超宽纵横比支持、原生 4K）。

主要特性

高速、多分辨率生成：具备 Flash 等级的速度，支持 0.5K / 1K / 2K / 4K 输出，以及新的极端纵横比（1:4、4:1、1:8、8:1）。
实时网页锚定：在启用“Thinking”或搜索锚定时，将文本与图像搜索结果整合，使生成内容锚定于当前网页信息。适用于最新参考资料与事实信息图。
改进的文本渲染：相较早期 Flash 模型，在短文本与图形文字（字体、字号）渲染方面更佳；在长段落/小字号文本上仍不完美。
多输入编辑与多轮工作流：对将多张图像作为输入以及跨轮次的迭代编辑提供有力支持。

📊 基准表现——图像生成与编辑（Elo 分数）

能力	Gemini 3.1 Flash Image (Nano Banana 2)	Gemini 2.5 Flash Image (Nano Banana)	Gemini 3 Pro Image (Nano Banana Pro)	GPT-Image 1.5	Seedream 5.0 Lite	Grok Imagine Image Pro
文生图 — 整体偏好	1079.0 ± 7.0	1073.0 ± 5.0	942.0 ± 6.0	1021.0 ± 5.0	1047.0 ± 5.0	928.0 ± 8.0
文生图 — 视觉质量	1140.0 ± 6.0	1129.0 ± 6.0	929.0 ± 6.0	1043.0 ± 5.0	975.0 ± 5.0	759.0 ± 10.0
文生图 — 信息图（事实性）	1114.0 ± 14.0	1074.0 ± 12.0	881.0 ± 13.0	1102.0 ± 13.0	985.0 ± 12.0	890.0 ± 22.0
编辑 — 通用	1065.0 ± 9.0	1047.0 ± 9.0	913.0 ± 9.0	1051.0 ± 10.0	995.0 ± 8.0	937.0 ± 9.0
编辑 — 角色	1056.0 ± 7.0	1049.0 ± 7.0	952.0 ± 7.0	1050.0 ± 8.0	1025.0 ± 7.0	894.0 ± 8.0
编辑 — 创意	1023.0 ± 7.0	1031.0 ± 7.0	976.0 ± 7.0	1004.0 ± 7.0	1017.0 ± 7.0	938.0 ± 7.0
编辑 — 物体/环境	1029.0 ± 8.0	1018.0 ± 8.0	945.0 ± 8.0	1042.0 ± 10.0	976.0 ± 8.0	946.0 ± 9.0
编辑 — 多输入	1037.0 ± 8.0	1016.0 ± 8.0	919.0 ± 9.0	1056.0 ± 12.0	1014.0 ± 9.0	N/A
编辑 — 风格化	1045.0 ± 7.0	1031.0 ± 7.0	862.0 ± 8.0	1045.0 ± 9.0	996.0 ± 7.0	984.0 ± 7.0

该基准表的关键要点：

在文生图与图像编辑两大类中，Gemini 3.1 Flash Image 在 Flash 等级及众多竞品图像模型中持续领先或达到最高分。
该模型在视觉质量与**信息图（事实性）**基准上表现尤为突出——表明其不仅在美学质量上表现优异，也擅长呈现结构准确的内容。
在多输入编辑方面，Nano Banana 2 也展现出稳健的泛化能力，得分高于其上一代 Flash 模型。

这些评测基于多样化基准套件的人类并排 Elo 比较，反映了常见图像生成/编辑任务中的偏好与保真度。

Nano Banana 2 vs Nano Banana vs Nano Banana Pro

模型	定位	代表性基准/备注
Gemini 3.1 Flash Image (Nano Banana 2)	Flash 等级：速度 + 高视觉质量（2K–4K）	Overall preference 1079.0 ± 7.0；visual quality 1140 ± 6.0（内部 GenAI-Bench）。
Gemini 2.5 Flash Image (Nano Banana)	较早的 Flash 版本（保真度较低）	相较 3.1，在整体偏好/视觉评分上略低。
Gemini 3 Pro Image (Nano Banana Pro)	Pro 等级：在复杂任务上具备更高的感知保真度，成本/延迟更高	不同取舍；某些指标在特定任务上的相对排名不同。
GPT-Image 1.5 / 其他商业模型	竞争对手（开源/闭源）	在 Google 的内部基准中，报告的评测显示 GPT-Image 等在视觉质量与整体偏好上低于 Gemini 3.1。独立第三方对比结果可能不同。

何时选择 Flash Image Preview：

应用内实时图像预览
对成本敏感的大规模图像生成
交互式设计助手

如何获取并集成 Nano Banana 2

步骤 1：申请 API Key

步骤 2：向 `Nano Banana 2` API 发送请求

选择“gemini-3.1-flash-image-preview8”接口发送 API 请求并设置请求体。请求方式与请求体可在我们的网站 API 文档中获取。我们的网站还提供 Apifox 测试以便使用。将 <YOUR_API_KEY> 替换为你账号中的实际 CometAPI Key。调用入口：Gemini 生成图像

Nano Banana 2 支持图像编辑、图像生成和多图工作流。进行图像编辑时，需要上传图像 URL。更多参数请参见文档。

步骤 3：获取并验证结果

处理 API 响应以获取生成结果。处理完成后，API 会返回任务状态与输出数据。你可以在 playground 中直接将图像下载到本地（通常为 PNG 格式）。API 过程中会生成图像 URL；请及时下载。

Nano Banana 2 的定价

查看 Nano Banana 2 的竞争性定价，满足不同预算与使用需求，灵活方案确保随需求扩展。

nano-banana-2（image）

variant / alias	Price
gemini-3.1-flash-image (0.5K)	≈ $0.03600
gemini-3.1-flash-image (1K)	≈ $0.05360
gemini-3.1-flash-image (2K)	≈ $0.08080
gemini-3.1-flash-image (4K)	≈ $0.12080
gemini-3.1-flash-image-preview (0.5K)	≈ $0.03600
gemini-3.1-flash-image-preview (1K)	≈ $0.05360
gemini-3.1-flash-image-preview (2K)	≈ $0.08080
gemini-3.1-flash-image-preview (4K)	≈ $0.12080

Nano Banana 2 的示例代码与 API

获取完整示例代码与 API 资源，简化 Nano Banana 2 的集成流程，我们提供逐步指导，助你发挥模型潜能。

Python
JavaScript
Curl

from google import genai
from google.genai import types
from PIL import Image
import os

# Get your CometAPI key from https://api.cometapi.com/console/token, and paste it here
COMETAPI_KEY = os.environ.get("COMETAPI_KEY") or "<YOUR_COMETAPI_KEY>"
BASE_URL = "https://api.cometapi.com"

client = genai.Client(
    http_options={"api_version": "v1beta", "base_url": BASE_URL},
    api_key=COMETAPI_KEY,
)

prompt = (
    "A woman leaning on a wooden railing of a traditional Chinese building. "
    "She is wearing a blue cheongsam with pink and red floral motifs and a headdress "
    "made of colorful flowers, including roses and lilacs. Realistic painting style, "
    "focusing on the textural details of the clothing patterns and wooden buildings."
)
aspect_ratio = "9:16"  # "1:1","2:3","3:2","3:4","4:3","4:5","5:4","9:16","16:9","21:9"

response = client.models.generate_content(
    model="gemini-3.1-flash-image-preview",
    contents=[prompt],
    config=types.GenerateContentConfig(
        response_modalities=["IMAGE"],
        image_config=types.ImageConfig(aspect_ratio=aspect_ratio),
    ),
)

os.makedirs("./output", exist_ok=True)

for part in response.parts:
    if part.text is not None:
        print(part.text)
    elif part.inline_data is not None:
        image = part.as_image()
        output_path = "./output/gemini-3.1-flash-image-preview.png"
        image.save(output_path)
        print(f"Image saved to {output_path}")

Python Code Example

from google import genai
from google.genai import types
from PIL import Image
import os

# Get your CometAPI key from https://api.cometapi.com/console/token, and paste it here
COMETAPI_KEY = os.environ.get("COMETAPI_KEY") or "<YOUR_COMETAPI_KEY>"
BASE_URL = "https://api.cometapi.com"

client = genai.Client(
    http_options={"api_version": "v1beta", "base_url": BASE_URL},
    api_key=COMETAPI_KEY,
)

prompt = (
    "A woman leaning on a wooden railing of a traditional Chinese building. "
    "She is wearing a blue cheongsam with pink and red floral motifs and a headdress "
    "made of colorful flowers, including roses and lilacs. Realistic painting style, "
    "focusing on the textural details of the clothing patterns and wooden buildings."
)
aspect_ratio = "9:16"  # "1:1","2:3","3:2","3:4","4:3","4:5","5:4","9:16","16:9","21:9"

response = client.models.generate_content(
    model="gemini-3.1-flash-image-preview",
    contents=[prompt],
    config=types.GenerateContentConfig(
        response_modalities=["IMAGE"],
        image_config=types.ImageConfig(aspect_ratio=aspect_ratio),
    ),
)

os.makedirs("./output", exist_ok=True)

for part in response.parts:
    if part.text is not None:
        print(part.text)
    elif part.inline_data is not None:
        image = part.as_image()
        output_path = "./output/gemini-3.1-flash-image-preview.png"
        image.save(output_path)
        print(f"Image saved to {output_path}")

JavaScript Code Example

import fs from "fs";
import path from "path";

// Get your CometAPI key from https://api.cometapi.com/console/token, and paste it here
const api_key = process.env.COMETAPI_KEY || "<YOUR_COMETAPI_KEY>";
const base_url = "https://api.cometapi.com/v1beta";
const model = "gemini-3.1-flash-image-preview";

const prompt =
  "A woman leaning on a wooden railing of a traditional Chinese building. " +
  "She is wearing a blue cheongsam with pink and red floral motifs and a headdress " +
  "made of colorful flowers, including roses and lilacs. Realistic painting style, " +
  "focusing on the textural details of the clothing patterns and wooden buildings.";

const response = await fetch(`${base_url}/models/${model}:generateContent`, {
  method: "POST",
  headers: {
    "Content-Type": "application/json",
    Authorization: api_key,
  },
  body: JSON.stringify({
    contents: [
      {
        role: "user",
        parts: [{ text: prompt }],
      },
    ],
    generationConfig: {
      responseModalities: ["IMAGE"],
      imageConfig: {
        aspectRatio: "9:16",
      },
    },
  }),
});

const data = await response.json();

const outputDir = "./output";
if (!fs.existsSync(outputDir)) {
  fs.mkdirSync(outputDir, { recursive: true });
}

for (const candidate of data.candidates) {
  for (const part of candidate.content.parts) {
    if (part.text) {
      console.log(part.text);
    } else if (part.inlineData) {
      const imageBuffer = Buffer.from(part.inlineData.data, "base64");
      const outputPath = path.join(outputDir, "gemini-3.1-flash-image-preview.png");
      fs.writeFileSync(outputPath, imageBuffer);
      console.log(`Image saved to ${outputPath}`);
    }
  }
}

Curl Code Example

# Get your CometAPI key from https://api.cometapi.com/console/token
# Export it as: export COMETAPI_KEY="your-key-here"

mkdir -p ./output

curl -s "https://api.cometapi.com/v1beta/models/gemini-3.1-flash-image-preview:generateContent" \
  -H "Authorization: $COMETAPI_KEY" \
  -H 'Content-Type: application/json' \
  -X POST \
  -d '{
    "contents": [
      {
        "role": "user",
        "parts": [
          {
            "text": "A woman leaning on a wooden railing of a traditional Chinese building. She is wearing a blue cheongsam with pink and red floral motifs and a headdress made of colorful flowers, including roses and lilacs. Realistic painting style, focusing on the textural details of the clothing patterns and wooden buildings."
          }
        ]
      }
    ],
    "generationConfig": {
      "responseModalities": ["IMAGE"],
      "imageConfig": {
        "aspectRatio": "9:16"
      }
    }
  }' | python3 -c "
import sys, json, base64
data = json.load(sys.stdin)
parts = data['candidates'][0]['content']['parts']
for part in parts:
    if 'text' in part:
        print(part['text'])
    elif 'inlineData' in part:
        img = base64.b64decode(part['inlineData']['data'])
        with open('./output/gemini-3.1-flash-image-preview.png', 'wb') as f:
            f.write(img)
        print('Image saved to ./output/gemini-3.1-flash-image-preview.png')
"

Nano Banana 2 的版本

Nano Banana 2 可能存在多个快照，原因包括：更新后保持一致性需要保留旧版、给开发者留出迁移窗口，以及全球/区域端点提供的优化差异。具体差异请参考官方文档。

模型 ID	描述	可用性	请求
gemini-3.1-flash-image	推荐，指向最新模型	✅	Gemini 生成图像
gemini-3.1-flash-image-preview	官方预览	✅	Gemini 生成图像

Gemini 3.1 Flash Image Preview 技术规格

项目	Gemini 3.1 Flash Image Preview
提供方	Google
模型系列	Gemini 3.1（Flash 等级）
主要定位	快速多模态生成，支持图像预览
输入类型	文本、图像
输出类型	文本、图像（预览生成）
上下文窗口	最多 1M tokens（Gemini 3.x Flash 等级标准）
延迟等级	低延迟、高吞吐
流式传输支持	支持
工具调用	支持（Gemini API 工具框架）
版本	3.1

What is Nano Banana 2

主要特性

高速、多分辨率生成：具备 Flash 等级的速度，支持 0.5K / 1K / 2K / 4K 输出，以及新的极端纵横比（1:4、4:1、1:8、8:1）。
实时网页锚定：在启用“Thinking”或搜索锚定时，将文本与图像搜索结果整合，使生成内容锚定于当前网页信息。适用于最新参考资料与事实信息图。
改进的文本渲染：相较早期 Flash 模型，在短文本与图形文字（字体、字号）渲染方面更佳；在长段落/小字号文本上仍不完美。
多输入编辑与多轮工作流：对将多张图像作为输入以及跨轮次的迭代编辑提供有力支持。

📊 基准表现——图像生成与编辑（Elo 分数）

能力	Gemini 3.1 Flash Image (Nano Banana 2)	Gemini 2.5 Flash Image (Nano Banana)	Gemini 3 Pro Image (Nano Banana Pro)	GPT-Image 1.5	Seedream 5.0 Lite	Grok Imagine Image Pro
文生图 — 整体偏好	1079.0 ± 7.0	1073.0 ± 5.0	942.0 ± 6.0	1021.0 ± 5.0	1047.0 ± 5.0	928.0 ± 8.0
文生图 — 视觉质量	1140.0 ± 6.0	1129.0 ± 6.0	929.0 ± 6.0	1043.0 ± 5.0	975.0 ± 5.0	759.0 ± 10.0
文生图 — 信息图（事实性）	1114.0 ± 14.0	1074.0 ± 12.0	881.0 ± 13.0	1102.0 ± 13.0	985.0 ± 12.0	890.0 ± 22.0
编辑 — 通用	1065.0 ± 9.0	1047.0 ± 9.0	913.0 ± 9.0	1051.0 ± 10.0	995.0 ± 8.0	937.0 ± 9.0
编辑 — 角色	1056.0 ± 7.0	1049.0 ± 7.0	952.0 ± 7.0	1050.0 ± 8.0	1025.0 ± 7.0	894.0 ± 8.0
编辑 — 创意	1023.0 ± 7.0	1031.0 ± 7.0	976.0 ± 7.0	1004.0 ± 7.0	1017.0 ± 7.0	938.0 ± 7.0
编辑 — 物体/环境	1029.0 ± 8.0	1018.0 ± 8.0	945.0 ± 8.0	1042.0 ± 10.0	976.0 ± 8.0	946.0 ± 9.0
编辑 — 多输入	1037.0 ± 8.0	1016.0 ± 8.0	919.0 ± 9.0	1056.0 ± 12.0	1014.0 ± 9.0	N/A
编辑 — 风格化	1045.0 ± 7.0	1031.0 ± 7.0	862.0 ± 8.0	1045.0 ± 9.0	996.0 ± 7.0	984.0 ± 7.0

该基准表的关键要点：

在文生图与图像编辑两大类中，Gemini 3.1 Flash Image 在 Flash 等级及众多竞品图像模型中持续领先或达到最高分。
该模型在视觉质量与**信息图（事实性）**基准上表现尤为突出——表明其不仅在美学质量上表现优异，也擅长呈现结构准确的内容。
在多输入编辑方面，Nano Banana 2 也展现出稳健的泛化能力，得分高于其上一代 Flash 模型。

这些评测基于多样化基准套件的人类并排 Elo 比较，反映了常见图像生成/编辑任务中的偏好与保真度。

Nano Banana 2 vs Nano Banana vs Nano Banana Pro

模型	定位	代表性基准/备注
Gemini 3.1 Flash Image (Nano Banana 2)	Flash 等级：速度 + 高视觉质量（2K–4K）	Overall preference 1079.0 ± 7.0；visual quality 1140 ± 6.0（内部 GenAI-Bench）。
Gemini 2.5 Flash Image (Nano Banana)	较早的 Flash 版本（保真度较低）	相较 3.1，在整体偏好/视觉评分上略低。
Gemini 3 Pro Image (Nano Banana Pro)	Pro 等级：在复杂任务上具备更高的感知保真度，成本/延迟更高	不同取舍；某些指标在特定任务上的相对排名不同。
GPT-Image 1.5 / 其他商业模型	竞争对手（开源/闭源）	在 Google 的内部基准中，报告的评测显示 GPT-Image 等在视觉质量与整体偏好上低于 Gemini 3.1。独立第三方对比结果可能不同。

何时选择 Flash Image Preview：

应用内实时图像预览
对成本敏感的大规模图像生成
交互式设计助手

如何获取并集成 Nano Banana 2

步骤 1：申请 API Key

步骤 2：向 `Nano Banana 2` API 发送请求

Nano Banana 2 支持图像编辑、图像生成和多图工作流。进行图像编辑时，需要上传图像 URL。更多参数请参见文档。

Nano Banana 2

更多模型

Doubao Seedream 5

FLUX 2 MAX

Black Forest Labs/FLUX 2 MAX

GPT Image 1.5

Doubao Seedream 4.5

Black Forest Labs/FLUX 2 PRO

Black Forest Labs/FLUX 2 PRO

相关博客

OpenClaw 在 2026 年的费用是多少？完整定价明细

GPT Image 1.5 与 Seedream 4.5：2026 年哪个更好？

ChatGPT 在 2026 年生成一张图像需要多长时间？

阿里巴巴 Wan2.7-Image 2026 年评测：革命性的统一 AI 图像模型

Luma AI Unit-1 图像模型（2026）：全面分析与对比

Nano Banana 2

更多模型

Doubao Seedream 5

FLUX 2 MAX

Black Forest Labs/FLUX 2 MAX

GPT Image 1.5

Doubao Seedream 4.5

Black Forest Labs/FLUX 2 PRO

Black Forest Labs/FLUX 2 PRO

相关博客

OpenClaw 在 2026 年的费用是多少？完整定价明细

GPT Image 1.5 与 Seedream 4.5：2026 年哪个更好？

ChatGPT 在 2026 年生成一张图像需要多长时间？

阿里巴巴 Wan2.7-Image 2026 年评测：革命性的统一 AI 图像模型

Luma AI Unit-1 图像模型（2026）：全面分析与对比