What exactly is Nano Banana 2 and what does it do?

Nano Banana 2 is Google’s latest AI image generation and editing model, built on Gemini Flash image technology to deliver fast, high-quality visual generation and precise instruction following across text and image inputs.

How does Nano Banana 2 relate to Gemini 3.1 Flash Image?

Nano Banana 2 is essentially the consumer-facing branding for Google’s Gemini 3.1 Flash Image model, combining advanced capabilities from previous Nano Banana versions with the speed of Flash models.

What improvements does Nano Banana 2 add over earlier Nano Banana models?

Nano Banana 2 brings faster generation speed, sharper detail, better instruction fidelity, enhanced text rendering/localized translation, and broader creative control while making many Pro-grade features available at base tier.

What kinds of images and resolutions can Nano Banana 2 generate?

The model supports flexible output with various aspect ratios and resolutions up to 4K, suitable for social media, ads, displays, and professional content.

Can Nano Banana 2 maintain consistency in complex compositions?

Yes — it preserves consistency across multiple subjects and objects (e.g., up to five characters and 14 objects in a single prompt workflow), helping with narrative scenes and storyboard-style tasks.

What image generation use cases is Gemini 3.1 Flash Image best suited for?

It’s well-suited for professional-grade image creation and editing, infographics, multi-image consistency, text rendering, and localized multilingual outputs, especially when workflows need precise control and repeated iterations.

Does Nano Banana 2 use real-time information or world knowledge?

Nano Banana 2 incorporates real-world knowledge and image search integration to help generate more accurate subjects, infographics, and location-aware visuals.

Can Gemini 3.1 Flash Image generate detailed text within images or diagrams?

Yes — it can generate and render clear text within images, but extremely small or dense multi-paragraph text sometimes remains challenging.

저렴한 Nano Banana 2 API | text-to-image

Gemini 3.1 Flash Image Preview의 기술 사양

항목	Gemini 3.1 Flash Image Preview
제공자	Google
모델 계열	Gemini 3.1 (Flash 티어)
주요 초점	이미지 미리보기를 포함한 고속 멀티모달 생성
입력 유형	텍스트, 이미지
출력 유형	텍스트, 이미지(미리보기 생성)
컨텍스트 윈도우	최대 1M 토큰(Gemini 3.x Flash 티어 표준)
지연 시간 티어	저지연, 고처리량
스트리밍 지원	예
도구 호출	예(Gemini API tools framework)
버전	3.1

Nano Banana 2란 무엇인가

Nano Banana 2는 새로 출시된 Gemini-3.1-Flash-Image 모델에 대해 언론과 개발자 커뮤니티에서 사용하는 인기 있는 별칭입니다. Google은 이를 저지연·저비용의 “Flash” 티어에서 거의 Pro에 가까운 시각적 충실도를 제공하는 이미지 엔진으로 포지셔닝하고 있으며, 대량 생성, 신속한 반복 편집, Google 서비스 전반의 제품 워크플로 통합에 적합합니다. Gemini 3.1의 멀티모달 추론을 계승하고, 이미지 중심 기능(이미지 내 가독성 있는 텍스트, 다중 이미지 합성, 와이드 종횡비 지원, 네이티브 4K)을 추가했습니다.

주요 기능

고속 멀티 해상도 생성: Flash 티어 속도와 함께 0.5K / 1K / 2K / 4K 출력 옵션 및 새로운 극단 종횡비(1:4, 4:1, 1:8, 8:1) 지원.
실시간 웹 그라운딩: “Thinking” 또는 검색 그라운딩이 활성화된 경우, 텍스트 및 이미지 검색 결과를 통합하여 생성물을 최신 웹 정보에 기반하도록 합니다. 최신 레퍼런스와 사실 기반 인포그래픽에 유용합니다.
향상된 텍스트 렌더링: 이전 Flash 모델 대비 짧은 텍스트 및 그래픽 텍스트(폰트, 크기) 렌더링 품질 개선; 다만 긴 문단/소형 텍스트는 여전히 완벽하지 않습니다.
다중 입력 편집 및 멀티 턴 워크플로: 여러 이미지를 입력으로 결합하고, 여러 턴에 걸친 반복 편집을 강력히 지원합니다.

📊 벤치마크 성능 — 이미지 생성 및 편집(Elo 점수)

역량	Gemini 3.1 Flash Image (Nano Banana 2)	Gemini 2.5 Flash Image (Nano Banana)	Gemini 3 Pro Image (Nano Banana Pro)	GPT-Image 1.5	Seedream 5.0 Lite	Grok Imagine Image Pro
텍스트-투-이미지 — 전체 선호도	1079.0 ± 7.0	1073.0 ± 5.0	942.0 ± 6.0	1021.0 ± 5.0	1047.0 ± 5.0	928.0 ± 8.0
텍스트-투-이미지 — 시각적 품질	1140.0 ± 6.0	1129.0 ± 6.0	929.0 ± 6.0	1043.0 ± 5.0	975.0 ± 5.0	759.0 ± 10.0
텍스트-투-이미지 — 인포그래픽(사실성)	1114.0 ± 14.0	1074.0 ± 12.0	881.0 ± 13.0	1102.0 ± 13.0	985.0 ± 12.0	890.0 ± 22.0
편집 — 일반	1065.0 ± 9.0	1047.0 ± 9.0	913.0 ± 9.0	1051.0 ± 10.0	995.0 ± 8.0	937.0 ± 9.0
편집 — 캐릭터	1056.0 ± 7.0	1049.0 ± 7.0	952.0 ± 7.0	1050.0 ± 8.0	1025.0 ± 7.0	894.0 ± 8.0
편집 — 크리에이티브	1023.0 ± 7.0	1031.0 ± 7.0	976.0 ± 7.0	1004.0 ± 7.0	1017.0 ± 7.0	938.0 ± 7.0
편집 — 객체/환경	1029.0 ± 8.0	1018.0 ± 8.0	945.0 ± 8.0	1042.0 ± 10.0	976.0 ± 8.0	946.0 ± 9.0
편집 — 다중 입력	1037.0 ± 8.0	1016.0 ± 8.0	919.0 ± 9.0	1056.0 ± 12.0	1014.0 ± 9.0	N/A
편집 — 스타일라이제이션	1045.0 ± 7.0	1031.0 ± 7.0	862.0 ± 8.0	1045.0 ± 9.0	996.0 ± 7.0	984.0 ± 7.0

이 벤치마크 표의 핵심 포인트:

텍스트-투-이미지 생성과 이미지 편집 전반에서, Gemini 3.1 Flash Image는 Flash 티어 및 다수 경쟁 이미지 모델 중 최고 수준과 동률이거나 앞서는 점수를 일관되게 보입니다.
특히 시각적 품질과 인포그래픽(사실성) 벤치마크에서 강세를 보여, 미적 품질뿐만 아니라 구조적으로 정확한 콘텐츠 렌더링에서도 뛰어남을 시사합니다.
다중 입력 편집에서 Nano Banana 2는 이전 Flash 세대보다 높은 점수를 기록하며, 강건한 일반화 성능을 보여줍니다.

이 평가는 다양한 벤치마크 스위트에서 사람에 의한 나란히 비교(Elo) 방식을 통해 수행되었으며, 일반적으로 사용되는 이미지 생성/편집 작업 전반의 선호도와 충실도를 반영합니다.

Nano Banana 2 vs Nano Banana vs Nano Banana Pro

모델	포지셔닝	대표 벤치마크/비고
Gemini 3.1 Flash Image (Nano Banana 2)	Flash 티어: 속도 + 높은 시각적 품질(2K–4K)	전체 선호도 1079.0 ± 7.0; 시각적 품질 1140 ± 6.0(내부 GenAI-Bench).
Gemini 2.5 Flash Image (Nano Banana)	이전 Flash 릴리스(낮은 충실도)	3.1 대비 선호도/시각적 점수가 다소 낮음.
Gemini 3 Pro Image (Nano Banana Pro)	Pro 티어: 복잡한 작업에서 더 높은 지각적 충실도, 더 높은 비용/지연	상이한 트레이드오프; 일부 지표에서는 특수 작업에서 서로 다른 상대적 순위를 보임.
GPT-Image 1.5 / 기타 상용 모델	경쟁 모델(오픈/클로즈드)	Google 내부 벤치마크에서는 시각적 품질과 전체 선호도에서 Gemini 3.1이 GPT-Image 및 기타 모델보다 높은 점수를 기록. 독립적인 제3자 비교에서는 결과가 상이할 수 있음.

Flash Image Preview를 선택해야 하는 경우:

앱에서의 실시간 이미지 미리보기
비용에 민감한 대규모 이미지 생성
인터랙티브 디자인 어시스턴트

Nano Banana 2 액세스 및 통합 방법

1단계: API 키 등록

cometapi.com에 로그인하세요. 아직 사용자라면 먼저 회원가입을 진행하세요. CometAPI console에 로그인합니다. 인터페이스의 액세스 자격 API 키를 발급받습니다. 개인 센터의 API 토큰에서 “Add Token”을 클릭하여 토큰 키 sk-xxxxx를 발급받아 제출하세요.

2단계: `Nano Banana 2` API로 요청 보내기

API 요청을 보내기 위해 “gemini-3.1-flash-image-preview8” 엔드포인트를 선택하고 요청 본문을 설정하세요. 요청 메서드와 본문은 당사 웹사이트의 API 문서에서 확인할 수 있습니다. 편의를 위해 Apifox 테스트도 제공합니다. 계정에서 발급받은 실제 CometAPI 키로 <YOUR_API_KEY>를 교체하세요. 호출 위치: Gemini 이미지 생성

Nano Banana 2는 이미지 편집, 이미지 생성, 다중 이미지 워크플로를 지원합니다. 이미지 편집의 경우 이미지 URL을 업로드해야 합니다. 더 많은 파라미터는 문서를 참고하세요.

3단계: 결과 조회 및 검증

API 응답을 처리하여 생성된 결과를 수신하세요. 처리 후 API는 작업 상태와 출력 데이터를 반환합니다. 플레이그라운드에서 이미지를 로컬(일반적으로 PNG 형식)로 바로 다운로드할 수 있습니다. API 처리 중 이미지 URL이 생성되며, 신속히 다운로드하시기 바랍니다.

Nano Banana 2 가격

[모델명]의 경쟁력 있는 가격을 살펴보세요. 다양한 예산과 사용 요구에 맞게 설계되었습니다. 유연한 요금제로 사용한 만큼만 지불하므로 요구사항이 증가함에 따라 쉽게 확장할 수 있습니다. [모델명]이 비용을 관리 가능한 수준으로 유지하면서 프로젝트를 어떻게 향상시킬 수 있는지 알아보세요.

nano-banana-2（image）

variant / alias	Price
gemini-3.1-flash-image (0.5K)	≈ $0.03600
gemini-3.1-flash-image (1K)	≈ $0.05360
gemini-3.1-flash-image (2K)	≈ $0.08080
gemini-3.1-flash-image (4K)	≈ $0.12080
gemini-3.1-flash-image-preview (0.5K)	≈ $0.03600
gemini-3.1-flash-image-preview (1K)	≈ $0.05360
gemini-3.1-flash-image-preview (2K)	≈ $0.08080
gemini-3.1-flash-image-preview (4K)	≈ $0.12080

Nano Banana 2의 샘플 코드 및 API

[모델 이름]의 포괄적인 샘플 코드와 API 리소스에 액세스하여 통합 프로세스를 간소화하세요. 자세한 문서는 단계별 가이드를 제공하여 프로젝트에서 [모델 이름]의 모든 잠재력을 활용할 수 있도록 돕습니다.

Python
JavaScript
Curl

from google import genai
from google.genai import types
from PIL import Image
import os

# Get your CometAPI key from https://api.cometapi.com/console/token, and paste it here
COMETAPI_KEY = os.environ.get("COMETAPI_KEY") or "<YOUR_COMETAPI_KEY>"
BASE_URL = "https://api.cometapi.com"

client = genai.Client(
    http_options={"api_version": "v1beta", "base_url": BASE_URL},
    api_key=COMETAPI_KEY,
)

prompt = (
    "A woman leaning on a wooden railing of a traditional Chinese building. "
    "She is wearing a blue cheongsam with pink and red floral motifs and a headdress "
    "made of colorful flowers, including roses and lilacs. Realistic painting style, "
    "focusing on the textural details of the clothing patterns and wooden buildings."
)
aspect_ratio = "9:16"  # "1:1","2:3","3:2","3:4","4:3","4:5","5:4","9:16","16:9","21:9"

response = client.models.generate_content(
    model="gemini-3.1-flash-image-preview",
    contents=[prompt],
    config=types.GenerateContentConfig(
        response_modalities=["IMAGE"],
        image_config=types.ImageConfig(aspect_ratio=aspect_ratio),
    ),
)

os.makedirs("./output", exist_ok=True)

for part in response.parts:
    if part.text is not None:
        print(part.text)
    elif part.inline_data is not None:
        image = part.as_image()
        output_path = "./output/gemini-3.1-flash-image-preview.png"
        image.save(output_path)
        print(f"Image saved to {output_path}")

Python Code Example

from google import genai
from google.genai import types
from PIL import Image
import os

# Get your CometAPI key from https://api.cometapi.com/console/token, and paste it here
COMETAPI_KEY = os.environ.get("COMETAPI_KEY") or "<YOUR_COMETAPI_KEY>"
BASE_URL = "https://api.cometapi.com"

client = genai.Client(
    http_options={"api_version": "v1beta", "base_url": BASE_URL},
    api_key=COMETAPI_KEY,
)

prompt = (
    "A woman leaning on a wooden railing of a traditional Chinese building. "
    "She is wearing a blue cheongsam with pink and red floral motifs and a headdress "
    "made of colorful flowers, including roses and lilacs. Realistic painting style, "
    "focusing on the textural details of the clothing patterns and wooden buildings."
)
aspect_ratio = "9:16"  # "1:1","2:3","3:2","3:4","4:3","4:5","5:4","9:16","16:9","21:9"

response = client.models.generate_content(
    model="gemini-3.1-flash-image-preview",
    contents=[prompt],
    config=types.GenerateContentConfig(
        response_modalities=["IMAGE"],
        image_config=types.ImageConfig(aspect_ratio=aspect_ratio),
    ),
)

os.makedirs("./output", exist_ok=True)

for part in response.parts:
    if part.text is not None:
        print(part.text)
    elif part.inline_data is not None:
        image = part.as_image()
        output_path = "./output/gemini-3.1-flash-image-preview.png"
        image.save(output_path)
        print(f"Image saved to {output_path}")

JavaScript Code Example

import fs from "fs";
import path from "path";

// Get your CometAPI key from https://api.cometapi.com/console/token, and paste it here
const api_key = process.env.COMETAPI_KEY || "<YOUR_COMETAPI_KEY>";
const base_url = "https://api.cometapi.com/v1beta";
const model = "gemini-3.1-flash-image-preview";

const prompt =
  "A woman leaning on a wooden railing of a traditional Chinese building. " +
  "She is wearing a blue cheongsam with pink and red floral motifs and a headdress " +
  "made of colorful flowers, including roses and lilacs. Realistic painting style, " +
  "focusing on the textural details of the clothing patterns and wooden buildings.";

const response = await fetch(`${base_url}/models/${model}:generateContent`, {
  method: "POST",
  headers: {
    "Content-Type": "application/json",
    Authorization: api_key,
  },
  body: JSON.stringify({
    contents: [
      {
        role: "user",
        parts: [{ text: prompt }],
      },
    ],
    generationConfig: {
      responseModalities: ["IMAGE"],
      imageConfig: {
        aspectRatio: "9:16",
      },
    },
  }),
});

const data = await response.json();

const outputDir = "./output";
if (!fs.existsSync(outputDir)) {
  fs.mkdirSync(outputDir, { recursive: true });
}

for (const candidate of data.candidates) {
  for (const part of candidate.content.parts) {
    if (part.text) {
      console.log(part.text);
    } else if (part.inlineData) {
      const imageBuffer = Buffer.from(part.inlineData.data, "base64");
      const outputPath = path.join(outputDir, "gemini-3.1-flash-image-preview.png");
      fs.writeFileSync(outputPath, imageBuffer);
      console.log(`Image saved to ${outputPath}`);
    }
  }
}

Curl Code Example

# Get your CometAPI key from https://api.cometapi.com/console/token
# Export it as: export COMETAPI_KEY="your-key-here"

mkdir -p ./output

curl -s "https://api.cometapi.com/v1beta/models/gemini-3.1-flash-image-preview:generateContent" \
  -H "Authorization: $COMETAPI_KEY" \
  -H 'Content-Type: application/json' \
  -X POST \
  -d '{
    "contents": [
      {
        "role": "user",
        "parts": [
          {
            "text": "A woman leaning on a wooden railing of a traditional Chinese building. She is wearing a blue cheongsam with pink and red floral motifs and a headdress made of colorful flowers, including roses and lilacs. Realistic painting style, focusing on the textural details of the clothing patterns and wooden buildings."
          }
        ]
      }
    ],
    "generationConfig": {
      "responseModalities": ["IMAGE"],
      "imageConfig": {
        "aspectRatio": "9:16"
      }
    }
  }' | python3 -c "
import sys, json, base64
data = json.load(sys.stdin)
parts = data['candidates'][0]['content']['parts']
for part in parts:
    if 'text' in part:
        print(part['text'])
    elif 'inlineData' in part:
        img = base64.b64decode(part['inlineData']['data'])
        with open('./output/gemini-3.1-flash-image-preview.png', 'wb') as f:
            f.write(img)
        print('Image saved to ./output/gemini-3.1-flash-image-preview.png')
"

Nano Banana 2의 버전

Nano Banana 2에 여러 스냅샷이 존재하는 이유는 업데이트 후 출력 변동으로 인해 일관성을 유지하기 위해 이전 스냅샷을 보관하거나, 개발자에게 적응 및 마이그레이션을 위한 전환 기간을 제공하거나, 글로벌 또는 지역별 엔드포인트에 따라 다양한 스냅샷을 제공하여 사용자 경험을 최적화하기 위한 것 등이 포함될 수 있습니다. 버전 간 상세한 차이점은 공식 문서를 참고해 주시기 바랍니다.

Model id	설명	사용 가능 여부	요청
gemini-3.1-flash-image	권장, 최신 모델을 가리킴	✅	Gemini가 이미지를 생성
gemini-3.1-flash-image-preview	공식 프리뷰	✅	Gemini가 이미지를 생성