如何使用 OpenAI GPT-Image-1 API 编辑图像

OpenAI 的 GPT-Image-1 API 正在革新开发者与创意工作者对图像编辑的处理方式。它将先进的多模态能力与直观的文本提示相结合，使得可以直接通过代码实现精确且高质量的图像处理。无论你希望生成全新视觉、编辑现有图像，还是创建变体，gpt-image-1 都提供强大的解决方案。

What Is gpt-image-1?

GPT-Image-1 是 OpenAI 最新的图像生成模型，能够根据文本描述创建与编辑图像。它可以理解复杂提示，并生成高度逼真、与用户意图高度契合的图像。其关键特性包括：

高保真图像生成：生成细节丰富、准确的视觉效果。
多样化视觉风格：支持从照片级真实到抽象的多种美学风格。
精确图像编辑：可对生成图像进行有针对性的修改。
丰富的世界知识：在上下文中准确理解复杂提示。
一致的文本渲染：可在图像中可靠地渲染文本。

Real-World Applications

利用 gpt-image-1 的行业包括：

设计与原型：Figma 等工具集成 gpt-image-1 以提升创意工作流。
电商：平台使用它生成产品视觉与营销素材。
教育：为学习平台创建图表与视觉辅助材料。
营销：即时生成广告图形与社交媒体视觉内容。

Setting Up Your Environment

Prerequisites

开始之前，请确保你具备以下条件：

一个 OpenAI API 密钥。
系统已安装 Python。
已安装 openai Python 包。

你可以使用 pip 安装 openai 包：

bashpip install openai

Setting Up the OpenAI API Client

首先，在你的 Python 脚本中设置 OpenAI API 客户端：

pythonimport openai

openai.api_key = 'your-api-key-here'

将 'your-api-key-here' 替换为你的实际 OpenAI API 密钥。

How to Edit Images with GPT-Image-1

How Does Image Editing Work?

GPT-Image-1 允许你通过提供基础图像、可选的蒙版（用于指定可编辑区域），以及描述期望结果的文本提示来编辑图像。API 会处理这些输入，并返回与规范相符的修改后图像。

Preparing the Image and Mask

确保你的输入图像与蒙版满足以下条件：

二者均为正方形图像。
蒙版应为透明 PNG，透明区域表示需要编辑的区域。

Writing the Python Script

下面是一个使用 gpt-image-1 API 编辑图像的 Python 示例脚本：

import requests
edit_url = "https://api.openai.com/v1/images/edits"
headers = {
"Authorization": "Bearer YOUR_API_KEY"
}
files = {
"image": open("input-image.png", "rb")
"mask":open("path_to_your_mask.png", "rb"),
}
data = {
"model": "gpt-image-1",
"prompt": "Add a bright red balloon in the sky",
"n": 1,
"size": "1024x1024"
}
response = requests.post(edit_url, headers=headers, files=files, data=data)
image_url = response.json()
print("Edited Image URL:", image_url)

替换：

'Bearer YOUR_API_KEY' 为你的 OpenAI API 密钥。
"path_to_your_image.png" 为你原始图像的路径。
"path_to_your_mask.png" 为你的蒙版图像路径。
"Describe the desired edit here" 为描述你期望编辑的提示。

Example: Changing an Object’s Color

假设你有一张红色球的图像，希望将其颜色改为蓝色。你的提示可以是：

pythonprompt="Change the red ball to a blue ball"

确保你的蒙版仅突出红色球的区域。

GPT-Image-1

Advanced Tips and Considerations

What Are Some Advanced Features?

风格迁移：通过调整提示应用不同的艺术风格。
对象添加/移除：使用描述性提示在图像中添加或移除元素。
文本渲染：以指定字体与位置在图像中插入文本。

Image Size and Aspect Ratio

GPT-Image-1 API 要求图像为正方形，支持的尺寸包括 256×256、512×512 或 1024×1024 像素。非正方形图像可能会被调整大小或裁剪，这可能影响最终输出。

Token Usage and Costs

使用 base64 编码图像时，载荷大小会增加约 33%，这会影响 token 使用与成本。为降低影响，考虑将图像进行托管并提供 URL，而不是使用 base64 数据。

Model Limitations

尽管 gpt-image-1 提供强大的图像编辑能力，但对于涉及多个对象或复杂细节的高难度编辑，可能不如专业图像编辑软件有效。它更适合由清晰提示引导的直观编辑。

Best Practices for Optimal Results

How to Enhance Image Editing Outcomes?

具体明确：详细的提示可带来更准确的结果。
使用高质量图像：确保基础图像清晰、光照良好。
测试不同提示：尝试多种描述以达到理想效果。
善用蒙版：通过蒙版精确控制可编辑区域。

Integrating gpt-image-1 into Design Workflows

将 gpt-image-1 集成到 Figma 与 Adobe Firefly 等工具中可简化设计流程。设计师可在这些平台内直接使用文本提示生成与编辑图像，实现快速原型与迭代。

例如，在 Figma 中，你可以选择某个设计元素，输入类似“为该对象添加阴影”的提示，GPT-Image-1 集成将据此应用相应编辑。

Conclusion

OpenAI 的 GPT-Image-1 API 代表了 AI 驱动图像编辑的重大进步。通过自然语言提示引导图像编辑，它赋能设计师与开发者高效创建与修改视觉内容。随着与设计工具的集成持续演进，gpt-image-1 有望成为创意工作流中不可或缺的资产。

Getting Started

开发者可通过 GPT-image-1 API 由 CometAPI 访问。开始使用前，可在 Playground 中探索该模型的能力，并查阅 API guide（model name: gpt-image-1）获取详细说明。请注意，部分开发者在使用该模型前可能需要完成组织验证。

`GPT-Image-1` API Pricing in CometAPI，20% off the official price:

Output Tokens: $32/ M tokens

Input Tokens: $8 / M tokens

What Is gpt-image-1?

Real-World Applications

Setting Up Your Environment

Prerequisites

Setting Up the OpenAI API Client

How to Edit Images with GPT-Image-1

How Does Image Editing Work?

Preparing the Image and Mask

Writing the Python Script

Example: Changing an Object’s Color

Advanced Tips and Considerations

What Are Some Advanced Features?

Image Size and Aspect Ratio

Token Usage and Costs

Model Limitations

Best Practices for Optimal Results

How to Enhance Image Editing Outcomes?

Integrating gpt-image-1 into Design Workflows

Conclusion

Getting Started

`GPT-Image-1` API Pricing in CometAPI，20% off the official price:

阅读更多

一个 API 中超 500 个模型

如何使用 OpenAI GPT-Image-1 API 编辑图像

What Is gpt-image-1?

Real-World Applications

Setting Up Your Environment

Prerequisites

Setting Up the OpenAI API Client

How to Edit Images with GPT-Image-1

How Does Image Editing Work?

Preparing the Image and Mask

Writing the Python Script

Example: Changing an Object’s Color

Advanced Tips and Considerations

What Are Some Advanced Features?

Image Size and Aspect Ratio

Token Usage and Costs

Model Limitations

Best Practices for Optimal Results

How to Enhance Image Editing Outcomes?

Integrating gpt-image-1 into Design Workflows

Conclusion

Getting Started

GPT-Image-1 API Pricing in CometAPI，20% off the official price:

阅读更多

一个 API 中超 500 个模型

`GPT-Image-1` API Pricing in CometAPI，20% off the official price: