Docs/Image Generation API

IMG

Image Generation API

Access top image generation models through a unified API -- text-to-image, image editing, mask inpainting, and multi-image fusion.

Quick Start

Get your API key from Console

Choose an image provider below

POST to create a generation task

Poll GET to retrieve the image URL

Authentication

Add Authorization header to all requests:

Authorization: Bearer YOUR_API_KEY

Endpoints

POST/api/v1/images/generations

Create an image generation task

GET/api/v1/images/generations?task_id=xxx

Query task status and get image URL

API Reference

Select a provider to see its parameters and examples

OpenAI

GPT Image 2 Beta

OpenAI GPT Image 2 via the kie.ai beta channel — an alternate upstream to our primary gpt-image-2 route. One endpoint handles text-to-image and multi-image editing (up to 16 references), aspect_ratio drives composition, NSFW filtering is on by default. Async, typical latency 15–40s.

Models

GPT Image 2 Beta

gpt-image-2-beta

$0.06/image

Async · flat pricing · up to 16 references

Parameters

── Request parameters — POST /api/v1/images/generations ──

Content-Type: application/json

modelrequired

stringMust be "gpt-image-2-beta"

promptrequired

stringPrompt or editing instructions. When editing multiple images, reference them as "image 1", "image 2". Length 1–20000 chars.

aspect_ratio

stringDefault "auto". Options: auto / 1:1 / 2:3 / 3:2 / 3:4 / 4:3 / 4:5 / 5:4 / 9:16 / 16:9 / 21:9

image_url

stringSingle reference image URL (publicly reachable). Providing it routes to image-to-image; ≤ 30MB.

image_urls

string[]Multiple reference image URLs (up to 16, ≤ 30MB each) for multi-image fusion.

image_base64

stringBase64 or data-URI reference image (for local files). The server stages it to R2 before sending to kie. Mutually exclusive with image_url.

image_mime_type

stringMIME type of image_base64. Default image/jpeg. Supports image/png, image/jpeg, image/webp.

nsfw_checker

booleanContent filter toggle. Defaults to false (filtering OFF — raw model output, NSFW allowed). Pass true to enable safe-for-work filtering.

callback_url

stringWebhook URL called when task completes. Server will POST the same JSON as the poll response to this URL.

── Response fields — 2xx ──

application/json

code

integer200 on success; otherwise see the error-code table.

msg

string"success" or an error message.

data.taskId

stringTask ID used for status queries.

data.state

stringpending / processing / completed / failed.

data.resultUrls

string[]Present only when state=completed; R2-hosted image URLs.

data.failMsg

stringPresent only when state=failed; upstream failure reason.

data.costTime

integerTask duration in milliseconds.

data.completeTime

integerCompletion timestamp in ms.

Notes

-Async flow: POST returns taskId immediately with state=pending; poll GET ?task_id= until state=completed. Typical latency 15–40s.
-Endpoint auto-routes: prompt only → text-to-image; with image_url / image_urls / image_base64 → image-to-image.
-kie.ai only accepts URL-form reference images. Base64 inputs are automatically staged to R2 server-side before forwarding — no extra work for you.
-nsfw_checker defaults to false — content filtering is OFF and NSFW output is permitted. Pass true explicitly to enable safe-for-work filtering.
-Independent upstream from gpt-image-2 (RunningHub channel); they are mutually redundant routes to the same OpenAI model. Swap if the primary is having issues.
-Failed tasks are free: state=failed requests are not billed, retry is safe.
-Flat pricing: $0.06 per image, independent of aspect ratio or reference image count.

Code Example

# ─── 1. Text-to-image ──────────────────────────────────────
curl -X POST https://apimodels.app/api/v1/images/generations \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-image-2-beta",
    "prompt": "A cinematic night-city poster with neon reflections on a rainy street",
    "aspect_ratio": "16:9"
  }'


# ─── 2. Image editing with a reference URL ─────────────────
curl -X POST https://apimodels.app/api/v1/images/generations \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-image-2-beta",
    "prompt": "Transform this product shot into a premium e-commerce poster style",
    "image_url": "https://example.com/product.jpg",
    "aspect_ratio": "4:3"
  }'


# ─── 3. Multi-image fusion (up to 16 references) ──────────
curl -X POST https://apimodels.app/api/v1/images/generations \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-image-2-beta",
    "prompt": "Dress the model from image 1 in the outfit from image 2",
    "image_urls": [
      "https://example.com/model.jpg",
      "https://example.com/outfit.jpg"
    ],
    "aspect_ratio": "3:4"
  }'


# ─── 4. Enable safe-for-work filter (default is off) ──────
curl -X POST https://apimodels.app/api/v1/images/generations \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-image-2-beta",
    "prompt": "Family-friendly cartoon mascot",
    "aspect_ratio": "1:1",
    "nsfw_checker": true
  }'


# ─── 5. Poll task status ───────────────────────────────────
curl "https://apimodels.app/api/v1/images/generations?task_id=TASK_ID" \
  -H "Authorization: Bearer YOUR_API_KEY"


# ─── 6. Webhook (recommended for production) ──────────────
curl -X POST https://apimodels.app/api/v1/images/generations \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-image-2-beta",
    "prompt": "Studio photo of a ceramic mug on a marble counter",
    "aspect_ratio": "1:1",
    "callback_url": "https://your-domain.com/webhook/image"
  }'

Response Format

Create Task Response

{
  "code": 200,
  "msg": "success",
  "data": {
    "taskId": "clxxx...",
    "state": "pending"
  }
}

Success Response

{
  "code": 200,
  "msg": "success",
  "data": {
    "taskId": "clxxx...",
    "state": "completed",
    "resultUrls": ["https://r2.apimodels.app/images/xxx.jpeg"],
    "createTime": 1705123450000,
    "completeTime": 1705123465000
  }
}

Failed Response

{
  "code": 200,
  "msg": "success",
  "data": {
    "taskId": "clxxx...",
    "state": "failed",
    "failMsg": "Content policy violation"
  }
}

Webhook Callback (callback_url)

Pass callback_url in the create request. When the task reaches the completed or failed terminal state, our server sends a single HTTP POST to that URL with Content-Type: application/json (no signing header). Delivery is retried up to 3 times (exponential backoff 1s/2s/4s, 10s per attempt); if still unsuccessful, a background job keeps retrying for up to 30 minutes until your endpoint returns 2xx.

Payload Structure

POST {your callback_url}
Content-Type: application/json

{
  "code": 200,
  "msg": "success",
  "data": {
    "taskId": "clxxx...",
    "model": "<provider>/<model_name>",
    "state": "completed" | "failed",
    "param": "<JSON string>",            // request params, JSON.parse once
    "resultJson": "<JSON string> | null", // result object, JSON.parse once
    "failCode": null | "string",
    "failMsg": null | "string",
    "costTime": 12345,                    // duration in ms
    "completeTime": 1705123465000,        // ms epoch
    "createTime": 1705123450000           // ms epoch
  }
}

Note: data.param and data.resultJson are both JSON strings — call JSON.parse once on each to get the underlying object.

Image task: shape after JSON.parse(data.resultJson)

{
  "resultUrls": [
    "https://r2.apimodels.app/images/xxx.png"
  ]
}

resultUrls is an array of Cloudflare R2-hosted image URLs. Multi-image models (SparkPix Image Edit, GPT Image 2 Lite with n>1) return more than one. When state=failed, resultJson is typically null or {"resultUrls":[]} — do not assume a link is present.

Node.js receiver example

app.post('/webhook/image', express.json(), (req, res) => {
  const { taskId, state, param, resultJson, failMsg } = req.body.data
  if (state === 'completed') {
    const { resultUrls } = JSON.parse(resultJson)
    console.log('image ready', taskId, resultUrls[0])
  } else {
    console.warn('image failed', taskId, failMsg)
  }
  res.status(200).end()                 // must be 2xx, otherwise we retry
})

Notes

- A task stops retrying only after a 2xx response — once delivered it is never pushed again.
- Callbacks are not signed today. Embed a random token in your callback_url path and verify it on receipt.
- GPT Image 2 Lite and SparkPix are synchronous — the POST create call usually already returns the result, so callbacks add little value. The async providers (gpt-image-2, gpt-image-2-beta, gemini-image, kling-image, …) are where callbacks matter — prefer them over polling in production.
- Use a public HTTPS endpoint that responds within 10 seconds (per-attempt timeout).

Task States

pendingQueued, waiting to start

processingImage is being generated

completedDone -- image URLs available

failedGeneration failed

Error Codes

400Bad Request -- invalid or missing parameters

401Unauthorized -- invalid API key

402Payment Required -- insufficient credits

404Not Found -- task ID not found

500Internal Server Error

Important Notes

-Image files are stored for 7 days -- download promptly
-Failed generations are not charged
-Poll every 2-3 seconds for status updates
-Use callback_url for production workloads to avoid polling
-Keep your API key secure

Try in Playground Get API Key

Docs/Image Generation API

IMG

Image Generation API

Access top image generation models through a unified API -- text-to-image, image editing, mask inpainting, and multi-image fusion.

Quick Start

Get your API key from Console

Choose an image provider below

POST to create a generation task

Poll GET to retrieve the image URL

Authentication

Add Authorization header to all requests:

Authorization: Bearer YOUR_API_KEY

Endpoints

POST/api/v1/images/generations

Create an image generation task

GET/api/v1/images/generations?task_id=xxx

Query task status and get image URL

API Reference

Select a provider to see its parameters and examples

OpenAI

GPT Image 2 Beta

Models

GPT Image 2 Beta

gpt-image-2-beta

$0.06/image

Async · flat pricing · up to 16 references

Parameters

── Request parameters — POST /api/v1/images/generations ──

Content-Type: application/json

modelrequired

stringMust be "gpt-image-2-beta"

promptrequired

stringPrompt or editing instructions. When editing multiple images, reference them as "image 1", "image 2". Length 1–20000 chars.

aspect_ratio

stringDefault "auto". Options: auto / 1:1 / 2:3 / 3:2 / 3:4 / 4:3 / 4:5 / 5:4 / 9:16 / 16:9 / 21:9

image_url

stringSingle reference image URL (publicly reachable). Providing it routes to image-to-image; ≤ 30MB.

image_urls

string[]Multiple reference image URLs (up to 16, ≤ 30MB each) for multi-image fusion.

image_base64

stringBase64 or data-URI reference image (for local files). The server stages it to R2 before sending to kie. Mutually exclusive with image_url.

image_mime_type

stringMIME type of image_base64. Default image/jpeg. Supports image/png, image/jpeg, image/webp.

nsfw_checker

booleanContent filter toggle. Defaults to false (filtering OFF — raw model output, NSFW allowed). Pass true to enable safe-for-work filtering.

callback_url

stringWebhook URL called when task completes. Server will POST the same JSON as the poll response to this URL.

── Response fields — 2xx ──

application/json

code

integer200 on success; otherwise see the error-code table.

msg

string"success" or an error message.

data.taskId

stringTask ID used for status queries.

data.state

stringpending / processing / completed / failed.

data.resultUrls

string[]Present only when state=completed; R2-hosted image URLs.

data.failMsg

stringPresent only when state=failed; upstream failure reason.

data.costTime

integerTask duration in milliseconds.

data.completeTime

integerCompletion timestamp in ms.

Notes

-Async flow: POST returns taskId immediately with state=pending; poll GET ?task_id= until state=completed. Typical latency 15–40s.
-Endpoint auto-routes: prompt only → text-to-image; with image_url / image_urls / image_base64 → image-to-image.
-kie.ai only accepts URL-form reference images. Base64 inputs are automatically staged to R2 server-side before forwarding — no extra work for you.
-nsfw_checker defaults to false — content filtering is OFF and NSFW output is permitted. Pass true explicitly to enable safe-for-work filtering.
-Independent upstream from gpt-image-2 (RunningHub channel); they are mutually redundant routes to the same OpenAI model. Swap if the primary is having issues.
-Failed tasks are free: state=failed requests are not billed, retry is safe.
-Flat pricing: $0.06 per image, independent of aspect ratio or reference image count.

Code Example

# ─── 1. Text-to-image ──────────────────────────────────────
curl -X POST https://apimodels.app/api/v1/images/generations \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-image-2-beta",
    "prompt": "A cinematic night-city poster with neon reflections on a rainy street",
    "aspect_ratio": "16:9"
  }'


# ─── 2. Image editing with a reference URL ─────────────────
curl -X POST https://apimodels.app/api/v1/images/generations \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-image-2-beta",
    "prompt": "Transform this product shot into a premium e-commerce poster style",
    "image_url": "https://example.com/product.jpg",
    "aspect_ratio": "4:3"
  }'


# ─── 3. Multi-image fusion (up to 16 references) ──────────
curl -X POST https://apimodels.app/api/v1/images/generations \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-image-2-beta",
    "prompt": "Dress the model from image 1 in the outfit from image 2",
    "image_urls": [
      "https://example.com/model.jpg",
      "https://example.com/outfit.jpg"
    ],
    "aspect_ratio": "3:4"
  }'


# ─── 4. Enable safe-for-work filter (default is off) ──────
curl -X POST https://apimodels.app/api/v1/images/generations \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-image-2-beta",
    "prompt": "Family-friendly cartoon mascot",
    "aspect_ratio": "1:1",
    "nsfw_checker": true
  }'


# ─── 5. Poll task status ───────────────────────────────────
curl "https://apimodels.app/api/v1/images/generations?task_id=TASK_ID" \
  -H "Authorization: Bearer YOUR_API_KEY"


# ─── 6. Webhook (recommended for production) ──────────────
curl -X POST https://apimodels.app/api/v1/images/generations \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-image-2-beta",
    "prompt": "Studio photo of a ceramic mug on a marble counter",
    "aspect_ratio": "1:1",
    "callback_url": "https://your-domain.com/webhook/image"
  }'

Response Format

Create Task Response

{
  "code": 200,
  "msg": "success",
  "data": {
    "taskId": "clxxx...",
    "state": "pending"
  }
}

Success Response

{
  "code": 200,
  "msg": "success",
  "data": {
    "taskId": "clxxx...",
    "state": "completed",
    "resultUrls": ["https://r2.apimodels.app/images/xxx.jpeg"],
    "createTime": 1705123450000,
    "completeTime": 1705123465000
  }
}

Failed Response

{
  "code": 200,
  "msg": "success",
  "data": {
    "taskId": "clxxx...",
    "state": "failed",
    "failMsg": "Content policy violation"
  }
}

Webhook Callback (callback_url)

Payload Structure

POST {your callback_url}
Content-Type: application/json

{
  "code": 200,
  "msg": "success",
  "data": {
    "taskId": "clxxx...",
    "model": "<provider>/<model_name>",
    "state": "completed" | "failed",
    "param": "<JSON string>",            // request params, JSON.parse once
    "resultJson": "<JSON string> | null", // result object, JSON.parse once
    "failCode": null | "string",
    "failMsg": null | "string",
    "costTime": 12345,                    // duration in ms
    "completeTime": 1705123465000,        // ms epoch
    "createTime": 1705123450000           // ms epoch
  }
}

Note: data.param and data.resultJson are both JSON strings — call JSON.parse once on each to get the underlying object.

Image task: shape after JSON.parse(data.resultJson)

{
  "resultUrls": [
    "https://r2.apimodels.app/images/xxx.png"
  ]
}

Node.js receiver example

app.post('/webhook/image', express.json(), (req, res) => {
  const { taskId, state, param, resultJson, failMsg } = req.body.data
  if (state === 'completed') {
    const { resultUrls } = JSON.parse(resultJson)
    console.log('image ready', taskId, resultUrls[0])
  } else {
    console.warn('image failed', taskId, failMsg)
  }
  res.status(200).end()                 // must be 2xx, otherwise we retry
})

Notes

- A task stops retrying only after a 2xx response — once delivered it is never pushed again.
- Callbacks are not signed today. Embed a random token in your callback_url path and verify it on receipt.
- GPT Image 2 Lite and SparkPix are synchronous — the POST create call usually already returns the result, so callbacks add little value. The async providers (gpt-image-2, gpt-image-2-beta, gemini-image, kling-image, …) are where callbacks matter — prefer them over polling in production.
- Use a public HTTPS endpoint that responds within 10 seconds (per-attempt timeout).

Task States

pendingQueued, waiting to start

processingImage is being generated

completedDone -- image URLs available

failedGeneration failed

Error Codes

400Bad Request -- invalid or missing parameters

401Unauthorized -- invalid API key

402Payment Required -- insufficient credits

404Not Found -- task ID not found

500Internal Server Error

Important Notes

-Image files are stored for 7 days -- download promptly
-Failed generations are not charged
-Poll every 2-3 seconds for status updates
-Use callback_url for production workloads to avoid polling
-Keep your API key secure

Try in Playground Get API Key