DS2API API Reference

This document describes the actual behavior of the current Go codebase.

Docs: Overview / Architecture / Deployment / Testing

Basics
Configuration Best Practice
Authentication
Route Index
Health Endpoints
OpenAI-Compatible API
Claude-Compatible API
Gemini-Compatible API
Admin API
Error Payloads
cURL Examples

Basics

Item	Details
Base URL	`http://localhost:5001` or your deployment domain
Default Content-Type	`application/json`
Health probes	`GET /healthz`, `GET /readyz`
CORS	Enabled (`Access-Control-Allow-Origin: *`, allows `Content-Type`, `Authorization`, `X-API-Key`, `X-Ds2-Target-Account`, `X-Vercel-Protection-Bypass`)

3.0 Adapter-Layer Notes

OpenAI / Claude / Gemini protocols are now mounted on one shared chi router tree assembled in internal/server/router.go.
Adapter responsibilities are streamlined to: request normalization → DeepSeek invocation → protocol-shaped rendering, reducing legacy split-logic paths.
Tool-calling semantics are aligned between Go and Node runtime: structured parsing first (JSON/XML/invoke/markup), plus stream-time anti-leak filtering.
Admin API separates static config from runtime policy: /admin/config* for configuration state, /admin/settings* for runtime behavior.

Configuration Best Practice

Use config.json as the single source of truth:

cp config.example.json config.json
# Edit config.json (keys/accounts)

Use it per deployment mode:

Local run: read config.json directly
Docker / Vercel: generate Base64 from config.json, then set DS2API_CONFIG_JSON, or paste raw JSON directly

DS2API_CONFIG_JSON="$(base64 < config.json | tr -d '\n')"

For Vercel one-click bootstrap, you can set only DS2API_ADMIN_KEY first, then import config at /admin and sync env vars from the "Vercel Sync" page.

Authentication

Business Endpoints (`/v1/`, `/anthropic/`, `/v1beta/models/*`)

Two header formats accepted:

Method	Example
Bearer Token	`Authorization: Bearer <token>`
API Key Header	`x-api-key: <token>` (no `Bearer` prefix)
Gemini-compatible	`x-goog-api-key: <token>` or `?key=<token>` / `?api_key=<token>`

Auth behavior:

Token is in config.keys → Managed account mode: DS2API auto-selects an account via rotation
Token is not in config.keys → Direct token mode: treated as a DeepSeek token directly

Optional header: X-Ds2-Target-Account: <email_or_mobile> — Pin a specific managed account. Gemini-compatible clients can also send x-goog-api-key, ?key=, or ?api_key= as the caller credential source.

Admin Endpoints (`/admin/*`)

Endpoint	Auth
`POST /admin/login`	Public
`GET /admin/verify`	`Authorization: Bearer <jwt>` (JWT only)
Other `/admin/*`	`Authorization: Bearer <jwt>` or `Authorization: Bearer <admin_key>`

Route Index

Method	Path	Auth	Description
GET	`/healthz`	None	Liveness probe
HEAD	`/healthz`	None	Liveness probe (no body)
GET	`/readyz`	None	Readiness probe
HEAD	`/readyz`	None	Readiness probe (no body)
GET	`/v1/models`	None	OpenAI model list
GET	`/v1/models/{id}`	None	OpenAI single-model query (alias accepted)
POST	`/v1/chat/completions`	Business	OpenAI chat completions
POST	`/v1/responses`	Business	OpenAI Responses API (stream/non-stream)
GET	`/v1/responses/{response_id}`	Business	Query stored response (in-memory TTL)
POST	`/v1/embeddings`	Business	OpenAI Embeddings API
GET	`/anthropic/v1/models`	None	Claude model list
POST	`/anthropic/v1/messages`	Business	Claude messages
POST	`/anthropic/v1/messages/count_tokens`	Business	Claude token counting
POST	`/v1/messages`	Business	Claude shortcut path
POST	`/messages`	Business	Claude shortcut path
POST	`/v1/messages/count_tokens`	Business	Claude token counting shortcut
POST	`/messages/count_tokens`	Business	Claude token counting shortcut
POST	`/v1beta/models/{model}:generateContent`	Business	Gemini non-stream
POST	`/v1beta/models/{model}:streamGenerateContent`	Business	Gemini stream
POST	`/v1/models/{model}:generateContent`	Business	Gemini non-stream compat path
POST	`/v1/models/{model}:streamGenerateContent`	Business	Gemini stream compat path
POST	`/admin/login`	None	Admin login
GET	`/admin/verify`	JWT	Verify admin JWT
GET	`/admin/vercel/config`	Admin	Read preconfigured Vercel creds
GET	`/admin/config`	Admin	Read sanitized config
POST	`/admin/config`	Admin	Update config
GET	`/admin/settings`	Admin	Read runtime settings
PUT	`/admin/settings`	Admin	Update runtime settings (hot reload)
POST	`/admin/settings/password`	Admin	Update admin password and invalidate old JWTs
POST	`/admin/config/import`	Admin	Import config (merge/replace)
GET	`/admin/config/export`	Admin	Export full config (`config`/`json`/`base64`)
POST	`/admin/keys`	Admin	Add API key
DELETE	`/admin/keys/{key}`	Admin	Delete API key
GET	`/admin/accounts`	Admin	Paginated account list
POST	`/admin/accounts`	Admin	Add account
DELETE	`/admin/accounts/{identifier}`	Admin	Delete account
GET	`/admin/queue/status`	Admin	Account queue status
POST	`/admin/accounts/test`	Admin	Test one account
POST	`/admin/accounts/test-all`	Admin	Test all accounts
POST	`/admin/accounts/sessions/delete-all`	Admin	Delete all sessions for one account
POST	`/admin/import`	Admin	Batch import keys/accounts
POST	`/admin/test`	Admin	Test API through service
POST	`/admin/dev/raw-samples/capture`	Admin	Fire one request and persist it as a raw sample
GET	`/admin/dev/raw-samples/query`	Admin	Search current in-memory capture chains by prompt keyword
POST	`/admin/dev/raw-samples/save`	Admin	Persist a selected in-memory capture chain as a raw sample
POST	`/admin/vercel/sync`	Admin	Sync config to Vercel
GET	`/admin/vercel/status`	Admin	Vercel sync status
POST	`/admin/vercel/status`	Admin	Vercel sync status / draft compare
GET	`/admin/export`	Admin	Export config JSON/Base64
GET	`/admin/dev/captures`	Admin	Read local packet-capture entries
DELETE	`/admin/dev/captures`	Admin	Clear local packet-capture entries
GET	`/admin/version`	Admin	Check current version and latest Release

Health Endpoints

`GET /healthz`

{"status": "ok"}

`GET /readyz`

{"status": "ready"}

OpenAI-Compatible API

`GET /v1/models`

No auth required. Returns the currently supported DeepSeek native model list.

Response:

{
  "object": "list",
  "data": [
    {"id": "deepseek-chat", "object": "model", "created": 1677610602, "owned_by": "deepseek", "permission": []},
    {"id": "deepseek-reasoner", "object": "model", "created": 1677610602, "owned_by": "deepseek", "permission": []},
    {"id": "deepseek-chat-search", "object": "model", "created": 1677610602, "owned_by": "deepseek", "permission": []},
    {"id": "deepseek-reasoner-search", "object": "model", "created": 1677610602, "owned_by": "deepseek", "permission": []},
    {"id": "deepseek-expert-chat", "object": "model", "created": 1677610602, "owned_by": "deepseek", "permission": []},
    {"id": "deepseek-expert-reasoner", "object": "model", "created": 1677610602, "owned_by": "deepseek", "permission": []},
    {"id": "deepseek-expert-chat-search", "object": "model", "created": 1677610602, "owned_by": "deepseek", "permission": []},
    {"id": "deepseek-expert-reasoner-search", "object": "model", "created": 1677610602, "owned_by": "deepseek", "permission": []},
    {"id": "deepseek-vision-chat", "object": "model", "created": 1677610602, "owned_by": "deepseek", "permission": []},
    {"id": "deepseek-vision-reasoner", "object": "model", "created": 1677610602, "owned_by": "deepseek", "permission": []},
    {"id": "deepseek-vision-chat-search", "object": "model", "created": 1677610602, "owned_by": "deepseek", "permission": []},
    {"id": "deepseek-vision-reasoner-search", "object": "model", "created": 1677610602, "owned_by": "deepseek", "permission": []}
  ]
}

Note: /v1/models returns normalized DeepSeek native model IDs. Common aliases are accepted only as request input and are not expanded as separate items in this endpoint.

Model Alias Resolution

For chat / responses / embeddings, DS2API follows a wide-input/strict-output policy:

Match DeepSeek native model IDs first.
Then match exact keys in model_aliases.
If still unmatched, fall back by known family heuristics (o*, gpt-*, claude-*, etc.).
If still unmatched, return invalid_request_error.

`POST /v1/chat/completions`

Headers:

Authorization: Bearer your-api-key
Content-Type: application/json

Request body:

Field	Type	Required	Notes
`model`	string	✅	DeepSeek native models + common aliases (`gpt-4o`, `gpt-5-codex`, `o3`, `claude-sonnet-4-5`, `gemini-2.5-pro`, etc.)
`messages`	array	✅	OpenAI-style messages
`stream`	boolean	❌	Default `false`
`tools`	array	❌	Function calling schema
`temperature`, etc.	any	❌	Accepted but final behavior depends on upstream

Non-Stream Response

{
  "id": "<chat_session_id>",
  "object": "chat.completion",
  "created": 1738400000,
  "model": "deepseek-reasoner",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "final response",
        "reasoning_content": "reasoning trace (reasoner models)"
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 10,
    "completion_tokens": 20,
    "total_tokens": 30,
    "completion_tokens_details": {
      "reasoning_tokens": 5
    }
  }
}

Streaming (`stream=true`)

SSE format: each frame is data: <json>\n\n, terminated by data: [DONE].

data: {"id":"...","object":"chat.completion.chunk","choices":[{"delta":{"role":"assistant"},"index":0}]}

data: {"id":"...","object":"chat.completion.chunk","choices":[{"delta":{"reasoning_content":"..."},"index":0}]}

data: {"id":"...","object":"chat.completion.chunk","choices":[{"delta":{"content":"..."},"index":0}]}

data: {"id":"...","object":"chat.completion.chunk","choices":[{"delta":{},"index":0,"finish_reason":"stop"}],"usage":{...}}

data: [DONE]

Field notes:

First delta includes role: assistant
deepseek-reasoner / deepseek-reasoner-search models emit delta.reasoning_content
Text emits delta.content
Last chunk includes finish_reason and usage
Token counting prefers pass-through from upstream DeepSeek SSE (accumulated_token_usage / token_usage), and only falls back to local estimation when upstream usage is absent

Tool Calls

When tools is present, DS2API performs anti-leak handling:

Non-stream: If detected, returns message.tool_calls, finish_reason=tool_calls, message.content=null.

{
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": null,
        "tool_calls": [
          {
            "id": "call_xxx",
            "type": "function",
            "function": {
              "name": "get_weather",
              "arguments": "{\"city\":\"beijing\"}"
            }
          }
        ]
      },
      "finish_reason": "tool_calls"
    }
  ]
}

Stream: Once high-confidence toolcall features are matched, DS2API emits delta.tool_calls immediately (without waiting for full JSON closure), then keeps sending argument deltas; confirmed raw tool JSON is never forwarded as delta.content.

`GET /v1/models/{id}`

No auth required. Alias values are accepted as path params (for example gpt-4o), and the returned object is the mapped DeepSeek model.

`POST /v1/responses`

OpenAI Responses-style endpoint, accepting either input or messages.

Field	Type	Required	Notes
`model`	string	✅	Supports native models + alias mapping
`input`	string/array/object	❌	One of `input` or `messages` is required
`messages`	array	❌	One of `input` or `messages` is required
`instructions`	string	❌	Prepended as a system message
`stream`	boolean	❌	Default `false`
`tools`	array	❌	Same tool detection/translation policy as chat
`tool_choice`	string/object	❌	Supports `auto`/`none`/`required` and forced function selection (`{"type":"function","name":"..."}`)

Non-stream: Returns a standard response object with an ID like resp_xxx, and stores it in in-memory TTL cache. If tool_choice=required and no valid tool call is produced, DS2API returns HTTP 422 (error.code=tool_choice_violation).

Stream (SSE): minimal event sequence:

event: response.created
data: {"type":"response.created","id":"resp_xxx","status":"in_progress",...}

event: response.output_item.added
data: {"type":"response.output_item.added","response_id":"resp_xxx","item":{"type":"message|function_call",...},...}

event: response.content_part.added
data: {"type":"response.content_part.added","response_id":"resp_xxx","part":{"type":"output_text",...},...}

event: response.output_text.delta
data: {"type":"response.output_text.delta","response_id":"resp_xxx","item_id":"msg_xxx","output_index":0,"content_index":0,"delta":"..."}

event: response.function_call_arguments.delta
data: {"type":"response.function_call_arguments.delta","response_id":"resp_xxx","call_id":"call_xxx","delta":"..."}

event: response.function_call_arguments.done
data: {"type":"response.function_call_arguments.done","response_id":"resp_xxx","call_id":"call_xxx","name":"tool","arguments":"{...}"}

event: response.content_part.done
data: {"type":"response.content_part.done","response_id":"resp_xxx",...}

event: response.output_item.done
data: {"type":"response.output_item.done","response_id":"resp_xxx","item":{"type":"message|function_call",...},...}

event: response.completed
data: {"type":"response.completed","response":{...}}

data: [DONE]

If tool_choice=required is violated in stream mode, DS2API emits response.failed then [DONE] (no response.completed).

Current behavior: the parser tries to extract structured tool calls and does not enforce a hard allow-list reject; your tool executor should still validate against a whitelist before executing.

`GET /v1/responses/{response_id}`

Business auth required. Fetches cached responses created by POST /v1/responses (caller-scoped; only the same key/token can read).

Backed by in-memory TTL store. Default TTL is 900s (configurable via responses.store_ttl_seconds).

`POST /v1/embeddings`

Business auth required. Returns OpenAI-compatible embeddings shape.

Field	Type	Required	Notes
`model`	string	✅	Supports native models + alias mapping
`input`	string/array	✅	Supports string, string array, token array

Requires embeddings.provider. Current supported values: mock / deterministic / builtin. If missing/unsupported, returns standard error shape with HTTP 501.

Claude-Compatible API

Besides /anthropic/v1/*, DS2API also supports shortcut paths: /v1/messages, /messages, /v1/messages/count_tokens, /messages/count_tokens. Implementation-wise this path is unified on the OpenAI Chat Completions parse-and-translate pipeline to avoid maintaining divergent parsing chains.

`GET /anthropic/v1/models`

No auth required.

Response:

{
  "object": "list",
  "data": [
    {"id": "claude-sonnet-4-5", "object": "model", "created": 1715635200, "owned_by": "anthropic"},
    {"id": "claude-haiku-4-5", "object": "model", "created": 1715635200, "owned_by": "anthropic"},
    {"id": "claude-opus-4-6", "object": "model", "created": 1715635200, "owned_by": "anthropic"}
  ],
  "first_id": "claude-opus-4-6",
  "last_id": "claude-instant-1.0",
  "has_more": false
}

Note: the example is partial; besides the current primary aliases, the real response also includes Claude 4.x snapshots plus historical 3.x / 2.x / 1.x IDs and common aliases.

`POST /anthropic/v1/messages`

Headers:

x-api-key: your-api-key
Content-Type: application/json
anthropic-version: 2023-06-01

anthropic-version is optional; DS2API auto-fills 2023-06-01 when absent.

Request body:

Field	Type	Required	Notes
`model`	string	✅	For example `claude-sonnet-4-5` / `claude-opus-4-6` / `claude-haiku-4-5` (compatible with `claude-3-5-haiku-latest`), plus historical Claude model IDs
`messages`	array	✅	Claude-style messages
`max_tokens`	number	❌	Auto-filled to `8192` when omitted; not strictly enforced by upstream bridge
`stream`	boolean	❌	Default `false`
`system`	string	❌	Optional system prompt
`tools`	array	❌	Claude tool schema

Non-Stream Response

{
  "id": "msg_1738400000000000000",
  "type": "message",
  "role": "assistant",
  "model": "claude-sonnet-4-5",
  "content": [
    {"type": "text", "text": "response"}
  ],
  "stop_reason": "end_turn",
  "stop_sequence": null,
  "usage": {
    "input_tokens": 12,
    "output_tokens": 34
  }
}

If tool use is detected, stop_reason becomes tool_use and content contains tool_use blocks.

Streaming (`stream=true`)

SSE uses paired event: + data: lines. Event type is also in JSON type.

event: message_start
data: {"type":"message_start","message":{...}}

event: content_block_start
data: {"type":"content_block_start","index":0,"content_block":{"type":"text","text":""}}

event: content_block_delta
data: {"type":"content_block_delta","index":0,"delta":{"type":"text_delta","text":"hello"}}

event: ping
data: {"type":"ping"}

event: content_block_stop
data: {"type":"content_block_stop","index":0}

event: message_delta
data: {"type":"message_delta","delta":{"stop_reason":"end_turn","stop_sequence":null},"usage":{"output_tokens":12}}

event: message_stop
data: {"type":"message_stop"}

Notes:

Models whose names contain opus / reasoner / slow stream thinking_delta
signature_delta is not emitted (DeepSeek does not provide verifiable thinking signatures)
In tools mode, the stream avoids leaking raw tool JSON and does not force input_json_delta

`POST /anthropic/v1/messages/count_tokens`

Request:

{
  "model": "claude-sonnet-4-5",
  "messages": [
    {"role": "user", "content": "Hello"}
  ]
}

Response:

{
  "input_tokens": 5
}

Gemini-Compatible API

Supported paths:

/v1beta/models/{model}:generateContent
/v1beta/models/{model}:streamGenerateContent
/v1/models/{model}:generateContent (compat path)
/v1/models/{model}:streamGenerateContent (compat path)

Authentication is the same as other business routes (Authorization: Bearer <token> or x-api-key). Implementation-wise this path is unified on the OpenAI Chat Completions parse-and-translate pipeline to avoid maintaining divergent parsing chains.

`POST /v1beta/models/{model}:generateContent`

Request body accepts Gemini-style contents / tools. Model names can use aliases and are mapped to DeepSeek models.

Response uses Gemini-compatible fields, including:

candidates[].content.parts[].text
candidates[].content.parts[].functionCall (when tool call is produced)
usageMetadata (promptTokenCount / candidatesTokenCount / totalTokenCount)

`POST /v1beta/models/{model}:streamGenerateContent`

Returns SSE (text/event-stream), each chunk as data: <json>:

regular text: incremental text chunks
tools mode: buffered and emitted as functionCall at finalize phase
final chunk: includes finishReason: "STOP" and usageMetadata
Token counting prefers pass-through from upstream DeepSeek SSE (accumulated_token_usage / token_usage), and only falls back to local estimation when upstream usage is absent

Admin API

`POST /admin/login`

Public endpoint.

Request:

{
  "admin_key": "admin",
  "expire_hours": 24
}

expire_hours is optional, default 24.

Response:

{
  "success": true,
  "token": "<jwt>",
  "expires_in": 86400
}

`GET /admin/verify`

Requires JWT: Authorization: Bearer <jwt>

Response:

{
  "valid": true,
  "expires_at": 1738400000,
  "remaining_seconds": 72000
}

`GET /admin/vercel/config`

Returns Vercel preconfiguration status.

{
  "has_token": true,
  "project_id": "prj_xxx",
  "team_id": null
}

`GET /admin/config`

Returns sanitized config.

{
  "keys": ["k1", "k2"],
  "env_backed": false,
  "env_source_present": true,
  "env_writeback_enabled": true,
  "config_path": "/data/config.json",
  "accounts": [
    {
      "identifier": "user@example.com",
      "email": "user@example.com",
      "mobile": "",
      "has_password": true,
      "has_token": true,
      "token_preview": "abcde..."
    }
  ],
  "claude_mapping": {
    "fast": "deepseek-chat",
    "slow": "deepseek-reasoner"
  }
}

`POST /admin/config`

Only updates keys, accounts, and claude_mapping.

Request:

{
  "keys": ["k1", "k2"],
  "accounts": [
    {"email": "user@example.com", "password": "pwd", "token": ""}
  ],
  "claude_mapping": {
    "fast": "deepseek-chat",
    "slow": "deepseek-reasoner"
  }
}

`GET /admin/settings`

Reads runtime settings and status, including:

success
admin (has_password_hash, jwt_expire_hours, jwt_valid_after_unix, default_password_warning)
runtime (account_max_inflight, account_max_queue, global_max_inflight, token_refresh_interval_hours)
compat (wide_input_strict_output, strip_reference_markers)
responses / embeddings
auto_delete (mode: none / single / all; legacy sessions=true is still treated as all)
claude_mapping / model_aliases
env_backed, needs_vercel_sync
toolcall policy is fixed to feature_match + high and is no longer returned or editable via settings

`PUT /admin/settings`

Hot-updates runtime settings. Supported fields:

admin.jwt_expire_hours
runtime.account_max_inflight / runtime.account_max_queue / runtime.global_max_inflight / runtime.token_refresh_interval_hours
compat.wide_input_strict_output / compat.strip_reference_markers
responses.store_ttl_seconds
embeddings.provider
auto_delete.mode
claude_mapping
model_aliases
toolcall policy is fixed and is no longer writable through settings

`POST /admin/settings/password`

Updates admin password and invalidates existing JWTs.

Request example:

{"new_password":"your-new-password"}

It also accepts {"password":"your-new-password"}.

`POST /admin/config/import`

Imports full config with:

mode=merge (default)
mode=replace

The request can send config directly, or wrapped as {"config": {...}, "mode":"merge"}. Query params ?mode=merge / ?mode=replace are also supported. Import accepts keys, accounts, claude_mapping / claude_model_mapping, model_aliases, admin, runtime, responses, embeddings, and auto_delete; legacy toolcall fields are ignored.

compat fields are managed via /admin/settings or the config file; this import endpoint does not update compat.

`GET /admin/config/export`

Exports full config in three forms: config, json, and base64.

`POST /admin/keys`

{"key": "new-api-key"}

Response: {"success": true, "total_keys": 3}

`DELETE /admin/keys/{key}`

Response: {"success": true, "total_keys": 2}

`GET /admin/accounts`

Query params:

Param	Default	Range
`page`	`1`	≥ 1
`page_size`	`10`	1–100
`q`	empty	Filter by identifier / email / mobile

Response:

{
  "items": [
    {
      "identifier": "user@example.com",
      "email": "user@example.com",
      "mobile": "",
      "has_password": true,
      "has_token": true,
      "token_preview": "abc...",
      "test_status": "ok"
    }
  ],
  "total": 25,
  "page": 1,
  "page_size": 10,
  "total_pages": 3
}

Returned items also include test_status, usually ok or failed.

`POST /admin/accounts`

{"email": "user@example.com", "password": "pwd"}

Response: {"success": true, "total_accounts": 6}

`DELETE /admin/accounts/{identifier}`

identifier can be email, mobile, or the synthetic id for token-only accounts (token:<hash>).

Response: {"success": true, "total_accounts": 5}

`GET /admin/queue/status`

{
  "available": 3,
  "in_use": 1,
  "total": 4,
  "available_accounts": ["a@example.com"],
  "in_use_accounts": ["b@example.com"],
  "max_inflight_per_account": 2,
  "global_max_inflight": 8,
  "recommended_concurrency": 8,
  "waiting": 0,
  "max_queue_size": 8
}

Field	Description
`available`	Accounts that still have spare inflight capacity
`in_use`	Number of occupied in-flight slots
`total`	Total accounts
`available_accounts`	List of account IDs with remaining inflight capacity
`in_use_accounts`	List of account IDs currently in use
`max_inflight_per_account`	Per-account inflight limit
`global_max_inflight`	Global inflight limit
`recommended_concurrency`	Suggested concurrency (`total × max_inflight_per_account`)
`waiting`	Number of queued requests currently waiting
`max_queue_size`	Waiting queue limit

`POST /admin/accounts/test`

Field	Required	Notes
`identifier`	✅	email / mobile / token-only synthetic id
`model`	❌	default `deepseek-chat`
`message`	❌	if empty, only session creation is tested

Response:

{
  "account": "user@example.com",
  "success": true,
  "response_time": 1240,
  "message": "API test successful (session creation only)",
  "model": "deepseek-chat",
  "session_count": 0,
  "config_writable": true
}

If a message is provided, thinking may also be included when the upstream response carries reasoning text.

`POST /admin/accounts/test-all`

Optional request field: model.

{
  "total": 5,
  "success": 4,
  "failed": 1,
  "results": [...]
}

The internal concurrency limit is currently fixed at 5.

`POST /admin/accounts/sessions/delete-all`

Deletes all DeepSeek sessions for a specific account. Request example:

{"identifier":"user@example.com"}

Response:

{"success": true, "message": "删除成功"}

If the account is missing or deletion fails, success becomes false and message contains the error. The current handler returns the Chinese literal 删除成功 on success.

`POST /admin/import`

Batch import keys and accounts.

Request:

{
  "keys": ["k1", "k2"],
  "accounts": [
    {"email": "user@example.com", "password": "pwd", "token": ""}
  ]
}

Response:

{
  "success": true,
  "imported_keys": 2,
  "imported_accounts": 1
}

`POST /admin/test`

Test API availability through the service itself.

Field	Required	Default
`model`	❌	`deepseek-chat`
`message`	❌	`你好`
`api_key`	❌	First key in config

Response:

{
  "success": true,
  "status_code": 200,
  "response": {"id": "..."}
}

`POST /admin/dev/raw-samples/capture`

Internally issues one /v1/chat/completions request through the service, then persists the request metadata and raw upstream SSE into tests/raw_stream_samples/<sample-id>/.

Common request fields:

Field	Required	Default	Notes
`message`	No	`你好`	Convenience single-turn user message
`messages`	No	Auto-derived from `message`	OpenAI-style message array
`model`	No	`deepseek-chat`	Target model
`stream`	No	`true`	Recommended to keep streaming enabled so raw SSE is recorded
`api_key`	No	First configured key	Business API key to use
`sample_id`	No	Auto-generated	Sample directory name

On success, the response headers include:

X-Ds2-Sample-Id
X-Ds2-Sample-Dir
X-Ds2-Sample-Meta
X-Ds2-Sample-Upstream

If the request itself succeeds but the process did not record a new upstream capture, the endpoint returns:

{"detail":"no upstream capture was recorded"}

`GET /admin/dev/raw-samples/query`

Searches the current process's in-memory capture entries and groups completion + continue rounds by chat_session_id.

Query parameters:

Param	Default	Notes
`q`	empty	Fuzzy match against request/response text
`limit`	`20`	Max number of chains returned

Response fields include:

items[].chain_key
items[].capture_ids
items[].round_count
items[].initial_label
items[].request_preview
items[].response_preview

`POST /admin/dev/raw-samples/save`

Persists one selected in-memory capture chain into tests/raw_stream_samples/<sample-id>/.

Any one of these selectors is accepted:

{"chain_key":"session:xxxx","sample_id":"tmp-from-memory"}

{"capture_id":"cap_xxx","sample_id":"tmp-from-memory"}

{"query":"Guangzhou weather","sample_id":"tmp-from-memory"}

The success payload includes sample_id, dir, meta_path, and upstream_path.

`POST /admin/vercel/sync`

Field	Required	Notes
`vercel_token`	❌	If empty or `__USE_PRECONFIG__`, read env
`project_id`	❌	Fallback: `VERCEL_PROJECT_ID`
`team_id`	❌	Fallback: `VERCEL_TEAM_ID`
`auto_validate`	❌	Default `true`
`save_credentials`	❌	Default `true`

Success response:

{
  "success": true,
  "validated_accounts": 3,
  "message": "Config synced, redeploying...",
  "deployment_url": "https://..."
}

Or manual deploy required:

{
  "success": true,
  "validated_accounts": 3,
  "message": "Config synced to Vercel, please trigger redeploy manually",
  "manual_deploy_required": true
}

Failed account checks are returned in failed_accounts, and any saved Vercel credentials are returned in saved_credentials.

`GET /admin/vercel/status`

{
  "synced": true,
  "last_sync_time": 1738400000,
  "has_synced_before": true,
  "env_backed": false,
  "config_hash": "....",
  "last_synced_hash": "....",
  "draft_hash": "....",
  "draft_differs": false
}

POST /admin/vercel/status can also accept config_override to compare a draft config against the current synced config.

`GET /admin/export`

{
  "json": "{...}",
  "base64": "ey4uLn0="
}

This is the same payload as GET /admin/config/export, just with a shorter path.

`GET /admin/version`

Checks the current build version and the latest GitHub Release:

{
  "success": true,
  "current_version": "3.0.0",
  "current_tag": "v3.0.0",
  "source": "file:VERSION",
  "checked_at": "2026-03-29T00:00:00Z",
  "latest_tag": "v3.0.0",
  "latest_version": "3.0.0",
  "release_url": "https://github.com/CJackHwang/ds2api/releases/tag/v3.0.0",
  "published_at": "2026-03-28T12:00:00Z",
  "has_update": false
}

If GitHub API access fails, the response includes check_error while still returning HTTP 200.

`GET /admin/dev/captures`

Reads local packet-capture status and recent entries (Admin auth required):

enabled
limit
max_body_bytes
items

`DELETE /admin/dev/captures`

Clears packet-capture entries:

{"success":true,"detail":"capture logs cleared"}

Error Payloads

Compatible routes (/v1/*, /anthropic/*) use the same error envelope:

{
  "error": {
    "message": "...",
    "type": "invalid_request_error",
    "code": "invalid_request",
    "param": null
  }
}

Admin routes keep {"detail":"..."}.

Gemini routes use Google-style errors:

{
  "error": {
    "code": 400,
    "message": "invalid json",
    "status": "INVALID_ARGUMENT"
  }
}

Clients should handle HTTP status code plus error / detail fields.

Common status codes:

Code	Meaning
`401`	Authentication failed (invalid key/token, or expired admin JWT)
`429`	Too many requests (exceeded inflight + queue capacity)
`503`	Model unavailable or upstream error

cURL Examples

OpenAI Non-Stream

curl http://localhost:5001/v1/chat/completions \
  -H "Authorization: Bearer your-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "deepseek-chat",
    "messages": [{"role": "user", "content": "Hello"}],
    "stream": false
  }'

OpenAI Stream

curl http://localhost:5001/v1/chat/completions \
  -H "Authorization: Bearer your-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "deepseek-reasoner",
    "messages": [{"role": "user", "content": "Explain quantum entanglement"}],
    "stream": true
  }'

OpenAI Responses (Stream)

curl http://localhost:5001/v1/responses \
  -H "Authorization: Bearer your-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-5-codex",
    "input": "Write a hello world in golang",
    "stream": true
  }'

OpenAI Embeddings

curl http://localhost:5001/v1/embeddings \
  -H "Authorization: Bearer your-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-4o",
    "input": ["first text", "second text"]
  }'

OpenAI with Search

curl http://localhost:5001/v1/chat/completions \
  -H "Authorization: Bearer your-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "deepseek-chat-search",
    "messages": [{"role": "user", "content": "Latest news today"}],
    "stream": true
  }'

OpenAI Tool Calling

curl http://localhost:5001/v1/chat/completions \
  -H "Authorization: Bearer your-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "deepseek-chat",
    "messages": [{"role": "user", "content": "What is the weather in Beijing?"}],
    "tools": [
      {
        "type": "function",
        "function": {
          "name": "get_weather",
          "description": "Get weather for a city",
          "parameters": {
            "type": "object",
            "properties": {
              "city": {"type": "string", "description": "City name"}
            },
            "required": ["city"]
          }
        }
      }
    ]
  }'

Gemini Non-Stream

curl "http://localhost:5001/v1beta/models/gemini-2.5-pro:generateContent" \
  -H "Authorization: Bearer your-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "contents": [
      {
        "role": "user",
        "parts": [{"text": "Introduce Go in three sentences"}]
      }
    ]
  }'

Gemini Stream

curl "http://localhost:5001/v1beta/models/gemini-2.5-flash:streamGenerateContent" \
  -H "Authorization: Bearer your-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "contents": [
      {
        "role": "user",
        "parts": [{"text": "Write a short summary"}]
      }
    ]
  }'

Claude Non-Stream

curl http://localhost:5001/anthropic/v1/messages \
  -H "x-api-key: your-api-key" \
  -H "Content-Type: application/json" \
  -H "anthropic-version: 2023-06-01" \
  -d '{
    "model": "claude-sonnet-4-5",
    "max_tokens": 1024,
    "messages": [{"role": "user", "content": "Hello"}]
  }'

Claude Stream

curl http://localhost:5001/anthropic/v1/messages \
  -H "x-api-key: your-api-key" \
  -H "Content-Type: application/json" \
  -H "anthropic-version: 2023-06-01" \
  -d '{
    "model": "claude-opus-4-6",
    "max_tokens": 1024,
    "messages": [{"role": "user", "content": "Explain relativity"}],
    "stream": true
  }'

Admin Login

curl http://localhost:5001/admin/login \
  -H "Content-Type: application/json" \
  -d '{"admin_key": "admin"}'

Pin Specific Account

curl http://localhost:5001/v1/chat/completions \
  -H "Authorization: Bearer your-api-key" \
  -H "X-Ds2-Target-Account: user@example.com" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "deepseek-chat",
    "messages": [{"role": "user", "content": "Hello"}]
  }'

FilesExpand file tree

API.en.md

Latest commit

History

API.en.md

File metadata and controls

DS2API API Reference

Table of Contents

Basics

3.0 Adapter-Layer Notes

Configuration Best Practice

Authentication

Business Endpoints (/v1/*, /anthropic/*, /v1beta/models/*)

Admin Endpoints (/admin/*)

Route Index

Health Endpoints

GET /healthz

GET /readyz

OpenAI-Compatible API

GET /v1/models

Model Alias Resolution

POST /v1/chat/completions

Non-Stream Response

Streaming (stream=true)

Tool Calls

GET /v1/models/{id}

POST /v1/responses

GET /v1/responses/{response_id}

POST /v1/embeddings

Claude-Compatible API

GET /anthropic/v1/models

POST /anthropic/v1/messages

Non-Stream Response

Streaming (stream=true)

POST /anthropic/v1/messages/count_tokens

Gemini-Compatible API

POST /v1beta/models/{model}:generateContent

POST /v1beta/models/{model}:streamGenerateContent

Admin API

POST /admin/login

GET /admin/verify

GET /admin/vercel/config

GET /admin/config

POST /admin/config

GET /admin/settings

PUT /admin/settings

POST /admin/settings/password

POST /admin/config/import

GET /admin/config/export

POST /admin/keys

DELETE /admin/keys/{key}

GET /admin/accounts

POST /admin/accounts

DELETE /admin/accounts/{identifier}

GET /admin/queue/status

POST /admin/accounts/test

POST /admin/accounts/test-all

POST /admin/accounts/sessions/delete-all

POST /admin/import

POST /admin/test

POST /admin/dev/raw-samples/capture

GET /admin/dev/raw-samples/query

POST /admin/dev/raw-samples/save

POST /admin/vercel/sync

GET /admin/vercel/status

GET /admin/export

GET /admin/version

GET /admin/dev/captures

DELETE /admin/dev/captures

Error Payloads

cURL Examples

OpenAI Non-Stream

OpenAI Stream

OpenAI Responses (Stream)

OpenAI Embeddings

OpenAI with Search

OpenAI Tool Calling

Gemini Non-Stream

Gemini Stream

Claude Non-Stream

Business Endpoints (`/v1/`, `/anthropic/`, `/v1beta/models/*`)

Admin Endpoints (`/admin/*`)

`GET /healthz`

`GET /readyz`

`GET /v1/models`

`POST /v1/chat/completions`

Streaming (`stream=true`)

`GET /v1/models/{id}`

`POST /v1/responses`

`GET /v1/responses/{response_id}`

`POST /v1/embeddings`

`GET /anthropic/v1/models`

`POST /anthropic/v1/messages`

Streaming (`stream=true`)

`POST /anthropic/v1/messages/count_tokens`

`POST /v1beta/models/{model}:generateContent`

`POST /v1beta/models/{model}:streamGenerateContent`

`POST /admin/login`

`GET /admin/verify`

`GET /admin/vercel/config`

`GET /admin/config`

`POST /admin/config`

`GET /admin/settings`

`PUT /admin/settings`

`POST /admin/settings/password`

`POST /admin/config/import`

`GET /admin/config/export`

`POST /admin/keys`

`DELETE /admin/keys/{key}`

`GET /admin/accounts`

`POST /admin/accounts`

`DELETE /admin/accounts/{identifier}`

`GET /admin/queue/status`

`POST /admin/accounts/test`

`POST /admin/accounts/test-all`

`POST /admin/accounts/sessions/delete-all`

`POST /admin/import`

`POST /admin/test`

`POST /admin/dev/raw-samples/capture`

`GET /admin/dev/raw-samples/query`

`POST /admin/dev/raw-samples/save`

`POST /admin/vercel/sync`

`GET /admin/vercel/status`

`GET /admin/export`

`GET /admin/version`

`GET /admin/dev/captures`

`DELETE /admin/dev/captures`