curl --request POST \
--url https://xcompute.us/v1beta/models/gemini-2.5-pro:generateContent \
--header 'Authorization: Bearer <token>' \
--header 'Content-Type: application/json' \
--data '{
"contents": [
{
"role": "user",
"parts": [
{
"text": "你好,介绍一下自己"
}
]
}
]
}'
{
"code": 200,
"data": {
"candidates": [
{
"content": {
"role": "model",
"parts": [
{
"text": "你好!很高兴能向你介绍我自己。\n\n我是一个大型语言模型,由 Google 训练和开发..."
}
]
},
"finishReason": "STOP",
"index": 0,
"safetyRatings": [
{
"category": "HARM_CATEGORY_HATE_SPEECH",
"probability": "NEGLIGIBLE"
}
]
}
],
"promptFeedback": {
"safetyRatings": [
{
"category": "HARM_CATEGORY_HATE_SPEECH",
"probability": "NEGLIGIBLE"
}
]
]
},
"usageMetadata": {
"promptTokenCount": 4,
"candidatesTokenCount": 611,
"totalTokenCount": 2422,
"thoughtsTokenCount": 1807,
"promptTokensDetails": [
{
"modality": "TEXT",
"tokenCount": 4
}
]
}
}
文本接口
Gemini 原生格式
POST
https://xcompute.us
/
v1beta
/
models
/
gemini-2.5-pro:generateContent
curl --request POST \
--url https://xcompute.us/v1beta/models/gemini-2.5-pro:generateContent \
--header 'Authorization: Bearer <token>' \
--header 'Content-Type: application/json' \
--data '{
"contents": [
{
"role": "user",
"parts": [
{
"text": "你好,介绍一下自己"
}
]
}
]
}'
{
"code": 200,
"data": {
"candidates": [
{
"content": {
"role": "model",
"parts": [
{
"text": "你好!很高兴能向你介绍我自己。\n\n我是一个大型语言模型,由 Google 训练和开发..."
}
]
},
"finishReason": "STOP",
"index": 0,
"safetyRatings": [
{
"category": "HARM_CATEGORY_HATE_SPEECH",
"probability": "NEGLIGIBLE"
}
]
}
],
"promptFeedback": {
"safetyRatings": [
{
"category": "HARM_CATEGORY_HATE_SPEECH",
"probability": "NEGLIGIBLE"
}
]
]
},
"usageMetadata": {
"promptTokenCount": 4,
"candidatesTokenCount": 611,
"totalTokenCount": 2422,
"thoughtsTokenCount": 1807,
"promptTokensDetails": [
{
"modality": "TEXT",
"tokenCount": 4
}
]
}
}
- 使用 Google 原生 API 格式调用 Gemini 模型
- 同步处理模式,实时返回对话内容
- 最简化参数,快速上手
curl --request POST \
--url https://xcompute.us/v1beta/models/gemini-2.5-pro:generateContent \
--header 'Authorization: Bearer <token>' \
--header 'Content-Type: application/json' \
--data '{
"contents": [
{
"role": "user",
"parts": [
{
"text": "你好,介绍一下自己"
}
]
}
]
}'
{
"code": 200,
"data": {
"candidates": [
{
"content": {
"role": "model",
"parts": [
{
"text": "你好!很高兴能向你介绍我自己。\n\n我是一个大型语言模型,由 Google 训练和开发..."
}
]
},
"finishReason": "STOP",
"index": 0,
"safetyRatings": [
{
"category": "HARM_CATEGORY_HATE_SPEECH",
"probability": "NEGLIGIBLE"
}
]
}
],
"promptFeedback": {
"safetyRatings": [
{
"category": "HARM_CATEGORY_HATE_SPEECH",
"probability": "NEGLIGIBLE"
}
]
]
},
"usageMetadata": {
"promptTokenCount": 4,
"candidatesTokenCount": 611,
"totalTokenCount": 2422,
"thoughtsTokenCount": 1807,
"promptTokensDetails": [
{
"modality": "TEXT",
"tokenCount": 4
}
]
}
}
Authorizations
所有接口均需要使用Bearer Token进行认证获取 API Key:访问 API Key 管理页面 获取您的 API Key使用时在请求头中添加:
Authorization: Bearer YOUR_API_KEY
Path Parameters
模型名称示例中使用
gemini-2.5-pro,您可以将其替换为其他支持的 Gemini 模型:gemini-2.5-flash- Gemini 2.5 快速版gemini-2.5-pro- Gemini 2.5 专业版gemini-2.5-flash-lite- Gemini 2.5 超轻量版gemini-2.5-pro-thinking- Gemini 2.5 Pro 深度思考版
生成方法(快速开始推荐使用
generateContent):generateContent: 等待完整响应后一次性返回streamGenerateContent: 流式返回,逐块实时返回内容
generateContent, streamGenerateContentBody
对话内容列表最少需要1条消息
示例:
Show contents 对象结构
Show contents 对象结构
[
{
"role": "user",
"parts": [{ "text": "你好,介绍一下自己" }]
}
]
安全设置(可选)
Show safetySettings 对象结构
Show safetySettings 对象结构
Response
候选响应列表
Show candidates 对象结构
Show candidates 对象结构
完成原因:
STOP: 正常结束MAX_TOKENS: 达到最大 token 限制SAFETY: 因安全原因停止RECITATION: 因重复内容停止OTHER: 其他原因
候选响应的索引
使用量统计
Show usageMetadata 属性
Show usageMetadata 属性
⌘I