Enhance inference API to support OpenAI style about llm-inference HOT 3 CLOSED

SeanHH86 commented on June 11, 2024

Enhance inference API to support OpenAI style

from llm-inference.

Comments (3)

SeanHH86 commented on June 11, 2024

OpenAI API : https://platform.openai.com/docs/api-reference/introduction

Organizations and projects (optional)

curl https://api.openai.com/v1/models  -H "Authorization: Bearer $OPENAI_API_KEY"   -H "OpenAI-Organization: YOUR_ORG_ID"  -H "OpenAI-Project: $PROJECT_ID"

List models: GET https://api.openai.com/v1/models

curl https://api.openai.com/v1/models  -H "Authorization: Bearer $OPENAI_API_KEY"

Response json data:
{
  "data": [
    {
      "id": "model-id-0",
      "object": "model",
      "owned_by": "organization-owner",
      "permission": [...]
    },
    {
      "id": "model-id-1",
      "object": "model",
      "owned_by": "organization-owner",
      "permission": [...]
    },
    {
      "id": "model-id-2",
      "object": "model",
      "owned_by": "openai",
      "permission": [...]
    },
  ],
  "object": "list"
}

Completions: POST https://api.openai.com/v1/completions

curl https://api.openai.com/v1/chat/completions  -H "Content-Type: application/json" -H "Authorization: Bearer $OPENAI_API_KEY" \
  -d '{
     "model": "gpt-3.5-turbo",
     "messages": [{"role": "user", "content": "Say this is a test!"}],
     "temperature": 0.7
   }'

Response json data:
{
   "id":"chatcmpl-abc123",
   "object":"chat.completion",
   "created":1677858242,
   "model":"gpt-3.5-turbo-0301",
   "usage":{
      "prompt_tokens":13,
      "completion_tokens":7,
      "total_tokens":20
   },
   "choices":[
      {
         "message":{
            "role":"assistant",
            "content":"\n\nThis is a test!"
         },
         "finish_reason":"stop",
         "index":0
      }
   ]
}

streaming: Post https://api.openai.com/v1/chat/completions

curl https://api.openai.com/v1/chat/completions -H "Content-Type: application/json"  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -d '{
    "model": "gpt-3.5-turbo",
    "messages": [
      {
        "role": "system",
        "content": "You are a helpful assistant."
      },
      {
        "role": "user",
        "content": "Hello!"
      }
    ],
    "stream": true
  }'

Response json data:
{"id":"chatcmpl-123","object":"chat.completion.chunk","created":1694268190,"model":"gpt-3.5-turbo-0125", "system_fingerprint": "fp_44709d6fcb", "choices":[{"index":0,"delta":{"role":"assistant","content":""},"logprobs":null,"finish_reason":null}]}

{"id":"chatcmpl-123","object":"chat.completion.chunk","created":1694268190,"model":"gpt-3.5-turbo-0125", "system_fingerprint": "fp_44709d6fcb", "choices":[{"index":0,"delta":{"content":"Hello"},"logprobs":null,"finish_reason":null}]}

....

{"id":"chatcmpl-123","object":"chat.completion.chunk","created":1694268190,"model":"gpt-3.5-turbo-0125", "system_fingerprint": "fp_44709d6fcb", "choices":[{"index":0,"delta":{},"logprobs":null,"finish_reason":"stop"}]}

from llm-inference.

SeanHH86 commented on June 11, 2024

Sould be work when set OpenSDK endpoint.

from llm-inference.

SeanHH86 commented on June 11, 2024

done.

from llm-inference.

Enhance inference API to support OpenAI style about llm-inference HOT 3 CLOSED

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent