heyCASEy.io

We support streaming for the chat completions API.  Limitations: We only support text prompts and completions.  We haven't tested every model, but are confident that all models work. We have NOT tests on the Assistants API.

We support streaming for chat completions API. Limitations: We only support text prompts and completions. We estimate token usage (the Microsoft API doesn't yet send token usage in streaming responses)

We support the Anthropic streaming API. We only support text prompts using the messages api. We have tested the endpoint <a href="https://api.anthropic.com/v1/messages" target="_blank" rel="nofollow noopener noreferrer">https://api.anthropic.com/v1/messages</a>. Anthropic support token usage in their streaming api. 

We support the Gemini streaming api. We have tested on the  <a href="https://generativelanguage.googleapis.com/v1beta/models/gemini-1.5-flash-latest:streamGenerateContent?alt=sse&amp;key={{API-KEY}}" target="_blank" rel="nofollow noopener noreferrer">https://generativelanguage.googleapis.com/v1beta/models/gemini-1.5-flash-latest:streamGenerateContent?alt=sse&amp;key={{API-KEY}}</a>. Gemeni supports token usage in their API. 

We will expand the tested scenarios as we achieve them here.

Coming soon: Amazon Bedrock. What others do you need? Send us a message and we'll add support.

The current status of different LLM Providers and limitations

What LLM Providers Do You Support

Find answers and get help from Intercom Support and Community Experts

Empty Help Center

Uh oh. That page doesn’t exist.

Disappointed

Neutral

Smiley

Title

Track the progress of all tickets related to your company.

Tickets portal.

{assigneeName} needs more information from you

OpenAI

Azure OpenAI

Anthropic Claude

Google Gemini