llm

Large Language Models

Streaming Chat

Streaming Chat is a large language models capability available through Anthropic, OpenAI, Groq and 2 more on Aweb. Real-time streaming text generation. Access it through a single unified API with automatic failover and intelligent routing.

Try Streaming ChatAPI docs

Best for

Highest quality

Anthropic, OpenAI

Premium tier

Most affordable

Groq, Google Gemini

Economy tier

Contract

Max Latency500ms
Streaming RequiredYes

Providers (5)

ProviderScoreQualityPricing
AnthropicDEFAULT
95premiumpremium
OpenAI
85premiumpremium
Groq
95premiumeconomy
Mistral AI
88premiumstandard
Google Gemini
88premiumeconomy

Quick start

Call Streaming Chat through Alfred — automatic provider selection, failover, and load balancing included.

cURL

curl -X POST https://api.alfred-ai.app/v1/execute \
  -H "Authorization: Bearer $ALFRED_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "capability": "llm.stream",
    "input": { "prompt": "Hello world" }
  }'

TypeScript

import { Alfred } from '@alfred/core';

const alfred = new Alfred({ apiKey: process.env.ALFRED_API_KEY });

// Alfred automatically selects the best provider
const result = await alfred.execute({
  capability: 'llm.stream',
  input: { prompt: 'Hello world' },
});

console.log(result.output);

Orchestration pipeline

import { Alfred } from '@alfred/core';

const alfred = new Alfred({ apiKey: process.env.ALFRED_API_KEY });

// Multi-step pipeline with automatic failover
const result = await alfred.orchestrate({
  steps: [
    { id: 'step1', capability: 'llm.stream', input: { prompt: 'Hello world' } },
    { id: 'step2', capability: 'llm.chat', dependsOn: ['step1'],
      input: { prompt: 'Summarize: $step1.output' } },
  ],
});

Related Large Language Models capabilities

Chat Completion

llm

Vision Analysis

llm

Structured Output

llm

Fast LLM Inference

llm

Code Completion

llm

Getting started →API reference →All providers →All capabilities →