Streaming
OpenAI SSE Format
import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://gateway.vibecc.dev/v1",
apiKey: process.env.OPENAI_API_KEY,
});
const stream = await client.chat.completions.create({
model: "gpt-4o",
messages: [{ role: "user", content: "Hello" }],
stream: true,
});
for await (const chunk of stream) {
process.stdout.write(chunk.choices[0]?.delta?.content ?? "");
}
Raw SSE Format
data: {"id":"chatcmpl-xxx","object":"chat.completion.chunk","choices":[{"delta":{"content":"Hello"}}]}
data: [DONE]
Anthropic Streaming
import Anthropic from "@anthropic-ai/sdk";
const client = new Anthropic({
baseURL: "https://gateway.vibecc.dev",
apiKey: process.env.ANTHROPIC_API_KEY,
});
const stream = await client.messages.stream({
model: "gpt-4o",
max_tokens: 1024,
messages: [{ role: "user", content: "Hello" }],
});
for await (const chunk of stream) {
if (chunk.type === "content_block_delta" && chunk.delta.type === "text_delta") {
process.stdout.write(chunk.delta.text);
}
}
Connection Behavior
- Heartbeat keep-alive sent every 15s on idle streams
- Client disconnect immediately cancels upstream provider request
- Max stream duration: 60 seconds (configurable per service class)