AI Glossary
Streaming
Displaying AI output token-by-token as it is generated
Definition
Streaming is the technique of sending AI model output to the client incrementally as tokens are generated, rather than waiting for the full response. This dramatically improves perceived responsiveness — users see text appearing word-by-word rather than waiting for a long pause followed by a complete response. Almost all modern AI chat interfaces use streaming.