Home/Glossary/Streaming

AI Glossary

Streaming

Displaying AI output token-by-token as it is generated

Definition

Streaming is the technique of sending AI model output to the client incrementally as tokens are generated, rather than waiting for the full response. This dramatically improves perceived responsiveness — users see text appearing word-by-word rather than waiting for a long pause followed by a complete response. Almost all modern AI chat interfaces use streaming.

Related Terms

Back to Glossary