🤖 AI / ML

每秒 10 个 token 到底有多快？How fast is 10 tokens per second really?

simonwillison.net·2026-05-20 节选正文

Mike Veerman 开发了一个交互式 HTML 应用，模拟不同 LLM 输出速度（5–800 tokens/秒）下的文本生成体验。该工具帮助用户直观感受广告中宣称的“30 tokens/second”在实际对话中的延迟表现，尤其适用于评估实时聊天机器人的流畅度。

Simon Willison

This is a link post by Simon Willison, posted on 20th May 2026.

ai 2028 generative-ai 1795 llms 1761

Sponsor me for $10/month and get a curated email digest of the month's most important LLM developments.

Pay me to send you less!

需要完整排版与评论请前往来源站点阅读。