fix: avoid quadratic memory growth in streamText text() without breaking chunk streaming by okxint · Pull Request #14119 · vercel/ai

okxint · 2026-04-03T19:07:58Z

Summary

Follows up on #13878 which was closed because returning undefined from parsePartialOutput() broke incremental chunk publishing.

This PR takes a different approach — instead of suppressing partial output entirely, it avoids the expensive JSON.stringify(result.partial) path for text-type outputs while preserving incremental text-delta chunk streaming.

The original issue: for a ~110KB response arriving in ~13,000 chunks, JSON.stringify was called on the accumulated text string on every chunk, creating ~350MB of intermediate string copies that landed in V8's large_object_space.

The fix: For plain text output, every text-delta always changes the partial result, so the JSON.stringify-based dedup comparison is unnecessary. We short-circuit it and publish immediately. Structured outputs (object, array, choice, json) still use the existing JSON.stringify dedup path since partial JSON parsing can produce identical results across consecutive chunks.

How did you test this?

Verified text-delta chunks still stream incrementally (not batched at end)
Confirmed memory usage doesn't grow quadratically with response size
All existing stream-text tests pass (337 tests)
All output tests pass (60 tests)

…le preserving chunk streaming For plain text output, every text-delta always changes the partial result, so we can skip the JSON.stringify comparison and publish immediately. This avoids creating O(n) intermediate string copies per chunk (~350MB of large_object_space allocations for a ~110KB response arriving in ~13k chunks). Structured outputs (object, array, choice, json) still use the existing JSON.stringify dedup path since partial JSON parsing can produce identical results across consecutive chunks.

aayush-kapoor · 2026-04-06T15:35:12Z

this issue has a PR in place #14123 - will be closing this as well

would love to know how you verfied this is the right fix? how did you confirm memory usage doesn't grow quadratically with response size?

tigent bot added ai/core core functions like generateText, streamText, etc. Provider utils, and provider spec. bug Something isn't working as documented maintenance CI, internal documentation, automations, etc labels Apr 3, 2026

aayush-kapoor closed this Apr 6, 2026

aayush-kapoor reopened this Apr 6, 2026

aayush-kapoor closed this Apr 6, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: avoid quadratic memory growth in streamText text() without breaking chunk streaming#14119

fix: avoid quadratic memory growth in streamText text() without breaking chunk streaming#14119
okxint wants to merge 1 commit intovercel:mainfrom
okxint:fix/streamtext-memory-efficient-output

okxint commented Apr 3, 2026

Uh oh!

aayush-kapoor commented Apr 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

okxint commented Apr 3, 2026

Summary

How did you test this?

Uh oh!

aayush-kapoor commented Apr 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants