ptx: use char counts for before-chunk sizing in get_output_chunks by sylvestre · Pull Request #12685 · uutils/coreutils

sylvestre · 2026-06-07T10:03:08Z

The max_before_size assert compared against before.len() (byte length) while max_before_size is measured in chars, panicking on multibyte input like 'éé word'. The tail-chunk budget (max_tail_size) had the same byte/char mismatch, shrinking the tail too much and dropping a word that fits. Use char counts in both places, matching the after chunk.

Fixes #10893

cakebaker · 2026-06-07T14:35:16Z

Hm, the linked issue is already fixed. Is this PR still necessary?

sylvestre · 2026-06-07T18:03:52Z

It is a follow up

The max_before_size assert compared against before.len() (byte length) while max_before_size is measured in chars, panicking on multibyte input like 'éé word'. The tail-chunk budget (max_tail_size) had the same byte/char mismatch, shrinking the tail too much and dropping a word that fits. Use char counts in both places, matching the after chunk. Fixes uutils#10893

sylvestre force-pushed the ptx-fix-panic-multibyte-before branch from ced2a38 to 17ead12 Compare June 7, 2026 19:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ptx: use char counts for before-chunk sizing in get_output_chunks#12685

ptx: use char counts for before-chunk sizing in get_output_chunks#12685
sylvestre wants to merge 1 commit into
uutils:mainfrom
sylvestre:ptx-fix-panic-multibyte-before

sylvestre commented Jun 7, 2026

Uh oh!

cakebaker commented Jun 7, 2026

Uh oh!

sylvestre commented Jun 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

sylvestre commented Jun 7, 2026

Uh oh!

cakebaker commented Jun 7, 2026

Uh oh!

sylvestre commented Jun 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants