Add cache_write_tokens mapping to UsageExtractor by ali-corpo · Pull Request #330 · pydantic/genai-prices

ali-corpo · 2026-03-30T13:39:41Z

Add cache_write_tokens mapping to UsageExtractor

Copilot

Pull request overview

Adds extraction support for cache-write token counts from OpenAI-compatible usage.prompt_tokens_details payloads.

Changes:

Added UsageExtractorMapping for prompt_tokens_details.cache_write_tokens → cache_write_tokens across multiple chat-oriented extractors.
Enables downstream pricing/usage calculations to see cache-write token counts when present in provider responses.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-30T13:46:13Z

packages/python/genai_prices/data.py

                        path=['prompt_tokens_details', 'cached_tokens'], dest='cache_read_tokens', required=False
                    ),
+                    UsageExtractorMapping(
+                        path=['prompt_tokens_details', 'cache_write_tokens'], dest='cache_write_tokens', required=False
+                    ),


cache_write_tokens is treated as a separately-priced bucket in ModelPrice.calc_price() (it is subtracted from uncached input and priced via cache_write_mtok). OpenAI model entries in this data file do not define cache_write_mtok, so if responses start including usage.prompt_tokens_details.cache_write_tokens with a non-zero value, price calculation will undercharge. Either avoid extracting this field for providers/models without a cache_write_mtok, or ensure OpenAI model pricing defines an appropriate cache_write_mtok (often equal to input_mtok if cache writes are billed at the normal input rate).

Copilot · 2026-03-30T13:46:13Z

packages/python/genai_prices/data.py

+                    UsageExtractorMapping(
+                        path=['prompt_tokens_details', 'cache_write_tokens'], dest='cache_write_tokens', required=False
+                    ),


This change adds support for extracting usage.prompt_tokens_details.cache_write_tokens, but there isn't currently a regression test asserting the extraction behavior (and downstream pricing impact) for an OpenAI-style response that actually includes this field. Please add/extend an existing OpenAI extraction test (e.g. in tests/test_end_to_end.py or tests/test_extract_usage.py) to cover a non-zero cache_write_tokens value.

Copilot · 2026-03-30T13:46:14Z

packages/python/genai_prices/data.py

                        path=['prompt_tokens_details', 'cached_tokens'], dest='cache_read_tokens', required=False
                    ),
+                    UsageExtractorMapping(
+                        path=['prompt_tokens_details', 'cache_write_tokens'], dest='cache_write_tokens', required=False
+                    ),


Deepseek models in this file appear to have cache_read_mtok pricing but no cache_write_mtok. Since cache_write_tokens are priced via cache_write_mtok (and subtracted from uncached input), extracting prompt_tokens_details.cache_write_tokens here can lead to undercharging if the API returns a non-zero value. Consider only mapping this field for providers/models that have a defined cache_write_mtok, or set an explicit cache_write_mtok that matches the provider’s billing semantics.

Copilot · 2026-03-30T13:46:14Z

packages/python/genai_prices/data.py

                        path=['prompt_tokens_details', 'cached_tokens'], dest='cache_read_tokens', required=False
                    ),
+                    UsageExtractorMapping(
+                        path=['prompt_tokens_details', 'cache_write_tokens'], dest='cache_write_tokens', required=False
+                    ),


X AI model pricing in this file includes cache_read_mtok but not cache_write_mtok. Because cache_write_tokens are subtracted from uncached input tokens and priced via cache_write_mtok, extracting prompt_tokens_details.cache_write_tokens can undercharge if present and non-zero. Consider removing this mapping for providers/models without cache_write_mtok, or define cache_write_mtok where appropriate.

Copilot · 2026-03-30T13:46:14Z

packages/python/genai_prices/data.py

                        path=['prompt_tokens_details', 'cached_tokens'], dest='cache_read_tokens', required=False
                    ),
+                    UsageExtractorMapping(
+                        path=['prompt_tokens_details', 'cache_write_tokens'], dest='cache_write_tokens', required=False
+                    ),


Azure’s chat extractor now maps usage.prompt_tokens_details.cache_write_tokens into cache_write_tokens, but Azure/OpenAI-priced models in this file don’t define cache_write_mtok. Since cache_write_tokens are subtracted from uncached input and priced via cache_write_mtok, a non-zero value from the API would undercharge. Consider only extracting this field when a cache_write_mtok is defined (or ensure the pricing model includes the appropriate cache-write rate).

Add cache_write_tokens mapping to UsageExtractor

7541453

Copilot AI review requested due to automatic review settings March 30, 2026 13:39

Copilot started reviewing on behalf of ali-corpo March 30, 2026 13:40 View session

Copilot AI reviewed Mar 30, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add cache_write_tokens mapping to UsageExtractor#330

Add cache_write_tokens mapping to UsageExtractor#330
ali-corpo wants to merge 1 commit intopydantic:mainfrom
ali-corpo:patch-1

ali-corpo commented Mar 30, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 30, 2026

Uh oh!

Copilot AI Mar 30, 2026

Uh oh!

Copilot AI Mar 30, 2026

Uh oh!

Copilot AI Mar 30, 2026

Uh oh!

Copilot AI Mar 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ali-corpo commented Mar 30, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants