Fix mtp by RunningLeon · Pull Request #4517 · InternLM/lmdeploy

RunningLeon · 2026-04-10T06:50:08Z

Motivation

Modification

Fix mtp:

duplicate adding 1 to mrope_ids in lmdeploy/pytorch/spec_decode/spec_agent.py
not using target_hidden_states for second spec decoding step in lmdeploy/pytorch/spec_decode/proposers/deepseek_mtp.py

improvement

update logits sampling related codes

oc results

dataset	version	metric	mode	eval_qwen35-mtp3
GPQA_diamond_repeat_4	772ea0	accuracy (4 runs average)	gen	82.70
	-	-	-	-
Math Calculation	-	-	-	-
aime2025_repeat_32	5e9f4f	accuracy (32 runs average)	gen	89.06

BC-breaking (Optional)

Does the modification introduce changes that break the backward-compatibility of the downstream repositories?
If so, please describe how it breaks the compatibility and how the downstream projects should modify their code to keep compatibility with this PR.

Use cases (Optional)

If this PR introduces a new feature, it is better to list some use cases here, and update the documentation.

Checklist

Pre-commit or other linting tools are used to fix the potential lint issues.
The modification is covered by complete unit tests. If not, please add more unit tests to ensure the correctness.
If the modification has a dependency on downstream projects of a newer version, this PR should be tested with all supported versions of downstream projects.
The documentation has been modified accordingly, like docstring or example tutorials.

Copilot

Pull request overview

This PR fixes DeepSeek MTP speculative decoding correctness issues and refactors speculative sampling/logits processing to reduce duplicated logic and align shapes/flows between prefill vs decoding.

Changes:

Simplifies AR-spec extra-input slicing so target_logits is passed through in the model’s native flattened form.
Refactors speculative rejection sampling to run FusedLogitsProcessor inline (and removes the dedicated async_sampling_logits helper), plus fixes a duplicate mrope_pos_ids += 1 increment.
Fixes DeepSeek MTP proposer to propagate target_hidden_states for subsequent speculative decoding steps (instead of reusing draft hidden states).

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 1 comment.

File	Description
lmdeploy/pytorch/strategies/ar_spec/model_agent.py	Adjusts `target_logits` slicing to pass through flattened logits directly.
lmdeploy/pytorch/spec_decode/spec_agent.py	Refactors logits processing within `_rejection_sampling`, removes duplicate mRoPE increment, and adds profiling scope.
lmdeploy/pytorch/spec_decode/proposers/deepseek_mtp.py	Uses `model_inputs.target_hidden_states` (sliced by last-token indices) for MTP continuation steps.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

lmdeploy/pytorch/spec_decode/spec_agent.py

RunningLeon added 2 commits April 9, 2026 16:41

fix second spec step

6448091

change logits process

4b2f451

Copilot AI review requested due to automatic review settings April 10, 2026 06:50

Copilot started reviewing on behalf of RunningLeon April 10, 2026 06:50 View session

Copilot AI reviewed Apr 10, 2026

View reviewed changes

lmdeploy/pytorch/spec_decode/spec_agent.py Show resolved Hide resolved

fix ut

f97124e

lvhan028 requested a review from grimoire April 10, 2026 09:06

lvhan028 added the Bug:P0 label Apr 10, 2026

grimoire approved these changes Apr 10, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix mtp #4517

Fix mtp #4517
RunningLeon wants to merge 3 commits intoInternLM:mainfrom
RunningLeon:fix-mtp

RunningLeon commented Apr 10, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

RunningLeon commented Apr 10, 2026

Motivation

Modification

BC-breaking (Optional)

Use cases (Optional)

Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants