Skip to content

Commit b1a3776

Browse files
kathyyu-googlemzusman
authored andcommitted
[Bugfix] Free cross attention block table for preempted-for-recompute sequence group. (vllm-project#10013)
Signed-off-by: Kathy Yu <[email protected]>
1 parent 9111a70 commit b1a3776

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

vllm/core/scheduler.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1579,6 +1579,7 @@ def _preempt_by_recompute(
15791579
seq.status = SequenceStatus.WAITING
15801580
self.free_seq(seq)
15811581
seq.reset_state_for_recompute()
1582+
self._free_seq_group_cross_attn_blocks(seq_group)
15821583

15831584
def _preempt_by_swap(
15841585
self,

0 commit comments

Comments
 (0)