Enable CDPruner #2714

yangwang201911 · 2025-09-09T05:35:49Z

Integrating a CDPruner module for visual token pruning into VLM pipeline.
Paper: CDPruner (arXiv)
Code: GitHub Repository

Tickets: CVS-173220

…e configuration.

…ased models

…yDPP

…r requests for performance optimization

…e configuration. Refactor CDPruner to use visual tokens percentage instead of count for pruning configuration

…ode and remove unused visual token pruning methods

…e arguments and update GenerationConfig structure

…onfig

…e vision config handling

…genai into ywang2/vlm-cdpruner

…ross codebase for consistency in CDPruner configuration

…ructor

…in_percentage" - Updated Python scripts to reflect the corrected parameter name in argument parsing and configuration settings. 2. Added unit tests for the FastGreedyDPP class to ensure proper functionality and selection behavior based on the visual tokens retention percentage.

… FastGreedyDPP

…genai into ywang2/vlm-cdpruner

…ions

…elated configurations

…ove performance

…ameter

…r improved cache performance

…nhance cache performance with manual loop unrolling

…mproved readability

…lves for improved relevance computation and DPP selection

…ing invalid values Refactor performance logging in CDPruner: consolidate timing summary and metrics output

…ent vector subtraction and multiplication functions

…n for improved performance

…uration for performance metrics

…model compilation and enhance token selection with visual token splitting for improved performance Add support for OpenVINO ops in CDPruner configuration: introduce 'use_ops_model' flag for enhanced pruning computation

…r function for efficient token selection across kernel matrices

liangali and others added 30 commits August 1, 2025 10:06

[POC] implement cdpruner for qwen2.5-vl

3afb35b

Enhance CDPruner and RelevanceCalculator to support negative relevanc…

38879b5

…e configuration.

Update CDPruner configuration to enable negative relevance for CLIP-b…

5bedef4

…ased models

Add support for subgraph in CDPruner and ConditionalKernelBuilder

c81af98

Update L2 normalization function

4c7e1c0

Skip updating marginal gains for already selected tokens in FastGreed…

5c1b678

…yDPP

Enhance ConditionalKernelBuilder to precompile models and create infe…

4ac2a1c

…r requests for performance optimization

Enhance CDPruner and RelevanceCalculator to support negative relevanc…

1d2ff66

…e configuration. Refactor CDPruner to use visual tokens percentage instead of count for pruning configuration

Add CDPruner configuration parameters to GenerationConfig

95c243f

Implement GPU model compilation in constructor.

221456b

Refactor CDPruner configuration: rename debug_mode to pruning_debug_m…

79d7955

…ode and remove unused visual token pruning methods

Enhance CDPruner configuration: add pruning parameters to command-lin…

79529fa

…e arguments and update GenerationConfig structure

Merge remote-tracking branch 'upstream' into ywang2/enable_cdpruner_c…

99f55b0

…onfig

Refactor CDPruner configuration: remove unused settings and streamlin…

6a4a332

…e vision config handling

update format

5ba4d7d

Merge branch 'master' of https://github.com/openvinotoolkit/openvino.…

b618463

…genai into ywang2/vlm-cdpruner

Merge branch 'ywang2/enable_cdpruner_config' into ywang2/vlm-cdpruner

e63d071

Refactor pruning debug mode checks and enable ops model by default

ebf1a18

Add logging for CDPruner configuration

087d1c8

Add logging for CDPruner configuration settings

81fcf68

Rename visual_tokens_percentage to viusal_tokens_retain_percentage ac…

cc89a26

…ross codebase for consistency in CDPruner configuration

Initialize CDPruner with default configuration in VisionEncoder const…

05e7e65

…ructor

Add debug logging for conditional kernel matrix and marginal gains in…

c1e1f45

… FastGreedyDPP

update.

2cb1e8f

Merge branch 'master' of https://github.com/openvinotoolkit/openvino.…

572b251

…genai into ywang2/vlm-cdpruner

[visual_language_chat] Add CDPruner options and update usage instruct…

26b29f7

…ions

Enhance CDPruner functionality with new ops model option and update r…

c0280eb

…elated configurations

Refactor CDPruner debug output for consistency and clarity in logging

b2f2601

Optimize orthogonal vector computation: reduce memory access and impr…

9452f2f

…ove performance

yangwang201911 added 11 commits August 28, 2025 17:56

Refactor update_marginal_gains method: remove unused selected_idx par…

4d5b4d3

…ameter

optimize projection calculations and improve performance

96039d0

Optimize orthogonal vector update: implement manual loop unrolling fo…

acbec3d

…r improved cache performance

Optimize orthogonal vector update: improve zero-check precision and e…

1777f71

…nhance cache performance with manual loop unrolling

Optimize orthogonal vector update: remove manual loop unrolling for i…

9627723

…mproved readability

Optimize token selection in CDPruner: split visual tokens into two ha…

ad90d91

…lves for improved relevance computation and DPP selection

Fix NaN handling in update_marginal_gains: improve stability by skipp…

c85c1a6

…ing invalid values Refactor performance logging in CDPruner: consolidate timing summary and metrics output

Add SIMD optimizations for vector operations in FastGreedyDPP: implem…

f59b04f

…ent vector subtraction and multiplication functions

Optimize token selection in CDPruner: implement parallel DPP selectio…

ffd8dfd

…n for improved performance

Enhance DPP timing in CDPruner: initialize and report DPP selection d…

abcda0a

…uration for performance metrics

yangwang201911 requested a review from peterchen-intel September 9, 2025 05:35

yangwang201911 added 2 commits September 9, 2025 19:22

Add parallel DPP selection functionality in CDPruner: implement helpe…

dd6d9cf

…r function for efficient token selection across kernel matrices

update.

867adaf

peterchen-intel requested a review from xipingyan September 9, 2025 23:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable CDPruner #2714

Enable CDPruner #2714

yangwang201911 commented Sep 9, 2025 •

edited by peterchen-intel

Loading

Uh oh!

Uh oh!

Enable CDPruner #2714

Are you sure you want to change the base?

Enable CDPruner #2714

Conversation

yangwang201911 commented Sep 9, 2025 • edited by peterchen-intel Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

yangwang201911 commented Sep 9, 2025 •

edited by peterchen-intel

Loading