Skip to content
Open
Changes from all commits
Commits
Show all changes
37 commits
Select commit Hold shift + click to select a range
8417d8f
Add sampler transform test
quic-sanising Jun 18, 2025
27d8dd5
Merge branch 'main' into ods-unit-tests
Jun 30, 2025
067f9b5
Add example script
Jun 30, 2025
931860f
Update docs
Jun 30, 2025
79b6c95
Enable On Device Sampling for _continuous_batching_execution()
Jun 30, 2025
75eac30
Disable On Device Sampling for _regular_model_execution()
Jul 1, 2025
eb6e2eb
Use same sampling parameters for each sequence in a batch
Jul 1, 2025
48b35e3
Enable On Device Sampling for _regular_model_execution()
Jul 2, 2025
c83a631
Add test for greedy sampling
Jul 3, 2025
f698a24
Add test for random sampling
Jul 3, 2025
7b34a07
Remove else block
Jul 3, 2025
5fa7269
Merge branch 'main' into ods-unit-tests
Jul 3, 2025
0ee201a
Reformat code
Jul 3, 2025
c074768
Merge branch 'quic:main' into ods-unit-tests
quic-sanising Jul 24, 2025
115505e
Move sampling operations, inputs, and validation functions to utils
Aug 4, 2025
3ac7503
Change model to TinyLlama
Aug 4, 2025
02669e0
Add header
Aug 4, 2025
137cc4a
Reformat code
Aug 4, 2025
54a926a
Merge branch 'quic:main' into ods-unit-tests
quic-sanising Aug 4, 2025
6acf446
Update linter
Aug 4, 2025
6083f5b
Merge branch 'quic:main' into ods-unit-tests
quic-sanising Aug 6, 2025
c2d7e83
Remove device_id
Aug 6, 2025
1069109
Remove redundant line
Aug 6, 2025
7d67132
Merge branch 'quic:main' into ods-unit-tests
quic-sanising Aug 19, 2025
0e3f257
Merge branch 'main' into ods-unit-tests
Aug 20, 2025
908e67e
Remove redundant reinitialization of output buffers
Aug 20, 2025
a8e55da
Merge branch 'main' into ods-unit-tests
Aug 22, 2025
f3f89d3
Add qaic_config to model hash
Aug 22, 2025
81ae15a
Merge branch 'main' into ods-unit-tests
Aug 25, 2025
c485bfd
Change config
Aug 25, 2025
0e3b383
Remove pretrained_model_name_or_path from qaic_config
Aug 25, 2025
7d91470
Revert changes to model hash
Aug 25, 2025
e36add0
Added qaic_config to hash parameters via inclusion list.
quic-dhirajku Aug 26, 2025
127ec74
Added qaic_config in manual hash tests for causal_lm dummy models.
quic-dhirajku Aug 26, 2025
dad96ca
Use different config for each test
Aug 26, 2025
af854e8
Add On Device Sampling support for more CausalLM models
Sep 4, 2025
538c69f
Merge branch 'main' into ods-extend
Sep 18, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
10 changes: 9 additions & 1 deletion QEfficient/transformers/models/pytorch_transforms.py
Original file line number Diff line number Diff line change
Expand Up @@ -548,8 +548,16 @@ class SamplerTransform:

# supported architectures
_module_mapping = {
# Llama
QEffFalconForCausalLM,
QEffGemmaForCausalLM,
QEffGPT2LMHeadModel,
QEffGPTJForCausalLM,
QEffGraniteForCausalLM,
QEffGraniteMoeForCausalLM,
QEffLlamaForCausalLM,
QEffMptForCausalLM,
QEffPhi3ForCausalLM,
QEffQwen2ForCausalLM,
}

@classmethod
Expand Down
Loading