File tree Expand file tree Collapse file tree 1 file changed +4
-3
lines changed Expand file tree Collapse file tree 1 file changed +4
-3
lines changed Original file line number Diff line number Diff line change @@ -41,17 +41,18 @@ th:not(:first-child) {
41
41
| [ LoRA] ( lora.md ) | ✅ | ✅ | ✅ | | | | | | | | | | | |
42
42
| [ SD] ( spec_decode.md ) | ✅ | ✅ | ❌ | ✅ | | | | | | | | | | |
43
43
| CUDA graph | ✅ | ✅ | ✅ | ✅ | ✅ | | | | | | | | | |
44
- | [ pooling] ( ../models/pooling_models.md ) | ✅ \* | ✅ \* | ✅ | ❌ | ✅ | ✅ | | | | | | | | |
44
+ | [ pooling] ( ../models/pooling_models.md ) | 🟠 \* | 🟠 \* | ✅ | ❌ | ✅ | ✅ | | | | | | | | |
45
45
| <abbr title =" Encoder-Decoder Models " >enc-dec</abbr > | ❌ | [ ❌] ( gh-issue:7366 ) | ❌ | [ ❌] ( gh-issue:7366 ) | ✅ | ✅ | ✅ | | | | | | | |
46
46
| <abbr title =" Logprobs " >logP</abbr > | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | ✅ | ✅ | | | | | | |
47
47
| <abbr title =" Prompt Logprobs " >prmpt logP</abbr > | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | ✅ | ✅ | ✅ | | | | | |
48
48
| <abbr title =" Async Output Processing " >async output</abbr > | ✅ | ✅ | ✅ | ❌ | ✅ | ❌ | ❌ | ✅ | ✅ | ✅ | | | | |
49
49
| multi-step | ❌ | ✅ | ❌ | ❌ | ✅ | ❌ | ❌ | ✅ | ✅ | ✅ | ✅ | | | |
50
- | [ mm] ( multimodal_inputs.md ) | ✅ | ✅ | [ 🟠] ( gh-pr:4194 ) | ❔ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ❔ | ✅ | | |
50
+ | [ mm] ( multimodal_inputs.md ) | ✅ | ✅ | [ 🟠] ( gh-pr:4194 ) < sup >^</ sup > | ❔ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ❔ | ✅ | | |
51
51
| best-of | ✅ | ✅ | ✅ | [ ❌] ( gh-issue:6137 ) | ✅ | ❌ | ✅ | ✅ | ✅ | ❔ | [ ❌] ( gh-issue:7968 ) | ✅ | ✅ | |
52
52
| beam-search | ✅ | ✅ | ✅ | [ ❌] ( gh-issue:6137 ) | ✅ | ❌ | ✅ | ✅ | ✅ | ❔ | [ ❌] ( gh-issue:7968 ) | ❔ | ✅ | ✅ |
53
53
54
- \* Chunked prefill and prefix caching are only applicable to last-token pooling.
54
+ \* Chunked prefill and prefix caching are only applicable to last-token pooling.
55
+ <sup >^</sup > LoRA is only applicable to the language backbone of multimodal models.
55
56
56
57
[ ] ( ) { #feature-x-hardware }
57
58
You can’t perform that action at this time.
0 commit comments