We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent e7c0663 commit 9f0b328Copy full SHA for 9f0b328
README.md
@@ -354,7 +354,7 @@ We will soon:
354
- [x] Add `I4` format to simplify the deployment of 4-bit models.
355
- [x] Embed T-MAC GEMM kernels into llama.cpp to accelerate prefill/prompt.
356
- [x] Android cross-compilation guidance
357
-- [ ] Merge latest llama.cpp for more functionalities
+- [ ] Merge latest llama.cpp for better multi-threading performance and more functionalities
358
- [ ] Optimize for ARMv9 CPU with SME2 through LUTI4
359
360
## Techniques
0 commit comments