[doc] Update README.md with News

kaleid-liner · web-flow · commit ebb370a1dbc5 · 2024-08-14T20:17:11.000+08:00
diff --git a/README.md b/README.md
@@ -7,6 +7,8 @@
 
 ## News
 
+- 08/24/2024 🚀: The T-MAC GEMM (N>1) kernels are now integrated into llama.cpp to accelerate prefill.
+
 - 08/06/2024 🚀: Support 1/2/3/4-bit quantized Llama models in GPTQ format. Test it using the pretrained models released by [EfficientQAT](https://github.com/OpenGVLab/EfficientQAT).
 
 - 07/27/2024 ✨: We've noted that T-MAC is even faster than the NPU in token generation speed on the latest Snapdragon X Elite chipset! Check [Compared to NPU](#compared-to-npu) for more details.