Pinned Loading
Repositories
Showing 10 of 48 repositories
- NeMo Public Forked from NVIDIA-NeMo/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
krai/NeMo’s past year of commit activity - axs2stg Public
krai/axs2stg’s past year of commit activity - vllm Public Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
krai/vllm’s past year of commit activity - axs2kiss Public
Automated KRAI-X workflows for inference engines on selected backends: vLLM and SGLang on CUDA and ROCm, NIM/TensorRT-LLM on CUDA, using an OpenAI API compatible LoadGen client
krai/axs2kiss’s past year of commit activity - kilt4qaic Public
krai/kilt4qaic’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…