Change the repository type filter
All
Repositories list
28 repositories
- Community maintained hardware plugin for vLLM on Ascend
- A framework for efficient model inference with omni-modality models
- Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
- Intelligent Router for Mixture-of-Models
rfcs
Public