|
| 1 | +# AIBrix KVCache-Enabled vLLM & SGLang Images |
| 2 | + |
| 3 | +This directory contains Dockerfiles that build **vLLM** and **SGLang** images |
| 4 | +enhanced with AIBrix capabilities: |
| 5 | + |
| 6 | +- **aibrix_kvcache** - Built from source for KV cache disaggregation |
| 7 | +- **nixl + nixl-cu12** - UCX-based high-performance networking libraries |
| 8 | +- **UCX tooling** - Pre-installed debugging and performance testing utilities |
| 9 | + |
| 10 | +## Image Naming Convention |
| 11 | + |
| 12 | +**Upstream vs. AIBrix Images:** |
| 13 | + |
| 14 | +| Upstream Image | AIBrix Enhanced Image | Description | |
| 15 | +|----------------|----------------------------------------------------------------|-------------| |
| 16 | +| `vllm/vllm-openai:v0.10.2` | `aibrix/vllm-openai:v0.10.2-aibrix-v0.5.0-nixl-0.7.1-20251123` | vLLM base + AIBrix KVCache + UCX/NIXL networking | |
| 17 | +| `lmsysorg/sglang:v0.5.5.post3` | `aibrix/sglang:v0.5.5.post3-aibrix-v0.5.0-nixl-0.7.1-20251123` | SGLang base + AIBrix KVCache + UCX/NIXL networking | |
| 18 | + |
| 19 | +**AIBrix images** extend upstream inference engines with: |
| 20 | +- Distributed KV cache support via `aibrix_kvcache` |
| 21 | +- RDMA-capable networking through NIXL/UCX for disaggregated inference |
| 22 | +- Compatible torch versions automatically derived from base images |
| 23 | + |
| 24 | +## Compatibility Matrix |
| 25 | + |
| 26 | +Default build arguments produce the following component versions: |
| 27 | + |
| 28 | +| Component | vLLM Image | SGLang Image | |
| 29 | +|-----------|-----------|--------------| |
| 30 | +| Engine version | v0.10.2 | v0.5.5.post3 | |
| 31 | +| Torch version | 2.8 | 2.9 | |
| 32 | +| aibrix_kvcache | v0.5.0 | v0.5.0 | |
| 33 | +| NIXL / CUDA plugin | 0.7.1 | 0.7.1 | |
| 34 | +| UCX | 1.19.0 | 1.19.0 | |
| 35 | + |
| 36 | +**Version Compatibility:** |
| 37 | +- Torch version is automatically extracted from the upstream base image to ensure compatibility |
| 38 | +- AIBrix KVCache is built against the exact torch version from the base image |
| 39 | +- NIXL and UCX versions are pinned for stable RDMA networking |
| 40 | + |
| 41 | +## Building the Images |
| 42 | + |
| 43 | +### vLLM Image |
| 44 | + |
| 45 | +```bash |
| 46 | +docker build \ |
| 47 | + -f Dockerfile.vllm \ |
| 48 | + --build-arg VLLM_VERSION=v0.10.2 \ |
| 49 | + --build-arg AIBRIX_BRANCH=v0.5.0 \ |
| 50 | + --build-arg NIXL_VERSION=0.7.1 \ |
| 51 | + -t aibrix/vllm-openai:v0.10.2-aibrix-v0.5.0-nixl-0.7.1-$(date +'%Y%m%d') \ |
| 52 | + . |
| 53 | +``` |
| 54 | + |
| 55 | +### SGLang Image |
| 56 | + |
| 57 | +```bash |
| 58 | +docker build \ |
| 59 | + -f Dockerfile.sglang \ |
| 60 | + --build-arg SGLANG_VERSION=v0.5.5.post3 \ |
| 61 | + --build-arg AIBRIX_BRANCH=v0.5.0 \ |
| 62 | + --build-arg NIXL_VERSION=0.7.1 \ |
| 63 | + -t aibrix/sglang:v0.5.5.post3-aibrix-v0.5.0-nixl-0.7.1-$(date +'%Y%m%d') \ |
| 64 | + . |
| 65 | +``` |
| 66 | + |
| 67 | +### Build Arguments |
| 68 | + |
| 69 | +All build arguments are optional and have sensible defaults: |
| 70 | + |
| 71 | +| Argument | Default | Description | |
| 72 | +|----------|---------|-------------| |
| 73 | +| `VLLM_VERSION` | `v0.10.2` | vLLM upstream version to use as base | |
| 74 | +| `SGLANG_VERSION` | `v0.5.5.post3` | SGLang upstream version to use as base | |
| 75 | +| `AIBRIX_BRANCH` | `v0.5.0` | AIBrix release tag or branch to build from | |
| 76 | +| `NIXL_VERSION` | `0.7.1` | NIXL networking library version | |
| 77 | +| `AIBRIX_REPO` | `https://github.com/vllm-project/aibrix` | AIBrix repository URL | |
| 78 | + |
| 79 | +## Release History |
| 80 | + |
| 81 | +AIBrix maintains stable image releases with tested component combinations: |
| 82 | + |
| 83 | +### v0.5.0 (Current) |
| 84 | + |
| 85 | +| Component | vLLM | SGLang | Notes | |
| 86 | +|----------------|---------|--------------|---------------------------------| |
| 87 | +| Engine | v0.10.2 | v0.5.5.post3 | Stable inference engines | |
| 88 | +| CUDA | 12.8 | 12.9 | CUDA Version | |
| 89 | +| Torch | 2.8 | 2.9 | PyTorch Version | |
| 90 | +| AIBrix KVCache | v0.5.0 | v0.5.0 | KV cache disaggregation support | |
| 91 | +| NIXL | 0.7.1 | 0.7.1 | UCX-based RDMA networking | |
| 92 | +| UCX | 1.19.0 | 1.19.0 | Pre-installed for debugging | |
| 93 | + |
| 94 | +**Recommended Tags:** |
| 95 | +- `aibrix/vllm-openai:v0.10.2-aibrix-v0.5.0-nixl-0.7.1-20251123` |
| 96 | +- `aibrix/sglang:v0.5.5.post3-aibrix-v0.5.0-nixl-0.7.1-20251123` |
0 commit comments