Skip to content

v1.1.0

Choose a tag to compare

@joerunde joerunde released this 10 Oct 23:28
· 55 commits to main since this release
dff277b

v1.1.0

  • ⬆️ Adds support for vllm v0.11.0
  • 🔥 Drops support for vllm v0.10.1.1
  • ✨ Writes performance metrics to file when VLLM_SPYRE_PERF_METRIC_LOGGING_ENABLED is set
  • 🐛 Fixes a bug where incorrect logits processors were applied to requests under load
  • 🐛 Fixes a bug where /chat/completions required a user-specified max_tokens param to function

What's Changed

Full Changelog: v1.0.2...v1.1.0