Skip to content
Change the repository type filter

All

    Repositories list

    • Kubernetes controllers for fast model actuation using vLLM sleep/wake and launcher-based model swapping
      Go
      Apache License 2.0
      1411379Updated Apr 21, 2026Apr 21, 2026
    • Asynchronous Processor for Inference Gateway. Orchestrator of queues
      Go
      Apache License 2.0
      108148Updated Apr 21, 2026Apr 21, 2026
    • The batch gateway is an llm-d implementation of the OpenAI batch inference API
      Go
      Apache License 2.0
      158237Updated Apr 20, 2026Apr 20, 2026
    • llm-d-skills

      Public
      Apache License 2.0
      5215Updated Apr 20, 2026Apr 20, 2026
    • secure-inference

      Public
      Go
      Apache License 2.0
      33103Updated Apr 18, 2026Apr 18, 2026
    • llm-d-rl

      Public
      Python
      2011Updated Apr 17, 2026Apr 17, 2026
    • hermes

      Public
      Hermes is a cluster configuration scanning and self-test generation tool for llm-d inference workloads
      Rust
      4072Updated Apr 17, 2026Apr 17, 2026
    • Python
      Apache License 2.0
      0300Updated Apr 16, 2026Apr 16, 2026
    • llm-d-planner

      Public
      Python
      Apache License 2.0
      914265Updated Apr 16, 2026Apr 16, 2026
    • Python based inference-scheduler for Reinforcement Learning
      Python
      Apache License 2.0
      4850Updated Apr 16, 2026Apr 16, 2026
    • llm-d helm charts and deployment examples
      Go Template
      Apache License 2.0
      56551311Updated Apr 16, 2026Apr 16, 2026
    • helm charts for deploying models with llm-d
      Go Template
      Apache License 2.0
      5830157Updated Apr 12, 2026Apr 12, 2026
    • ig-wva

      Public
      Workload Variant Autoscaler is a service to compute the cost-optimal provisioning of heterogeneous accelerators for inference workloads with varying request lat…
      Jupyter Notebook
      22330Updated Mar 10, 2026Mar 10, 2026
    • llm-d-ci

      Public
      Shell
      2290Updated Mar 10, 2026Mar 10, 2026
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.