Skip to content
Change the repository type filter

All

    Repositories list

    • Mooncake

      Public
      Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
      C++
      Apache License 2.0
      685000Updated Dec 17, 2025Dec 17, 2025
    • llama.cpp

      Public
      LLM inference in C/C++
      C++
      MIT License
      17k300Updated Nov 28, 2025Nov 28, 2025
    • Docker Model Runner
      Go
      Apache License 2.0
      116000Updated Oct 29, 2025Oct 29, 2025
    • MAD

      Public
      MAD (Model Automation and Dashboarding)
      Shell
      MIT License
      45000Updated Oct 28, 2025Oct 28, 2025
    • gpustack

      Public
      Manage GPU clusters for running LLMs
      Python
      Apache License 2.0
      504000Updated Aug 4, 2025Aug 4, 2025
    • ramalama

      Public
      Ramalama is an open-source developer tool that simplifies the local serving of AI models from any source and facilitates their use for inference in production, …
      Python
      MIT License
      333000Updated Jul 28, 2025Jul 28, 2025
    • cozeloop

      Public
      Next-generation AI Agent Optimization Platform: Cozeloop addresses challenges in AI agent development by providing full-lifecycle management capabilities from d…
      Go
      Apache License 2.0
      749000Updated Jul 26, 2025Jul 26, 2025
    • octotools

      Public
      OctoTools: An agentic framework with extensible tools for complex reasoning
      Python
      MIT License
      187000Updated Jul 24, 2025Jul 24, 2025
    • llama-box

      Public
      LLM inference server implementation based on llama.cpp.
      C++
      MIT License
      28000Updated Jul 24, 2025Jul 24, 2025
    • Stable Diffusion and Flux in pure C/C++
      C++
      MIT License
      593000Updated Jul 24, 2025Jul 24, 2025
    • Port of OpenAI's Whisper model in C/C++
      C++
      MIT License
      5.4k000Updated Jul 24, 2025Jul 24, 2025
    • jax

      Public
      Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
      Python
      Apache License 2.0
      3.5k000Updated Jul 21, 2025Jul 21, 2025
    • ollama

      Public
      Get up and running with Llama 3, Mistral, Gemma, and other large language models.
      Go
      MIT License
      16k000Updated Jun 18, 2025Jun 18, 2025
    • A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
      Python
      Apache License 2.0
      1.3k100Updated Mar 20, 2025Mar 20, 2025
    • exo

      Public
      Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
      Python
      GNU General Public License v3.0
      3.1k000Updated Nov 28, 2024Nov 28, 2024
    • Python bindings for llama.cpp
      Python
      MIT License
      1.4k000Updated Nov 26, 2024Nov 26, 2024
    • fastfetch

      Public
      Like neofetch, but much faster because written mostly in C.
      C
      MIT License
      744000Updated Nov 19, 2024Nov 19, 2024
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      16k000Updated Oct 16, 2024Oct 16, 2024
    • k8sgpt

      Public
      Giving Kubernetes Superpowers to everyone
      Go
      Apache License 2.0
      979000Updated Sep 24, 2024Sep 24, 2024
    • Automatic SRE Superpowers within your Kubernetes cluster
      Go
      Apache License 2.0
      133000Updated Jul 31, 2024Jul 31, 2024
    • llm.c

      Public
      LLM training in simple, raw C/CUDA
      Cuda
      MIT License
      3.5k000Updated Jul 22, 2024Jul 22, 2024
    • A proxy that allows you to host ollama images in your local environment
      Go
      MIT License
      8000Updated Jul 2, 2024Jul 2, 2024
    • LLM Benchmark for Throughput via Ollama (Local LLMs)
      Python
      MIT License
      41000Updated Jun 11, 2024Jun 11, 2024
    • makllama

      Public
      MaK(Mac+Kubernetes)llama - Running and orchestrating large language models (LLMs) on Kubernetes with macOS nodes.
      Go
      Apache License 2.0
      34600Updated May 22, 2024May 22, 2024
    • An open and reliable container runtime
      Go
      Apache License 2.0
      3.9k100Updated May 22, 2024May 22, 2024
    • cri

      Public
      Go
      17100Updated May 21, 2024May 21, 2024
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.