Best Open-Source LLM Inference Servers on GitHub (2026)
This list contains the top 12 open-source llm inference servers on GitHub, ranked by the RepoRadar scoring engine across five quality dimensions. The top-ranked repo is jundot/omlx with 16.8k stars. Projects span written in Python, Shell, Go, C++. Data last updated 2026-06-19.
csghub-server is the backend server for CSGHub which helps user to manage datasets, modes, and also run Model Inference, Finetune and Application Spaces.
OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal su…
StatsPAI is the first agent-native Python library for causal inference and applied econometrics — unified API, broad cross-method coverage, structured result objects, machine-read…
What is the best open-source llm inference servers on GitHub?
Based on the RepoRadar scoring engine, jundot/omlx is currently the top-ranked option with 16.8k stars and a score of 85/100.
How are open-source llm inference servers repositories ranked?
Repositories are ranked by the RepoRadar score — a composite of five dimensions: Popularity (35%), Freshness (25%), Maintenance (20%), Community (10%), and Completeness (10%). Scores range from 0–100.
When was this list last updated?
This list was last updated on 2026-06-19. Data is sourced directly from GitHub's public API. No cached or fabricated repositories are used.