LiteLLM Alternatives
22 alternatives to LiteLLM — comparable AI coding tools for building with AI agents.
AutoGen (AG2)
A multi-agent conversation framework where agents collaborate via message-passing, with group chat and code execution.
BAML
A domain-specific language for type-safe LLM functions, with generated clients and schema-aligned parsing.
Chonkie
A lightweight, fast chunking library for RAG with many splitting strategies in one API.
Chroma
An open-source, Python-first vector database that runs in-process — the fastest path from pip install to a working retrieval prototype.
Claude Agent SDK
A toolkit for building custom agents on the same harness that powers Claude Code.
CrewAI
A Python framework for orchestrating role-playing AI agents as collaborating 'crews', plus event-driven flows.
DSPy
Program language models instead of prompting them: declare tasks as typed signatures and let optimizers compile the prompts and few-shot examples for you.
FastMCP
A Pythonic framework for building Model Context Protocol servers and clients — decorator-based tools, resources, and prompts, with auth and deployment built in.
Instructor
Get structured, validated output from LLMs using plain type definitions, with automatic retries on validation failure.
LanceDB
An open-source embedded vector database built on the Lance columnar format — serverless, multimodal, and designed to scale on local disk or object storage.
LangGraph
A low-level library for building stateful, controllable agents as graphs, with checkpointing and human-in-the-loop.
LLM Guard
An open-source security toolkit of input and output scanners for LLM apps — prompt injection, PII/anonymize, secrets, toxicity, and more, from Protect AI.
Mem0
A memory layer for AI agents and apps — persistent, personalized long-term memory across sessions.
NeMo Guardrails
NVIDIA's open-source toolkit for adding programmable guardrails to LLM apps — input, dialog, retrieval, and output rails defined in the Colang language.
OpenAI Agents SDK
OpenAI's lightweight, open-source framework for agents — handoffs, guardrails, sessions, and built-in tracing.
OpenRouter
A hosted unified API to hundreds of models from many providers, with one key, one bill, and automatic fallbacks.
pgvector
An open-source Postgres extension that adds a vector type and HNSW/IVFFlat indexes for similarity search inside your existing database.
Pipecat
An open-source Python framework for real-time voice and multimodal conversational AI — it orchestrates streaming STT, LLM, and TTS into composable pipelines.
Portkey
An AI gateway and LLMOps platform: route to many LLMs through one API with caching, retries, fallbacks, load balancing, guardrails, and full observability.
Unsloth
An open-source library that makes LoRA/QLoRA fine-tuning of LLMs roughly 2x faster and far more memory-efficient, so you can fine-tune on a single GPU.
Vercel AI SDK
An open-source TypeScript toolkit for building AI apps — unified model API, streaming, structured output, tool calling, and UI hooks.
vLLM
A high-throughput, memory-efficient inference and serving engine for LLMs, with PagedAttention, continuous batching, and an OpenAI-compatible API server.