ExLLM

A unified Elixir client for interfacing with multiple Large Language Model (LLM) providers.

ExLLM provides a single, consistent API to interact with a growing list of LLM providers. It abstracts away the complexities of provider-specific request formats, authentication, and error handling, allowing you to focus on building features.

🚀 Release Candidate: This library is approaching its 1.0.0 stable release. The API is stabilized and ready for production use.

Key Features

Unified API: Use a single ExLLM.chat/2 interface for all supported providers, dramatically reducing boilerplate code
Broad Provider Support: Seamlessly switch between models from 14+ major providers
Streaming Support: Handle real-time responses for chat completions using Elixir's native streaming
Standardized Error Handling: Get predictable {:error, reason} tuples for common failure modes
Session Management: Built-in conversation state tracking and persistence
Function Calling: Unified tool use interface across providers that support it
Multimodal Support: Vision, audio, and document processing capabilities where available
Minimal Overhead: Designed as a thin, efficient client layer with focus on performance
Extensible Architecture: Adding new providers is straightforward through clean delegation patterns

Feature Status

✅ Production Ready: Core chat, streaming, sessions, providers, function calling, cost tracking
🚧 Under Development: Context management, model capabilities API, configuration validation

See FEATURE_STATUS.md for detailed testing results and API status.

Supported Providers

ExLLM supports 14 providers with access to hundreds of models:

Anthropic Claude - Claude 4, 3.7, 3.5, and 3 series models
OpenAI - GPT-4.1, o1 reasoning models, GPT-4o, and GPT-3.5 series
AWS Bedrock - Multi-provider access (Anthropic, Amazon Nova, Meta Llama, etc.)
Google Gemini - Gemini 2.5, 2.0, and 1.5 series with multimodal support
OpenRouter - Access to hundreds of models from multiple providers
Groq - Ultra-fast inference with Llama 4, DeepSeek R1, and more
X.AI - Grok models with web search and reasoning capabilities
Mistral AI - Mistral Large, Pixtral, and specialized code models
Perplexity - Search-enhanced language models
Ollama - Local model runner (any model in your installation)
LM Studio - Local model server with OpenAI-compatible API
Bumblebee - Local model inference with Elixir/Nx (optional dependency)
Mock Adapter - For testing and development

Installation

Add ex_llm to your list of dependencies in mix.exs:

def deps do
  [
    {:ex_llm, "~> 1.0.0-rc1"},
    
    # Optional: For local model inference via Bumblebee
    {:bumblebee, "~> 0.6.2", optional: true},
    {:nx, "~> 0.7", optional: true},
    
    # Optional hardware acceleration backends (choose one):
    {:exla, "~> 0.7", optional: true},
    
    # Optional: For Apple Silicon Metal acceleration
    {:emlx, github: "elixir-nx/emlx", branch: "main", optional: true}
  ]
end

Quick Start

1. Configuration

Set your API keys as environment variables:

export ANTHROPIC_API_KEY="your-anthropic-key"
export OPENAI_API_KEY="your-openai-key"
export GROQ_API_KEY="your-groq-key"
# ... other provider keys as needed

2. Basic Usage

# Single completion
{:ok, response} = ExLLM.chat(:anthropic, [
  %{role: "user", content: "Explain quantum computing in simple terms"}
])

IO.puts(response.content)
# Cost automatically tracked: response.cost

# Streaming response
ExLLM.chat_stream(:openai, [
  %{role: "user", content: "Write a short story"}
], fn chunk ->
  IO.write(chunk.delta)
end)

# With session management
{:ok, session} = ExLLM.Session.new(:groq)
{:ok, session, response} = ExLLM.Session.chat(session, "Hello!")
{:ok, session, response} = ExLLM.Session.chat(session, "How are you?")

# Multimodal with vision
{:ok, response} = ExLLM.chat(:gemini, [
  %{role: "user", content: [
    %{type: "text", text: "What's in this image?"},
    %{type: "image", image: %{data: base64_image, media_type: "image/jpeg"}}
  ]}
])

Configuration

You can configure providers in your config/config.exs:

import Config

config :ex_llm,
  default_provider: :openai,
  providers: [
    openai: [api_key: System.get_env("OPENAI_API_KEY")],
    anthropic: [api_key: System.get_env("ANTHROPIC_API_KEY")],
    gemini: [api_key: System.get_env("GEMINI_API_KEY")]
  ]

Testing

The test suite includes both unit tests and integration tests. Integration tests that make live API calls are tagged and excluded by default.

To run unit tests only:

mix test

To run integration tests (requires API keys):

mix test --include integration

To run tests with intelligent caching for faster development:

mix test.live  # Runs with test response caching enabled

Architecture

ExLLM uses a clean, modular architecture that separates concerns while maintaining a unified API:

Core Modules

ExLLM - Main entry point with unified API
ExLLM.API.Delegator - Central delegation engine for provider routing
ExLLM.API.Capabilities - Provider capability registry
ExLLM.Pipeline - Phoenix-style pipeline for request processing

Specialized Modules

ExLLM.Embeddings - Vector operations and similarity calculations
ExLLM.Assistants - OpenAI Assistants API for stateful agents
ExLLM.KnowledgeBase - Document management and semantic search
ExLLM.Builder - Fluent interface for chat construction
ExLLM.Session - Conversation state management

Benefits

Clean Separation: Each module has a single, focused responsibility
Easy Extension: Adding providers requires changes in just 1-2 files
Performance: Delegation adds minimal overhead
Maintainability: Clear boundaries between components

Documentation

📚 Quick Start Guide - Get up and running in 5 minutes
📖 User Guide - Comprehensive documentation of all features
🏗️ Architecture Guide - Clean layered architecture and namespace organization
🔌 Pipeline Architecture - Phoenix-style plug system and extensibility
🔧 Logger Guide - Debug logging and troubleshooting
⚡ Provider Capabilities - Feature comparison across providers
🧪 Testing Guide - Comprehensive testing system with semantic tagging and caching

Key Topics Covered in the User Guide

Configuration: Environment variables, config files, and provider setup
Chat Completions: Messages, parameters, and response handling
Streaming: Real-time responses with error recovery and coordinator
Session Management: Conversation state and persistence
Function Calling: Tool use and structured interactions across providers
Vision & Multimodal: Image, audio, and document processing
Cost Tracking: Automatic cost calculation and token estimation
Error Handling: Retry logic and error recovery strategies
Test Caching: Intelligent response caching for faster development
Model Discovery: Query available models and capabilities
OAuth2 Integration: Complete OAuth2 flow for Gemini and other providers

Additional Documentation

📋 Unified API Guide - Complete unified API documentation
🔄 Migration Guide - Upgrading to v1.0.0
✅ Release Checklist - Automated release process
📚 API Reference - Detailed API documentation on HexDocs

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Support

📖 Documentation: User Guide
🐛 Issues: GitHub Issues
💬 Discussions: GitHub Discussions

Name		Name	Last commit message	Last commit date
Latest commit History 334 Commits
.amazonq/rules		.amazonq/rules
.github		.github
config		config
docs		docs
examples		examples
guides		guides
lib		lib
scripts		scripts
test		test
.credo.exs		.credo.exs
.dialyzer_ignore.exs		.dialyzer_ignore.exs
.dialyzer_ignore.exs.backup		.dialyzer_ignore.exs.backup
.env.example		.env.example
.env.test.example		.env.test.example
.formatter.exs		.formatter.exs
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
ENVIRONMENT.md		ENVIRONMENT.md
FEATURE_STATUS.md		FEATURE_STATUS.md
HTTPCLIENT_REMOVAL_PLAN.md		HTTPCLIENT_REMOVAL_PLAN.md
INTEGRATION_TEST_DEBUG_PLAN.md		INTEGRATION_TEST_DEBUG_PLAN.md
LICENSE		LICENSE
MIGRATION_GUIDE_V1.md		MIGRATION_GUIDE_V1.md
README.md		README.md
RELEASE_CHECKLIST.md		RELEASE_CHECKLIST.md
TASKS.md		TASKS.md
TEST_COVERAGE_IMPLEMENTATION_PLAN.md		TEST_COVERAGE_IMPLEMENTATION_PLAN.md
TEST_FIX_IMPLEMENTATION_PLAN.md		TEST_FIX_IMPLEMENTATION_PLAN.md
TEST_SETUP.md		TEST_SETUP.md
check_cache_support.exs		check_cache_support.exs
fix_example_app.patch		fix_example_app.patch
http_client_references.md		http_client_references.md
list_gemini_models.exs		list_gemini_models.exs
mix.exs		mix.exs
mix.lock		mix.lock
provider_test_report.md		provider_test_report.md
streaming_timeout_fix.patch		streaming_timeout_fix.patch
test_all_providers.sh		test_all_providers.sh
test_api_keys.exs		test_api_keys.exs
test_output.log		test_output.log
test_providers_clean.sh		test_providers_clean.sh
test_providers_summary.sh		test_providers_summary.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ExLLM

Key Features

Feature Status

Supported Providers

Installation

Quick Start

1. Configuration

2. Basic Usage

Configuration

Testing

Architecture

Core Modules

Specialized Modules

Benefits

Documentation

Key Topics Covered in the User Guide

Additional Documentation

Contributing

License

Support

About

Uh oh!

Releases 4

Packages

Contributors 3

Uh oh!

Languages

License

azmaveth/ex_llm

Folders and files

Latest commit

History

Repository files navigation

ExLLM

Key Features

Feature Status

Supported Providers

Installation

Quick Start

1. Configuration

2. Basic Usage

Configuration

Testing

Architecture

Core Modules

Specialized Modules

Benefits

Documentation

Key Topics Covered in the User Guide

Additional Documentation

Contributing

License

Support

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Contributors 3

Uh oh!

Languages

Packages