LEANN

Lightweight vector search with 85-96% storage savings.

LEANN stores only the graph structure, not the embeddings. During search, it recomputes embeddings on-the-fly for visited nodes only. This achieves massive storage reduction while maintaining search quality.

# Build an index - embeddings are computed but NOT stored
Leann.build("knowledge_base") do
  add "LEANN saves 85-96% storage compared to traditional vector databases."
  add "It uses graph-only storage with on-demand embedding recomputation."
  add_directory "docs/", extensions: [".md", ".txt"]
end

# Search semantically
results = Leann.search("knowledge_base", "storage efficiency")
puts results.first.text

Storage Benchmark

At 1 million documents:

Embedding Model	Traditional HNSW	LEANN	Savings
Ollama (384 dim)	1.67 GB	251.8 MB	85%
OpenAI (1536 dim)	5.96 GB	251.8 MB	96%

Full benchmark across scales:

Documents	Traditional HNSW	LEANN	Savings
1,000	1.7 MB	257 KB	85%
10,000	17 MB	2.5 MB	85%
100,000	171 MB	25 MB	85%
1,000,000	1.67 GB	252 MB	85%

Measured with Ollama all-minilm (384 dimensions). OpenAI models achieve ~96% savings due to larger embeddings.

How It Works

Traditional vector databases:

[Documents] + [Embeddings] + [Index] → Large storage (1.67 GB for 1M docs)

LEANN:

[Documents] + [Graph-only] → Tiny storage (252 MB for 1M docs)
                ↓
        Embeddings recomputed on-the-fly during search

The trade-off: search is slightly slower (requires API calls to recompute embeddings for visited nodes), but storage is dramatically smaller.

Quick Start

1. Install

gem 'leann'
gem 'ruby_llm'  # Recommended - LEANN uses RubyLLM for embeddings automatically

2. Configure

# If you have RubyLLM, just configure it - LEANN uses it automatically
RubyLLM.configure do |config|
  config.openai_api_key = ENV["OPENAI_API_KEY"]
end

# Or configure LEANN directly (without RubyLLM)
Leann.configure do |config|
  config.embedding_provider = :openai  # or :ollama
  config.openai_api_key = ENV["OPENAI_API_KEY"]
end

3. Build & Search

# Index some documents
Leann.build("my_index") do
  add "Ruby is a dynamic programming language."
  add "Rails is a web framework written in Ruby."
  add "Sinatra is a lightweight Ruby web framework."
end

# Search
results = Leann.search("my_index", "web frameworks")
results.each { |r| puts "#{r.score.round(3)}: #{r.text}" }

Real-World Use Cases

Documentation Search

# Index your documentation
Leann.build("docs") do
  add_directory "docs/", extensions: [".md"]
  add_directory "guides/", extensions: [".md"]
end

# Search API
get "/search" do
  results = Leann.search("docs", params[:q], limit: 10)
  json results.map(&:to_h)
end

Code Search

# Index your codebase
Leann.build("codebase", embedding: :ollama, model: "nomic-embed-text") do
  add_directory "app/", extensions: [".rb"]
  add_directory "lib/", extensions: [".rb"]
end

# Find relevant code
results = Leann.search("codebase", "user authentication", limit: 5)

Local-First with Ollama

# No API keys needed - runs entirely local
Leann.configure do |config|
  config.embedding_provider = :ollama
end

Leann.build("local_index") do
  add "Your private data stays on your machine."
end

results = Leann.search("local_index", "privacy")

RubyLLM Integration

LEANN works seamlessly with RubyLLM:

Embeddings: If RubyLLM is present, LEANN uses it automatically for embeddings
RAG: Use LEANN as a tool for retrieval-augmented generation

Embeddings via RubyLLM

# Just configure RubyLLM - LEANN detects and uses it automatically
RubyLLM.configure do |config|
  config.openai_api_key = ENV["OPENAI_API_KEY"]
end

# Build uses RubyLLM.embed under the hood
Leann.build("docs") do
  add "Your documents here..."
end

RAG with Search Tool

require "leann"
require "leann/ruby_llm/search"

# Build your index
Leann.build("knowledge_base") do
  add "LEANN saves 85-96% storage compared to traditional vector databases."
  add "It uses graph-only storage for massive space savings."
end

# Create a RubyLLM chat with LEANN as a tool
chat = RubyLLM.chat(model: "gpt-4o")
  .with_tool(Leann::RubyLLM::Search.new("knowledge_base"))

# The LLM will automatically search when needed
answer = chat.ask("How much storage does LEANN save?")
puts answer
# => "Based on my search, LEANN saves 85-96% storage compared to traditional vector databases..."

Multiple Indexes

require "leann/ruby_llm/search"

docs_tool = Leann::RubyLLM::Search.new("docs", name: "search_docs")
code_tool = Leann::RubyLLM::Search.new("codebase", name: "search_code")

chat = RubyLLM.chat(model: "gpt-4o")
  .with_tools(docs_tool, code_tool)

chat.ask("How does authentication work in this project?")
# => LLM searches both indexes and synthesizes an answer

API Reference

Building Indexes

# DSL style (recommended)
Leann.build("name") do
  add "text", key: "value"           # Add text with metadata
  add_file "path/to/file.txt"        # Add file contents
  add_directory "docs/"              # Add all files from directory
end

# Programmatic style
builder = Leann::Builder.new("name", embedding: :openai)
builder.add("text one")
builder.add("text two", category: "example")
builder.save

Searching

# Simple search
results = Leann.search("index", "query")

# With options
results = Leann.search("index", "query",
  limit: 10,                    # Max results
  threshold: 0.7,               # Min similarity
  filters: { category: "docs" } # Metadata filter
)

# Working with results
results.each do |r|
  puts r.id           # Document ID
  puts r.text         # Document text
  puts r.score        # Similarity score (0-1)
  puts r.metadata     # Metadata hash
end

puts results.combined_text  # All texts joined

Configuration

Leann.configure do |config|
  # Embedding provider (defaults to :ruby_llm if available, otherwise :openai)
  config.embedding_provider = :ruby_llm  # :ruby_llm, :openai, :ollama, or :fastembed

  # Provider-specific settings (only needed if not using RubyLLM)
  config.openai_api_key = "sk-..."
  config.ollama_host = "http://localhost:11434"
  config.default_embedding_model = "text-embedding-3-small"

  # Index settings
  config.hnsw_m = 16                         # Graph connectivity
  config.hnsw_ef_construction = 200          # Build quality
end

Rails Integration

LEANN provides ActiveRecord integration for storing indexes in your database instead of files.

Setup

# Generate the migration
rails generate leann:install
rails db:migrate

This creates two tables:

leann_indexes - Stores index metadata and graph configuration
leann_passages - Stores documents with their text, metadata, and graph neighbors

Usage

require "leann/rails"

# Build an index (stored in database)
Leann::Rails.build("products") do
  add "Red running shoes for athletes", category: "shoes", price: 89.99
  add "Blue denim jeans, slim fit", category: "pants", price: 59.99
  add "White cotton t-shirt", category: "tops", price: 24.99
end

# Search
results = Leann::Rails.search("products", "comfortable footwear")
results.each { |r| puts "#{r.score.round(3)}: #{r.text}" }

# Other operations
Leann::Rails.exists?("products")  # => true
Leann::Rails.list                  # => ["products"]
Leann::Rails.delete("products")    # => true

In Controllers

class SearchController < ApplicationController
  def index
    @results = Leann::Rails.search("products", params[:q], limit: 10)
  end
end

With ActiveRecord Models

# Direct access to index records
index = Leann::Rails::Index.find_by(name: "products")
index.document_count  # => 3
index.search("shoes") # Search directly on the index

# Access passages
index.passages.each do |passage|
  puts passage.text
  puts passage.metadata
end

Benefits of Database Storage

Transactions: Index updates are ACID-compliant
Backups: Indexes are included in database backups
Scaling: Use read replicas for search-heavy workloads
Deployment: No need to manage separate index files

Embedding Providers

Provider	Setup	Best For
RubyLLM (default)	`gem 'ruby_llm'`	Unified API, multiple backends
OpenAI	`OPENAI_API_KEY`	Direct API access
Ollama	Install Ollama	Local, privacy-first
FastEmbed	`gem 'fastembed'`	Fast local, no server needed

When RubyLLM is present, LEANN uses it automatically. This gives you access to all embedding providers RubyLLM supports (OpenAI, Ollama, and more) through a single configuration.

FastEmbed (Local, Serverless)

FastEmbed provides fast local embeddings using ONNX Runtime - no API keys or running servers required:

gem 'fastembed'

Leann.configure do |config|
  config.embedding_provider = :fastembed
end

# Or specify model explicitly
Leann.build("index", embedding: :fastembed, model: "BAAI/bge-small-en-v1.5") do
  add "Your documents here..."
end

Supported models:

BAAI/bge-small-en-v1.5 (384 dim, default) - Fast English
BAAI/bge-base-en-v1.5 (768 dim) - Higher accuracy English
intfloat/multilingual-e5-small (384 dim) - 100+ languages
nomic-ai/nomic-embed-text-v1.5 (768 dim) - Long context (8192 tokens)

Requirements

Ruby 3.0+
RubyLLM gem (recommended) OR direct API access:
- OpenAI API key (for cloud embeddings)
- Ollama running locally (for local embeddings)

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
benchmark		benchmark
bin		bin
exe		exe
lib		lib
spec		spec
.gitignore		.gitignore
.mise.toml		.mise.toml
.rspec		.rspec
Gemfile		Gemfile
Gemfile.lock		Gemfile.lock
LICENSE.txt		LICENSE.txt
README.md		README.md
Rakefile		Rakefile
leann.gemspec		leann.gemspec

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LEANN

Storage Benchmark

How It Works

Quick Start

1. Install

2. Configure

3. Build & Search

Real-World Use Cases

Documentation Search

Code Search

Local-First with Ollama

RubyLLM Integration

Embeddings via RubyLLM

RAG with Search Tool

Multiple Indexes

API Reference

Building Indexes

Searching

Configuration

Rails Integration

Setup

Usage

In Controllers

With ActiveRecord Models

Benefits of Database Storage

Embedding Providers

FastEmbed (Local, Serverless)

Requirements

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

khasinski/leann-rb

Folders and files

Latest commit

History

Repository files navigation

LEANN

Storage Benchmark

How It Works

Quick Start

1. Install

2. Configure

3. Build & Search

Real-World Use Cases

Documentation Search

Code Search

Local-First with Ollama

RubyLLM Integration

Embeddings via RubyLLM

RAG with Search Tool

Multiple Indexes

API Reference

Building Indexes

Searching

Configuration

Rails Integration

Setup

Usage

In Controllers

With ActiveRecord Models

Benefits of Database Storage

Embedding Providers

FastEmbed (Local, Serverless)

Requirements

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages