gai - Go for AI

Package gai provides a unified interface for interacting with various large language model (LLM) providers.

The package abstracts away provider-specific implementations, allowing you to write code that works with multiple AI providers (OpenAI, Anthropic, Google Gemini) without changing your core logic. It supports text, image, audio, and PDF modalities (provider dependent), tool integration with JSON Schema-based parameters, callback-based tool execution, automatic fallback strategies for reliability, standardized error types for better error handling, and detailed usage metrics.

Features

Unified API across different LLM providers
Support for text, image, audio, and PDF modalities (provider dependent)
Tool integration with JSON Schema-based parameters
Callback-based tool execution
Automatic fallback strategies for reliability
Standardized error types for better error handling
Detailed usage metrics
Model Context Protocol (MCP) client support

Installation

go get github.com/spachava753/gai

Core Concepts

Generator: The core interface that all providers implement. It takes a Dialog and generates a Response.

type Generator interface {
	Generate(ctx context.Context, dialog Dialog, options *GenOpts) (Response, error)
}

Each LLM provider (OpenAI, Anthropic, Gemini) has its own implementation of the Generator interface.

Dialog: A conversation with a language model, represented as a slice of Message objects.

type Dialog []Message

Message: A single exchange in the conversation, with a Role (User, Assistant, or ToolResult) and a collection of Blocks.

type Message struct {
	Role   Role
	Blocks []Block
	ToolResultError bool
	ExtraFields  map[string]interface{}
}

Block: A self-contained piece of content within a message, which can be text, image, audio, or a tool call.

type Block struct {
	ID           string
	BlockType    string
	ModalityType Modality
	MimeType     string
	Content      fmt.Stringer
	ExtraFields  map[string]interface{}
}

Common block types include:

Content - Regular content like text or images
Thinking - Reasoning/thinking from the model
ToolCall - A request to call a tool

Modalities: gai supports multiple modalities for input and output.

type Modality uint

const (
	Text Modality = iota
	Image
	Audio
	Video
)

Support for specific modalities depends on the underlying model provider.

Tool: A function that can be called by the language model during generation.

type Tool struct {
	Name        string
	Description string
	InputSchema *jsonschema.Schema
}

The InputSchema defines the parameters the tool accepts using JSON Schema conventions:

&jsonschema.Schema{
	Type:       "object",
	Properties: map[string]*jsonschema.Schema{...},
	Required:   []string{...},
}

Basic Usage Examples

Basic usage with OpenAI:

package main

import (
	"context"
	"fmt"
	"github.com/openai/openai-go/v3"
	"github.com/spachava753/gai"
)

func main() {
	// Create an OpenAI client
	client := openai.NewClient()

	// Create a generator with a specific model
	generator := gai.NewOpenAiGenerator(
		client.Chat.Completions,
		openai.ChatModelGPT4,
		"You are a helpful assistant.",
	)

	// Create a dialog with a user message
	dialog := gai.Dialog{
		{
			Role: gai.User,
			Blocks: []gai.Block{
				{
					BlockType:    gai.Content,
					ModalityType: gai.Text,
					Content:      gai.Str("What is the capital of France?"),
				},
			},
		},
	}

	// Generate a response
	response, err := generator.Generate(context.Background(), dialog, &gai.GenOpts{
		Temperature: Ptr(0.7),
	})
	if err != nil {
		fmt.Printf("Error: %v\n", err)
		return
	}

	// Print the response
	if len(response.Candidates) > 0 && len(response.Candidates[0].Blocks) > 0 {
		fmt.Println(response.Candidates[0].Blocks[0].Content)
	}

	// Get usage metrics
	if inputTokens, ok := gai.InputTokens(response.UsageMetadata); ok {
		fmt.Printf("Input tokens: %d\n", inputTokens)
	}
	if outputTokens, ok := gai.OutputTokens(response.UsageMetadata); ok {
		fmt.Printf("Output tokens: %d\n", outputTokens)
	}
}

Prompt Caching with ResponsesGenerator

The OpenAI Responses generator supports explicit prompt cache routing through [GenOpts.ExtraArgs]. Set [ResponsesPromptCacheKeyParam] to a stable key for requests that share the same long static prompt prefix. Keep repeated instructions, schemas, and tool definitions at the beginning of the prompt, and put request-specific content near the end.

client := openai.NewClient()
gen := gai.NewResponsesGenerator(
	&client.Responses,
	openai.ChatModelGPT5Mini,
	"You are a helpful assistant that summarizes support incidents.",
)
opts := &gai.GenOpts{ExtraArgs: map[string]any{
	gai.ResponsesPromptCacheKeyParam: "support-incident-summary:v1",
}}
resp, err := gen.Generate(ctx, dialog, opts)
if err != nil {
	fmt.Printf("Error: %v\n", err)
	return
}
if cached, ok := gai.CacheReadTokens(resp.UsageMetadata); ok {
	fmt.Printf("cached tokens: %d\n", cached)
}

Tool Usage Example

Using tools with a language model:

package main

import (
	"context"
	"encoding/json"
	"fmt"
	"time"
	"github.com/openai/openai-go/v3"
	"github.com/spachava753/gai"
)

// Define a tool callback for getting the current time
type TimeToolCallback struct{}

func (t TimeToolCallback) Call(ctx context.Context, parametersJSON json.RawMessage, toolCallID string) (gai.Message, error) {
	return gai.ToolResultMessage(toolCallID, gai.TextBlock(time.Now().Format(time.RFC1123))), nil
}

func main() {
	client := openai.NewClient()

	// Create an OpenAI generator
	baseGen := gai.NewOpenAiGenerator(
		client.Chat.Completions,
		openai.ChatModelGPT4,
		"You are a helpful assistant.",
	)

	// Create a tool generator that wraps the base generator
	toolGen := &gai.ToolGenerator{
		G: &baseGen,
	}

	// Define a time tool
	timeTool := gai.Tool{
		Name:        "get_current_time",
		Description: "Get the current server time",
	}

	// Register the tool with its callback
	if err := toolGen.Register(timeTool, &TimeToolCallback{}); err != nil {
		fmt.Printf("Error registering tool: %v\n", err)
		return
	}

	// Create a dialog
	dialog := gai.Dialog{
		{
			Role: gai.User,
			Blocks: []gai.Block{
				{
					BlockType:    gai.Content,
					ModalityType: gai.Text,
					Content:      gai.Str("What time is it now?"),
				},
			},
		},
	}

	// Generate a response with tool usage
	completeDialog, err := toolGen.Generate(context.Background(), dialog, func(d gai.Dialog) *gai.GenOpts {
		return &gai.GenOpts{
			ToolChoice: gai.ToolChoiceAuto,
		}
	})
	if err != nil {
		fmt.Printf("Error: %v\n", err)
		return
	}

	// Print the final result
	finalMsg := completeDialog[len(completeDialog)-1]
	if len(finalMsg.Blocks) > 0 {
		fmt.Println(finalMsg.Blocks[0].Content)
	}
}

Fallback Strategy Example

Implementing a fallback strategy between providers:

package main

import (
	"context"
	"fmt"
	"github.com/anthropics/anthropic-sdk-go"
	"github.com/openai/openai-go/v3"
	"github.com/spachava753/gai"
)

func main() {
	// Create clients for both providers
	openaiClient := openai.NewClient()
	anthropicClient := anthropic.NewClient()

	// Create generators for each provider
	openaiGen := gai.NewOpenAiGenerator(
		openaiClient.Chat.Completions,
		openai.ChatModelGPT4,
		"You are a helpful assistant.",
	)

	anthropicGen := gai.NewAnthropicGenerator(
		anthropicClient.Messages,
		"claude-3-opus-20240229",
		"You are a helpful assistant.",
	)

	// Create a fallback generator that tries OpenAI first, then falls back to Anthropic
	fallbackGen, err := gai.NewFallbackGenerator(
		[]gai.Generator{&openaiGen, &anthropicGen},
		&gai.FallbackConfig{
			// Custom fallback condition: fall back on rate limits and 5xx errors
			ShouldFallback: gai.NewHTTPStatusFallbackConfig(429, 500, 502, 503, 504).ShouldFallback,
		},
	)
	if err != nil {
		fmt.Printf("Error creating fallback generator: %v\n", err)
		return
	}

	// Create a dialog
	dialog := gai.Dialog{
		{
			Role: gai.User,
			Blocks: []gai.Block{
				{
					BlockType:    gai.Content,
					ModalityType: gai.Text,
					Content:      gai.Str("What is the meaning of life?"),
				},
			},
		},
	}

	// Generate a response using the fallback strategy
	response, err := fallbackGen.Generate(context.Background(), dialog, &gai.GenOpts{
		Temperature: Ptr(0.7),
	})
	if err != nil {
		fmt.Printf("Error: %v\n", err)
		return
	}

	// Print the response
	if len(response.Candidates) > 0 && len(response.Candidates[0].Blocks) > 0 {
		fmt.Println(response.Candidates[0].Blocks[0].Content)
	}
}

Working with Thinking Blocks

Many LLM providers support "thinking" or "reasoning" output, where the model shows its internal reasoning process. gai normalizes these into Thinking blocks (BlockType == Thinking).

To identify which generator produced a thinking block, check the ThinkingExtraFieldGeneratorKey in the block's ExtraFields. This allows you to handle provider-specific features:

for _, block := range message.Blocks {
    if block.BlockType == gai.Thinking {
        generator := block.ExtraFields[gai.ThinkingExtraFieldGeneratorKey]
        fmt.Printf("Thinking from %s: %s\n", generator, block.Content)

        // Handle provider-specific fields
        switch generator {
        case gai.ThinkingGeneratorAnthropic:
            // Anthropic requires signatures for extended thinking
            if sig, ok := block.ExtraFields[gai.AnthropicExtraFieldThinkingSignature]; ok {
                fmt.Printf("Signature: %s\n", sig)
            }
        case gai.ThinkingGeneratorGemini:
            // Gemini may include thought signatures
            if sig, ok := block.ExtraFields[gai.GeminiExtraFieldThoughtSignature]; ok {
                fmt.Printf("Thought signature: %s\n", sig)
            }
        case gai.ThinkingGeneratorOpenRouter:
            // OpenRouter includes reasoning metadata
            reasonType := block.ExtraFields[gai.OpenRouterExtraFieldReasoningType]
            fmt.Printf("Reasoning type: %s\n", reasonType)
        }
    }
}

Available generator constants:

ThinkingGeneratorAnthropic - Anthropic Claude models with extended thinking
ThinkingGeneratorCerebras - Cerebras models with reasoning
ThinkingGeneratorGemini - Google Gemini models with thinking
ThinkingGeneratorOpenRouter - OpenRouter with reasoning models
ThinkingGeneratorResponses - OpenAI Responses API with reasoning
ThinkingGeneratorZai - Zai generator with reasoning

Note: The OpenAI Chat Completions generator (OpenAiGenerator) does not support thinking blocks.

Working with PDFs

gai supports PDF documents as a special case of the Image modality. PDFs are automatically converted to images at the model provider's API level:

package main

import (
	"context"
	"fmt"
	"os"
	"github.com/openai/openai-go/v3"
	"github.com/spachava753/gai"
)

func main() {
	// Read a PDF file
	pdfData, err := os.ReadFile("document.pdf")
	if err != nil {
		fmt.Printf("Error reading PDF: %v\n", err)
		return
	}

	// Create an OpenAI client and generator
	client := openai.NewClient()
	generator := gai.NewOpenAiGenerator(
		&client.Chat.Completions,
		openai.ChatModelGPT4o,
		"You are a helpful document analyst.",
	)

	// Create a dialog with PDF content
	dialog := gai.Dialog{
		{
			Role: gai.User,
			Blocks: []gai.Block{
				gai.TextBlock("Please summarize this PDF document:"),
				gai.PDFBlock(pdfData, "document.pdf"),
			},
		},
	}

	// Generate a response
	response, err := generator.Generate(context.Background(), dialog, nil)
	if err != nil {
		fmt.Printf("Error: %v\n", err)
		return
	}

	// Print the response
	if len(response.Candidates) > 0 && len(response.Candidates[0].Blocks) > 0 {
		fmt.Println(response.Candidates[0].Blocks[0].Content)
	}
}

PDF support notes:

OpenAI Token counting: PDF token counting is not supported and will return an error when using the TokenCounter interface
When creating a PDF block, you must provide both the PDF data and a filename, e.g. PDFBlock(data, "paper.pdf")
All providers: PDFs are converted to images server-side, so exact page dimensions are not known

Provider Support

The package supports multiple LLM providers with varying capabilities:

OpenAI: The OpenAI implementation supports text generation, image inputs (including PDFs), audio inputs, and tool calling.

import (
	"github.com/openai/openai-go/v3"
	"github.com/spachava753/gai"
)

client := openai.NewClient()
generator := gai.NewOpenAiGenerator(
	&client.Chat.Completions,
	openai.ChatModelGPT4,
	"System instructions here.",
)

Anthropic: The Anthropic implementation supports text generation, image inputs (including PDFs with special handling), and tool calling.

import (
	"github.com/anthropics/anthropic-sdk-go"
	"github.com/spachava753/gai"
)

client := anthropic.NewClient()
generator := gai.NewAnthropicGenerator(
	&client.Messages,
	"claude-3-opus-20240229",
	"System instructions here.",
)

Gemini: The Gemini implementation supports text generation, image inputs (including PDFs), audio inputs, and tool calling.

import (
	"google.golang.org/genai"
	"github.com/spachava753/gai"
)

client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey: "your-api-key",
})
generator, err := gai.NewGeminiGenerator(
	client,
	"gemini-1.5-pro",
	"System instructions here.",
)

Error Handling

The package provides standardized error types for consistent error handling across providers:

ErrMaxGenerationLimit - Maximum token generation limit reached
UnsupportedInputModalityErr - Model doesn't support the requested input modality
UnsupportedOutputModalityErr - Model doesn't support the requested output modality
InvalidToolChoiceErr - Invalid tool choice specified
InvalidParameterErr - Invalid generation parameter
ErrContextLengthExceeded - Input dialog exceeds model's context length
ContentPolicyErr - Content violates usage policies
ErrEmptyDialog - No messages provided
ApiErr - Provider/server errors with normalized provider, kind, status, and message fields

Example error handling:

response, err := generator.Generate(ctx, dialog, options)
if err != nil {
	switch {
	case errors.Is(err, gai.ErrMaxGenerationLimit):
		fmt.Println("Maximum generation limit reached")
	case errors.Is(err, gai.ErrContextLengthExceeded):
		fmt.Println("Context length exceeded")
	case errors.Is(err, gai.ErrEmptyDialog):
		fmt.Println("Empty dialog provided")

	case errors.As(err, &gai.ContentPolicyErr{}):
		fmt.Println("Content policy violation:", err)
	default:
		var apiErr *gai.ApiErr
		if errors.As(err, &apiErr) {
			fmt.Printf("API error: provider=%s kind=%s status=%d message=%s\n", apiErr.Provider, apiErr.Kind, apiErr.StatusCode, apiErr.Message)
		} else {
			fmt.Println("Unexpected error:", err)
		}
	}
	return
}

Advanced Usage

Tool Generator: The ToolGenerator provides advanced functionality for working with tools. It automatically handles registering tools with the underlying generator, executing tool callbacks when tools are called, managing the conversation flow during tool use, and handling parallel tool calls.

type ToolGenerator struct {
	G             ToolCapableGenerator
	toolCallbacks map[string]ToolCallback
}

Example:

// Create a base generator (OpenAI or Anthropic)
baseGen := gai.NewOpenAiGenerator(...)

// Create a tool generator
toolGen := &gai.ToolGenerator{
	G: &baseGen,
}

// Register tools with callbacks
toolGen.Register(weatherTool, &WeatherAPI{})
toolGen.Register(stockPriceTool, &StockAPI{})

// Generate with tool support
completeDialog, err := toolGen.Generate(ctx, dialog, func(d gai.Dialog) *gai.GenOpts {
	return &gai.GenOpts{
		ToolChoice: gai.ToolChoiceAuto,
		Temperature: Ptr(0.7),
	}
})

Fallback Generator: The FallbackGenerator provides automatic fallback between different providers. It automatically tries each generator in sequence, falls back based on configurable conditions, and preserves the original error if all generators fail.

type FallbackGenerator struct {
	generators []Generator
	config     FallbackConfig
}

Configuration options:

NewHTTPStatusFallbackConfig() - Fallback on specific HTTP status codes
NewRateLimitOnlyFallbackConfig() - Fallback only on rate limit errors
Custom fallback logic via ShouldFallback function

Example:

primaryGen := gai.NewOpenAiGenerator(...)
backupGen := gai.NewAnthropicGenerator(...)

fallbackGen, err := gai.NewFallbackGenerator(
	[]gai.Generator{primaryGen, backupGen},
	&gai.FallbackConfig{
		ShouldFallback: func(err error) bool {
			// Custom fallback logic
			var apiErr *gai.ApiErr
			return errors.As(err, &apiErr) && apiErr.Retryable()
		},
	},
)

Model Context Protocol (MCP)

The package includes MCP (Model Context Protocol) client support for connecting to external tools and data sources. The MCP client allows you to connect to MCP servers via stdio, HTTP, or other transports and use their tools within the gai framework.

Note: This MCP implementation does not support JSON-RPC batch requests/responses. All messages are sent and received individually for simplicity and forward compatibility with planned protocol changes.

Example MCP usage:

import "github.com/spachava753/gai/mcp"

// Create MCP client
transport := mcp.NewStdio(mcp.StdioConfig{
	Command: "python",
	Args:    []string{"mcp_server.py"},
})

client, err := mcp.NewClient(ctx, transport, mcp.ClientInfo{
	Name:    "gai-client",
	Version: "1.0.0",
}, mcp.ClientCapabilities{}, mcp.DefaultOptions())

// Register MCP tools with a tool generator
err = mcp.RegisterMCPToolsWithGenerator(ctx, client, toolGen)

For more information and examples, example files in the repository.

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 177 Commits
scripts		scripts
.gitignore		.gitignore
AGENTS.md		AGENTS.md
LICENSE		LICENSE
README.md		README.md
anthropic.go		anthropic.go
anthropic_caching.go		anthropic_caching.go
anthropic_count_test.go		anthropic_count_test.go
anthropic_example_test.go		anthropic_example_test.go
api_errors.go		api_errors.go
callback.go		callback.go
callback_example_test.go		callback_example_test.go
callback_test.go		callback_test.go
cerebras.go		cerebras.go
cerebras_example_test.go		cerebras_example_test.go
doc.go		doc.go
errors.go		errors.go
fallback_generator.go		fallback_generator.go
fallback_generator_example_test.go		fallback_generator_example_test.go
fallback_generator_test.go		fallback_generator_test.go
gemini.go		gemini.go
gemini_errors_test.go		gemini_errors_test.go
gemini_example_test.go		gemini_example_test.go
generate.go		generate.go
go.mod		go.mod
go.sum		go.sum
message.go		message.go
metrics.go		metrics.go
mixed_generators_example_test.go		mixed_generators_example_test.go
openai.go		openai.go
openai_example_test.go		openai_example_test.go
openai_generate_test.go		openai_generate_test.go
openai_test.go		openai_test.go
openai_token_test.go		openai_token_test.go
openrouter.go		openrouter.go
openrouter_example_test.go		openrouter_example_test.go
preprocessing_generator.go		preprocessing_generator.go
responses.go		responses.go
responses_example_test.go		responses_example_test.go
responses_phase_test.go		responses_phase_test.go
responses_thinking_test.go		responses_thinking_test.go
retry_generator.go		retry_generator.go
retry_generator_test.go		retry_generator_test.go
sample.jpg		sample.jpg
sample.pdf		sample.pdf
sample.wav		sample.wav
streaming.go		streaming.go
streaming_example_test.go		streaming_example_test.go
streaming_test.go		streaming_test.go
tool.go		tool.go
tool_example_openai_test.go		tool_example_openai_test.go
tool_test.go		tool_test.go
tool_validation_test.go		tool_validation_test.go
wrapper.go		wrapper.go
wrapper_example_test.go		wrapper_example_test.go
wrapper_middleware_test.go		wrapper_middleware_test.go
wrapper_test.go		wrapper_test.go
zai.go		zai.go
zai_example_test.go		zai_example_test.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

gai - Go for AI

Features

Installation

Core Concepts

Basic Usage Examples

Prompt Caching with ResponsesGenerator

Tool Usage Example

Fallback Strategy Example

Working with Thinking Blocks

Working with PDFs

Provider Support

Error Handling

Advanced Usage

Model Context Protocol (MCP)

License

About

Uh oh!

Releases 57

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

gai - Go for AI

Features

Installation

Core Concepts

Basic Usage Examples

Prompt Caching with ResponsesGenerator

Tool Usage Example

Fallback Strategy Example

Working with Thinking Blocks

Working with PDFs

Provider Support

Error Handling

Advanced Usage

Model Context Protocol (MCP)

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 57

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages