LocalAI.NET (In Progress)

A unified .NET client library for running LLMs (Large Language Models) locally. LocalAI.NET provides a single, consistent API for interacting with popular local LLM providers like KoboldCpp, Ollama, LM Studio, and Text Generation WebUI. Perfect for developers who want to:

Run AI models locally without cloud dependencies
Switch between different local LLM providers without changing code

LocalAI.NET acts as a universal wrapper around local LLM providers, offering:

Single API: Use the same code regardless of the underlying LLM provider
Provider Flexibility: Easily switch between KoboldCpp, Ollama, LM Studio, or Text Generation WebUI
Production Ready: Built-in retry policies, circuit breakers, and error handling
Modern .NET: Async/await, streaming support, and comprehensive logging
Privacy First: All AI operations run locally - your data never leaves your machine

📦 View Package on NuGet

Feature Comparison

Feature	LocalAI.NET	OpenAI.NET	LLamaSharp	OllamaSharp
Local LLM Support	✅	❌	✅	✅
Multiple Providers	✅	❌	❌	❌
KoboldCpp Support	✅	❌	❌	❌
Ollama Support	✅	❌	❌	✅
LM Studio Support	✅	❌	❌	❌
Text Gen WebUI Support	✅	❌	❌	❌
Streaming	✅	✅	✅	✅
OpenAI Compatible	✅	✅	❌	✅
Progress Tracking	✅	❌	❌	❌
Retry Policies	✅	❌	❌	❌
Circuit Breaker	✅	❌	❌	❌
.NET Standard 2.0	❌	✅	✅	✅
.NET 6.0+	✅	✅	✅	✅

Supported Providers

KoboldCpp: Both native and OpenAI-compatible modes
Ollama: Run Llama 2, Code Llama, and other models locally (using OllamaSharp).
LM Studio: Local deployment of various open-source models
Text Generation WebUI: Popular web interface for running local models

Installation

Install LocalAI.NET via NuGet:

dotnet add package LocalAI.NET

Quick Start

using LocalAI.NET.Client;
using LocalAI.NET.Models.Configuration;

// Create client with KoboldCpp provider
var options = new LocalAIOptions
{
    BaseUrl = "http://localhost:5000",
    ProviderOptions = new KoboldCppNativeOptions
    {
        ContextSize = 2048,
        UseGpu = true,
        RepetitionPenalty = 1.1f
    }
};

using var client = new LocalAIClient(options);

// Generate text completion
string response = await client.CompleteAsync("Write a short story about a robot:");

// Stream completion tokens
await foreach (var token in client.StreamCompletionAsync("Once upon a time..."))
{
    Console.Write(token);
}

// List available models
var models = await client.GetAvailableModelsAsync();
foreach (var model in models)
{
    Console.WriteLine($"Model: {model.Name} (Provider: {model.Provider})");
    Console.WriteLine($"Context Length: {model.Capabilities.MaxContextLength}");
}

Provider Configuration

KoboldCpp (Native)

var options = new LocalAIOptions
{
    BaseUrl = "http://localhost:5000",
    ProviderOptions = new KoboldCppNativeOptions
    {
        ContextSize = 2048,
        UseGpu = true,
        RepetitionPenalty = 1.1f,
        RepetitionPenaltyRange = 320,
        TrimStop = true,
        Mirostat = new MirostatSettings
        {
            Mode = 2,
            Tau = 5.0f,
            Eta = 0.1f
        }
    }
};

KoboldCpp (OpenAI-compatible)

var options = new LocalAIOptions
{
    BaseUrl = "http://localhost:5000",
    ProviderOptions = new KoboldCppOpenAiOptions
    {
        ContextSize = 2048,
        UseGpu = true,
        ModelName = "koboldcpp",
        UseChatCompletions = true
    }
};

Ollama

var options = new LocalAIOptions
{
    BaseUrl = "http://localhost:11434",
    ProviderOptions = new OllamaOptions
    {
        ConcurrentRequests = 1
    }
};

LM Studio

var options = new LocalAIOptions
{
    BaseUrl = "http://localhost:1234",
    ProviderOptions = new LMStudioOptions
    {
        UseOpenAIEndpoint = true
    }
};

Text Generation WebUI

var options = new LocalAIOptions
{
    BaseUrl = "http://localhost:7860",
    ProviderOptions = new TextGenWebOptions
    {
        UseOpenAIEndpoint = true
    }
};

Completion Options

var options = new CompletionOptions
{
    ModelName = "wizardLM",         // Optional model name
    MaxTokens = 200,                // Max tokens to generate
    Temperature = 0.7f,             // Randomness (0.0-1.0)
    TopP = 0.9f,                    // Nucleus sampling threshold
    StopSequences = new[] { "\n" }  // Sequences that stop generation
};

string response = await client.CompleteAsync("Your prompt here", options);

Progress Tracking

client.OnProgress += (progress) =>
{
    switch (progress.State)
    {
        case LocalAIProgressState.Starting:
            Console.WriteLine("Starting completion...");
            break;
        case LocalAIProgressState.Processing:
            Console.WriteLine($"Processing: {progress.Message}");
            break;
        case LocalAIProgressState.Streaming:
            Console.WriteLine("Receiving tokens...");
            break;
        case LocalAIProgressState.Complete:
            Console.WriteLine("Completion finished!");
            break;
        case LocalAIProgressState.Failed:
            Console.WriteLine($"Error: {progress.Message}");
            break;
    }
};

Error Handling

try
{
    var response = await client.CompleteAsync("Test prompt");
}
catch (LocalAIException ex)
{
    Console.WriteLine($"LocalAI API error: {ex.Message}");
    if (ex.StatusCode.HasValue)
    {
        Console.WriteLine($"Status code: {ex.StatusCode}");
    }
    if (ex.Provider != null)
    {
        Console.WriteLine($"Provider: {ex.Provider}");
    }
}
catch (Exception ex)
{
    Console.WriteLine($"General error: {ex.Message}");
}

Advanced Configuration

var options = new LocalAIOptions
{
    BaseUrl = "http://localhost:5000",
    ApiKey = "optional_api_key",
    Timeout = TimeSpan.FromMinutes(2),
    MaxRetryAttempts = 3,
    RetryDelay = TimeSpan.FromSeconds(2),
    Logger = loggerInstance,
    JsonSettings = new JsonSerializerSettings(),
    ProviderOptions = new KoboldCppNativeOptions()
};

License

This project is licensed under the MIT License - see the LICENSE file for details.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Please see CONTRIBUTING.md for details on:

How to publish to NuGet
Development guidelines
Code style
Testing requirements
Pull request process

Support

For issues and feature requests, please use the GitHub issues page.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.github/workflows		.github/workflows
LocalAI.NET.Tests		LocalAI.NET.Tests
LocalAI.NET		LocalAI.NET
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
LocalAI.NET.sln		LocalAI.NET.sln
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LocalAI.NET (In Progress)

Feature Comparison

Supported Providers

Installation

Quick Start

Provider Configuration

KoboldCpp (Native)

KoboldCpp (OpenAI-compatible)

Ollama

LM Studio

Text Generation WebUI

Completion Options

Progress Tracking

Error Handling

Advanced Configuration

License

Contributing

Support

About

Releases

Packages

Languages

License

SpongeEngine/LMSharp

Folders and files

Latest commit

History

Repository files navigation

LocalAI.NET (In Progress)

Feature Comparison

Supported Providers

Installation

Quick Start

Provider Configuration

KoboldCpp (Native)

KoboldCpp (OpenAI-compatible)

Ollama

LM Studio

Text Generation WebUI

Completion Options

Progress Tracking

Error Handling

Advanced Configuration

License

Contributing

Support

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages