swift-llama-cpp

Run any LLM locally on iOS or MacOS. Powered by llama.cpp

Coverage

This wrapper covers:

Model loading (single file and splits), save, metadata, size, params, encoder/decoder flags
Vocab API (token text, score, attrs, special tokens), tokenize/detokenize
Context creation/free, threads, embeddings/attention/warmup toggles
Memory API (sequence remove/copy/keep/add/div/min/max/canShift)
Encode/decode, logits/embeddings getters, synchronize
State/session save-load, per-sequence state
Chat templates (apply and list built-ins)
Sampler chain (grammar, top-k, top-p, temp, penalties, dist), sample/accept/reset/clone
LoRA adapter load/apply/remove/clear, control vectors
Backend init/free, capability queries, system info, logging hook

Basic usage

Here is a quick example of how to use SwiftLlama to generate text from a model.

First, make sure you have a GGUF model file accessible in your project. You can download models from sources like Hugging Face.

import SwiftLlama
import Foundation

// 1. Get the model URL
// Make sure to add a GGUF model to your project and get its URL.
guard let modelUrl = Bundle.main.url(forResource: "your-model-name", withExtension: "gguf") else {
    print("Model file not found")
    return
}

// 2. Initialize the LlamaService
// This service manages the model and context.
let llamaService = LlamaService(modelUrl: modelUrl, config: .init(batchSize: 256, maxTokenCount: 4096, useGPU: true))

// 3. Prepare your messages
// The conversation history can be provided as an array of messages.
let messages = [
    LlamaChatMessage(role: .system, content: "You are a helpful assistant."),
    LlamaChatMessage(role: .user, content: "Tell me a short story."),
]

// 4. Generate text
// The `streamCompletion` method returns an `AsyncThrowingStream` of tokens.
do {
    let stream = try await llamaService.streamCompletion(of: messages, samplingConfig: .init(temperature: 0.8, seed: 42))
    var generatedText = ""
    for try await token in stream {
        generatedText += token
        print("Generated token: \(token)")
    }
    print("Generated text: \(generatedText)")
} catch {
    print("Error generating text: \(error.localizedDescription)")
}

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.cursor/rules		.cursor/rules
.swiftpm		.swiftpm
Sources/SwiftLlama		Sources/SwiftLlama
Tests/SwiftLlamaTests		Tests/SwiftLlamaTests
.gitignore		.gitignore
LICENSE		LICENSE
Package.swift		Package.swift
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

swift-llama-cpp

Coverage

Basic usage

About

Uh oh!

Releases 1

Packages

Languages

License

pgorzelany/swift-llama-cpp

Folders and files

Latest commit

History

Repository files navigation

swift-llama-cpp

Coverage

Basic usage

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages