@terminals-tech/embeddings

Ultra-lightweight semantic embeddings for event graphs. Browser-first with swappable providers.

Zero dependencies on the core, <3MB total with embeddings model. Runs entirely in browsers.

Installation

npm install @terminals-tech/embeddings

Quick Start

import { EmbeddingProviderFactory, SemanticSearch } from '@terminals-tech/embeddings'

// Create the best available provider (auto-detects)
const provider = await EmbeddingProviderFactory.createBest({
  cache: true,
  quantizeCache: true  // Save 75% memory
})

// Create semantic search
const search = new SemanticSearch(provider)

// Add documents
await search.addDocuments([
  { text: 'User logged in successfully' },
  { text: 'Authentication failed due to invalid password' },
  { text: 'Session expired after timeout' }
])

// Search semantically
const results = await search.search('login problems', { topK: 2 })
// Returns documents about authentication failures and session issues

Features

🎯 Ultra-Lightweight

TransformersJS Provider: all-MiniLM-L6-v2 (22MB model, 384 dimensions)
Mock Provider: Deterministic embeddings for testing (0 dependencies)
Memory-Efficient Cache: LRU with optional int8 quantization

🔄 Swappable Providers

// Use TransformersJS (browser-optimized)
const transformer = await EmbeddingProviderFactory.create('transformers', {
  modelId: 'Xenova/all-MiniLM-L6-v2'
})

// Use mock for testing
const mock = await EmbeddingProviderFactory.create('mock')

// Auto-detect best available
const best = await EmbeddingProviderFactory.createBest()

💾 Memory-Efficient Caching

import { EmbeddingCache } from '@terminals-tech/embeddings'

const cache = new EmbeddingCache({
  maxSize: 1000,       // Maximum entries
  ttlMs: 3600000,      // 1 hour TTL
  quantize: true       // Int8 quantization (75% memory savings)
})

// Use with any provider
const provider = new TransformersEmbeddingProvider({
  cache: true,
  cacheSize: 500,
  quantizeCache: true
})

🔍 Semantic Search

const search = new SemanticSearch(provider)

// Add documents with metadata
await search.addDocuments([
  { 
    text: 'Critical error in payment processing',
    metadata: { severity: 'high', timestamp: Date.now() }
  }
])

// Search with threshold
const results = await search.search('payment issues', {
  topK: 5,
  threshold: 0.7  // Minimum similarity score
})

// Find semantic clusters
const clusters = await search.findClusters({
  minClusterSize: 3,
  similarityThreshold: 0.8
})

Integration with @terminals-tech/graph

Enhance your event graphs with semantic understanding:

import { TextGraph } from '@terminals-tech/graph'
import { enhanceGraphWithSemantics } from '@terminals-tech/embeddings'

const graph = new TextGraph()
await enhanceGraphWithSemantics(graph)

// Now extract semantic relationships
const relations = await graph.extractSemanticRelations(
  'The server crashed because memory usage exceeded the limit'
)
// Understands causal relationship even without explicit keywords

// Find semantic clusters in events
const clusters = await graph.findSemanticClusters(events)
// Groups events by meaning, not just structure

Provider Comparison

Provider	Model Size	Dimensions	Speed	Quality	Dependencies
TransformersJS	22MB	384	Fast	High	@huggingface/transformers
Mock	0KB	64	Instant	Test	None

Memory Optimization

With quantization enabled:

Float32: 384 dimensions × 4 bytes = 1,536 bytes per embedding
Int8: 384 dimensions × 1 byte = 384 bytes per embedding
Savings: 75% memory reduction with ~5% accuracy loss

Browser Support

✅ Chrome 90+
✅ Firefox 89+
✅ Safari 14.1+
✅ Edge 90+

WebAssembly and SIMD support recommended for best performance.

API Reference

EmbeddingProvider Interface

interface EmbeddingProvider {
  embed(text: string): Promise<EmbeddingVector>
  embedBatch(texts: string[]): Promise<EmbeddingVector[]>
  similarity(a: EmbeddingVector, b: EmbeddingVector): number
  dispose?(): void
}

SemanticSearch Methods

addDocuments(docs) - Add documents to search index
search(query, options) - Search for similar documents
findClusters(options) - Find document clusters
clear() - Clear the search index
stats() - Get memory usage statistics

Performance

Embedding Generation: ~10ms per sentence (CPU)
Similarity Search: <1ms for 1000 documents
Memory Usage: ~400KB for 1000 cached embeddings (quantized)
Model Load Time: ~2s first load (cached after)

Examples

See the examples/ directory for:

full-system.ts - Complete integration with @terminals-tech suite
More examples coming soon!

License

Built with ❤️ for developers who want semantic understanding without the complexity.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
examples		examples
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

@terminals-tech/embeddings

Installation

Quick Start

Features

🎯 Ultra-Lightweight

🔄 Swappable Providers

💾 Memory-Efficient Caching

🔍 Semantic Search

Integration with @terminals-tech/graph

Provider Comparison

Memory Optimization

Browser Support

API Reference

EmbeddingProvider Interface

SemanticSearch Methods

Performance

Examples

License

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

wheattoast11/terminals-embeddings

Folders and files

Latest commit

History

Repository files navigation

@terminals-tech/embeddings

Installation

Quick Start

Features

🎯 Ultra-Lightweight

🔄 Swappable Providers

💾 Memory-Efficient Caching

🔍 Semantic Search

Integration with @terminals-tech/graph

Provider Comparison

Memory Optimization

Browser Support

API Reference

EmbeddingProvider Interface

SemanticSearch Methods

Performance

Examples

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages