Added Nomic moe v2 embedding model class definition and hf conversion for it #162

ncylich · 2025-10-13T23:44:37Z

Add Nomic Embedding Model Support

Summary

Adds foundational support for Nomic BERT embedding models (e.g., nomic-embed-text-v2-moe)
with weight loading and HuggingFace conversion. Model architecture is defined but forward
pass implementation is deferred to future PRs.

Key Changes

Model Architecture

New NomicModel class with weight loading for:
- Embedding layer normalization
- Transformer layers (Q/K/V attention, FFN, layer norms)
- Mixture-of-Experts (MoE) layers
Placeholder methods for attention, MLP, and forward pass (to be implemented)

Configuration

Added MoE config parameters: num_experts, num_shared_experts, num_top_experts,
moe_every_n_layers
New ModelType::NOMIC enum and factory support

HuggingFace Conversion

Enhanced convert_hf.py for Nomic BERT models:
- Fuses word + token type embeddings
- Splits combined QKV weight matrices
- Exports MoE weights (router, per-expert MLPs)
- Handles embedding layer norms
Fallback to AutoModel for non-causal models
Improved quantization handling for embeddings/norms/biases

Files Changed

cactus/models/model.h (+62 lines)
cactus/models/model_nomic.cpp (+82 lines, new)
cactus/engine/engine.h (+6 lines)
cactus/engine/engine_model.cpp (+7 lines)
tools/convert_hf.py (+110/-22 lines)

Next Steps

Add operation and kernel updates in origin/moe-ops-for-nomic-embed and forward pass implementation is complete on the implemented-nomic-model branch

cactus/models/model.h

tools/convert_hf.py

… support for it Signed-off-by: Noah Cylich <noahcylich@gmail.com>

ncylich force-pushed the load-nomic-embed branch from 3322275 to 2d531a0 Compare October 14, 2025 07:49

HenryNdubuaku reviewed Oct 14, 2025

View reviewed changes

ncylich force-pushed the load-nomic-embed branch 14 times, most recently from e2dea5a to 5575d24 Compare October 15, 2025 20:48

Added Nomic moe v2 embedding model class definition and hf conversion…

4bab3ae

… support for it Signed-off-by: Noah Cylich <noahcylich@gmail.com>

ncylich force-pushed the load-nomic-embed branch from 5575d24 to 4bab3ae Compare October 15, 2025 20:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added Nomic moe v2 embedding model class definition and hf conversion for it #162

Added Nomic moe v2 embedding model class definition and hf conversion for it #162

ncylich commented Oct 13, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Added Nomic moe v2 embedding model class definition and hf conversion for it #162

Are you sure you want to change the base?

Added Nomic moe v2 embedding model class definition and hf conversion for it #162

Conversation

ncylich commented Oct 13, 2025

Add Nomic Embedding Model Support

Summary

Key Changes

Model Architecture

Configuration

HuggingFace Conversion

Files Changed

Next Steps

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants