Share single model across channels and add 2-channel e2e tests by devin-ai-integration[bot] · Pull Request #4142 · fastrepl/char

devin-ai-integration · 2026-02-22T01:45:40Z

Summary

Moves model creation (hypr_cactus::Model::builder().build()) out of the per-channel loop so that a single model is loaded once and shared (via Arc::clone()) across all channel streams. Previously, for 2-channel interleaved audio, two full model instances (weights + VAD) were loaded into memory.

This is safe because each cactus_stream_transcribe_process call runs a complete encode→decode cycle and then calls cactus_reset(), leaving no model state between calls. The per-channel state (audio buffer, confirmation logic) lives entirely in the separate CactusStreamTranscribeHandle instances. Concurrent access is serialized by the existing model_mutex on the C++ side.

Tradeoff: Channels now serialize on the model mutex instead of running in parallel. With ~50-150ms inference per 300ms chunk, this adds at most one inference duration of latency when both channels need the model simultaneously.

Updates since last revision

Added two new e2e tests and a CI workflow to exercise 2-channel inference:

e2e_streaming_dual_channel: Creates a single shared Model with two transcribe_stream handles, feeds english_1 to ch0 and english_2 to ch1, asserts both channels produce events.
e2e_websocket_dual_channel: Spins up the full TranscribeService, connects via WebSocket with channels=2, sends interleaved stereo PCM (english_1 + english_2), asserts Results messages arrive for both channel_index 0 and 1.
.github/workflows/local_stt_e2e.yaml: New workflow running the above tests (plus the existing e2e_streaming) on depot-ubuntu-24.04-arm-8 with the moonshine-base model. Triggers on changes to transcribe-cactus, cactus, or cactus-sys.

Review & Testing Checklist for Human

Dual-channel streaming test early-exit: The e2e_streaming_dual_channel test uses tokio::select! over two event streams — if one stream finishes before the other (e.g. shorter audio), the loop breaks and remaining events from the other stream are silently dropped. Verify this doesn't cause false passes or flaky failures.
CI passes on ARM: None of these tests could be built/run locally (cactus C++ targets ARM). Confirm the new local_stt_e2e workflow passes end-to-end on the first run.
No cross-channel state leakage: With the shared model, confirm that cactus_reset() fully clears all model state (KV-cache, encoder output, persistent nodes) between channel inferences — any missed state would cause one channel's audio context to bleed into the other.
Memory improvement: Confirm ~50% reduction in model memory for 2-channel sessions (one model load instead of two).

Suggested manual test plan: Run a real 2-channel transcription session on device and compare transcription quality/latency against the previous 2-model approach.

Notes

Link to Devin run
Requested by @yujonglee

Co-Authored-By: yujonglee <yujonglee.dev@gmail.com>

devin-ai-integration · 2026-02-22T01:45:43Z

🤖 Devin AI Engineer

I'll be helping with this pull request! Here's what you should know:

✅ I will automatically:

Address comments on this PR that start with 'DevinAI' or '@devin'.
Look at CI failures and help fix them

Note: I can only respond to comments from users who have write access to this repository.

⚙️ Control Options:

Disable automatic comment and CI monitoring

netlify · 2026-02-22T01:45:45Z

✅ Deploy Preview for hyprnote-storybook canceled.

Name	Link
🔨 Latest commit	`683c5cd`
🔍 Latest deploy log	https://app.netlify.com/projects/hyprnote-storybook/deploys/699a7949fb4c240008a9ea66

netlify · 2026-02-22T01:45:45Z

✅ Deploy Preview for hyprnote canceled.

Name	Link
🔨 Latest commit	`683c5cd`
🔍 Latest deploy log	https://app.netlify.com/projects/hyprnote/deploys/699a79492868d3000875ac2f

Co-Authored-By: yujonglee <yujonglee.dev@gmail.com>

devin-ai-integration

Devin Review found 3 potential issues.

View 4 additional findings in Devin Review.

crates/transcribe-cactus/src/service/streaming/tests.rs

.github/workflows/local_stt_e2e.yaml

crates/transcribe-cactus/src/service/streaming/tests.rs

Co-Authored-By: yujonglee <yujonglee.dev@gmail.com>

share single model across channels instead of creating one per channel

c577732

Co-Authored-By: yujonglee <yujonglee.dev@gmail.com>

devin-ai-integration bot assigned yujonglee Feb 22, 2026

devin-ai-integration bot requested a review from yujonglee February 22, 2026 01:45

add 2-channel e2e tests and local-stt-e2e workflow

0ac0c96

Co-Authored-By: yujonglee <yujonglee.dev@gmail.com>

devin-ai-integration bot changed the title ~~Share single model across channels instead of creating one per channel~~ Share single model across channels and add 2-channel e2e tests Feb 22, 2026

devin-ai-integration bot commented Feb 22, 2026

View reviewed changes

crates/transcribe-cactus/src/service/streaming/tests.rs Outdated Show resolved Hide resolved

.github/workflows/local_stt_e2e.yaml Outdated Show resolved Hide resolved

crates/transcribe-cactus/src/service/streaming/tests.rs Outdated Show resolved Hide resolved

devin-ai-integration bot and others added 3 commits February 22, 2026 01:57

fix formatting with dprint

8327a86

Co-Authored-By: yujonglee <yujonglee.dev@gmail.com>

various testing improvements

b5869dc

timeout

683c5cd

yujonglee merged commit f5a23ed into main Feb 22, 2026
14 of 17 checks passed

yujonglee deleted the devin/1771724573-share-single-model-across-channels branch February 22, 2026 03:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Share single model across channels and add 2-channel e2e tests#4142

Share single model across channels and add 2-channel e2e tests#4142
yujonglee merged 5 commits intomainfrom
devin/1771724573-share-single-model-across-channels

devin-ai-integration bot commented Feb 22, 2026 •

edited

Loading

Uh oh!

devin-ai-integration bot commented Feb 22, 2026

Uh oh!

netlify bot commented Feb 22, 2026 •

edited

Loading

Uh oh!

netlify bot commented Feb 22, 2026 •

edited

Loading

Uh oh!

devin-ai-integration bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

devin-ai-integration bot commented Feb 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Updates since last revision

Review & Testing Checklist for Human

Notes

Uh oh!

devin-ai-integration bot commented Feb 22, 2026

🤖 Devin AI Engineer

Uh oh!

netlify bot commented Feb 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for hyprnote-storybook canceled.

Uh oh!

netlify bot commented Feb 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for hyprnote canceled.

Uh oh!

devin-ai-integration bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

devin-ai-integration bot commented Feb 22, 2026 •

edited

Loading

netlify bot commented Feb 22, 2026 •

edited

Loading

netlify bot commented Feb 22, 2026 •

edited

Loading