fix: improve cactus API ergonomics and consistency by devin-ai-integration[bot] · Pull Request #4150 · fastrepl/char

devin-ai-integration · 2026-02-22T05:37:21Z

fix: improve cactus API ergonomics and consistency

Summary

Addresses three low-priority issues from a cactus FFI wrapper review:

Stream return type ergonomics: complete_stream previously returned a 3-tuple (Stream, CancellationToken, JoinHandle) and transcribe_stream returned a 4-tuple. Both are now wrapped in proper structs (CompletionStream and TranscriptionSession) that implement Stream, expose cancel() methods, and handle cleanup via Drop.
Token count type consistency: Unified token count fields across result structs — CompletionResult changed from u32 to u64, StreamResult changed from f64 to u64 (now consistent with TranscriptionResult which already used u64).
Mutex poisoning observability: Added tracing::warn when recovering from a poisoned inference mutex in Model::lock_inference.

Callers in llm-cactus and transcribe-cactus are updated accordingly. The drop_guard+unfold pattern in llm-cactus streaming is replaced by CompletionStream's own Drop impl, and the manual worker_handles join loop in transcribe-cactus is replaced by TranscriptionSession::Drop. Worker panic logging is preserved via tracing::error! in both Drop impls.

Review & Testing Checklist for Human

f64 → u64 deserialization for StreamResult token fields: If the C++ build_stream_response emits JSON numbers as floats (e.g., "prefill_tokens": 12.0), serde_json will fail to deserialize them into u64. Verify the C++ side emits integer-typed JSON for these fields, or add a deserialize_with helper to handle both. This is a runtime-only failure that CI cannot catch.
Blocking Drop on async runtime: Both CompletionStream::drop() and TranscriptionSession::drop() call handle.join(), which blocks the current thread. Verify this isn't called on a tokio worker thread (it should be fine since SSE streams and websocket sessions run on their own tasks, but worth confirming).
Streaming cancellation behavior change: The drop_guard+unfold pattern in llm-cactus was replaced by relying on CompletionStream's Drop. Verify that client disconnect still cancels inference promptly — the new path is: SSE stream dropped → FilterMap dropped → CompletionStream dropped → cancel() + join().

Suggested test plan: Run an LLM streaming completion and a live transcription session end-to-end. Verify (1) streaming tokens arrive normally, (2) client disconnect cancels inference promptly, and (3) token count fields in metrics/responses are populated as integers.

CI status: All functional checks pass (cactus, desktop_ci on linux-x86_64/linux-aarch64/macos, local-stt-e2e). The fmt check failed due to a transient network timeout downloading rustfmt, unrelated to these changes.

Notes

Requested by: @yujonglee
Link to Devin run

- Wrap complete_stream return type in CompletionStream struct with Stream impl, cancel() method, and Drop-based cleanup - Wrap transcribe_stream return type in TranscriptionSession struct with Stream impl, audio_tx()/cancel() accessors, and Drop cleanup - Unify token count types: CompletionResult u32->u64, StreamResult f64->u64 (now consistent with TranscriptionResult which already used u64) - Add tracing::warn when recovering from poisoned inference mutex - Update callers in llm-cactus and transcribe-cactus Co-Authored-By: yujonglee <yujonglee.dev@gmail.com>

devin-ai-integration · 2026-02-22T05:37:25Z

🤖 Devin AI Engineer

I'll be helping with this pull request! Here's what you should know:

✅ I will automatically:

Address comments on this PR that start with 'DevinAI' or '@devin'.
Look at CI failures and help fix them

Note: I can only respond to comments from users who have write access to this repository.

⚙️ Control Options:

Disable automatic comment and CI monitoring

netlify · 2026-02-22T05:37:26Z

✅ Deploy Preview for hyprnote-storybook canceled.

Name	Link
🔨 Latest commit	`18c3f4b`
🔍 Latest deploy log	https://app.netlify.com/projects/hyprnote-storybook/deploys/699a9f9b17c03f0008f50226

netlify · 2026-02-22T05:37:26Z

✅ Deploy Preview for hyprnote canceled.

Name	Link
🔨 Latest commit	`18c3f4b`
🔍 Latest deploy log	https://app.netlify.com/projects/hyprnote/deploys/699a9f9b59f899000825669f

…rop impls Co-Authored-By: yujonglee <yujonglee.dev@gmail.com>

…ncy of transcribe-cactus) Co-Authored-By: yujonglee <yujonglee.dev@gmail.com>

The C++ side stores token counts as double and serialises via operator<< which may emit 42.0 instead of 42. serde_json rejects 42.0 when deserialising into u64, so we accept both forms. Co-Authored-By: yujonglee <yujonglee.dev@gmail.com>

Co-Authored-By: yujonglee <yujonglee.dev@gmail.com>

devin-ai-integration

Devin Review found 2 potential issues.

View 3 additional findings in Devin Review.

crates/llm-cactus/src/service.rs

crates/cactus/src/llm/result.rs

CompletionStream and TranscriptionSession Drop impls were calling handle.join() which blocks the current thread. When dropped on a tokio worker thread (e.g. SSE client disconnect), this starves the async runtime. Now we spawn a lightweight background thread to join and log panics without blocking the caller. Co-Authored-By: yujonglee <yujonglee.dev@gmail.com>

devin-ai-integration bot assigned yujonglee Feb 22, 2026

devin-ai-integration bot requested a review from yujonglee February 22, 2026 05:37

devin-ai-integration bot and others added 4 commits February 22, 2026 05:37

fix: log worker panics in CompletionStream and TranscriptionSession D…

72d52ad

…rop impls Co-Authored-By: yujonglee <yujonglee.dev@gmail.com>

fix: remove explicit tokio_util type annotation (not a direct depende…

b74c124

…ncy of transcribe-cactus) Co-Authored-By: yujonglee <yujonglee.dev@gmail.com>

style: fix import ordering in lib.rs for dprint

cd5e17a

Co-Authored-By: yujonglee <yujonglee.dev@gmail.com>

devin-ai-integration bot commented Feb 22, 2026

View reviewed changes

crates/llm-cactus/src/service.rs Show resolved Hide resolved

crates/cactus/src/llm/result.rs Show resolved Hide resolved

yujonglee merged commit a3c7a04 into main Feb 22, 2026
15 of 16 checks passed

yujonglee deleted the devin/1771738169-cactus-api-improvements branch February 22, 2026 06:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

fix: improve cactus API ergonomics and consistency#4150

fix: improve cactus API ergonomics and consistency#4150
yujonglee merged 6 commits intomainfrom
devin/1771738169-cactus-api-improvements

devin-ai-integration bot commented Feb 22, 2026 •

edited

Loading

Uh oh!

devin-ai-integration bot commented Feb 22, 2026

Uh oh!

netlify bot commented Feb 22, 2026 •

edited

Loading

Uh oh!

netlify bot commented Feb 22, 2026 •

edited

Loading

Uh oh!

devin-ai-integration bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

devin-ai-integration bot commented Feb 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

fix: improve cactus API ergonomics and consistency

Summary

Review & Testing Checklist for Human

Notes

Uh oh!

devin-ai-integration bot commented Feb 22, 2026

🤖 Devin AI Engineer

Uh oh!

netlify bot commented Feb 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for hyprnote-storybook canceled.

Uh oh!

netlify bot commented Feb 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for hyprnote canceled.

Uh oh!

devin-ai-integration bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

devin-ai-integration bot commented Feb 22, 2026 •

edited

Loading

netlify bot commented Feb 22, 2026 •

edited

Loading

netlify bot commented Feb 22, 2026 •

edited

Loading