Added structs and functions necessary for GPU inferencing support #8

lightningpwr28 · 2024-06-30T23:42:53Z

I basically just made some classes in the style of the stuff that was always there, with the exception of gpu_init() and gpu_thread_init(), which I put in the root of the vosk crate.

I know that the recommended dynamic library binaries don't support it, but I need the support for the back end of a project I'm working on.

I think it should be possible to get binaries that do by following the compilation steps of or copying the compiled result out of the GPU containers in vosk-server, at least for Ubuntu based distros. I'm working on a Windows binary, but I'm not entirely sure I'll be able to do it.

Bear-03 · 2024-07-01T00:29:08Z

Wonderful work! Just some days ago I was thinking of adding exactly this, so perfect timing! In the first version of the crate I didn't wrap GPU-related code because I don't have an NVIDIA GPU myself.

I think it should be possible to get binaries that do by following the compilation steps of or copying the compiled result out of the GPU containers in vosk-server, at least for Ubuntu based distros. I'm working on a Windows binary, but I'm not entirely sure I'll be able to do it.

Yeah, as mentioned in this issue you have to build from source with HAVE_CUDA=1 set. Is this the Dockerfile you mention?

Also, since the gpu code is only available if you compiled your libs with the HAVE_CUDA flag, it is misleading if the rust code related to cuda is always available. I suggest putting it behind a crate feature flag, probably called cuda.

As to the specifics of the code, I'll review it more thoroughly as soon as possible :D

lightningpwr28 · 2024-07-01T16:32:45Z

Yes, that is the one! I'll look into the feature flag and probably get that added today.

Bear-03

Sorry for the delay! I've been really busy these last weeks. Anyway, I only added some minor comments of some things that probably slipped your mind, totally fair.

Another thing I wanna comment on is that I couldn't help but notice the amount of times that #[cfg(feature = "cuda")] is repeated. I think it'd be best to group things in a batch module, and then import everything into its parent if the feature is enabled.

Maybe the cleanest way to do that would be to split recognition/mod.rs into recognition/batch.rs and recognition/<normal, default, singlethread, someting like that...>, then mod.rs would always import the normal recognizer but only import the batch recognizer if the feature is enabled. Then do a similar thing with the models, with a models/ folder, a models/mod.rs, models/batch.rs, and models/<normal, ... whatever you decided to name the recognizer file>.

Bear-03 · 2024-08-15T12:34:23Z

vosk/src/models.rs

+
+/// The same as [`Model`], but uses a CUDA enabled Nvidia GPU and dynamic batching to enable higher throughput.
+#[cfg(feature = "cuda")]
+pub struct  BatchModel(pub(crate) NonNull<VoskBatchModel>);


Double space here, between struct and BatchModel

lightningpwr28 · 2024-08-16T00:14:27Z

I added the batch modules in-file to allow for backwards compatibility. If we moved the non-batch to a different file, I think users would have to change the way they imported non-batch functions. This way is a little messier, but not that much messier.

Bear-03

I think the batch module part could be organized better but I have to think about it so I'll handle it myself before publishing the release that will include this.

Let me know what you think about the comment, and when that's changed I can proceed with the merge :P

Btw the clippy CI pipeline failed because of some CI issue I have to fix, (github is fetching the old vosk-sys crate, not the one you pushed) so don't worry about it.

vosk/src/recognition/mod.rs

Bear-03

Everything looks good to me, merging ;)

Bear-03 · 2024-10-26T23:38:51Z

Hey @lightningpwr28 I'm organizing and cleaning up PRS to prepare the crate for a new release and I was wondering, would you happen to have some examples of the JSON returned by BatchRecognizer::front_result? I would want that function to return a Rust type instead of a serde_json object so it is consistent with the result functions in Recognizer, but I cannot test this myself since I don't have a device with CUDA.

lightningpwr28 added 2 commits June 30, 2024 12:31

Added bindings to batch/gpu inferencing

1dbe02e

Finished adding rudimentary documentation

deab9d1

lightningpwr28 added 3 commits July 1, 2024 09:38

Finish documentation I missed

547e7c1

Add a line to top level Cargo.toml to remove a warning while compiling

be0c6d3

Put gpu inferencing features behind cuda feature flag

b017cd1

Bear-03 requested changes Aug 15, 2024

View reviewed changes

lightningpwr28 added 2 commits August 15, 2024 16:26

Fixed whitespace error

17bbf52

Added in-file modules for batch operations

82608a9

lightningpwr28 requested a review from Bear-03 August 16, 2024 00:10

Bear-03 requested changes Aug 17, 2024

View reviewed changes

vosk/src/recognition/mod.rs Outdated Show resolved Hide resolved

Changed return type of get_pending_chunks

cb211a1

lightningpwr28 requested a review from Bear-03 August 17, 2024 17:13

Bear-03 approved these changes Aug 17, 2024

View reviewed changes

Bear-03 merged commit 5fc7ad1 into Bear-03:main Aug 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added structs and functions necessary for GPU inferencing support #8

Added structs and functions necessary for GPU inferencing support #8

lightningpwr28 commented Jun 30, 2024

Bear-03 commented Jul 1, 2024

lightningpwr28 commented Jul 1, 2024

Bear-03 left a comment •

edited

Loading

Bear-03 Aug 15, 2024

lightningpwr28 commented Aug 16, 2024

Bear-03 left a comment

Bear-03 left a comment

Bear-03 commented Oct 26, 2024

Added structs and functions necessary for GPU inferencing support #8

Added structs and functions necessary for GPU inferencing support #8

Conversation

lightningpwr28 commented Jun 30, 2024

Bear-03 commented Jul 1, 2024

lightningpwr28 commented Jul 1, 2024

Bear-03 left a comment • edited Loading

Choose a reason for hiding this comment

Bear-03 Aug 15, 2024

Choose a reason for hiding this comment

lightningpwr28 commented Aug 16, 2024

Bear-03 left a comment

Choose a reason for hiding this comment

Bear-03 left a comment

Choose a reason for hiding this comment

Bear-03 commented Oct 26, 2024

Bear-03 left a comment •

edited

Loading