pizzadesk's AzureAI UiPath Toolkit

Overview

This solution contains custom UiPath activities for Windows framework for the use of Azure AI SDK, including but not limited to Speech-to-Text (STT) operations, and other Azure AI functionality using the Microsoft Cognitive Services Speech SDK.

Components

1. Windows_PerformSTTFromFile

This project focuses on the the SpeechToTextActivity class, which leverages the Microsoft Cognitive Services Speech SDK to perform STT on a provided WAV file.

Key Methods:

BeginExecute: Starts the asynchronous speech recognition operation. It retrieves input arguments, validates them, and initiates the speech recognition task using TaskCompletionSource<T> to ensure compatibility with UiPath's AsyncCodeActivity.
EndExecute: Handles the completion of the asynchronous operation. It retrieves the result or handles exceptions if any occurred.
PerformSpeechRecognition: Executes the speech recognition using Azure Cognitive Services Speech SDK and returns the recognized text or an error message.
HandleSpeechRecognitionResult: Interprets the Speech SDK's results and formats them for output.

Setup and Usage

Prerequisites

Visual Studio (or compatible IDE)
Microsoft Cognitive Services Speech SDK (version 1.30.0 or newer) - comes as built-in dependency
.NET Framework 4.6.1 or newer / .NET 6.0-windows
A valid Azure subscription with access to Azure Speech Services

Installation

Clone or download the repository.
Open the solution in Visual Studio.
Restore NuGet packages by right-clicking on the solution in Solution Explorer and selecting "Restore NuGet Packages."
Build the project to generate the necessary assemblies.

Running the Application

Compile the project to create the assembly or deploy it as a custom activity in UiPath.
Use UiPath Studio to create a workflow that utilizes the custom SpeechToTextActivity activity.
Provide the required input arguments in UiPath Studio:
- SubscriptionKey: Your Microsoft Cognitive Services subscription key.
- ServiceRegion: The region where your Speech Service is hosted (e.g., westus).
- AudioFilePath: The full path to the WAV file you want to transcribe.
- Locale: The language locale for speech recognition (e.g., en-US).

Output

The activity will output the result of the speech recognition process:

RECOGNIZED: Displays the recognized text from the audio file.
NOMATCH: Indicates that speech could not be recognized from the audio file.
CANCELED: Provides details if the operation was canceled, including the reason and error details.
UNKNOWN ERROR: Displays an error message if the recognition fails for unspecified reasons.

Example

SubscriptionKey: <your_subscription_key>
ServiceRegion: <your_service_region>
AudioFilePath: <path_to_audio_file>
Locale: <language_locale>

Speech Recognition Result:

RECOGNIZED: Recognized Text: <recognized_text>
NOMATCH: Speech could not be recognized.
CANCELED: Reason=<cancellation_reason>, ErrorCode=<error_code>, Details=<error_details>.
UNKNOWN ERROR: Unable to process speech.

Error Handling

FileNotFoundException: If the specified audio file does not exist, an error will be thrown.
ArgumentException: Thrown if invalid arguments are provided.
Exception: Any other unexpected errors will be captured and returned with details.

Notes

Ensure that your WAV file is properly formatted for Azure Speech SDK.
Check the Azure Speech SDK documentation for supported locales and regions.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Windows_PerformSTTFromFile		Windows_PerformSTTFromFile
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
pizzadesk's Azure AI UiPath Toolkit.sln		pizzadesk's Azure AI UiPath Toolkit.sln

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pizzadesk's AzureAI UiPath Toolkit

Overview

Components

1. Windows_PerformSTTFromFile

Key Methods:

Setup and Usage

Prerequisites

Installation

Running the Application

Output

Example

Error Handling

Notes

License

About

Releases

Packages

Languages

License

pizzadesk/pizzadesk-s-Azure-AI-UiPath-Toolkit

Folders and files

Latest commit

History

Repository files navigation

pizzadesk's AzureAI UiPath Toolkit

Overview

Components

1. Windows_PerformSTTFromFile

Key Methods:

Setup and Usage

Prerequisites

Installation

Running the Application

Output

Example

Error Handling

Notes

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages