Skip to content

A Python-based audio transcription tool with Whisper ASR, document integration, and a customized ui for Doctors to make quick prescriptions.

Notifications You must be signed in to change notification settings

arjamand/QuickDoc

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 

Repository files navigation

QuickDoc

This project provides a set of tools for audio transcription to create presciptions in matter of seconds using Whisper ASR, with a focus on integration with document editing workflows. It features a user-friendly interface built with customtkinter and includes functionality for recording audio, transcribing it, and embedding the transcriptions into a document template. image

Prerequisites

  • Python 3.10 or higher
  • Git

Installation

  1. Clone the repository:
    git clone https://github.com/arjamand/QuickDoc.git
    cd QuickDoc
  2. Create a virtual environment (optional but recommended):
    python -m venv venv
    source venv/bin/activate  # On Windows use `venv\Scripts\activate`
  3. Install dependencies:
    pip install -r requirements.txt

Running the Application

To run the application, execute the following command in the project root folder:

python app.py

Usage

  1. Select Microphone: Choose your desired microphone from the dropdown.
  2. Start/Stop Recording: Click the "Start Recording" button to begin audio capture, and click "Stop Recording" to end it.
  3. View Transcription: The transcribed text will appear in the "Transcription" text box.
  4. Embed Transcription: Click "Embed Transcription" to add the transcribed text to the document.
  5. Print/Save Document: Use the "Print Document" or "Save Document" buttons to manage the final document.
  6. Adjust UI: Use the "Appearance Mode" and "UI Scaling" dropdowns to customize the user interface.

Contributing

Feel free to contribute to the project by submitting issues, or pull requests.

License

This project is licensed under the MIT License

About

A Python-based audio transcription tool with Whisper ASR, document integration, and a customized ui for Doctors to make quick prescriptions.

Topics

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages