Touch: Video & Audio to Braille Converter

Convert spoken content from video files, MP3 audio, or online platforms (YouTube, Vimeo, Dailymotion) to literal Unicode Braille or Braille-optimized text using AWS services.

Features

🎥 Extract audio from video files (MP4, AVI, MOV, etc.)
🎵 Process MP3 audio files directly
📺 Download and process YouTube, Vimeo, and Dailymotion videos
🎤 Transcribe audio using AWS Transcribe
🔤 Convert to literal Unicode Braille (U+2800–U+28FF) or Braille-optimized text using AWS Bedrock (Claude)
🗂️ Automatic cleanup of temporary files
📝 Comprehensive logging and error handling

Prerequisites

Python: 3.8–3.12 (Python 3.13+ is not supported due to audio library limitations)
AWS Account with access to:
- Amazon S3
- Amazon Transcribe
- Amazon Bedrock (Claude model)
FFmpeg (for audio processing)

Installation

Clone the repository:
```
git clone <repository-url>
cd touch
```

Create and activate a Python 3.12 virtual environment:

python3.12 -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:
```
pip install -r requirements.txt
```
Install FFmpeg:
- macOS: brew install ffmpeg
- Ubuntu/Debian: sudo apt install ffmpeg
- Windows: Download from https://ffmpeg.org/download.html

AWS Setup

Option 1: Automated Setup (Recommended)

Use the provided script to automatically create an IAM user with the necessary permissions:

Configure your AWS credentials (if not already done):
```
aws configure
```
Run the automated setup script:
```
python setup_aws_iam.py --bucket-name my-touch-bucket --output-env
```
This script will:
- Create an IAM user (touch-app-user by default)
- Create an S3 bucket for audio files
- Attach the required AWS managed policies:
  - AmazonS3FullAccess
  - AmazonTranscribeFullAccess
  - AmazonBedrockFullAccess
- Generate access keys
- Output the .env file content
Create the .env file with the output from the script
Test the setup:
```
python cli.py --test-aws
```

Advanced options:

Use custom policy (more secure): --use-custom-policy
Specify different username: --username my-user
Specify different region: --region us-west-2

Option 2: Manual Setup

If you prefer to set up AWS manually:

Create an IAM user in the AWS Console
Attach these managed policies:
- AmazonS3FullAccess
- AmazonTranscribeFullAccess
- AmazonBedrockFullAccess
Create access keys for the user
Create an S3 bucket for audio files

Configuration

Create a .env file in the project root:

# AWS Configuration
AWS_ACCESS_KEY_ID=your_access_key
AWS_SECRET_ACCESS_KEY=your_secret_key
AWS_REGION=us-east-1
# S3 Configuration
TOUCH_S3_BUCKET=your-s3-bucket-name

Usage

Basic Examples

Convert a local video file to Unicode Braille:
```
python cli.py video.mp4
```
Convert a local MP3 file to Unicode Braille:
```
python cli.py audio.mp3
```

Convert a YouTube video to Unicode Braille:

python cli.py 'https://www.youtube.com/watch?v=example'

Note: Always wrap video URLs in single or double quotes to avoid shell issues. MP3 files are processed directly and do not require extraction from video.

Testing and Validation

Test AWS connectivity:
```
python cli.py --test-aws
```
Check environment configuration:
```
python cli.py --check-env
```

Output Modes

By default, output is literal Unicode Braille (U+2800–U+28FF).
To get plain Braille-optimized text instead, use:
```
python cli.py video.mp4 --braille-mode optimized
```
--braille-mode unicode (default): Output is literal Unicode Braille (⠞⠑⠭⠞ ...)
--braille-mode optimized: Output is plain text, optimized for Braille translation software

Viewing and Using Unicode Braille Output

View Unicode Braille in any Unicode-aware text editor (VSCode, Sublime, Notepad++, etc.)
Copy/paste the output into Braille embosser software, or use it with digital Braille displays that support Unicode Braille.
For physical Braille, use translation software or embosser tools that accept Unicode Braille input.

Advanced Usage

Specify output file:

python cli.py video.mp4 --output my_output.txt

Enable verbose logging:
```
python cli.py video.mp4 --verbose
```

Monitoring & Debugging in AWS Console

S3 (Audio Storage):
- Uploaded audio files are stored in your configured S3 bucket.
- View your S3 bucket.
Transcribe (Speech-to-Text):
- Transcription jobs are visible in the AWS Transcribe Console.
- Look for jobs named touch-....
Bedrock (AI Model):
- Bedrock model invocations are not directly visible, but you can monitor usage and logs in the Bedrock Console.
CloudWatch (Logs & Errors):
- For detailed logs and errors, check CloudWatch Logs.

Architecture

Video/Audio Input → Audio Extraction → S3 Upload → AWS Transcribe → AWS Bedrock → Braille Text

Audio Extraction: MoviePy for video files, pydub for MP3 files
S3 Storage: Temporary audio storage for AWS Transcribe
Transcription: AWS Transcribe converts speech to text
Braille Conversion: AWS Bedrock (Claude) outputs literal Unicode Braille or Braille-optimized text
Cleanup: Automatic removal of temporary files and S3 objects

Error Handling

Invalid input files/URLs
Network connectivity issues
AWS service failures
Audio extraction problems
Transcription timeouts

Troubleshooting

"TOUCH_S3_BUCKET environment variable is required"
- Ensure your .env file is properly configured
- Check that the S3 bucket exists and is accessible
"Video file has no audio track"
- Verify the video file contains audio
- Try a different video file
"Transcription job failed"
- Check AWS credentials and permissions
- Ensure the audio file is not corrupted
- Verify AWS Transcribe service is available in your region
"audioop not found" or MP3 extraction errors
- Ensure you are using Python 3.12 or lower (Python 3.13+ is not supported)
AWS permission errors
- Run python cli.py --test-aws to diagnose specific service issues
- Verify your IAM user has the required policies attached

Cost Estimation

Service	Estimated Cost
S3	<$0.01
Transcribe	$0.12
Bedrock (Claude)	$0.01–$0.02
Total	$0.13–$0.15

Costs scale linearly with file length.
Using more advanced Claude models may increase Bedrock costs.
Local compute and YouTube download are free (except for your own bandwidth/electricity).
AWS Free Tier may cover some or all costs for new accounts.

Contributing

Fork the repository
Create a feature branch
Make your changes
Add tests if applicable
Submit a pull request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Support

For issues and questions, please open an issue on GitHub.

Output Files

All output files are placed in the output/ directory by default.
Unicode Braille output (default) will be in .txt files, and BRF output (embossable) will be in .brf files.
Example: output/test_output.brf

Testing Example

This project was tested using the following YouTube video:

https://www.youtube.com/watch?v=WLQ6HyFbfKU

The pipeline was run as follows (be sure to quote the URL):

python cli.py --input-url "https://www.youtube.com/watch?v=WLQ6HyFbfKU" --braille-mode unicode

The resulting Braille output was saved in the output/ directory as test_output.brf (for BRF) and as .txt for Unicode Braille.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
app		app
output		output
.gitignore		.gitignore
LICENSE		LICENSE
PERFORMANCE_OPTIMIZATIONS.md		PERFORMANCE_OPTIMIZATIONS.md
README.md		README.md
SHIPPING_CHECKLIST.md		SHIPPING_CHECKLIST.md
cli.py		cli.py
env.example		env.example
quickstart.sh		quickstart.sh
requirements.txt		requirements.txt
setup_aws_iam.py		setup_aws_iam.py
setup_example.sh		setup_example.sh
test_setup.py		test_setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Touch: Video & Audio to Braille Converter

Features

Prerequisites

Installation

AWS Setup

Option 1: Automated Setup (Recommended)

Option 2: Manual Setup

Configuration

Usage

Basic Examples

Testing and Validation

Output Modes

Viewing and Using Unicode Braille Output

Advanced Usage

Monitoring & Debugging in AWS Console

Architecture

Error Handling

Troubleshooting

Cost Estimation

Contributing

License

Support

Output Files

Testing Example

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

kyhw/touch

Folders and files

Latest commit

History

Repository files navigation

Touch: Video & Audio to Braille Converter

Features

Prerequisites

Installation

AWS Setup

Option 1: Automated Setup (Recommended)

Option 2: Manual Setup

Configuration

Usage

Basic Examples

Testing and Validation

Output Modes

Viewing and Using Unicode Braille Output

Advanced Usage

Monitoring & Debugging in AWS Console

Architecture

Error Handling

Troubleshooting

Cost Estimation

Contributing

License

Support

Output Files

Testing Example

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages