KlarText Project Overview

A text simplification library that transforms complex German and English text into easy-to-understand language with optional text-to-speech output.

Introduction

KlarText is an accessibility-focused application designed to help users who struggle with complex text. This includes people with:

Reading or cognitive difficulties
Dyslexia
Non-native language speakers
Anyone who needs simpler, clearer text

The system takes dense bureaucratic, legal, medical, or technical text and transforms it into plain language while preserving the original meaning.

Important: KlarText produces "easy language" simplifications. It is not certified "Leichte Sprache" (official German easy language standard) and does not guarantee legal, medical, or financial accuracy.

Core Features

Text Simplification

Transforms complex text into easy-to-understand language
Supports German (de) and English (en)
Three simplification levels:
- very_easy: 8-10 word sentences, defines uncommon terms, bullet points
- easy: 12-15 word sentences, clear structure, minimal jargon
- medium: Plain language with normal sentence length

PDF Ingestion

Upload PDF documents for text extraction
Automatic header/footer removal
Handles multi-page documents
Text cleanup and normalization

Text-to-Speech (TTS)

Converts simplified text to audio using gTTS
Supports German and English voices
Returns base64-encoded MP3 audio
Text preprocessing for better punctuation handling

Accessibility-First UI

Large, readable fonts (18-20px base)
High contrast mode support
Keyboard navigation
Screen reader compatible
Dyslexia-friendly font option
Reduced motion support

Architecture

┌──────────────────────────────────────────────────────────────────┐
│                         Frontend                                 │
│  ┌──────────────┐    ┌──────────────┐    ┌──────────────┐        │
│  │   web-mvp    │    │     demo     │    │  extension   │        │
│  │   (React)    │    │   (Gradio)   │    │   (Chrome)   │        │
│  └──────┬───────┘    └──────┬───────┘    └──────┬───────┘        │
└─────────┼───────────────────┼───────────────────┼────────────────┘
          │                   │                   │
          └───────────────────┼───────────────────┘
                              │ REST API
                              ▼
┌──────────────────────────────────────────────────────────────────┐
│                      Backend API (FastAPI)                       │
│  ┌────────────────────────────────────────────────────────────┐  │
│  │                      Endpoints                             │  │
│  │ /v1/simplify(/batch) │ /v1/ingest/pdf │ /v1/tts │ /log-run │  │
│  └────────────────────────────────────────────────────────────┘  │
│                             │                                    │
│  ┌────────────────────────────────────────────────────────────┐  │
│  │                    Core Logic & Assets                     │  │
│  │  llm_adapter.py  │  pdf_extractor.py  │  tts_adapter.py    │  │
│  │  run_logger.py   │  prompts.py        │  Live Prompts      │  │
│  └──────────────────────────┬─────────────────────────────────┘  │
└─────────────────────────────┼────────────────────────────────────┘
          ┌───────────────────┼───────────────────┐
          ▼                   ▼                   ▼
    ┌───────────┐       ┌───────────┐       ┌───────────┐
    │   Groq    │       │  PyMuPDF  │       │   gTTS    │
    │   (LLM)   │       │   (PDF)   │       │  (Audio)  │
    └─────┬─────┘       └───────────┘       └───────────┘
          │
    ┌─────▼──────┐
    │ Logs (JSONL)│
    └─────────────┘

Project Structure

klartext/
├── Requirements.txt          # Development & notebook dependencies
├── apps/
│   ├── web-mvp/              # Production React frontend (Vite + TypeScript)
│   │   ├── src/
│   │   │   ├── App.tsx       # Main application
│   │   │   ├── components/   # UI components
│   │   │   └── contexts/     # React contexts (language, accessibility)
│   │   └── package.json
│   ├── demo/                 # Testing/staging Gradio application
│   │   ├── app.py
│   │   └── requirements.txt
│   ├── extension/            # Chrome extension
│   └── deprecated/           # Previous frontend experiments
│
├── services/
│   └── api/                  # FastAPI backend
│       ├── app/
│       │   ├── main.py       # API endpoints
│       │   └── core/
│       │       ├── llm_adapter.py     # LLM integration (Groq)
│       │       ├── pdf_extractor.py   # PDF text extraction
│       │       ├── tts_adapter.py     # Text-to-speech
│       │       ├── run_logger.py      # Telemetry & performance logging
│       │       └── prompts.py         # Prompt template loading
│       ├── prompts/          # Live/Production Prompts (manual deployment)
│       │   └── templates/    # Active system/user prompts
│       ├── requirements.txt  # API deployment dependencies
│       └── Dockerfile
│
├── prompts/               # Central Prompt Library (Exploration & Versioning)
│   └── templates/
│       ├── v1/               # Legacy prompts
│       └── v2/               # Latest prompts (system, user, version_notes)
│
├── data/                     # Project data storage
│   ├── logs/                 # API run logs (JSONL)
│   ├── benchmarks/           # Evaluation datasets
│   └── samples/              # Test input documents
│
├── notebooks/                # Development & research
│   ├── README.md             # Notebook documentation
│   ├── evaluation/           # Accuracy & scoring notebooks
│   └── feedback_loop/        # Analytics & improvement workflows
│
├── scripts/                  # Automation
│   ├── metrics_reporter.py    # Report generation
│   └── scheduled_metrics.py   # CRON jobs for telemetry
│
└── docs/                     # Documentation (API, UI, Deployment)

Note on Requirements Files:

Root Requirements.txt: Used for notebooks and development/evaluation work (includes data science packages)
services/api/requirements.txt: Used for API deployment (production dependencies only)

API Reference

Base URL

http://localhost:8000

Endpoints

`POST /v1/simplify` | `/v1/simplify/batch`

Core feature — Transform complex text into easy language. The batch endpoint allows parallel processing for multiple snippets (optimized for browser extensions).

Request:

{
  "text": "Der Antragsteller muss die Unterlagen einreichen.",
  "target_lang": "de",
  "level": "easy"
}

Response:

{
  "simplified_text": "Sie müssen Papiere abgeben.",
  "key_points": [],
  "warnings": []
}

`POST /v1/tts`

Accessibility feature — Convert simplified text to MP3 audio (returned as Base64).

Request:

{
  "text": "Hallo, das ist ein Test.",
  "lang": "de"
}

Response:

{
  "audio_base64": "//OExAAAA...",
  "audio_url": null,
  "format": "mp3"
}

`POST /v1/log-run`

Analytics feature — Log performance data and user feedback for the continuous improvement loop.

`GET /healthz`

Monitoring — Standard health check. Returns {"ok": true}.

Quick Start

Prerequisites

Python 3.11+
Node.js 18+ (for web-mvp frontend)
Groq API key

1. Start the API Server

cd services/api
cp env.example .env
# Edit .env and add your GROQ_API_KEY

pip install -r requirements.txt
uvicorn app.main:app --reload --host 0.0.0.0 --port 8000

2. Start the Web Frontend

cd apps/web-mvp
npm install
npm run dev

3. Open in Browser

Web App: http://localhost:5173 (or 5174)
API Docs: http://localhost:8000/docs

Environment Variables

Backend (`services/api/.env`)

Variable	Required	Description
`GROQ_API_KEY`	Yes	Groq API key for LLM
`APP_PASSWORD`	No	Password for production access
`API_KEY`	No	API key for production auth
`ALLOWED_ORIGINS`	No	CORS origins (comma-separated)
`ENVIRONMENT`	No	`development` or `production`

Technology Stack

Component	Technology
Frontend	React, TypeScript, Vite, Tailwind CSS
Backend	Python, FastAPI, Uvicorn
LLM	Groq (llama-3.1-8b-instant)
PDF Extraction	PyMuPDF
TTS	gTTS (v2), OpenAI TTS (optional)
Telemetry	JSONL, python-json-logger, Metrics Scripts
Deployment	Docker, Fly.io (backend), Vercel (frontend)

Key Design Decisions

No File Storage for TTS

Audio is generated in-memory and returned as base64. No audio files are stored on disk, simplifying deployment and avoiding storage management.

Prompt Templates

System and user prompts are stored as separate text files in prompts/templates/. This allows:

Easy iteration on prompts without code changes
Version control for prompt evolution
Language-specific prompts (DE/EN)

Accessibility First

The UI is designed with accessibility as a core requirement:

Semantic HTML with proper ARIA labels
Visible focus indicators
Keyboard navigation support
Configurable text size and contrast

Graceful Degradation

TTS falls back to browser speech synthesis if API fails
PDF extraction handles corrupted/password-protected files gracefully
LLM errors return helpful error messages

License

Non-commercial use only. See LICENSE for details.

Attribution required: "Based on work from the KlarText Team (2025)."

For commercial licensing inquiries, contact a repository administrator.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KlarText Project Overview

Introduction

Core Features

Text Simplification

PDF Ingestion

Text-to-Speech (TTS)

Accessibility-First UI

Architecture

Project Structure

API Reference

Base URL

Endpoints

`POST /v1/simplify` | `/v1/simplify/batch`

`POST /v1/tts`

`POST /v1/log-run`

`GET /healthz`

Quick Start

Prerequisites

1. Start the API Server

2. Start the Web Frontend

3. Open in Browser

Environment Variables

Backend (`services/api/.env`)

Technology Stack

Key Design Decisions

No File Storage for TTS

Prompt Templates

Accessibility First

Graceful Degradation

Related Documentation

License

FilesExpand file tree

PROJECT_OVERVIEW.md

Latest commit

History

PROJECT_OVERVIEW.md

File metadata and controls

KlarText Project Overview

Introduction

Core Features

Text Simplification

PDF Ingestion

Text-to-Speech (TTS)

Accessibility-First UI

Architecture

Project Structure

API Reference

Base URL

Endpoints

POST /v1/simplify | /v1/simplify/batch

POST /v1/tts

POST /v1/log-run

GET /healthz

Quick Start

Prerequisites

1. Start the API Server

2. Start the Web Frontend

3. Open in Browser

Environment Variables

Backend (services/api/.env)

Technology Stack

Key Design Decisions

No File Storage for TTS

Prompt Templates

Accessibility First

Graceful Degradation

Related Documentation

License

`POST /v1/simplify` | `/v1/simplify/batch`

`POST /v1/tts`

`POST /v1/log-run`

`GET /healthz`

Backend (`services/api/.env`)