This project will allow uploading PDF files to Azure Blob Storage. We'll then queue up processing and use Adobe PDF Services to grade the file for accessibility, auto-tag them, use AI to generate summaries/titles/images alt, etc, score again, and store the results back in a database for reporting.
- Backend: .NET 8 Web API with ASP.NET Core
- Frontend: React 19 with Vite, TypeScript, and TanStack Router/Query/Table
- Authentication: OIDC with Microsoft Entra ID (Azure AD)
- Styling: Tailwind CSS
- Development: Hot reload for both frontend and backend
- Diagrams: See
docs/architecture.md
-
Clone the repository
git clone https://github.com/ucdavis/readable cd readable -
Open In DevContainer
- Open the project folder in Visual Studio Code.
- Click the prompt to open in container (or manually select from the command palette).
Using the DevContainer is optional, but it will get you the right version of dotnet + node, plus install all dependencies and setup a local SQL instance for you
The DevContainer also includes Azure Functions Core Tools v4 so you can run functions locally (func). See https://learn.microsoft.com/en-us/azure/azure-functions/functions-run-local.
-
Start the application
npm start
This command automatically installs dependencies (if needed) and starts both the .NET backend and Vite frontend with hot reload enabled.
Optional: If dependencies change, you can manually reinstall with
npm install && cd client && npm install && cd ..but you shouldn't have to, thenpm startshould handle it. -
Access the application
The application will auto launch in your browser (to http://localhost:5175).
If you want to access endpoints individually, you can do so at the following URLs:
- Frontend: http://localhost:5175
- Backend API: http://localhost:5166 (nothing to see, but /api/* has the direct API endpoints)
- API Documentation (Swagger): http://localhost:5166/swagger/index.html
- Health check: http://localhost:5166/health
At a high level, PDFs flow through:
- Upload to Blob Storage
- Queue/background processing (ingest worker)
- Accessibility evaluation + (optional) auto-tagging via Adobe PDF Services
- Remediation pass (metadata + tag-tree fixes)
- Persist results for reporting
- PDF title: extracts text from the first pages and writes a descriptive title into PDF metadata. If there isn't enough text, the existing title is kept (or a placeholder is used when missing).
- Alt text (tagged PDFs only): fills missing
Alttext for taggedFigureandLinkelements, using AI when configured and a fallback otherwise.
The backend requires a SQL Server connection string. By default appsettings.Development.json has a connection string configured for the local SQL Server instance.
When you want to specify your own DB connection, provide it by setting the DB_CONNECTION environment variable (for example in a .env file) or by updating ConnectionStrings:DefaultConnection in appsettings.*.json (.env is recommended)
AI-backed remediation is enabled when OPENAI_API_KEY is set. If it is not set, the app uses local “sample” implementations (useful for development/tests).
OPENAI_API_KEY: enables OpenAI-backed servicesOPENAI_ALT_TEXT_MODEL: model for image/link alt text generation (default:gpt-4o-mini)OPENAI_PDF_TITLE_MODEL: model for PDF title generation (default:gpt-4o-mini)
The Web API and the function_ingest worker are configured to export logs/traces/metrics via OTLP using standard OTEL_* environment variables.
- Web API: copy
server/.env.exampletoserver/.envand setOTEL_EXPORTER_OTLP_ENDPOINT/OTEL_EXPORTER_OTLP_HEADERS/OTEL_SERVICE_NAME. - Azure Functions (local): set the same keys in
workers/function.ingest/local.settings.jsonunderValues.
We use OIDC with Microsoft Entra ID (Azure AD) for authentication. The auth flow doesn't use any secrets and the settings in appsettings.*.json are sufficient for local development.
When you are ready to get your own, go to Microsoft Entra ID and create a new application registration. Set the redirect url to http://localhost:5166/signin-oidc and check the box for "ID tokens".
You might also want to set the publisher domain to ucdavis.edu and fill in the other general branding info.
The health check endpoint (/health) is configured to return the status of the application and its dependencies. It includes a database health check to ensure the SQL Server connection is healthy. See Health Checks.
The backend is configured with hot reload via dotnet watch. Any changes to C# files will automatically restart the server.
The frontend uses Vite's hot module replacement (HMR). Changes to React components, TypeScript files, and CSS will be reflected immediately.
- Frontend routes requiring authentication redirect to the backend's login endpoint
- Backend handles OIDC flow with Microsoft Entra ID
- Upon successful authentication, a same-site cookie is set
- Frontend API calls automatically include the authentication cookie
- Backend validates the cookie for protected endpoints
- Run
cd client && npm testto execute the Vitest suite once. - Use
npm run test:watchinsideclient/for red/green feedback while you work. - Tests run against a jsdom environment with Testing Library so you do not need the backend running.
- Run
dotnet testfrom the repository root to execute the .NET test project included inapp.sln. - Alternatively, target the project directly with
dotnet test tests/server.tests/server.tests.csproj. - The tests use EF Core's in-memory provider (see
tests/server.tests/TestDbContextFactory.cs) so no SQL Server instance is required. - PDF remediation tests live under
tests/server.tests/Integration/Remediate/and may use PDFs fromtests/server.tests/Fixtures/pdfs/or generate simple PDFs on the fly.
- JavaScript/TypeScript packages: run
npm outdatedat the repository root and insideclient/to see what can be updated. Usenpm updatein each location for compatible updates, ornpm install <package>@latestwhen you need to jump to a new major version. - After updating Node packages, reinstall if needed (
npm install,cd client && npm install) and rerun key checks likenpm run lint,cd client && npm test, anddotnet test.
.Net is a bit more complicated, but we're going to use the dotnet-outdated tool to help.
Run the following command from the repository root:
dotnet-outdated
and it'll show you a nice table of what can be updated. Be careful when updating major versions, especially with packages that are pinned to the .net version.
You can update individual packages or you can use the --upgrade flag to update all at once. Here's a nice way to do it and only update minor/patch versions:
dotnet-outdated --upgrade --version-lock Major
If you update Microsoft.EntityFrameworkCore.Design or another package that a tool depends on, you'll want to update that tool as well to match, ex: dotnet tool update dotnet-ef --local --version 8.0.21. That will update it for you but also set the value in our dotnet-tools.json so it's consistent for everyone.
And as always, after updating dependencies, make sure to run dotnet build and dotnet test to verify everything is working.
├── client/ # React frontend
│ ├── src/
│ │ ├── routes/ # TanStack Router routes
│ │ ├── queries/ # TanStack Query hooks
│ │ ├── lib/ # API client and utilities
│ │ └── shared/ # Shared components
│ ├── package.json
│ └── vite.config.ts
├── server/ # .NET backend
│ ├── Controllers/ # API controllers
│ ├── Helpers/ # Utility classes
│ ├── Properties/ # Launch settings
│ ├── Program.cs # Application entry point
│ └── server.csproj
├── package.json # Root package.json with start script
└── app.sln # Visual Studio solution file
npm start- Starts both backend and frontend with hot reload
npm run dev- Start Vite development servernpm run build- Build for productionnpm run lint- Run ESLintnpm run preview- Preview production buildnpm test- Run tests
dotnet run- Start the .NET applicationdotnet watch- Start with hot reloaddotnet build- Build the applicationdotnet test- Run tests