DataTalks.Club FAQ

A static site generator for DataTalks.Club course FAQs with automated AI-powered FAQ maintenance.

Features

Static Site Generation: Converts markdown FAQs to a beautiful, searchable HTML site
Automated FAQ Management: AI-powered bot that processes new FAQ proposals
Intelligent Triage: Automatically determines if proposals should create new entries, update existing ones, or are duplicates
GitHub Integration: Seamless workflow via GitHub Issues and Pull Requests

Project Structure

faq/
├── _questions/              # FAQ content organized by course
│   ├── machine-learning-zoomcamp/
│   │   ├── _metadata.yaml   # Course configuration
│   │   ├── general/         # General course questions
│   │   ├── module-1/        # Module-specific questions
│   │   └── ...
│   ├── data-engineering-zoomcamp/
│   └── ...
├── _layouts/                # Jinja2 HTML templates
│   ├── base.html
│   ├── course.html
│   └── index.html
├── assets/                  # CSS and static assets
├── faq_automation/          # FAQ automation module
│   ├── core.py             # Core FAQ processing functions
│   ├── rag_agent.py        # AI-powered decision agent
│   ├── actions.py          # GitHub Actions integration
│   └── cli.py              # Command-line interface
├── tests/                   # Test suite
├── generate_website.py      # Main site generator
└── Makefile                # Build commands

Contributing FAQ Entries

See CONTRIBUTING.md for detailed instructions.

Development

Setup

# Install dependencies
uv sync --dev

For testing the FAQ automation locally, you'll need to set your OpenAI API key:

export OPENAI_API_KEY='your-api-key-here'

Or add it to your shell configuration file (e.g., ~/.bashrc, ~/.zshrc).

Running Locally

To test the FAQ automation locally, create a test_issue.txt file:

cat > test_issue.txt << 'EOF'
### Course
machine-learning-zoomcamp

### Question
How do I check my Python version?

### Answer
Run `python --version` in your terminal.
EOF

Then process the FAQ proposal:

uv run python -m faq_automation.cli \
  --issue-body "$(cat test_issue.txt)" \
  --issue-number 42

Testing

# Generate static website
make website

# Run all tests
make test

# Run unit tests only
make test-unit

# Run integration tests only
make test-int

See testing documentation for detailed information about the test suite, including how to run specific test files or methods, test coverage details, and guidelines for adding new tests.

Architecture

Site Generation Pipeline

Collection (collect_questions()):
- Reads all markdown files from _questions/
- Parses YAML frontmatter
- Loads course metadata for section ordering
Processing (process_markdown()):
- Converts markdown to HTML
- Applies syntax highlighting (Pygments)
- Auto-links plain text URLs
- Handles image placeholders
Sorting (sort_sections_and_questions()):
- Orders sections per _metadata.yaml
- Sorts questions by sort_order field
Rendering (generate_site()):
- Applies Jinja2 templates
- Generates course pages and index
- Copies assets to _site/

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
.github		.github
_layouts		_layouts
_questions		_questions
assets/css		assets/css
cache		cache
faq_automation		faq_automation
images		images
notebooks		notebooks
scripts		scripts
tests		tests
utils		utils
.gitignore		.gitignore
.python-version		.python-version
CONTRIBUTING.md		CONTRIBUTING.md
Makefile		Makefile
README.md		README.md
agents.md		agents.md
debug_url.py		debug_url.py
generate_website.py		generate_website.py
index.md		index.md
main.py		main.py
pyproject.toml		pyproject.toml
run_tests.py		run_tests.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

DataTalks.Club FAQ

Features

Project Structure

Contributing FAQ Entries

Development

Setup

Running Locally

Testing

Architecture

Site Generation Pipeline

About

Uh oh!

Releases

Packages

Contributors 7

Uh oh!

Languages

Uh oh!

Uh oh!

DataTalksClub/faq

Folders and files

Latest commit

History

Repository files navigation

DataTalks.Club FAQ

Features

Project Structure

Contributing FAQ Entries

Development

Setup

Running Locally

Testing

Architecture

Site Generation Pipeline

About

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 7

Uh oh!

Languages

Packages