GitHub - MihaiStreames/Downwell.AI: AI for Downwell made in Python (WIP)

This is a side project I'm working on while learning about AI at university. I'm trying to teach a DQN (Deep Q-Network) agent how to play Downwell.

Overview

The way I go about this is by using memory reading (via pymem) to extract game state, screen capture for visual input, and keyboard automation (pyautogui) to control the game. The AI learns to play through trial and error using deep reinforcement learning.

Platform Requirements: Windows only (uses Windows-specific memory reading and window management)

Getting Started

Prerequisites

uv
Python 3.10 or higher
NVIDIA GPU with CUDA 12.8 support
Downwell

Setup

On Windows (for running the AI):

# Install all dependencies including Windows-specific packages
uv sync --extra windows

# Verify CUDA is available
uv run python -c "import torch; print(f'CUDA available: {torch.cuda.is_available()}')"

On Linux (for development only):

# Install core dependencies for code editing
uv sync

# Install dev tools (linter, type checker, test framework)
uv sync --extra dev

Training the AI

uv run python main.py

Prerequisites before running:

Downwell must be running (downwell.exe)
Game window must be visible on screen
The AI will automatically detect and connect to the game process

Training Output

Models are saved to models/ directory:
- downwell_ai_best.pth: Best performing model (highest reward)
- downwell_ai_<episode>.pth: Periodic checkpoints (every 25 episodes by default)
- downwell_ai_final_<episode>.pth: Final model at end of training
Training history is saved to training_history.csv (episode rewards, steps, combos, etc.)

Visualizing Training Progress

uv run python plotter.py

This generates training_progress.png with reward trends, episode duration, combos, and gems over time.

Architecture

Three-Thread Pipeline (Orchestrator Pattern)

The core AI system uses a three-threaded architecture coordinated by DownwellAI (src/core/orchestrator.py):

PerceptorThread (src/threaders/perceptor.py) - Captures game state at 60 FPS
- Reads memory values (HP, position, gems, combo, ammo)
- Captures and preprocesses screenshots
- Maintains a frame stack (4 frames) for temporal awareness
- Writes to shared state_buffer (deque with thread lock)
ThinkerThread (src/threaders/thinker.py) - Makes decisions at 15 FPS
- Reads latest state from state_buffer
- Computes rewards using RewardCalculator
- Trains the DQN agent (experience replay)
- Selects actions using epsilon-greedy policy
- Writes actions to action_queue
ActorThread (src/threaders/actor.py) - Executes actions
- Reads actions from action_queue
- Manages keyboard state (press/release keys)
- Uses pyautogui for input simulation

DQN Agent (src/agents/dqn_agent.py)

Network Architecture: Convolutional neural network (src/agents/dqn_network.py)
- Input: 4-frame stack (84x84 grayscale images)
- Output: Q-values for 6 discrete actions
Actions: No-op, Jump, Left, Right, Left+Jump, Right+Jump
Training: Uses target network, experience replay buffer (100k capacity)
Hardware: Requires NVIDIA GPU with CUDA support

Memory Reading (src/environment/mem_extractor.py)

The Player class uses pymem to read game state from memory via pointer chains (defined in utils/game_attributes.py).

Note: Memory addresses are Windows-specific and may break with game updates.

Reward System (src/core/reward_calculator.py)

Primary reward: Depth (2 points per unit of best y-level reached)
Bonuses:
- Level completion (+100)
- Gems (+1 each)
- Combos (+5 × combo value, threshold of 4)
Penalties:
- Death (-50)
- Step penalty (-0.01 per step)
- Damage (-2 per HP lost)
- Boundary penalty (for staying near edges)

Reward weights can be adjusted in src/config.py (RewardConfig).

Configuration

All hyperparameters are in src/config.py:

AgentConfig: Learning rate, gamma, epsilon decay, batch size
RewardConfig: Reward weights and clipping
TrainConfig: Episodes, memory size, save frequency
EnvConfig: Image size, frame stack, thread FPS

To modify training behavior, edit these dataclasses.

Common Issues

Memory read errors: Memory offsets in utils/game_attributes.py may need updating after game patches
CUDA out of memory: Reduce batch_size in AgentConfig or memory_size in TrainConfig
"Downwell window not found": Make sure the game is running and the window is visible
Import errors on Linux: This is expected - the Windows-only dependencies aren't needed for code editing

Contributing

This is a learning project, but feel free to open issues or PRs if you find bugs or have suggestions!

License

MIT - see LICENSE file for details

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
assets		assets
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
main.py		main.py
plotter.py		plotter.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Getting Started

Prerequisites

Setup

Training the AI

Training Output

Visualizing Training Progress

Architecture

Three-Thread Pipeline (Orchestrator Pattern)

DQN Agent (src/agents/dqn_agent.py)

Memory Reading (src/environment/mem_extractor.py)

Reward System (src/core/reward_calculator.py)

Configuration

Common Issues

Contributing

License

About

Uh oh!

Uh oh!

Languages

License

MihaiStreames/Downwell.AI

Folders and files

Latest commit

History

Repository files navigation

Overview

Getting Started

Prerequisites

Setup

Training the AI

Training Output

Visualizing Training Progress

Architecture

Three-Thread Pipeline (Orchestrator Pattern)

DQN Agent (src/agents/dqn_agent.py)

Memory Reading (src/environment/mem_extractor.py)

Reward System (src/core/reward_calculator.py)

Configuration

Common Issues

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages