PokemonPlatinum.AI 👾

Developing a Reinforcement Learning model to learn to play Pokemon Platinum, integrating this into the code through a desktop emulator such as DeSmuME. Will then utilise RLHF (Reinforcement Learning with Human Feedback) for model training.

Project Overview ✅

Develop YOLOv9 computer-vision based model to annotate gameplay states
Build and hyperparameter tune the gameplay model (PyTorch DQN)
Pretain gameplay model on gameplay states, actions, and annotations
Continue training gameplay model with reinforcement learning with human feedback (RLHF)
Containerize and deploy model (Docker, FastAPI)

RLHF Training Phases 🏋🏽

The overall goal of this project is to train model to learn to get from Jubilife City to Oreburgh City, beating at least one npc trainer along the way. For context, to get from Jubilife City to Oreburgh City in Pokémon Platinum, the user has to go from Jubilife City, pass through Route 203, then through Oreburgh Cave, then enters Oreburgh City.

For a human, this process should take no more than a few minutes. For the AI... we'll find out!

Phase 1: Train the model to leave Jubilife City through Route 203 Phase 2: Once the model has left Jubilife City into Route 203, train the model to challenge, and beat, one trainer npc Phase 3: Finally, train the model to go through Oreburgh Cave to Oreburgh City

Once the model has entered Oreburgh City, it will have been successful and close

Directory 📍

📁Annotated_images: Folder contained the actions, states, and labels for model pretraining before RL
    📄 action_map.py:
    📄 action_prep.py: Script that displays an image, prompts for an action, then repeats until all images have been proccessed
    📄 annotations_errorcheck.py: Script that runs through both the original and annotated filepath to ensure metadata matches
    📄 file_prep.py: Script to prep the files for usage in the models (renaming to a standardised naming system)
    📁 Actions: Folder that contains the actions (.json) for model pretraining
    📁 Images: Folder that contains the images (.png) for model pretraining
    📁 Labels: Folder that contains the labels (.txt) for model pretraining
📁 models: Folder that contains the model architecture, and the .pth files for each phase's trained model
    📄 PokemonModelLSTM.py: Modular script that contains the model architecture
📁 RLHF_Scripts: Folder that contains scripts and modular code for RLHF Model
    📁 human_review_logs: Folder to hold the final state of each ep of training for human review
    📄 rlhf_phase1.py: Script for training initial model for Phase 1 goal
    📄 rlhf_phase2.py: Script for training Phase 1 model for Phase 2 goal
    📄 rlhf_phase3.py: Script for training Phase 2 model for Phase 3, final goal
    📁 modular_scripts: Folder for modular scripts to be used used during different RLHF Training Phase
        📄 load_model.py: Loads model, the state dict (dependent on phase), to be used in RLHF Training
        📄 rlhf_utils.py: Collection of functions used throughout training (emulator connection, actions, rewards, etc.)
📁 runs: Folder which contains the trained annotation models
📄 annotation_model.ipynb: Notebook for training the annotation model from YOLO
📄 model_pretraining.ipynb: Notebook to pretrain the model based on gameplay screenshots before RLHF
📄 requirements.txt
📄 screenshots.py: Script to take screenshots of gameplay and store them for training
📄 video_extraction.py: Script to extract gameplay footage from videos of gameplay

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
RLHF_Scripts		RLHF_Scripts
annotated_images		annotated_images
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
annotation_model.ipynb		annotation_model.ipynb
import pygetwindow as gw.py		import pygetwindow as gw.py
metrics.md		metrics.md
model_pretraining.ipynb		model_pretraining.ipynb
requirements.txt		requirements.txt
reset_venv.sh		reset_venv.sh
screenshots.py		screenshots.py
testing_lua.py		testing_lua.py
video_extraction.py		video_extraction.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PokemonPlatinum.AI 👾

Project Overview ✅

RLHF Training Phases 🏋🏽

Directory 📍

About

Releases

Packages

Languages

scottpitcher/PokemonPlatinum.AI

Folders and files

Latest commit

History

Repository files navigation

PokemonPlatinum.AI 👾

Project Overview ✅

RLHF Training Phases 🏋🏽

Directory 📍

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages