Skip to content
View Dev1nW's full-sized avatar

Block or report Dev1nW

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Dev1nW/README.md

πŸ‘‹ Hi, I'm Devin!

I'm a Machine Learning Researcher with the Army Educational Outreach Program (AEOP). With over three years of combined professional and academic research experience, I specialize in investigating the emergent capabilities of Large Language Models (LLMs), particularly in interactive environments (like Atari!), alongside expertise in Reinforcement Learning (RL) and Reinforcement Learning from Human Feedback (RLHF), specifically Rating-based RL (RbRL). My focus is on leveraging human insights to enhance AI system learning and alignment.


🧠 Skills

  • Programming & Libraries: Python, PyTorch, TensorFlow, Stable Baselines3, Apple MLX, NumPy, Pandas, Matplotlib, Gymnasium
  • APIs & Tools: OpenAI API, Google Gemini API, Hugging Face API, Git & GitHub
  • AI/ML Concepts: Reinforcement Learning (RL), Reinforcement Learning from Human Feedback (RLHF), Rating-based RL (RbRL), Large Language Models (LLMs), Natural Language Processing (NLP)

πŸ”­ Research Interests

  • Reinforcement Learning from Human Feedback (RLHF) and Alignment: Specifically focusing on using ratings, as in Rating-Based Reinforcement Learning.
  • Large Language Models (LLMs): Exploring emergent capabilities, agentic behavior, and applications, including using LLMs in playing Atari games, as in Atari-GPT.
  • Human-AI Interaction: Designing and studying systems where human input guides AI learning.

πŸ“š Publications

  • RbRL2.0: Integrated Reward and Policy Learning for Rating-based Reinforcement Learning (Accepted, AAAI 2025 Bridge Program - Collaborative AI and Modeling of Humans) [ArXiv]
  • Performance Optimization of Ratings-Based Reinforcement Learning (Accepted, AAAI 2025 Bridge Program - Collaborative AI and Modeling of Humans) [ArXiv]
  • Atari-GPT: Benchmarking Multimodal Large Language Models as Low-Level Policies in Atari Games (Accepted, AAAI 2025 Workshop - Toward Knowledgable Foundation Models) [ArXiv] [Code]
  • Rating-Based Reinforcement Learning (AAAI 2024) [Paper] [Code] (Also presented at ICML 2023 Workshop - Many Facets of Preference-Based Learning [Workshop Paper])
  • Deep Reinforcement Learning-based Optimal Time-constrained Intercept Guidance (AIAA GNC 2024) [Paper]
  • Reinforcement Learning From Human Ratings (Master's Thesis) [Thesis]

πŸ’» Key Projects


πŸ“« Connect with me:

LinkedIn Twitter Google Scholar Website

Pinned Loading

  1. Simplified-Rating-and-Preference-RL Simplified-Rating-and-Preference-RL Public

    Simplified, modern implementation of Rating and Preference-based Reinforcement Learning.

    Python

  2. atari-gpt atari-gpt Public

    Forked from nwayt001/atari-gpt

    Official Codebase for Atari-GPT

    Python

  3. Rating-based-Reinforcement-Learning Rating-based-Reinforcement-Learning Public

    Official Codebase for Rating-Based Reinforcement Learning.

    Python 3

  4. Sign_language_recognition_final_project Sign_language_recognition_final_project Public

    Sign Language Recognition code using GRU, LSTM and Simple RNN.

    Jupyter Notebook 1

  5. Gemini_Research_Reviewer Gemini_Research_Reviewer Public

    Gemini Research Reviewer is an AI-powered tool that provides instant, constructive feedback on research papers, helping researchers improve their work with actionable insights and refined writing s…

    Python 1

  6. ASCII_Breakout ASCII_Breakout Public

    An ASCII version of Breakout aimed at having Large Language Models play Breakout.

    Python 1