Skip to content
@LAMDA-RL

LAMDA-RL

We are a fork of reinforcement learning researchers from LAMDA Group @ Nanjing University.

LAMDA-RL Lab

LAMDA-RL Lab is at the forefront of advancing the field of reinforcement learning and its application to creating general decision-making intelligence, by pushing the boundaries of what's possible with RL techniques.

We focus on developing novel algorithms and architectures that enable RL systems to learn and make decisions in increasingly general and adaptable ways. Some key areas we are exploring include:

  • Imitation learning;
  • Offline reinforcement learning;
  • Model-based RL and world model learning;
  • Multi-agent and collaborative RL;
  • Planning and learning with large models.

Through both fundamental and application research, our aim is to create RL-based systems that exhibit truly intelligent and general decision-making capabilities. For more information about our lab and research, please refer to our website https://lamda-rl.nju.edu.cn/.

Pinned Loading

  1. OfflineRL-Lib Public

    Benchmarked implementations of Offline RL Algorithms.

    Python 73 7

  2. ODIS Public

    The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".

    Python 40 6

  3. PRDC Public

    Forked from kimoyami/PRDC

    Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D4RL gym and AntMaze tasks.

    Python 18 3

  4. ACT Public

    Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)

    Python 13 3

  5. Pretrained_BWArea_2.7B_30G Public

    Pre-trained Models of BWArea Model

    Python 9

  6. CPR Public

    Forked from LyndonKong/CPR

    Python 3

Repositories

Showing 10 of 37 repositories
  • RIMRO Public

    A benchmark for evaluating reinforcement learning algorithms that train the policies using imaginary rollouts from LLMs.

    Python 1 0 0 0 Updated Apr 13, 2025
  • CoLA Public
    Python 4 0 0 0 Updated Mar 26, 2025
  • GMAIL Public Forked from chaoningjing/GMAIL

    Author's official implementation of TPAMI paper "Generalizable Multi-modal Adversarial Imitation Learning for Non-stationary Dynamics"

    Python 0 1 0 0 Updated Mar 14, 2025
  • OfflineRL-Lib Public

    Benchmarked implementations of Offline RL Algorithms.

    Python 73 MIT 7 2 2 Updated Mar 4, 2025
  • Q-Adapter Public Forked from mansicer/Q-Adapter

    Author's implementation of ICLR'25 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"

    Python 1 Apache-2.0 1 0 0 Updated Feb 28, 2025
  • DORA Public Forked from Xinyuz26/DORA

    Code for ICML'24 paper "Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics"

    Python 0 1 0 0 Updated Feb 19, 2025
  • ADMPO Public Forked from HxLyn3/ADMPO

    Any-step Dynamics Model for Policy Optimization

    Python 5 MIT 6 0 0 Updated Feb 1, 2025
  • KALM Public Forked from CharlieBrown-v1/KALM

    [NeurIPS‘24] KALM: Knowledgeable Agents by Offline Reinforcement Learning from Large Language Model Rollouts

    1 3 0 0 Updated Jan 20, 2025
  • WiseRL Public Forked from typoverflow/WiseRL

    PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms

    Python 1 MIT 2 0 0 Updated Dec 6, 2024
  • PRDC Public Forked from kimoyami/PRDC

    Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D4RL gym and AntMaze tasks.

    Python 18 6 0 0 Updated Nov 8, 2024

Top languages

Loading…

Most used topics

Loading…