Table Tennis Playing

Collabrative and Competitive Agents

Project Details

Project is about training the agents to collabratively play a table tennis game and at same time compete with each other to score, their best. The environemnt used is Tennis.

In this environment, two agents control rackets to bounce a ball over a net. If an agent hits the ball over the net, it receives a reward of +0.1. If an agent lets a ball hit the ground or hits the ball out of bounds, it receives a reward of -0.01. Thus, the goal of each agent is to keep the ball in play.

The observation space consists of 8 variables corresponding to the position and velocity of the ball and racket. Each agent receives its own, local observation. Two continuous actions are available, corresponding to movement toward (or away from) the net, and jumping.

The task is episodic, and the best score achieved in this solution is 0.9 .The environment is considered solved, when the average (over 100 episodes) of those scores is at least +0.5.

Collabrative Training

In this project two agents collabrate with each with other by sharing each others perspective of the state of the environment, and sharing a common replay buffer to store experience.

Competitive Training

Each agent is trained individually with reward gained in individual context. Each agent has its own Actor and Critic Network to be trained for its best performance.

Getting Started

Install anaconda using installer at https://www.anaconda.com/products/individual according to your Operating System.
Create (and activate) a new environment with Python 3.6.
```
conda create --name drl python=3.6 
conda activate drl
```

Clone the repository, and navigate to the python/ folder. Then, install several dependencies.

git clone https://github.com/KanikaGera/Multi-Agent-Collabrative-Comparitve-Environment.git
cd python 
conda install pytorch=0.4.1 cuda90 -c pytorch
pip install .

Structure of Repository

Train_Tennis.py is python code to interact with unity environment and train the agent.
nohup.out is output file consisting of training logs , while training agents.
Train_Tennis.ipynb is jupyter notebook format of Train_Tennis.py for easier access of code. The output of cell calling training function has scores until 10000 episodes. Actual environment was solved in 12000 episodes.
model.py consists of structure of RL model coded in pytorch.
ddpg_agent.py consist of DDPG Algorithm Implementation .
saved/solution_actor_local_[1 or 2].pth is saved trained model with weights for local actor network for agent 1 and 2 respectively.
saved/solution_actor_target_[1 or 2].pth is saved trained model with weights for target actor network for agent 1 and 2 respectively.
saved/solution_critic_local_[1 or 2].pth is saved trained model with weights for local critic network for agent 1 and 2 respectively.
saved/solution_critic_target_[1 or 2].pth is saved trained model with weights for target critic network for agent 1 and 2 respectively.
saved/scores.list is saved scores while training model.
videos folder consist of video clipping of trained agents playing.

Instructions

Install Dependies by following commands in Getting Started
Download the environment from one of the links below. You need only select the environment that matches your operating system:
- Linux: [click here](https://s3-us-west-1.amazonaws.com/udacity-drlnd/P3/Tennis/Tennis_Linux.zip)
- Mac OSX: [click here](https://s3-us-west-1.amazonaws.com/udacity-drlnd/P3/Tennis/Tennis.app.zip)
- Windows (32-bit): [click here](https://s3-us-west-1.amazonaws.com/udacity-drlnd/P3/Tennis/Tennis_Windows_x86.zip)
- Windows (64-bit): [click here](https://s3-us-west-1.amazonaws.com/udacity-drlnd/P3/Tennis/Tennis_Windows_x86_64.zip)
(_For Windows users_) Check out [this link](https://support.microsoft.com/en-us/help/827218/how-to-determine-whether-a-computer-is-running-a-32-bit-version-or-64) if you need help with determining if your computer is running a 32-bit version or 64-bit version of the Windows operating system.

(_For AWS_) If you'd like to train the agent on AWS (and have not [enabled a virtual screen](https://github.com/Unity-Technologies/ml-agents/blob/master/docs/Training-on-Amazon-Web-Service.md)), then please use [this link](https://s3-us-west-1.amazonaws.com/udacity-drlnd/P3/Tennis/Tennis_Linux_NoVis.zip) to obtain the "headless" version of the environment. You will **not** be able to watch the agent without enabling a virtual screen, but you will be able to train the agent. (_To watch the agent, you should follow the instructions to [enable a virtual screen](https://github.com/Unity-Technologies/ml-agents/blob/master/docs/Training-on-Amazon-Web-Service.md), and then download the environment for the **Linux** operating system above._)
Place the file in the GitHub repository, in the main folder, and unzip (or decompress) the file.

Train the Agents on Jupyter Notebook

Create an [IPython kernel](http://ipython.readthedocs.io/en/stable/install/kernel_install.html) for the `drl` environment.
python -m ipykernel install --user --name drl --display-name "drl"
Before running code in a notebook, change the kernel to match the `drl` environment by using the drop-down `Kernel` menu.
Open Train_Tennis.ipynb
Run Jupyter Notebook
Run the cells to train the model.

Train the Agents in Terminal Background

Activate drl envionment
```
`conda activate drl `
```
nohup is for training in background.
```
`nohup python -u Train_Tennis.py &`
```
Output can be seen in real-time in nohup.out file.
```
`tail -f nohup.out`
```

'

Evaluate Agents

Open Evaluate_Tennis.ipynb. The first half of notebook consist of plotting of scores and average score during training.
Run the cells to plot the graph and analyze the rewards achieved during training.
Test the trained agents , by running the cells marked for testing in the notebook.

Implementation Details

Multi Agent Deep Deterministic Policy Gradient Algorithm is used to train Report is attached to main folder for detailed anaylis.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Table Tennis Playing

Collabrative and Competitive Agents

Project Details

Collabrative Training

Competitive Training

Getting Started

Structure of Repository

Instructions

Train the Agents on Jupyter Notebook

Train the Agents in Terminal Background

Evaluate Agents

Implementation Details

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
python		python
saved		saved
videos		videos
Evaluate_Tennis.ipynb		Evaluate_Tennis.ipynb
README.md		README.md
Report.pdf		Report.pdf
Tennis.py		Tennis.py
Train_Tennis.ipynb		Train_Tennis.ipynb
ddpg_agent.py		ddpg_agent.py
model.py		model.py
nohup.out		nohup.out
unity-environment.log		unity-environment.log

KanikaGera/Multi-Agent-Collabrative-Comparitve-Environment

Folders and files

Latest commit

History

Repository files navigation

Table Tennis Playing

Collabrative and Competitive Agents

Project Details

Collabrative Training

Competitive Training

Getting Started

Structure of Repository

Instructions

Train the Agents on Jupyter Notebook

Train the Agents in Terminal Background

Evaluate Agents

Implementation Details

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages