Updated report

frangente · Feb 11, 2024 · 7cc723a · 7cc723a
1 parent c85dee2
commit 7cc723a
Show file tree

Hide file tree

Showing 4 changed files with 21 additions and 21 deletions.
diff --git a/report/src/sections/background.tex b/report/src/sections/background.tex
@@ -19,13 +19,14 @@ \subsection{Monte Carlo Tree Search}
               \item $c$ is the exploration parameter, usually equal to $\sqrt{2}$
           \end{itemize}
 
-    \item \textbf{Expansion}: if $L$ isn't a terminal node (i.e. there are valid moves that can be performed starting from the game state in $L$ ) pick a node $C$ among its children that haven't been yet expanded
+    \item \textbf{Expansion}: if $L$ isn't a terminal node (i.e. there are valid moves that can be performed starting from the game state in $L$ ), pick a node $C$ among its children that haven't been yet expanded
 
     \item \textbf{Simulation}: starting from $C$, randomly choose valid moves until a terminal state is reached and the game is decided (i.e. win/loss/draw)
 
     \item \textbf{Backpropagation}: the result of the simulation is used to update the statistics (number of wins and number of visits) for all nodes along the path from $C$ to $R$ that are then used to compute the UCTs for the following iterations
 \end{enumerate}
 
+
 \begin{figure}
     \centering
     \includegraphics[width=\linewidth]{figures/mcts.png}

diff --git a/report/src/sections/introduction.tex b/report/src/sections/introduction.tex
@@ -2,7 +2,7 @@ \section{Introduction}
 
 In artificial intelligence, an agent is anything that can be viewed as perceiving its environment through sensors and acting upon that environment through actuators \parencite{russel2010}. However, such a broad definition is not particularly useful. Much more interesting is the idea of an agent (or system of agents) that operates with the objective to maximize its expected performance measure given its available knowledge and resources. Still, intelligent agents need not only to be rational, but also autonomous, that is, they should act independently by external control and learn to compensate for their partial or incorrect prior knowledge of the world.
 
-Based on these ideas, the hereby presented project involved creating an autonomous and rational (multi-) agent system to play the game of Deliveroo. The task is inspired by the real-world problem of courier services, where parcels have to be delivered to different locations in a timely and efficient manner. In more details, the game is played on a two-dimensional map, where parcels are randomly generated and spawns with a random reward value that may decay over time. The objective of the agents is to collect and deliver parcels to the designated delivery locations in order to maximize their total reward.
+Based on these ideas, the hereby presented project involved creating an autonomous and rational (multi-) agent system to play the game of Deliveroo. The task is inspired by the real-world problem of courier services, where parcels have to be delivered to different locations in a timely and efficient manner. In more details, the game is played on a two-dimensional map, where parcels are randomly generated and spawn with a random reward value that may decay over time. The objective of the agents is to collect and deliver parcels to the designated delivery locations in order to maximize their total reward.
 
 To make the problem more challenging, the agents have to deal with a number of constraints and limitations. First, while the underlying mechanics of the game are known, the environment is only partially observable to the agents which can only perceive their surroundings within a certain radius. Second, the agents have to deal with the stochastic and dynamic nature of the game, as the parameters of the game (e.g. the number of parcels, their reward distribution, the movement speed of the agents) can change from one game to another. As an additional challenge, other agents can be present in the environment and compete for the same resources. Such agents can be either adversarial or cooperative, thus requiring the agents to adapt their strategies accordingly.