TL;DR
- Stop worrying about algorithms, just change the network architecture to SimbaV2
+ Stop worrying about algorithms, just change the network architecture to SimbaV2
@@ -151,7 +152,7 @@ Scaling Network Size & UTD Ratio
Empiricial Analysis: Training Stability
- We track $4$ metrics during training to understand the learning dynamics of SimbaV2 and Simba:
+ We track average return and $4$ metrics during training to understand the learning dynamics of SimbaV2 and Simba:
- (a) Average normalized return across tasks
- (b) Weighted sum of $\ell_2$-norms of all intermediate features in critics
@@ -281,7 +282,7 @@ SimbaV2 with Online RL
Paper
SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hojoon Lee*, Youngdo Lee*, Takuma Seno, Donghu Kim, Peter Stone, Jaegul Choo
- arXiv preprint
+ arXiv preprint
@@ -290,21 +291,31 @@ Paper
-
-
-
- Citation
+
+ Citation
- If you find our work useful, please consider citing the paper as follows:
+ If you find our work useful, please consider citing the paper as follows:
-
-
-
+ @article{lee2025simbav2,
+ title={Hyperspherical Normalization for Scalable Deep Reinforcement Learning},
+ author={Hojoon Lee and Youngdo Lee and Takuma Seno and Donghu Kim and Peter Stone and Jaegul Choo},
+ journal={arXiv preprint arXiv:2502.15280},
+ year={2025},
+}
+
+
TL;DR
- Stop worrying about algorithms, just change the network architecture to SimbaV2 + Stop worrying about algorithms, just change the network architecture to SimbaV2
- We track $4$ metrics during training to understand the learning dynamics of SimbaV2 and Simba: + We track average return and $4$ metrics during training to understand the learning dynamics of SimbaV2 and Simba:
- (a) Average normalized return across tasks
- (b) Weighted sum of $\ell_2$-norms of all intermediate features in critics
@@ -281,7 +282,7 @@
SimbaV2 with Online RL
Paper
SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement LearningHojoon Lee*, Youngdo Lee*, Takuma Seno, Donghu Kim, Peter Stone, Jaegul Choo
- arXiv preprint
+ arXiv preprint
@@ -290,21 +291,31 @@
Paper
- -Citation
+Citation
- If you find our work useful, please consider citing the paper as follows: + If you find our work useful, please consider citing the paper as follows:
-@article{lee2025simbav2,
+ title={Hyperspherical Normalization for Scalable Deep Reinforcement Learning},
+ author={Hojoon Lee and Youngdo Lee and Takuma Seno and Donghu Kim and Peter Stone and Jaegul Choo},
+ journal={arXiv preprint arXiv:2502.15280},
+ year={2025},
+}
+