-
Will Our Tactile Sensors Shape the Direction of My Research?
A collection of thoughts lingering on my mind about the possible scope of my research.
-
Teaching PPO Agent to Play Flappy Bird with SB3/Gymnasium
In this post, I explored a toy example to learn Stable Baselines3 (SB3) and Gymnasium.
-
Bi-Weekly Advisor Meeting 18th of December 2025
-
Bi-Weekly Advisor Meeting 27 of November 2025
-
Why does the Policy Improvement Theorem not hold true in function approximation