I am fortunately supervised by professor Csaba Szepesvári. My goal is to have a better theoretical understanding about policy gradient and actor-critic methods!