Haitham Bou AmmarFeb 14, 201915 min readReinforcement Learning as Probabilistic Modelling: A Variational Inference Formulation (Part I) WIPReinforcement Learning is concerned with an agent attempting to acquire optimal behaviour in unknown environments that typically exhibit...