TensorNetworks

Last updated on Jun 7, 2022

In this document all that is present are notes taken as the research progressed, the notes are structures in order of progression and are unfiltered raw thoughts and steps. with no respect to grammar.

We Begin by attempting to create a baseline, this is a simply CNN actor critic policy from stablebaselines 3 trained using the PPO algorithm. Once trained we transfer the parameters from pytorch to jax and buid a model, we change the policy network from a multilayerpreceptron to a TensorNetwork. (Currently progressing thus only the notes are available!)

TensorNetworks

Mustafa Omar

Physics graduate and a Machine learning MSc student

Related