I am pleased to announce the 15th revision of the paper is now available on Arxiv! This includes various minor updates and a new section on Multi Agent training which I’ll be discussing in the next paper Click here for Arxiv page and download!