Use more ideas from the prioritised replay paper: https://arxiv.org/pdf/1511.05952.pdf - Replay data on different device - use sum tree data structure - Include temperature parameter
Use more ideas from the prioritised replay paper: https://arxiv.org/pdf/1511.05952.pdf