Finding good policies in average-reward Markov Decision Processes without prior knowledge | Read Paper on Bytez