Exploration And Value Function Factorisation In Single And Multi-Agent Reinforcement Learning