Hierarchical Sampling For Least-Squares Policy Iteration