Learning deep policies for physics-based robotic manipulation in cluttered real-world environments