Efficient similarity computations on parallel machines using data shaping