Data-Centric Machine Learning for Speech and Audio