Trustworthy Machine Learning Under Imperfect Data: Data Collection, Robust Learning, And Real-World Considerations