Recent studies have found that even highly curated machine learning benchmark datasets contain label errors, which can significantly impact model performance. The open-source cleanlab library provides a standard framework for identifying and addressing these issues in real-world data. In this hands-on blog, the authors demonstrate how to use cleanlab to find label problems in the IMDb movie review text classification dataset and improve models without changing them. They also provide code examples for implementing the workflow on other datasets.