MediaPipe is Google's open-source framework for customizable multi-modal processing of data streams (e.g., video, audio, depth). It provides easy deployment on mobile and edge devices, with capabilities like pose estimation, face detection, tracking and augmented reality, among others. While it does not provide any new models per se, it simplifies the process of using state-of-the-art models in real time applications by providing a unified interface for various tasks.