Automatically redact PII from audio and video with Python
This tutorial teaches how to redact Personal Identifiable Information (PII) from audio and video files using the AssemblyAI Python SDK. The process involves setting up an environment, transcribing the file, printing the redacted transcript, fetching the redacted audio file, and running the program. PII categories such as medical conditions, email addresses, and credit card numbers can be redacted from both audio/video files and their textual transcripts. The AssemblyAI API key is required for this process, which can be obtained for free.
Company
AssemblyAI
Date published
March 18, 2024
Author(s)
Ryan O'Connor
Word count
1092
Hacker News points
None found.
Language
English