"""
Data Processing Module
=====================

Data preprocessing utilities and MLP probe pipeline for repeat detection.
"""