[a] **Quotation:**  
"Training, validation and testing data sets shall be subject to data governance and management practices appropriate for the intended purpose of the high-risk AI system. Those practices shall concern in particular: (b) data collection processes and the origin of data, and in the case of personal data, the original purpose of the data collection;"  

[b] **Guideline:**  
Organizations must ensure that all personal data included in training datasets were originally collected with purposes compatible with AI training use cases, fully documented, and compliant with data protection regulations. Secondary uses of data, such as repurposing historical recruitment records for model training, must respect candidate consent and legal bases. Data provenance should be transparent and verifiable.  

[c] **Violation:**  
Meridian Analytics Solutions incorporated legacy candidate data into training without confirming that the original data collection explicitly covered usage for AI-driven recruitment scoring or analytics. Some of the data was collected for administrative hiring purposes but repurposed for automated decision-making without refreshed consent or clear documentation, creating a mismatch between original data use and training objectives.  

[d] **Justification:**  
This violates the requirement to align data collection and origin with intended AI training purposes under Article 10(2)(b). It is a subtle non-compliance because legacy data reuse is common in enterprise settings and may appear lawful superficially; however, failing to secure explicit, purpose-compatible permissions risks privacy infringements and undermines lawful data governance foundational to trustworthy and compliant high-risk AI systems.