R2D2: Reducing Redundancy and Duplication in Data Lakes

Raunak Shah, Koyel Mukherjee, Atharv Tyagi, Sai Keerthana Karnam, Dhruv Joshi, Shivam Pravin Bhosale, Subrata Mitra

Published: 08 Dec 2023, Last Modified: 16 Mar 2026Proceedings of the ACM on Management of DataEveryoneRevisionsCC BY-SA 4.0
External IDs:doi:10.1145/3626762
Loading