Causal Inference in Data Analysis with Applications to Fairness and Explanations

Sudeepa Roy; Babak Salimi

Causal Inference in Data Analysis with Applications to Fairness and Explanations

Sudeepa Roy, Babak Salimi

Published: 01 Jan 2022, Last Modified: 09 Feb 2025RW 2022EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Causal inference is a fundamental concept that goes beyond simple correlation and model-based prediction analysis, and is highly relevant in domains such as health, medicine, and the social sciences. Causal inference enables the estimation of the impact of an intervention or treatment on the world, making it critical for sound and robust policy making. However, randomized controlled experiments, which are typically considered as the gold standard for inferring causal conclusions, are often not feasible due to ethical, cost, or other constraints. Fortunately, there is a rich literature in Artificial Intelligence (AI), Machine Learning (ML), and Statistics on observational studies, which are methods for causal inference on observed or collected data under certain assumptions. In this paper, we provide an overview of popular formal and rigorous techniques for causal inference on observed data from the AI and Statistics literature. Furthermore, we discuss how concepts from causal inference can be used to infer fairness and enable explainability in machine learning models, which are critical in responsible data science when ML is used in making high-stake decisions in various contexts. Our discussion highlights the importance of using causal inference in ML models and provides insights on how to develop more transparent and responsible AI systems.

Loading