\label{sec:conclusions}

In this paper, we provide a non-asymptotic analysis of IRM with sparsity constraints. 
First, we generalize the data model, relaxing the data model to allow for varying correlation between spurious features and the label.
Next, we provide the non-asymptotic results for sparse IRM, 
% general to an arbitrary number of environments. 
including a refinement and correction of previous work in sparse IRM, including 
theoretical guarantees for $L_1$- and $L_0$-constrained IRM, resulting in a sparse representation that selects invariant features. 
Finally, we demonstrate that these methods can be computed in a fast and efficient matter using projected gradient descent-based methods, 
and we provide experimental results that demonstrate improved test accuracy and time savings on domain generalization datasets.


