
exps:
- mnli->snli
    - diagonal
    - lrm
    - maybe lrm + expansion
- snli->snli [maybe not include this one at all?]
    - lrm
    - maybe lrm + expansion
- qqp
    - lrm?
    - lrm + expansion
- imagenet
    - diagonal
    - lrm


- perturbations to run  (-r means ready to run, -R means running)
    x mnli->snli diagonal
    x mnli->snli lrm
    x mnli->snli lrm + expansion?
    x snli->snli lrm?
    x snli->snli lrm + expansion?
    x qqp lrm?
    -r qqp lrm + expansion?
    x imagenet diagonal
    x imagenet lrm


- baselines
    - mnli->snli + imagent
    - pca + k-means
    - need tcav t-test for this and others


- sparsity check stuff
    - diagonal
    - probably lrm too


- First present lrm mnli->snli (and probaly lrm qqp) and diagonal imagenet
    - Look at components
    - Look at component perturbation selectivities and TCAVs
- Discuss expansion experiments (on wrongly predicted examples)
    - qqp, maybe mnli->snli
    - see if the expansion components have a higher fraction of incorrect predictions
    - Mention different reasons why a component might have a high number of incorrcet predictions
        - labelling issue
        - completely wrong heuristic
        - flawed heuristic
    - "Applications" involving incorrectly predicted components:
        - Inconsistent labelling
            - probably only qqp
            - I think the one where we look at top examples from data used to fit the npeff
              had cleaner components, train/test stuff not an issue for detecting inconsistent labeling
            - Maybe baselines of detection?
                - probably, at least in appendix, the method from one paper is simple and only uses logits
        - Incorrect heuristic fixing
            - only snli->mnli
            - single comps
            - idk whether to use expansion or not
            - multiple comps
            - discuss limitations and what we are doing by showing this
- Baselines
    - Have look at diagonal mnli->snli and lrm imagenet
        - Discuss why they are worse
    - Maybe perturbation without the semi-orthogonalization, maybe relegate to appendix.
    - Sparsity check stuff
    - pca + k-means
        - Maybe discuss components a bit, try to use TCAV to quickly explain stuff
