With Winograd, it can be a bit hard to tell whether a component is not working due to either
the model incorrectly applying specific knowledge it has or whether the model does not have
the prerequisite knowledge in the first place.

I'll try to address this by using the components in a data selection processing in addition
to affecting the training procedure on that data.

I'm thinking something like we see a component associated with temperature. We filter
a big text dataset (say wikipedia) based on containing temperature-specific words. We then
need to get this knowledge into the model. I can maybe try to preferentially update parameters
with a high value for that (or those) components. IDK about the task.


Maybe "re-route" examples using "incorrect" components to "correct" components for similar
examples.
