Amnesic Probing: Behavioral Explanation with Amnesic Counterfactuals
AbstractA growing body of work makes use of probing in order to investigate the working of neural models, often considered black boxes.Recently, an ongoing debate emerged surrounding the limitations of the probing paradigm.In this work, we point out the inability to infer behavioral conclusions from probing results, and offer an alternative method