Thermodynamics-inspired explanations of artificial intelligence
Interpretation unfaithfulness (\(\mathcalU\)) for surrogate model construction Our starting point is some given dataset \(\mathcalX\) and corresponding predictions g coming from a black-box model. For a particular element \(x\in {\mathcalX}\),…