On a Method to Measure Supervised Multiclass Model’s Interpretability: Application to Degradation Diagnosis (Short Paper)

2024 Dagstuhl Research Online Publication Server 12,892 citations

Abstract

In an industrial maintenance context, degradation diagnosis is the problem of determining the current level of degradation of operating machines based on measurements. With the emergence of Machine Learning techniques, such a problem can now be solved by training a degradation model offline and by using it online. While such models are more and more accurate and performant, they are often black-box and their decisions are therefore not interpretable for human maintenance operators. On the contrary, interpretable ML models are able to provide explanations for the model’s decisions and consequently improves the confidence of the human operator about the maintenance decision based on these models. This paper proposes a new method to quantitatively measure the interpretability of such models that is agnostic (no assumption about the class of models) and that is applied on degradation models. The proposed method requires that the decision maker sets up some high level parameters in order to measure the interpretability of the models and then can decide whether the obtained models are satisfactory or not. The method is formally defined and is fully illustrated on a decision tree degradation model and a model trained with a recent neural network architecture called Multiclass Neural Additive Model.

Keywords

InterpretabilityComputer scienceUnificationMachine learningClass (philosophy)Artificial intelligenceConsistency (knowledge bases)IntuitionUnified ModelShapley valueFeature (linguistics)Theoretical computer scienceMathematicsGame theory

Affiliated Institutions

Related Publications

Publication Info

Year
2024
Type
preprint
Citations
12892
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

12892
OpenAlex
0
Influential

Cite This

Scott Lundberg, Su‐In Lee (2024). On a Method to Measure Supervised Multiclass Model’s Interpretability: Application to Degradation Diagnosis (Short Paper). Dagstuhl Research Online Publication Server . https://doi.org/10.4230/oasics.dx.2024.27

Identifiers

DOI
10.4230/oasics.dx.2024.27

Data Quality

Data completeness: 77%