Model Prediction Accuracy Versus Interpretation in Machine Learning

阿新 • • 發佈：2019-01-12

In their book Applied Predictive Modeling, Kuhn and Johnson comment early on the trade-off of model prediction accuracy versus model interpretation.

For a given problem, it is critical to have a clear idea of the which is a priority, accuracy or explainability so that this trade-off can be made explicitly rather than implicitly.

In this post you will discover and consider this important trade-off.

Model Accuracy vs Explainability
Photo by Donald Hobern, some rights reserved

Accuracy and Explainability

Model performance is estimated in terms of its accuracy to predict the occurrence of an event on unseen data. A more accurate model is seen as a more valuable model.

Model interpretability provides insight into the relationship between in the inputs and the output. An interpreted model can answer questions as to why the independent features predict the dependent attribute.

The issue arises because as model accuracy increases so does model complexity, at the cost of interpretability.

Model Complexity

A model with higher the accuracy can mean more opportunities, benefits, time or money to a company. And as such prediction accuracy is optimized.

The optimization of accuracy leads to further increases in the complexity of models in the form of additional model parameters (and resources required to tune those parameters).

“Unfortunately, the predictive models that are most powerful are usually the least interpretable.“

A model with fewer parameters is easier to interpret. This is intuitive. A linear regression model has a coefficient per input feature and an intercept term. For example, you can look at each term and understand how they contribute to the output. Moving to logistic regression gives more power in terms of the underlying relationships that can be modeled at the expense of a function transform to the output that now too must be understood along with the coefficients.

A decision tree (of modest size) may be understandable, a bagged decision tree requires a different perspective to interpret why an event is predicted to occur. Pushing further, the optimized blend of multiple models into a single prediction may beyond meaningful or timely interpretation.

Accuracy Trumps Explainability

In their book, Kuhn and Johnson are concerned with model accuracy at the expense of interpretation.

They comment:

“As long as complex models are properly validated, it may be improper to use a model that is built for interpretation rather than predictive performance.“

Interpretation is secondary to model accuracy and they site examples such as discriminating email into spam and non-spam and the evaluation of a house as examples of problems where this is the case. Medical examples are touched on twice and in both cases are used to defend the absolute need and desirability for accuracy of explainability, as long as the models are appropriately validated.

I’m sure that “but I validated my model” would be no defense at an inquest when a model makes predictions that result in loss of life. Nevertheless, there is do doubt that this is an important issue that requires careful consideration.

Summary

Whenever you are modeling a problem, you are making a decision on the trade-off between model accuracy and model interpretation.

You can use knowledge of this trade-off in the selection of methods you use to model your problem and be clear of your objectives when presenting results.

Model Prediction Accuracy Versus Interpretation in Machine Learning

Accuracy and Explainability

Model Complexity

Accuracy Trumps Explainability

Summary

Model Prediction Accuracy Versus Interpretation in Machine Learning

機器學習筆記1 - Hello World In Machine Learning

Data Leakage in Machine Learning 機器學習訓練中的資料洩漏

Top 4 Steps for Data Preprocessing in Machine Learning

How Facebook Uses Bayesian Optimization to Conduct Better Experiments in Machine Learning Models

[Research] Help relating to a theorem in machine learning | AITopics

Regularization in Machine Learning: Connect the dots

Restoring balance in machine learning datasets

Vectorization Implementation in Machine Learning

Algorithmia Survey: Large Enterprises Have Taken the Lead in Machine Learning

Report: Large organizations are finding success in machine learning

Five steps for getting started in machine learning: Top data scientists share their tips

A new course to teach people about fairness in machine learning

A Quick Introduction to Text Summarization in Machine Learning

Evolutionary Algorithms: the Next Big Thing in Machine Learning?

conversations in machine learning

Embrace Randomness in Machine Learning

How Beginners Get It Wrong In Machine Learning

Common Pitfalls In Machine Learning Projects

5 Mistakes Programmers Make when Starting in Machine Learning

Model Prediction Accuracy Versus Interpretation in Machine Learning

Accuracy and Explainability

Model Complexity

Accuracy Trumps Explainability

Summary

相關推薦