[Submitted on 1 Sep 2024]
View a PDF of the paper titled From Predictive Importance to Causality: Which Machine Learning Model Reflects Reality?, by Muhammad Arbab Arshad and 2 other authors
Abstract:This study analyzes the Ames Housing Dataset using CatBoost and LightGBM models to explore feature importance and causal relationships in housing price prediction. We examine the correlation between SHAP values and EconML predictions, achieving high accuracy in price forecasting. Our analysis reveals a moderate Spearman rank correlation of 0.48 between SHAP-based feature importance and causally significant features, highlighting the complexity of aligning predictive modeling with causal understanding in housing market analysis. Through extensive causal analysis, including heterogeneity exploration and policy tree interpretation, we provide insights into how specific features like porches impact housing prices across various scenarios. This work underscores the need for integrated approaches that combine predictive power with causal insights in real estate valuation, offering valuable guidance for stakeholders in the industry.
Submission history
From: Muhammad Arbab Arshad [view email]
[v1]
Sun, 1 Sep 2024 22:37:47 UTC (5,517 KB)
Source link
lol