PREDICTIVE ANALYTICS FOR USED CAR PRICING USING R AND REGRESSION METHODS
DOI:
https://doi.org/10.53555/ephijer.v9i1.155Keywords:
Automobile Price Prediction, Multiple Linear Regression (MLR), Predictive modelling, Ridge regularizationAbstract
This study investigates the pricing dynamics of used cars using Multiple Linear Regression (MLR) and Ridge Regression techniques in R. The MLR model achieved a strong explanatory performance (R² = 0.8961), identifying key variables such as brand category, car age, and geographic region as significant predictors of price. However, issues like multicollinearity and overfitting limited its robustness. To address these challenges, Ridge Regression was employed, incorporating regularization to stabilize coefficient estimates and enhance predictive accuracy. Optimal lambda selection through cross-validation further improved model generalizability. The Ridge model confirmed key market trends, including depreciation with car age and premium valuation of electric and luxury vehicles. This dual-model approach not only demonstrates the comparative strengths of regression techniques but also provides actionable insights for stakeholders, offering a data-driven foundation for pricing strategy and policy development in the used car market.
References
Chen, X., & Lin, Y. (2023). Hybrid predictive models for used car pricing: A synthesis of regression and machine learning techniques. Journal of Data Science, 21(3), 456–472.
Das, S., Roy, T., & Mandal, S. (2020). Predictive modeling of car resale values using regression techniques. International Journal of Data Science and Analytics, 8(2), 123–137.
Hastie, T., Tibshirani, R., & Friedman, J. (2009). The Elements of Statistical Learning: Data Mining, Inference, and Prediction. New York: Springer.
Hoerl, A. E., & Kennard, R. W. (1970). Ridge regression: Biased estimation for nonorthogonal problems. Technometrics, 12(1), 55–67.
Iftikhar, M., Nazir, T., & Saeed, A. (2016). Factors influencing used car prices: An empirical investigation. Journal of Applied Economics, 9(1), 45–62.
Kundu, A., Sengupta, P., & Basu, R. (2019). A comparative study of regression techniques for predicting used car prices. Machine Learning and Applications, 5(2), 89–104.
Li, H., Zhang, Q., & Wu, Y. (2021). Regression-based dynamic pricing strategies in the used car market. Economic Modeling Review, 27(4), 321–339.
Park, S., Lee, J., & Kim, H. (2022). Enhancing predictive accuracy in used car markets: A case for Ridge Regression. Journal of Business Analytics, 10(3), 211–227.
Samad, Z., & Rahman, F. (2020). The application of Ridge Regression in used car price prediction. Journal of Statistical Methods, 12(4), 67–78.
Yu, C., & Deng, X. (2011). Predicting car resale values using linear regression models. Automotive Market Insights, 3(1), 14–25.