Performance of Prediction Interval Estimators based on Random Forest Models with Correlated Predictors
DOI:
https://doi.org/10.25077/jmua.14.4.320-332.2025Keywords:
Prediction interval, Random Forest, Quantile Regression Forest, Out of Bag Interval Prediction, Split Conformal Prediction, MulticollinearityAbstract
Uncertainty in prediction results is a crucial aspect that needs to be taken into account in regression modeling, especially when there is a high correlation between explanatory variables. This study aims to evaluate the performance of three prediction interval formation approaches, namely Out-of-Bag Prediction Interval (OOB-PI), Quan tile Regression Forest (QRF), and Split Conformal Prediction (SC), in Random Forest modeling. The evaluation was conducted through a simulation study with a variety of data structures, including the level of correlation between variables, the shape of the mean function, and the type of error distribution. Further validation was conducted using data from the National Socio-Economic Survey (SUSENAS) of West Java Province in 2023. The results show that increasing the correlation between explanatory variables can improve the efficiency and accuracy of prediction interval estimation. Overall, OOB-PI showed the most balanced performance compared to the other two methods, with a prediction coverage rate close to 90% and a narrower interval width than QRF and SC. This finding indicates that OOB-PI is an adaptive and efficient approach for various data structures, including socioeconomic data with highly correlated predictors.
References
[1] James, G., Witten, D., Hastie, T., Tibshirani, R., 2021, An Introduction to Statistical Learning, Springer Texts in Statistics, Springer, New York.
[2] Gregorich, M., Strohmaier, S., Dunkler, D., Heinze, G., 2021, Regression with Highly Correlated Predictors: Variable Omission is Not the Solution, International Journal of Environmental Research and Public Health, Vol. 18(8): 1–15.
[3] Bickel, P., Diggle, P., Fienberg, S., Gather, U., Olkin, I., Zeger, S., 2017, Springer Series in Statistics, Springer, Berlin.
[4] Fife, D. A., D’Onofrio, J., 2023, Common, Uncommon, and Novel Applications of Random Forest in Psychological Research, Behavior Research Methods, Vol. 55(5): 2447–2466.
[5] Johnson, R. A., 2024, Quantile-Forest: A Python Package for Quantile Regression Forests, Journal of Open Source Software, Vol. 9(93): 5976.
[6] Oliveira, R. I., Orenstein, P., Ramos, T., Romano, J. V., 2024, Split Conformal Prediction and Non-Exchangeable Data, Proceedings of Machine Learning Research, Vol. 25: 1–15.
[7] Zhang, H., Zimmerman, J., Nettleton, D., Nordman, D. J., 2020, Random Forest Prediction Intervals, Statistical Modelling, Vol. 20(5): 481–500.
[8] Harum, N. S., Aini, M., Risxi, M. A., Kartiasih, F., 2023, Pengaruh Sosial Ekonomi dan Kesehatan terhadap Pengeluaran Konsumsi Pangan Rumah Tangga Provinsi Jawa Tengah Tahun 2020, Seminar Nasional Official Statistics, Vol. 1: 899–908.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Departemen Matematika dan Sains Data FMIPA UNAND

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All articles published in Jurnal Matematika UNAND (JMUA) are open access and licensed under the Creative Commons Attribution-ShareAlike (CC BY-SA) license. This ensures that the content is freely available to all users and can be shared and adapted, provided appropriate credit is given and any adaptations are distributed under the same license.
Copyright Holder
The copyright of all articles published in Jurnal Matematika UNAND is held by the Departemen Matematika dan Sains Data, Fakultas Matematika dan Ilmu Pengetahuan Alam (FMIPA), Universitas Andalas (UNAND). This applies to all published versions, including the HTML and PDF formats of the articles.
Author Rights
While the Departemen Matematika dan Sains Data FMIPA UNAND holds the copyright for all published content, authors retain important rights under the Creative Commons Attribution-ShareAlike 4.0 International License (CC BY-SA). This license grants authors and users the following rights:
- Reuse: Authors can reuse and distribute their work for any lawful purpose, including sharing on personal websites, institutional repositories, or in subsequent publications.
- Attribution and Adaptation: Authors and others may remix, adapt, and build upon the published work for any purpose, even commercially, as long as proper credit is given to the original authors, and any derivative works are distributed under the same CC BY-SA license.
Creative Commons License (CC BY-SA)
Under the terms of the CC BY-SA license, users are free to:
- Share: Copy and redistribute the material in any medium or format.
- Adapt: Remix, transform, and build upon the material for any purpose, even commercially.
However, the following conditions apply:
- Attribution: Users must give appropriate credit to the original author(s) and Departemen Matematika dan Sains Data FMIPA UNAND, provide a link to the license, and indicate if changes were made. Attribution must not imply endorsement by the author or the journal.
- ShareAlike: If users remix, transform, or build upon the material, they must distribute their contributions under the same license as the original.
For more information about the CC BY-SA license, please visit the Creative Commons website.
Third-Party Content
If authors include third-party material (such as figures, tables, or images) that is not covered by a Creative Commons license, they must obtain the necessary permissions for reuse and provide proper attribution. Authors are required to ensure that any third-party content complies with open-access licensing requirements or includes permissions for redistribution under similar terms.
Copyright and Licensing Information Display
The copyright and licensing terms will be clearly displayed on each article's landing page, as well as within the full-text versions (HTML and PDF) of all published articles.
No "All Rights Reserved"
As an open-access journal, JMUA does not use "All Rights Reserved" policies. Instead, the CC BY-SA license ensures that the works remain accessible and reusable for a wide audience while still protecting both the authors' and the copyright holder's rights.
Â









