Optimasi Software Effort Estimation Menggunakan Random Forest

Maria Rosario Borroek; Jasmir Jasmir; Fachruddin Fachruddin; Marrylinteri Istoningtyas; Yosefina Venus

doi:10.61132/prosemnasproit.v2i2.156

Authors

Maria Rosario Borroek Universitas Dinamika Bangsa
Jasmir Jasmir Universitas Dinamika Bangsa
Fachruddin Fachruddin Universitas Dinamika Bangsa
Marrylinteri Istoningtyas Universitas Dinamika Bangsa
Yosefina Venus Universitas Dinamika Bangsa

DOI:

https://doi.org/10.61132/prosemnasproit.v2i2.156

Keywords:

effort, estimasi perangkat lunak, Random Forest, Dataset Cina, dataset Derhanais

Abstract

Software development effort estimation is crucial as it is one of the key factors for successful software development. This research employs Random Forest to estimate software development effort. To achieve better results, the study combines the Random Forest method with Genetic Algorithm. The results show that the China dataset provides more accurate estimation compared to the Desharnais dataset, because the China dataset uses relevant feature selection for estimation.

References

A G, P. V., K, A. K., & Varadarajan, V. (2021). Estimating Software Development Efforts Using a Random Forest-Based Stacked Ensemble Approach. Electronics, 10(10), 1195. https://doi.org/10.3390/electronics10101195

Abdelali, Z., Mustapha, H., & Abdelwahed, N. (2019). Investigating the use of random forest in software effort estimation. Procedia Computer Science, 148, 343–352. https://doi.org/10.1016/j.procs.2019.01.042

Abdu, A., Zhai, Z., Abdo, H. A., Lee, S., Al-masni, M. A., Gu, Y. H., & Algabri, R. (2025). Cross-project software defect prediction based on the reduction and hybridization of software metrics. Alexandria Engineering Journal, 112, 161–176. https://doi.org/10.1016/j.aej.2024.10.034

Ali, S. S., Ren, J., Zhang, K., Wu, J., & Liu, C. (2023). Heterogeneous Ensemble Model to Optimize Software Effort Estimation Accuracy. IEEE Access, 11, 27759–27792. https://doi.org/10.1109/ACCESS.2023.3256533

Chahar, V., & Bhatia, P. K. (2022). Performance Analysis of Software Test Effort Estimation using Genetic Algorithm and Neural Network. International Journal of Advanced Computer Science and Applications, 13(10). https://doi.org/10.14569/IJACSA.2022.0131045

Dai, H., Xi, J., & Dai, H.-L. (2025). Improve cross-project just-in-time defect prediction with dynamic transfer learning. Journal of Systems and Software, 219, 112214. https://doi.org/10.1016/j.jss.2024.112214

Denard, S., Ertas, A., Mengel, S., & Ekwaro-Osire, S. (2020). Development Cycle Modeling: Resource Estimation. Applied Sciences, 10(14), 5013. https://doi.org/10.3390/app10145013

Fernandez-Diego, M., Mendez, E. R., Gonzalez-Ladron-De-Guevara, F., Abrahao, S., & Insfran, E. (2020). An Update on Effort Estimation in Agile Software Development: A Systematic Literature Review. IEEE Access, 8, 166768–166800. https://doi.org/10.1109/ACCESS.2020.3021664

Hameed, S., Elsheikh, Y., & Azzeh, M. (2023). An optimized case-based software project effort estimation using genetic algorithm. Information and Software Technology, 153, 107088. https://doi.org/10.1016/j.infsof.2022.107088

Jadhav, A., Kaur, M., & Akter, F. (2022). Evolution of Software Development Effort and Cost Estimation Techniques: Five Decades Study Using Automated Text Mining Approach. Mathematical Problems in Engineering, 2022, 1–17. https://doi.org/10.1155/2022/5782587

Karimi, A., & Gandomani, T. J. (2021). Software development effort estimation modeling using a combination of fuzzy-neural network and differential evolution algorithm. International Journal of Electrical and Computer Engineering (IJECE), 11(1), 707. https://doi.org/10.11591/ijece.v11i1.pp707-715

Khan, M. S., Jabeen, F., Ghouzali, S., Rehman, Z., Naz, S., & Abdul, W. (2021). Metaheuristic Algorithms in Optimizing Deep Neural Network Model for Software Effort Estimation. IEEE Access, 9, 60309–60327. https://doi.org/10.1109/ACCESS.2021.3072380

Liu, H., Li, M., Cheng, J. C. P., Anumba, C. J., & Xia, L. (2025). Actual construction cost prediction using hypergraph deep learning techniques. Advanced Engineering Informatics, 65, 103187. https://doi.org/10.1016/j.aei.2025.103187

Liu, Y., Meng, Q., Chen, K., & Shen, Z. (2025). ALB-TP: Adaptive Load Balancing based on Traffic Prediction using GRU-Attention for Software-Defined DCNs. Journal of Network and Computer Applications, 236, 104103. https://doi.org/10.1016/j.jnca.2024.104103

Mahmood, Y., Kama, N., & Azmi, A. (2020). A systematic review of studies on use case points and expert‐based estimation of software development effort. Journal of Software: Evolution and Process, 32(7), e2245. https://doi.org/10.1002/smr.2245

Mahmood, Y., Kama, N., Azmi, A., Khan, A. S., & Ali, M. (2022). Software effort estimation accuracy prediction of machine learning techniques: A systematic performance evaluation. Software: Practice and Experience, 52(1), 39–65. https://doi.org/10.1002/spe.3009

Malhotra, R., & Singh, P. (2025). DHG-BiGRU: Dual-attention based hierarchical gated BiGRU for software defect prediction. Information and Software Technology, 179, 107646. https://doi.org/10.1016/j.infsof.2024.107646

Nashaat, M., & Miller, J. (2025). Refining software defect prediction through attentive neural models for code understanding. Journal of Systems and Software, 220, 112266. https://doi.org/10.1016/j.jss.2024.112266

Nassif, A. B., Azzeh, M., Idri, A., & Abran, A. (2019). Software Development Effort Estimation Using Regression Fuzzy Models. Computational Intelligence and Neuroscience, 2019, 1–17. https://doi.org/10.1155/2019/8367214

Nassif, A. B., Ho, D., & Capretz, L. F. (2013). Towards an early software estimation using log-linear regression and a multilayer perceptron model. Journal of Systems and Software, 86(1), 144–160. https://doi.org/10.1016/j.jss.2012.07.050

Nguyen, X. D. J., & Liu, Y. A. (2025). Methodology for hyperparameter tuning of deep neural networks for efficient and accurate molecular property prediction. Computers & Chemical Engineering, 193, 108928. https://doi.org/10.1016/j.compchemeng.2024.108928

Nhung, H. L. T. K., Van Hai, V., Silhavy, R., Prokopova, Z., & Silhavy, P. (2022). Parametric Software Effort Estimation Based on Optimizing Correction Factors and Multiple Linear Regression. IEEE Access, 10, 2963–2986. https://doi.org/10.1109/ACCESS.2021.3139183

Park, B. K., & Kim, R. Y. C. (2020). Effort Estimation Approach through Extracting Use Cases via Informal Requirement Specifications. Applied Sciences, 10(9), 3044. https://doi.org/10.3390/app10093044

Parmar, A., Katariya, R., & Patel, V. (2019). A Review on Random Forest: An Ensemble Classifier. Lecture Notes on Data Engineering and Communications Technologies, 26, 758–763. https://doi.org/10.1007/978-3-030-03146-6_86

PDF. (n.d.).

Phannachitta, P. (2020). On an optimal analogy-based software effort estimation. Information and Software Technology, 125, 106330. https://doi.org/10.1016/j.infsof.2020.106330

Probst, P., Wright, M. N., & Boulesteix, A. L. (2019). Hyperparameters and tuning strategies for random forest. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 9(3), 1–15. https://doi.org/10.1002/widm.1301

Rankovic, N., Rankovic, D., Ivanovic, M., & Lazic, L. (2021). A New Approach to Software Effort Estimation Using Different Artificial Neural Network Architectures and Taguchi Orthogonal Arrays. IEEE Access, 9, 26926–26936. https://doi.org/10.1109/ACCESS.2021.3057807

Rashid, C. H., Shafi, I., Khattak, B. H. A., Safran, M., Alfarhood, S., & Ashraf, I. (2025). ANN-based software cost estimation with input from COCOMO: CANN model. Alexandria Engineering Journal, 113, 681–694. https://doi.org/10.1016/j.aej.2024.11.042

Rosa, W., & Jardine, S. (2025). A novel method to early agile effort estimation through functional initiatives. Journal of Systems and Software, 223, 112302. https://doi.org/10.1016/j.jss.2024.112302

Sakhrawi, Z., Sellami, A., & Bouassida, N. (2022). Software enhancement effort estimation using correlation-based feature selection and stacking ensemble method. Cluster Computing, 25(4), 2779–2792. https://doi.org/10.1007/s10586-021-03447-5

Saljoughinejad, R., & Khatibi, V. (n.d.). A New Optimized Hybrid Model Based on COCOMO to Increase the Accuracy of Software Cost Estimation.

Shah, M. A., Jawawi, D. N. A., Isa, M. A., Younas, M., Abdelmaboud, A., & Sholichin, F. (2020). Ensembling Artificial Bee Colony With Analogy-Based Estimation to Improve Software Development Effort Prediction. IEEE Access, 8, 58402–58415. https://doi.org/10.1109/ACCESS.2020.2980236

Sharma, A., & Chaudhary, N. (2023). Prediction of Software Effort by Using Non-Linear Power Regression for Heterogeneous Projects Based on Use case Points and Lines of code. Procedia Computer Science, 218, 1601–1611. https://doi.org/10.1016/j.procs.2023.01.138

Sharma, A., & Kushwaha, D. S. (2012). Estimation of Software Development Effort from Requirements Based Complexity. Procedia Technology, 4, 716–722. https://doi.org/10.1016/j.protcy.2012.05.116

Shi, J., Lian, Y., Salzmann, C., & Jones, C. N. (2025). Adaptive data-driven prediction in a building control hierarchy: A case study of demand response in Switzerland. Energy and Buildings, 333, 115498. https://doi.org/10.1016/j.enbuild.2025.115498

Singal, P., Kumari, A. C., & Sharma, P. (2020). Estimation of Software Development Effort: A Differential Evolution Approach. Procedia Computer Science, 167, 2643–2652. https://doi.org/10.1016/j.procs.2020.03.343

Suresh Kumar, P., Behera, H. S., K, A. K., Nayak, J., & Naik, B. (2020). Advancement from neural networks to deep learning in software effort estimation: Perspective of two decades. Computer Science Review, 38, 100288. https://doi.org/10.1016/j.cosrev.2020.100288

Tan, A. J. J., Chong, C. Y., & Aleti, A. (2024). REARRANGE: Effort estimation approach for software clustering-based remodularisation. Information and Software Technology, 176, 107567. https://doi.org/10.1016/j.infsof.2024.107567

Valero-Carreras, D., Alcaraz, J., & Landete, M. (2023). Comparing two SVM models through different metrics based on the confusion matrix. Computers and Operations Research, 152(April 2022), 106131. https://doi.org/10.1016/j.cor.2022.106131

Vanathi, D., Anusha, K., Ahilan, A., & Salinda Eveline Suniram, A. (2024). Software cost and effort estimation using dragonfly whale optimized multilayer perceptron neural network. Alexandria Engineering Journal, 103, 30–37. https://doi.org/10.1016/j.aej.2024.04.043

Vescan, A., & Barac-Antonescu, D. (2025). Software maintainability prediction based on change metric using neural network models. Engineering Applications of Artificial Intelligence, 144, 110032. https://doi.org/10.1016/j.engappai.2025.110032

Villalobos-Arias, L., Quesada-López, C., Guevara-Coto, J., Martínez, A., & Jenkins, M. (2020). Evaluating hyper-parameter tuning using random search in support vector machines for software effort estimation. Proceedings of the 16th ACM International Conference on Predictive Models and Data Analytics in Software Engineering, 31–40. https://doi.org/10.1145/3416508.3417121

Optimasi Software Effort Estimation Menggunakan Random Forest

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

How to Cite

Issue

Section

License

Menu new new