Optimasi Software Effort Estimation Menggunakan Random Forest
DOI:
https://doi.org/10.61132/prosemnasproit.v2i2.156Keywords:
effort, estimasi perangkat lunak, Random Forest, Dataset Cina, dataset DerhanaisAbstract
Software development effort estimation is crucial as it is one of the key factors for successful software development. This research employs Random Forest to estimate software development effort. To achieve better results, the study combines the Random Forest method with Genetic Algorithm. The results show that the China dataset provides more accurate estimation compared to the Desharnais dataset, because the China dataset uses relevant feature selection for estimation.
References
A G, P. V., K, A. K., & Varadarajan, V. (2021). Estimating Software Development Efforts Using a Random Forest-Based Stacked Ensemble Approach. Electronics, 10(10), 1195. https://doi.org/10.3390/electronics10101195
Abdelali, Z., Mustapha, H., & Abdelwahed, N. (2019). Investigating the use of random forest in software effort estimation. Procedia Computer Science, 148, 343–352. https://doi.org/10.1016/j.procs.2019.01.042
Abdu, A., Zhai, Z., Abdo, H. A., Lee, S., Al-masni, M. A., Gu, Y. H., & Algabri, R. (2025). Cross-project software defect prediction based on the reduction and hybridization of software metrics. Alexandria Engineering Journal, 112, 161–176. https://doi.org/10.1016/j.aej.2024.10.034
Ali, S. S., Ren, J., Zhang, K., Wu, J., & Liu, C. (2023). Heterogeneous Ensemble Model to Optimize Software Effort Estimation Accuracy. IEEE Access, 11, 27759–27792. https://doi.org/10.1109/ACCESS.2023.3256533
Chahar, V., & Bhatia, P. K. (2022). Performance Analysis of Software Test Effort Estimation using Genetic Algorithm and Neural Network. International Journal of Advanced Computer Science and Applications, 13(10). https://doi.org/10.14569/IJACSA.2022.0131045
Dai, H., Xi, J., & Dai, H.-L. (2025). Improve cross-project just-in-time defect prediction with dynamic transfer learning. Journal of Systems and Software, 219, 112214. https://doi.org/10.1016/j.jss.2024.112214
Denard, S., Ertas, A., Mengel, S., & Ekwaro-Osire, S. (2020). Development Cycle Modeling: Resource Estimation. Applied Sciences, 10(14), 5013. https://doi.org/10.3390/app10145013
Fernandez-Diego, M., Mendez, E. R., Gonzalez-Ladron-De-Guevara, F., Abrahao, S., & Insfran, E. (2020). An Update on Effort Estimation in Agile Software Development: A Systematic Literature Review. IEEE Access, 8, 166768–166800. https://doi.org/10.1109/ACCESS.2020.3021664
Hameed, S., Elsheikh, Y., & Azzeh, M. (2023). An optimized case-based software project effort estimation using genetic algorithm. Information and Software Technology, 153, 107088. https://doi.org/10.1016/j.infsof.2022.107088
Jadhav, A., Kaur, M., & Akter, F. (2022). Evolution of Software Development Effort and Cost Estimation Techniques: Five Decades Study Using Automated Text Mining Approach. Mathematical Problems in Engineering, 2022, 1–17. https://doi.org/10.1155/2022/5782587
Karimi, A., & Gandomani, T. J. (2021). Software development effort estimation modeling using a combination of fuzzy-neural network and differential evolution algorithm. International Journal of Electrical and Computer Engineering (IJECE), 11(1), 707. https://doi.org/10.11591/ijece.v11i1.pp707-715
Khan, M. S., Jabeen, F., Ghouzali, S., Rehman, Z., Naz, S., & Abdul, W. (2021). Metaheuristic Algorithms in Optimizing Deep Neural Network Model for Software Effort Estimation. IEEE Access, 9, 60309–60327. https://doi.org/10.1109/ACCESS.2021.3072380
Liu, H., Li, M., Cheng, J. C. P., Anumba, C. J., & Xia, L. (2025). Actual construction cost prediction using hypergraph deep learning techniques. Advanced Engineering Informatics, 65, 103187. https://doi.org/10.1016/j.aei.2025.103187
Liu, Y., Meng, Q., Chen, K., & Shen, Z. (2025). ALB-TP: Adaptive Load Balancing based on Traffic Prediction using GRU-Attention for Software-Defined DCNs. Journal of Network and Computer Applications, 236, 104103. https://doi.org/10.1016/j.jnca.2024.104103
Mahmood, Y., Kama, N., & Azmi, A. (2020). A systematic review of studies on use case points and expert‐based estimation of software development effort. Journal of Software: Evolution and Process, 32(7), e2245. https://doi.org/10.1002/smr.2245
Mahmood, Y., Kama, N., Azmi, A., Khan, A. S., & Ali, M. (2022). Software effort estimation accuracy prediction of machine learning techniques: A systematic performance evaluation. Software: Practice and Experience, 52(1), 39–65. https://doi.org/10.1002/spe.3009
Malhotra, R., & Singh, P. (2025). DHG-BiGRU: Dual-attention based hierarchical gated BiGRU for software defect prediction. Information and Software Technology, 179, 107646. https://doi.org/10.1016/j.infsof.2024.107646
Nashaat, M., & Miller, J. (2025). Refining software defect prediction through attentive neural models for code understanding. Journal of Systems and Software, 220, 112266. https://doi.org/10.1016/j.jss.2024.112266
Nassif, A. B., Azzeh, M., Idri, A., & Abran, A. (2019). Software Development Effort Estimation Using Regression Fuzzy Models. Computational Intelligence and Neuroscience, 2019, 1–17. https://doi.org/10.1155/2019/8367214
Nassif, A. B., Ho, D., & Capretz, L. F. (2013). Towards an early software estimation using log-linear regression and a multilayer perceptron model. Journal of Systems and Software, 86(1), 144–160. https://doi.org/10.1016/j.jss.2012.07.050
Nguyen, X. D. J., & Liu, Y. A. (2025). Methodology for hyperparameter tuning of deep neural networks for efficient and accurate molecular property prediction. Computers & Chemical Engineering, 193, 108928. https://doi.org/10.1016/j.compchemeng.2024.108928
Nhung, H. L. T. K., Van Hai, V., Silhavy, R., Prokopova, Z., & Silhavy, P. (2022). Parametric Software Effort Estimation Based on Optimizing Correction Factors and Multiple Linear Regression. IEEE Access, 10, 2963–2986. https://doi.org/10.1109/ACCESS.2021.3139183
Park, B. K., & Kim, R. Y. C. (2020). Effort Estimation Approach through Extracting Use Cases via Informal Requirement Specifications. Applied Sciences, 10(9), 3044. https://doi.org/10.3390/app10093044
Parmar, A., Katariya, R., & Patel, V. (2019). A Review on Random Forest: An Ensemble Classifier. Lecture Notes on Data Engineering and Communications Technologies, 26, 758–763. https://doi.org/10.1007/978-3-030-03146-6_86
PDF. (n.d.).
Phannachitta, P. (2020). On an optimal analogy-based software effort estimation. Information and Software Technology, 125, 106330. https://doi.org/10.1016/j.infsof.2020.106330
Probst, P., Wright, M. N., & Boulesteix, A. L. (2019). Hyperparameters and tuning strategies for random forest. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 9(3), 1–15. https://doi.org/10.1002/widm.1301
Rankovic, N., Rankovic, D., Ivanovic, M., & Lazic, L. (2021). A New Approach to Software Effort Estimation Using Different Artificial Neural Network Architectures and Taguchi Orthogonal Arrays. IEEE Access, 9, 26926–26936. https://doi.org/10.1109/ACCESS.2021.3057807
Rashid, C. H., Shafi, I., Khattak, B. H. A., Safran, M., Alfarhood, S., & Ashraf, I. (2025). ANN-based software cost estimation with input from COCOMO: CANN model. Alexandria Engineering Journal, 113, 681–694. https://doi.org/10.1016/j.aej.2024.11.042
Rosa, W., & Jardine, S. (2025). A novel method to early agile effort estimation through functional initiatives. Journal of Systems and Software, 223, 112302. https://doi.org/10.1016/j.jss.2024.112302
Sakhrawi, Z., Sellami, A., & Bouassida, N. (2022). Software enhancement effort estimation using correlation-based feature selection and stacking ensemble method. Cluster Computing, 25(4), 2779–2792. https://doi.org/10.1007/s10586-021-03447-5
Saljoughinejad, R., & Khatibi, V. (n.d.). A New Optimized Hybrid Model Based on COCOMO to Increase the Accuracy of Software Cost Estimation.
Shah, M. A., Jawawi, D. N. A., Isa, M. A., Younas, M., Abdelmaboud, A., & Sholichin, F. (2020). Ensembling Artificial Bee Colony With Analogy-Based Estimation to Improve Software Development Effort Prediction. IEEE Access, 8, 58402–58415. https://doi.org/10.1109/ACCESS.2020.2980236
Sharma, A., & Chaudhary, N. (2023). Prediction of Software Effort by Using Non-Linear Power Regression for Heterogeneous Projects Based on Use case Points and Lines of code. Procedia Computer Science, 218, 1601–1611. https://doi.org/10.1016/j.procs.2023.01.138
Sharma, A., & Kushwaha, D. S. (2012). Estimation of Software Development Effort from Requirements Based Complexity. Procedia Technology, 4, 716–722. https://doi.org/10.1016/j.protcy.2012.05.116
Shi, J., Lian, Y., Salzmann, C., & Jones, C. N. (2025). Adaptive data-driven prediction in a building control hierarchy: A case study of demand response in Switzerland. Energy and Buildings, 333, 115498. https://doi.org/10.1016/j.enbuild.2025.115498
Singal, P., Kumari, A. C., & Sharma, P. (2020). Estimation of Software Development Effort: A Differential Evolution Approach. Procedia Computer Science, 167, 2643–2652. https://doi.org/10.1016/j.procs.2020.03.343
Suresh Kumar, P., Behera, H. S., K, A. K., Nayak, J., & Naik, B. (2020). Advancement from neural networks to deep learning in software effort estimation: Perspective of two decades. Computer Science Review, 38, 100288. https://doi.org/10.1016/j.cosrev.2020.100288
Tan, A. J. J., Chong, C. Y., & Aleti, A. (2024). REARRANGE: Effort estimation approach for software clustering-based remodularisation. Information and Software Technology, 176, 107567. https://doi.org/10.1016/j.infsof.2024.107567
Valero-Carreras, D., Alcaraz, J., & Landete, M. (2023). Comparing two SVM models through different metrics based on the confusion matrix. Computers and Operations Research, 152(April 2022), 106131. https://doi.org/10.1016/j.cor.2022.106131
Vanathi, D., Anusha, K., Ahilan, A., & Salinda Eveline Suniram, A. (2024). Software cost and effort estimation using dragonfly whale optimized multilayer perceptron neural network. Alexandria Engineering Journal, 103, 30–37. https://doi.org/10.1016/j.aej.2024.04.043
Vescan, A., & Barac-Antonescu, D. (2025). Software maintainability prediction based on change metric using neural network models. Engineering Applications of Artificial Intelligence, 144, 110032. https://doi.org/10.1016/j.engappai.2025.110032
Villalobos-Arias, L., Quesada-López, C., Guevara-Coto, J., Martínez, A., & Jenkins, M. (2020). Evaluating hyper-parameter tuning using random search in support vector machines for software effort estimation. Proceedings of the 16th ACM International Conference on Predictive Models and Data Analytics in Software Engineering, 31–40. https://doi.org/10.1145/3416508.3417121
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Prosiding Seminar Nasional Ilmu Teknik

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.





