Mental Health Problems Prediction Using Machine Learning Techniques

Main Article Content

Jia-Pao Cheng
Su-Cheng Haw


Mental health problems encompass a range of conditions that can impact an individual's emotions and behaviors. The conventional methods of mental illness prediction often suffer from the issue of either over-detection or under-detection and the time-consuming manual review process of patients' data during screening sessions. Therefore, this research aims to utilize machine learning techniques to predict mental health problems, complementing the traditional clinical screening and diagnosis process. The proposed models in this project: Logistic Regression, K-Nearest Neighbors, and Random Forest leverage relevant factors from the dataset concerning mental health survey published by Open Source Mental Disorders in 2014 to predict mental health problems. Feature selection and hyperparameter fine-tuning are employed to identify the factors contributing to mental health problems and enhance the performance of the models. The evaluation of these models is measured using accuracy, recall, precision, F1 score, and AUROC. Experimental evaluation results indicated that the Random Forest model utilizing hyperparameters derived from the RandomizedSearchCV method outperforms during model selection using cross-validation. When predicting test set data, it exhibits a good generalization with an accuracy of 83.23%, recall of 89.87%, precision of 78.02%, F1 score of 83.53%, and AUROC of 83.57%.


(Manuscript received: 13 July 2023 | Accepted: 2nd August 2023 | Published: 30 September 2023)


Article Details

How to Cite
Cheng, J.-P., & Haw, S.-C. (2023). Mental Health Problems Prediction Using Machine Learning Techniques. International Journal on Robotics, Automation and Sciences, 5(2), 59–72.


Zhang, X., Ren, H., Gao, L., Shia, B.-C., Chen, M.-C., Ye, L., Wang, R., & Qin, L. (2023). Identifying the predictors of severe psychological distress by auto-machine learning methods. Informatics in Medicine Unlocked, 39, 101258.

World Health Organization. (2022, June 8). Mental disorders.

Minister of Health Malaysia. (2016, September 28). Mental Health Problems in Malaysia.

Henderson, C., Evans-Lacko, S., & Thornicroft, G. (2013). Mental illness stigma, help seeking, and public health programs. American Journal of Public Health, 103(5), 777–780.

Arora, A., Bojko, L., Kumar, S., Lillington, J., Panesar, S., & Petrungaro, B. (2023). Assessment of machine learning algorithms in national data to classify the risk of self-harm among young adults in hospital: A retrospective study. International Journal of Medical Informatics, 177, 105164.

Mitchell, A. J., Vaze, A., & Rao, S. (2009). Clinical diagnosis of depression in primary care: A meta-analysis. The Lancet, 374(9690), 609–619.

Colizzi, M., Lasalvia, A., & Ruggeri, M. (2020). Prevention and early intervention in youth mental health: Is it time for a multidisciplinary and trans-diagnostic model for care? International Journal of Mental Health Systems, 14(1).

Cho, G., Yim, J., Choi, Y., Ko, J., & Lee, S.-H. (2019). Review of machine learning algorithms for diagnosing mental illness. Psychiatry Investigation, 16(4), 262–269.

Katarya, R., Maan, S. (2020). Predicting mental health disorders using machine learning for employees in technical and non-technical companies. 2020 IEEE International Conference on Advances and Developments in Electrical and Electronics Engineering (ICADEE), 1–5.

Jain, T., Jain, A., Hada, P. S., Kumar, H., Verma, V. K., & Patni, A. (2021). Machine learning techniques for prediction of Mental Health. 2021 Third International Conference on Inventive Research in Computing Applications (ICIRCA), 1606–1613.

Sujal, B. H., Neelima, K., Deepanjali, C., Bhuvanashree, P., Duraipandian, K., Rajan, S., & Sathiyanarayanan, M. (2022). Mental health analysis of employees using Machine Learning Techniques. 2022 14th International Conference on COMmunication Systems & NETworkS (COMSNETS), 1–6.

Mutalib, S., Shafiee, N. S. M., & Rahman, S. A. (2021). Mental Health Prediction Models Using Machine Learning in Higher Education Institution. Turkish Journal of Computer and Mathematics Education (TURCOMAT), 12(5), 1782-1792. 10.17762/turcomat.v12i5.2181.

Wshah, S., Skalka, C., & Price, M. (2019). Predicting posttraumatic stress disorder risk: A machine learning approach. JMIR Mental Health, 6(7).

Sumathi, M. R., & Poorna, B. (2016). Prediction of mental health problems among children using machine learning techniques. International Journal of Advanced Computer Science and Applications, 7(1).

Khattar, A., Jain, P. R., & Quadri, S. M. K. (2020) Effects of the Disastrous Pandemic COVID 19 on Learning Styles, Activities and Mental Health of Young Indian Students - A Machine Learning Approach, 2020 4th International Conference on Intelligent Computing and Control Systems (ICICCS), 1190-1195, doi: 10.1109/ICICCS48265.2020.9120955.

Wang, X., Li, H., Sun, C., Zhang, X., Wang, T., Dong, C., & Guo, D. (2021). Prediction of mental health in medical workers during COVID-19 based on machine learning. Frontiers in Public Health, 9.

Hornstein, S., Hoffman, V. F., Nazander, A., Ranta, K., & Hilbert, K. (2021). Predicting therapy outcome in a digital mental health intervention for depression and anxiety: A machine learning approach. DIGITAL HEALTH, 7, 1–11.

Iorfino, F., Ho, N., Carpenter, J. S., Cross, S. P., Davenport, T. A., Hermens, D. F., Yee, H., Nichles, A., Zmicerevska, N., Guastella, A., Scott, E., & Hickie, I. B. (2020). Predicting self-harm within six months after initial presentation to youth mental health services: A machine learning study. PLOS ONE, 15(12).

Couronné, R., Probst, P., & Boulesteix, A.-L. (2018). Random Forest versus logistic regression: A large-scale benchmark experiment. BMC Bioinformatics, 19(1), 270.

Urso, A., Fiannaca, A., La Rosa, M., Ravì, V., & Rizzo, R. (2019). Data Mining: Prediction Methods. Encyclopedia of Bioinformatics and Computational Biology, 1, 413–430.

Bansal, M., Goyal, A., & Choudhary, A. (2022). A comparative analysis of K-Nearest Neighbor, Genetic, Support Vector Machine, Decision Tree, and Long Short Term Memory algorithms in machine learning. Decision Analytics Journal, 3, 100071. cano

Amro, A., Al-Akhras, M., Hindi, K., Habib, M. & Shawar, B. (2021). Instance Reduction for Avoiding Overfitting in Decision Trees. Journal of Intelligent Systems, 30(1), 438-459.

G, S. G., & Sumathi, B. (2020). Grid search tuning of hyperparameters in random forest classifier for customer feedback sentiment prediction. International Journal of Advanced Computer Science and Applications, 11(9).

Open Sourcing Mental Illness, LTD. (2014). Mental Health in Tech Survey, Version 3. Retrieved February 13, 2023 from

Kourou, K. et al. (2015) ‘Machine learning applications in cancer prognosis and prediction’, Computational and Structural Biotechnology Journal, 13, pp. 8–17. doi:10.1016/j.csbj.2014.11.005.

Grant, J. E., & Chamberlain, S. R. (2020). Family history of substance use disorders: Significance for mental health in young adults who gamble. Journal of Behavioral Addictions, 9(2), 289–297.

Bergefurt, L., Weijs-Perrée, M., Appel-Meulenbroek, R., & Arentze, T. (2022). The physical office workplace as a resource for mental health – A systematic scoping review. Building and Environment, 207, 108505.

World Health Organization. (2023). Mental health in the Workplace.,to%20retain%20or%20gain%20work.

Prabadevi, B., Shalini, R., & Kavitha, B. R. (2023). Customer churning analysis using machine learning algorithms. International Journal of Intelligent Networks, 4, 145–154.