Mental Health Problems Prediction Using Machine Learning Techniques
Main Article Content
Abstract
Mental health problems encompass a range of conditions that can impact an individual's emotions and behaviors. The conventional methods of mental illness prediction often suffer from the issue of either over-detection or under-detection and the time-consuming manual review process of patients' data during screening sessions. Therefore, this research aims to utilize machine learning techniques to predict mental health problems, complementing the traditional clinical screening and diagnosis process. The proposed models in this project: Logistic Regression, K-Nearest Neighbors, and Random Forest leverage relevant factors from the dataset concerning mental health survey published by Open Source Mental Disorders in 2014 to predict mental health problems. Feature selection and hyperparameter fine-tuning are employed to identify the factors contributing to mental health problems and enhance the performance of the models. The evaluation of these models is measured using accuracy, recall, precision, F1 score, and AUROC. Experimental evaluation results indicated that the Random Forest model utilizing hyperparameters derived from the RandomizedSearchCV method outperforms during model selection using cross-validation. When predicting test set data, it exhibits a good generalization with an accuracy of 83.23%, recall of 89.87%, precision of 78.02%, F1 score of 83.53%, and AUROC of 83.57%.
Manuscript received: 13 May 2023 | Revised: 2 August 2023 | Accepted: 17 August 2023 |Published: 30 September 2023
Article Details

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
References
X. Zhang, H. Ren, L. Gao, B.-C. Shia, M.-C. Chen, L. Ye, R. Wang, and L. Qin, “Identifying the predictors of severe psychological distress by auto-machine learning methods,” Informatics in Medicine Unlocked, vol. 39, p. 101258, 2023.
DOI: https://doi.org/10.1016/j.imu.2023.101258.
World Health Organization, “Mental disorders,” Jun. 8, 2022
URL:https://www.who.int/news-room/fact-sheets/detail/mental-disorders (Accessed: 10 July, 2023)
Minister of Health Malaysia, “Mental Health Problems in Malaysia,” Sept. 28, 2016.
URL:https://www.moh.gov.my/moh/modules_resources/english/database_stores/96/337_451.pdf (Accessed: 10 July 2023)
C. Henderson, S. Evans-Lacko, and G. Thornicroft, “Mental illness stigma, help seeking, and public health programs,” American Journal of Public Health, vol. 103, no. 5, pp. 777–780, 2013
DOI: https://doi.org/10.2105/ajph.2012.301056.
A. Arora, L. Bojko, S. Kumar, J. Lillington, S. Panesar, and B. Petrungaro, “Assessment of machine learning algorithms in national data to classify the risk of self-harm among young adults in hospital: A retrospective study,” International Journal of Medical Informatics, vol. 177, p. 105164, 2023
DOI: https://doi.org/10.1016/j.ijmedinf.2023.105164.
A. J. Mitchell, A. Vaze, and S. Rao, “Clinical diagnosis of depression in primary care: A meta-analysis,” The Lancet, vol. 374, no. 9690, pp. 609–619, 2009
DOI: https://doi.org/10.1016/s0140-6736(09)60879-5.
M. Colizzi, A. Lasalvia, and M. Ruggeri, “Prevention and early intervention in youth mental health: Is it time for a multidisciplinary and trans-diagnostic model for care?,” International Journal of Mental Health Systems, vol. 14, no. 1, 2020,
DOI: https://doi.org/10.1186/s13033-020-00356-9.
G. Cho, J. Yim, Y. Choi, J. Ko, and S.-H. Lee, “Review of machine learning algorithms for diagnosing mental illness,” Psychiatry Investigation, vol. 16, no. 4, pp. 262–269, 2019
DOI: https://doi.org/10.30773/pi.2018.12.21.2.
R. Katarya and S. Maan, “Predicting mental health disorders using machine learning for employees in technical and non-technical companies,” in 2020 IEEE International Conference on Advances and Developments in Electrical and Electronics Engineering (ICADEE), 2020, pp. 1–5
DOI: https://doi.org/10.1109/icadee51157.2020.9368923.
T. Jain, A. Jain, P. S. Hada, H. Kumar, V. K. Verma, and A. Patni, “Machine learning techniques for prediction of mental health,” in 2021 Third International Conference on Inventive Research in Computing Applications (ICIRCA), 2021, pp. 1606–1613.
DOI: https://doi.org/10.1109/icirca51532.2021.9545061.
B. H. Sujal, K. Neelima, C. Deepanjali, P. Bhuvanashree, K. Duraipandian, S. Rajan, and M. Sathiyanarayanan, “Mental health analysis of employees using machine learning techniques,” in 2022 14th International Conference on COMmunication Systems & NETworkS (COMSNETS), 2022, pp. 1–6.
DOI: https://doi.org/10.1109/comsnets53615.2022.9668526.
S. Mutalib, N. S. M. Shafiee, and S. A. Rahman, “Mental health prediction models using machine learning in higher education institution,” Turkish Journal of Computer and Mathematics Education (TURCOMAT), vol. 12, no. 5, pp. 1782–1792, 2021, DOI: https://doi.org/10.17762/turcomat.v12i5.2181.
S. Wshah, C. Skalka, and M. Price, “Predicting posttraumatic stress disorder risk: A machine learning approach,” JMIR Mental Health, vol. 6, no. 7, 2019
DOI: https://doi.org/10.2196/13946 .
M. R. Sumathi and B. Poorna, “Prediction of mental health problems among children using machine learning techniques,” International Journal of Advanced Computer Science and Applications, vol. 7, no. 1, 2016
DOI: https://doi.org/10.14569/ijacsa.2016.070176.
A. Khattar, P. R. Jain, and S. M. K. Quadri, “Effects of the disastrous pandemic COVID 19 on learning styles, activities and mental health of young Indian students – A machine learning approach,” in 2020 4th International Conference on Intelligent Computing and Control Systems (ICICCS), 2020, pp. 1190–1195
DOI: https://doi.org/10.1109/ICICCS48265.2020.9120955.
X. Wang, H. Li, C. Sun, X. Zhang, T. Wang, C. Dong, and D. Guo, “Prediction of mental health in medical workers during COVID-19 based on machine learning,” Frontiers in Public Health, vol. 9, 2021.
DOI: https://doi.org/10.3389/fpubh.2021.697850.
S. Hornstein, V. F. Hoffman, A. Nazander, K. Ranta, and K. Hilbert, “Predicting therapy outcome in a digital mental health intervention for depression and anxiety: A machine learning approach,” DIGITAL HEALTH, vol. 7, pp. 1–11, 2021.
DOI: https://doi.org/10.1177/20552076211060659.
F. Iorfino, N. Ho, J. S. Carpenter, S. P. Cross, T. A. Davenport, D. F. Hermens, H. Yee, A. Nichles, N. Zmicerevska, A. Guastella, E. Scott, and I. B. Hickie, “Predicting self-harm within six months after initial presentation to youth mental health services: A machine learning study,” PLOS ONE, vol. 15, no. 12, 2020.
DOI: https://doi.org/10.1371/journal.pone.0243467.
R. Couronné, P. Probst, and A.-L. Boulesteix, “Random Forest versus logistic regression: A large-scale benchmark experiment,” BMC Bioinformatics, vol. 19, no. 1, p. 270, 2018.
DOI: https://doi.org/10.1186/s12859-018-2264-5.
A. Urso, A. Fiannaca, M. La Rosa, V. Ravì, and R. Rizzo, “Data mining: Prediction methods,” Encyclopedia of Bioinformatics and Computational Biology, vol. 1, pp. 413–430, 2019.
DOI: https://doi.org/10.1016/b978-0-12-809633-8.20462-7.
M. Bansal, A. Goyal, and A. Choudhary, “A comparative analysis of K-Nearest Neighbor, Genetic, Support Vector Machine, Decision Tree, and Long Short Term Memory algorithms in machine learning,” Decision Analytics Journal, vol. 3, p. 100071, 2022
DOI: https://doi.org/10.1016/j.dajour.2022.100071.
A. Amro, M. Al-Akhras, K. Hindi, M. Habib, and B. Shawar, “Instance reduction for avoiding overfitting in decision trees,” Journal of Intelligent Systems, vol. 30, no. 1, pp. 438–459, 2021, DOI: https://doi.org/10.1515/jisys-2020-0061.
S. G. G. and B. Sumathi, “Grid search tuning of hyperparameters in random forest classifier for customer feedback sentiment prediction,” International Journal of Advanced Computer Science and Applications, vol. 11, no. 9, 2020.
DOI: https://doi.org/10.14569/ijacsa.2020.0110920.
Open Sourcing Mental Illness, LTD, “Mental Health in Tech Survey, Version 3,” 2014.
URL:https://www.kaggle.com/datasets/osmi/mental-health-in-tech-survey (Accessed 13 July, 2023).
K. Kourou, et al., “Machine learning applications in cancer prognosis and prediction,” Computational and Structural Biotechnology Journal, vol. 13, pp. 8–17, 2015
DOI: https://doi.org/10.1016/j.csbj.2014.11.005.
J. E. Grant and S. R. Chamberlain, “Family history of substance use disorders: Significance for mental health in young adults who gamble,” Journal of Behavioral Addictions, vol. 9, no. 2, pp. 289–297, 2020.
DOI: https://doi.org/10.1556/2006.2020.00017.
L. Bergefurt, M. Weijs-Perrée, R. Appel-Meulenbroek, and T. Arentze, “The physical office workplace as a resource for mental health – A systematic scoping review,” Building and Environment, vol. 207, p. 108505, 2022.
DOI: https://doi.org/10.1016/j.buildenv.2021.108505.
World Health Organization, “Mental health in the Workplace,” 2023.
URL:https://www.who.int/teams/mental-health-and-substance-use/promotion-prevention/mental-health-in-the-workplace#:~:text=Without%20effective%20support%2C%20mental%20disorders,to%20retain%20or%20gain%20work. (Accessed: 10 july, 2023)
B. Prabadevi, R. Shalini, and B. R. Kavitha, “Customer churning analysis using machine learning algorithms,” International Journal of Intelligent Networks, vol. 4, pp. 145–154, 2023.