Mental Health Analysis

Machine learning
Logistic Regression
Random Forest Regressor
Python
Scikit-learn
Matplotlib
Pandas

This project delves into the critical issue of mental health, leveraging two distinct datasets to gain insights into its various facets.

Dataset 1: Health, Lifestyle, and Socio-Economic Factors
This dataset explores the correlation between mental health and factors such as age, education, employment, marital status, income, and lifestyle choices. Through visualizations like bar charts and heatmaps, we uncover trends such as: - Higher prevalence of depression among individuals with lower income and education levels. - The impact of lifestyle choices like smoking, alcohol consumption, diet, sleep, and physical activity on mental well-being.

Dataset 2: Country-wise Mental Health Prevalence
This dataset provides a global perspective on mental health, analyzing the prevalence of various mental disorders across different countries and years. Time series plots reveal trends in the prevalence of disorders like schizophrenia, bipolar disorder, eating disorders, anxiety, and depression. Additionally, we identify countries with the highest and lowest overall mental disorder prevalence.

Machine Learning for Prediction
To further our understanding, we employ machine learning algorithms to predict mental health outcomes. Using techniques like logistic regression, decision trees, and random forests, we assess the predictive power of different features. Notably, a Random Forest Regressor model demonstrates promising results in predicting mental fitness based on country-level data.

Conclusion
This project sheds light on the multifaceted nature of mental health, highlighting the influence of individual and societal factors. By combining data analysis and machine learning, we aim to contribute to a better understanding of mental health and inform potential interventions.

Feature Description
  • Name: The full name of the individual
  • Age: The age of the individual in years
  • Marital Status: The marital status of the individual. Possible values include Single, Married, Divorced, and Widowed
  • Education Level: The highest level of education attained by the individual. Possible values include High School, Associate Degree, Bachelor's Degree, Master's Degree, and PhD
  • Number of Children: The number of children the individual has
  • Smoking Status: Indicates whether the individual is a smoker or not. Possible values are Smoker, Former and Non-smoker
  • Physical Activity Level: The level of physical activity undertaken by the individual. Possible values include Sedentary, Moderate, and Active
  • Employment Status: The employment status of the individual. Possible values include Employed and Unemployed
  • Income: The annual income of the individual in USD
  • Alcohol Consumption: The level of alcohol consumption. Possible values include Low, Moderate, and High
  • Dietary Habits: The dietary habits of the individual. Possible values include Healthy, Moderate, and Unhealthy
  • Sleep Patterns: The quality of sleep. Possible values include Good, Fair, and Poor
  • History of Mental Illness: Whether the individual has a history of mental illness. Possible values are Yes and No
  • History of Substance Abuse: Whether the individual has a history of substance abuse. Possible values are Yes and No
  • Family History of Depression: Indicates if there is a family history of depression. Possible values are Yes and No
  • Chronic Medical Conditions: Whether the individual has chronic medical conditions. Possible values are Yes and No

Files

Random Forest Regressor

Try it out
Click Run for Results

Linear Regression

Try it out
Click Run for Results
Most Used Packages
logo
Scikit-learn
logo
Pandas
logo
Numpy
logo
Matplotlib
Other Projects