通过机器学习方法研究青少年抑郁研究进展：165篇文献的方向梳理与核心结论

通过机器学习方法研究青少年抑郁

本研究领域展示了机器学习在青少年抑郁识别与预防中的全方位应用。整体趋势表现为从依赖量表和人口学的静态预测，转向基于多模态数据（文、音、像）和生理指标（EEG、MRI）的动态监测。研究不仅在技术深度上追求多模态融合与深度学习优化，更在应用广度上拓展至数字表型长期监测和自杀风险早期预警。同时，模型的可解释性、决策支持系统及算法公平性成为近期研究的核心热点，旨在构建科学、精准且具有临床温度的青少年心理健康干预体系。

共 164 篇文献，6 个研究方向

基于社交媒体与自然语言处理的文本挖掘研究

这类研究侧重于利用社交媒体（Twitter, Reddit, Weibo）及论坛的文本数据，运用NLP技术、情感分析、修辞结构理论和大语言模型（LLM）捕捉青少年的心理压力、语言模式及抑郁征兆。相关文献: Prof. Saba Anjum Patel et. al, 2024 等 20 篇文献

基于音频、视频与面部表情的多模态情感计算

探讨通过分析语音（韵律、发音）、面部表情、眼神和头部动作等多维行为特征进行抑郁检测。重点在于特征融合技术、注意力机制（Attention）以及自监督学习模型在非接触式评估中的应用。相关文献: Lijuan Liang et. al, 2024 等 44 篇文献

基于生理信号与神经影像的客观生物学诊断

利用脑电图（EEG）、核磁共振（MRI/fMRI/DTI）、皮肤电（EDA/GSR）和遗传标记物等客观生物物理数据。通过图卷积网络（GCN）和深度迁移学习寻找与青少年抑郁相关的神经机制和生物表型。相关文献: N. Firoz et. al, 2025 等 18 篇文献

基于数字表型组学的日常行为被动监测

利用智能手机传感器和可穿戴设备（GPS、计步器、心率）收集的数字表型数据，进行长期、连续的监测。研究重点在于如何通过行为模式的变化识别抑郁风险及预测复发，并解决长程数据缺失填充问题。相关文献: Momoka Takeshige et. al, 2024 等 14 篇文献

社会人口学风险因子与青少年群体预警建模

基于大规模调查数据、临床量表（PHQ-9）及社会环境因素（如霸凌、学业、家庭）。利用机器学习分析青少年特有的风险因子，旨在实现自杀风险的早期预警和精准干预。相关文献: Molly M. Jacobs et. al, 2025 等 40 篇文献

算法框架优化、临床决策支持与模型可解释性

关注算法本身的创新与落地应用，包括混合模型（Hybrid Models）、不确定性量化、公平性研究、以及使用SHAP/LIME增强临床决策的可信度。同时探讨如何匹配个性化治疗方案。相关文献: D. Muthu et. al, 2025 等 28 篇文献

总计165篇相关文献

Apply Machine Learning to Predict Risk for Adolescent Depression in a Cohort of Kenyan Adolescents

应用机器学习预测肯尼亚青少年抑郁症风险

Hyungrok Do, Keng Huang, Sabrina Cheng 等, 2025-Healthcare

Background: Adolescent depression is highly prevalent in low- and middle-income countries (LMICs). Identifying top key risk factors is necessary to inform effective prevention program design. Machine learning (ML) offers a powerful approach to analyze complex multidomain of data to identify the most relevant predictors and estimate risks for mental health problems. This paper applies ML to study risks for adolescent depression to enhance adolescent depression prevention efforts in LMICs. Methods: Six ML approaches (e.g., Explainable Boosting Machine, random forests, and XGBoost) were applied to study the risks of depression. Data were drawn from a digital health intervention study conducted in Kenya (year 2024–2025, n = 269). Multiple domains of childhood and adolescent adversity and stress experiences were used to predict adolescent depression (using PHQ9-A). Findings: We found that ML was a valuable approach in the early identification of adolescents at risk for depression. Among the six ML approaches applied, the random forest approach outperformed other ML approaches, especially when multiple domains of risks were included. We also found that childhood adversity or home adversity alone were not strong predictors for depression. Adding adolescent stress experiences and community school adversity experiences significantly improves the accuracy and predictability of depression. Using the top-15 and top-20 ranking factors, we achieved 74.8% and 75.1% accuracy in depression prediction, which was similar to the accuracy when all 49 adverse/stress factors were included in the predictive model (78.3%). Conclusions: Innovative ML and modern predictive modeling approaches have the potential to transform modern preventive mental health care by better utilizing multidomain data to identify individuals at risk for developing depression early and identify top risk factors (for targeted individuals and/or populations). Findings from ML can inform tailored intervention design to better mitigate risks in order to prevent depression problem development. They can also inform the better utilization of resources to target high-need cases and key determinants, which is particularly useful for LMICs and low-resource settings. This paper illustrates an example of how to move toward this direction. Future research is needed to validate the approach.