• español
    • English
  • Políticas
  • español 
    • español
    • English
  • Acceder
Ver ítem 
  •   Repositorio Institucional ULima
  • Artículos
  • 4. En conferencias y otros eventos
  • Ingeniería de Sistemas
  • Ver ítem
  •   Repositorio Institucional ULima
  • Artículos
  • 4. En conferencias y otros eventos
  • Ingeniería de Sistemas
  • Ver ítem
JavaScript is disabled for your browser. Some features of this site may not work without it.

Model Comparison for the Classification of Comments Containing Suicidal Traits from Reddit via NLP and Supervised Learning

Thumbnail
Fecha
2022
Autor(es)
Mantilla Saavedra, Camila Stefany
Gutiérrez Cárdenas, Juan Manuel
Metadatos
Mostrar el registro completo del ítem
Resumen
In recent years, suicide has become one of the most critical issues regarding public health between teenagers and adults. On the other hand, the growth and wide-spread of social networks and mobile devices have allowed us to compile relevant information that helps us understand the thoughts, feelings, and emotions extracted from these platforms. The detection of suicidal traits on social media has be-come one relevant research topic. It has permitted the identification of probable suicide traits among media users by examining their posts on known social net-works such as Reddit. For that reason, the purpose of the present research is to compare different supervised classification models such as Logistic Regression, Support Vector Machines, Random Forest, AdaBoost, Gradient Boosting, and XGBoost; together with feature extraction techniques such as TF-IDF and Glove. The results from our experiments show that the best model is SVM with TF-IDF obtaining metrics of 91.50% in Accuracy, 92.40% in Precision, 90.30% in Re-call, and 91.50% regarding the F1-score. This study also shows that TF-IDF for feature extraction outperforms Glove when applied to the different models tested.
URI
https://hdl.handle.net/20.500.12724/17555
DOI
https://doi.org/10.1007/978-3-031-04447-2_17
Cómo citar
Mantilla-Saavedra, C. & Gutiérrez-Cárdenas, J. (2022). Model Comparison for the Classification of Comments Containing Suicidal Traits from Reddit via NLP and Supervised Learning. En J. A. Lossio-Ventura, J. Valverde-Rebaza, E. Díaz, D. Muñante, C. Gavidia-Calderon, A. D. B. Valejo & H. Alatrista-Salas (Eds.), Information Management and Big Data: Eighth Annual International Conference, SIMBig 2021, Proceedings, Communications in Computer and Information Science (vol. 1577, pp. 253-263). Springer. https://doi.org/10.1007/978-3-031-04447-2_17
Editor
Springer
Temas
Suicidio
Redes sociales
Programación neurolingüística
Suicide
Social networks
Neurolinguistic programming
ISSN
1865-0929
Evento
Communications in Computer and Information Science
Coleccion(es)
  • Ingeniería de Sistemas [73]


Contacto: [email protected]

Todos los derechos reservados. Diseñado por Chimera Software
 

 

Listar

Todo el RepositorioComunidades & ColeccionesPor fecha de publicaciónAutoresTítulosTemasAsesoresAutores UlimaTipos de documentoEsta colecciónPor fecha de publicaciónAutoresTítulosTemasAsesoresAutores UlimaTipos de documento

Mi cuenta

AccederRegistro

Estadísticas

Ver Estadísticas de uso

Contacto: [email protected]

Todos los derechos reservados. Diseñado por Chimera Software