dc.contributor.author | Gutiérrez Cárdenas, Juan Manuel | |
dc.contributor.author | Wang, Z. | |
dc.contributor.other | Gutiérrez Cárdenas, Juan Manuel | |
dc.date.accessioned | 2024-01-11T15:50:32Z | |
dc.date.available | 2024-01-11T15:50:32Z | |
dc.date.issued | 2021 | |
dc.identifier.citation | Gutiérrez-Cárdenas,J., & Wang, Z. (2021). Classification of Breast Cancer and Breast Neoplasm Scenarios Based on Machine Learning and Sequence Features from lncRNAs–miRNAs-Diseases Associations. Interdisciplinary Sciences – Computational Life Sciences, (13), 572-581. https://doi.org/10.1007/s12539-021-00451-6 | es_PE |
dc.identifier.issn | 1867-1462 | |
dc.identifier.uri | https://hdl.handle.net/20.500.12724/19567 | |
dc.description.abstract | The influence of non-coding RNAs, such as lncRNAs (long non-coding RNAs) and miRNAs (microRNAs), is undeniable in several diseases, for example, in the formation of neoplasms and cancer scenarios. However, there are challenges due to the scarcity of validated datasets and the imbalance in the data. We found that the research of associations between miRNAs-lncRNAs and diseases is limited or done separately. In addition, those investigations, which use Machine Learning models joined with genomic sequence features extracted from miRNAs and lncRNAs, are few compared with using some methods such as genomic expression or Deep Learning techniques. In this paper, we propose a structure of using supervised and unsupervised machine learning models with genomic sequence features, such as k-mers, sequence alignments, and energy folding values, to validate miRNAs and lncRNAs association with breast cancer and neoplasms scenarios. Using One-Class SVM for outlier detection and comparing two supervised models such as SVM and Random Forest, we manage to obtain accuracy results of 95.44% for the One-class model, with 88.79% and 99.65% for the SVM and Random Forest models, respectively. The results showed a promising path for the study of sequence features interactions joined with Machine Learning models comparable to those found in the existing literature. | en_EN |
dc.format | application/html | |
dc.language.iso | eng | |
dc.publisher | Springer | |
dc.relation.ispartof | urn:issn: 1867-1462 | |
dc.rights | info:eu-repo/semantics/restrictedAccess | * |
dc.source | Repositorio Institucional Ulima | |
dc.source | Universidad de Lima | |
dc.subject | Breast cancer | en_EN |
dc.subject | Non-coding RNA | en_EN |
dc.subject | Supervised learning (Machine learning) | en_EN |
dc.subject | Cáncer de mama | es_PE |
dc.subject | ARN no codificante | es_PE |
dc.subject | Aprendizaje supervizado (Aprendizaje automático) | es_PE |
dc.subject.classification | Pendiente | es_PE |
dc.title | Classification of Breast Cancer and Breast Neoplasm Scenarios Based on Machine Learning and Sequence Features from lncRNAs–miRNAs-Diseases Associations | en_EN |
dc.type | info:eu-repo/semantics/article | |
dc.type.other | Artículo en Scopus | |
ulima.areas.lineasdeinvestigacion | Calidad de vida y bienestar / Salud | es_PE |
dc.identifier.journal | Interdisciplinary Sciences – Computational Life Sciences | |
dc.publisher.country | CH | |
dc.subject.ocde | https://purl.org/pe-repo/ocde/ford#3.02.21 | |
dc.identifier.doi | https://doi.org/10.1007/s12539-021-00451-6 | |
ulima.cat | 15 | |
ulima.autor.afiliacion | Universidad de Lima (Scopus) | |
ulima.autor.carrera | Ingeniería de Sistemas | |
dc.identifier.isni | 0000000121541816 | |
dc.identifier.scopusid | 2-s2.0-85117284645 | |