Fernandes, Marcelo Augusto CostaCâmara, Gabriel Bezerra Motta2025-01-242025-01-242024-09-05CÂMARA, Gabriel Bezerra Motta. Advanced convolutional neural network techniques for classification of SARS-CoV-2 variants and other viruses: a study using k-mers and chaos game representation. Orientador: Dr Marcelo Augusto Costa Fernandes. 2024. 115f. Tese (Doutorado em Bioinformática) - Instituto Metrópole Digital, Universidade Federal do Rio Grande do Norte, Natal, 2024.https://repositorio.ufrn.br/handle/123456789/61974Since December 2019, the global impact of the Covid-19 pandemic, caused by the SARS-CoV-2 virus, has been profound. Early identification of the virus’s taxonomic classification and genomic origin is critical for strategic planning, containment, and treatment. Deep learning techniques have proven successful in addressing various viral classification challenges, including diagnosis, metagenomics, phylogenetics, and genomic analysis. Motivated by these advances, this study introduces an effective viral genome classifier for SARS-CoV-2, utilizing a convolutional neural network (CNN) framework. This research employed image representations of complete genome sequences to train the CNN, leveraging two distinct datasets: one based on k-mers image representation and the other on Chaos Game Representation (CGR). The k-mers dataset was used for taxonomic classification experiments of the SARS-CoV-2 virus, while the CGR dataset focused on classifying variants of concern (VOCs) and variants of interest (VOIs) of SARS-CoV-2. The CNN achieved remarkable performance in taxonomic classification using k-mers, with accuracy rates about 92% on the validation set and equal to 100% on the test set containing SARS-CoV-2 samples. These results demonstrate the model’s adaptability for classifying other emerging viruses. For the classification of SARS-CoV-2 variants using CGR images, the CNN delivered even higher accuracy, reaching 99.9% on the validation set and 99.8% on the test set. The findings underscore the applicability of deep learning techniques in genome classification tasks, providing a robust tool for the early detection and classification of viral threats. The integration of CNNs with k-mer and CGR image representations presents a novel and effective method for viral genome analysis, supporting ongoing efforts in virology and public health.Acesso AbertoSARS-CoV-2Covid-19Deep learningConvolutional neural networkViral classificationAdvanced convolutional neural network techniques for classification of SARS-CoV-2 variants and other viruses: a study using k-mers and chaos game representationdoctoralThesisCNPQ::CIENCIAS BIOLOGICAS