Enzo Guerrero-Araya

Phylodynamics - Epidemiology - Computational Biology - Data Scientist

About Me

Hello, I’m Enzo, a Computational Biologist with a freshly earned Ph.D. from Andrés Bello University in Chile. My core focus is on the genomic epidemiology, evolution, adaptation, and phylogenetic relationship of Clostridioides difficile. I’ve been privileged to collaborate with international peers on research exploring evolution, bioinformatics, and data analysis, which has led to numerous scientific publications. In addition to my research experience, I also have a strong background in software development and data science.

Softwares

Multilocus Sequence Typing (MLST) is a precise microbial typing approach at the intra-species level for epidemiologic and evolutionary purposes. It operates by assigning a sequence type (ST) identifier to each specimen, based on a combination of alleles of multiple housekeeping genes included in a defined scheme. The use of MLST has multiplied due to the availability of large numbers of genomic sequences and epidemiologic data in public repositories. However, data processing speed has become problematic due to the massive size of modern datasets. Here, we present FastMLST, a tool that is designed to perform PubMLST searches using BLASTn and a divide-and-conquer approach that processes each genome assembly in parallel. The output offered by FastMLST includes a table with the ST, allelic profile, and clonal complex or clade (when available), detected for a query, as well as a multi-FASTA file or a series of FASTA files with the concatenated or single allele sequences detected, respectively. FastMLST was validated with 91 different species, with a wide range of guanine-cytosine content (%GC), genome sizes, and fragmentation levels, and a speed test was performed on 3 datasets with varying genome sizes. Compared with other tools such as mlst, CGE/MLST, MLSTar, and PubMLST, FastMLST takes advantage of multiple processors to simultaneously type up to 28 000 genomes in less than 10 minutes, reducing processing times by at least 3-fold with 100% concordance to PubMLST, if contaminated genomes are excluded from the analysis. The source code, installation instructions, and documentation of FastMLST are available at https://github.com/EnzoAndree/FastMLST

Experience

Fundación Ciencia & Vida

Data Scientist

February 2021 to February 2023

I was a data scientist. The responsibilities were performing data preparation, data mining, data visualization, and data analysis in COVID-19 pandemic-related data.

Andrés Bello University

Professor

March 2017 to July 2017

I was a professor of the Laboratory of Genetic Engineering in Microorganisms (BIT200).

Education

Andrés Bello University

Doctor of Philosophy

2018 to 2023

During my Ph.D. project I studied the evolution and genomic epidemiology of Clostridioides difficile in Latin America. Several international collaborations were carried out in research related to evolution, bioinformatics, and data analysis that led to scientific publications.

Andrés Bello University

Bachelor of Biotechnology

2012 to 2016

During my undergraduate degree I received training in microbiology, genetics and I was further specialized in the area of bacterial genome assembly, genomic data analysis, and bioinformatics in general.

Publications - Published

Enzo Guerrero-Araya, Cesar Ravello, Mario Rosemblatt, Tomas Perez-Acle. “Socioeconomic status correlates with COVID-19 vaccination coverage among primary and secondary students in the most populated city of Chile”. Scientific Reports (2025).

Anny Camargo, Enzo Guerrero-Araya, Sergio Castañeda, Laura Vega, María X Cardenas-Alvarez, César Rodríguez, Daniel Paredes-Sabja, Juan David Ramírez, and Marina Muñoz. “Intra-species diversity of Clostridium perfringens: A diverse genetic repertoire reveals its pathogenic potential”. Frontiers in Microbiology (2022).

Luz H Patiño, Sergio Castañeda, Marina Muñoz, Nathalia Ballesteros, Angie L Ramirez, Nicolas Luna, Enzo Guerrero-Araya, Julie Pérez, Camilo A Correa-Cárdenas, Maria Clara Duque, Claudia Méndez, Carolina Oliveros, Maryia V Shaban, Alberto E Paniz-Mondolfi, and Juan David Ramírez. “Epidemiological Dynamics of SARS-CoV-2 Variants During Social Protests in Cali, Colombia”. Frontiers in Medicine 9 (2022).

Enzo Guerrero-Araya, Marina Muñoz, César Rodríguez, and Daniel Paredes-Sabja. “FastMLST: A multi-core tool for multilocus sequence typing of draft genome assemblies”. Bioinformatics and biology insights 15 (2021): 11779322211059238.

Castañeda, Sergio, Luz H. Patiño, Marina Muñoz, Nathalia Ballesteros, Enzo Guerrero-Araya, Daniel Paredes-Sabja, Carolina Flórez, et al. “Evolution and epidemic spread of SARS-CoV-2 in Colombia: A year into the pandemic”. Vaccines 9, no 8 (2021): 837.

Knight, Daniel R., Korakrit Imwattana, Brian Kullin, Enzo Guerrero-Araya, Daniel Paredes-Sabja, Xavier Didelot, Kate E. Dingle, David W. Eyre, César Rodríguez, and Thomas V. Riley. “Major genetic discontinuity and novel toxigenic species in Clostridioides difficile taxonomy”. Elife 10 (2021).

Muñoz, Marina, Enzo Guerrero-Araya, Catalina Cortés-Tapia, Angela Plaza-Garrido, Trevor D. Lawley, and Daniel Paredes-Sabja. “Comprehensive genome analyses of Sellimonas intestinalis, a potential biomarker of homeostasis gut recovery”. Microbial genomics 6, no 12 (2020).

Romero-Rodríguez, Alba, Scarlett Troncoso-Cotal, Enzo Guerrero-Araya, and Daniel Paredes-Sabja. “The Clostridioides difficile Cysteine-Rich Exosporium Morphogenetic Protein, CdeC, Exhibits Self-Assembly Properties That Lead to Organized Inclusion Bodies in Escherichia coli”. Msphere 5, no 6 (2020): e01065-20.

Enzo Guerrero-Araya, Claudio Meneses, Eduardo Castro-Nallar, Ana M. Guzmán D., Manuel Álvarez-Lobos, Carlos Quesada-Gómez, Daniel Paredes-Sabja, and César Rodríguez. “Origin, genomic diversity and microevolution of the Clostridium difficile B1/NAP1/RT027/ST01 strain in Costa Rica, Chile, Honduras and Mexico”. Microbial genomics 6, no 5 (2020).

Calderon-Romero, Paulina, Pablo Castro-Cordova, Rodrigo Reyes-Ramirez, Mauro Milano-Cespedes, Enzo Guerrero-Araya, Marjorie Pizarro-Guajardo, Valeria Olguin-Araneda, Fernando Gil, and Daniel Paredes-Sabja. “Clostridium difficile exosporium cysteine-rich proteins are essential for the morphogenesis of the exosporium layer, spore resistance, and affect C. difficile pathogenesis”. PLoS pathogens 14, no 8 (2018): e1007199.

Enzo Guerrero-Araya, Angela Plaza-Garrido, Fernando Díaz-Yañez, Marjorie Pizaro-Guajardo, Sandro L. Valenzuela, Claudio Meneses, Fernando Gil, Eduardo Castro-Nallar, and Daniel Paredes-Sabja. “Genome sequence of Clostridium paraputrificum 373-A1 isolated in Chile from a patient infected with Clostridium difficile”. Genome Announcements 4, no 6 (2016): e01178-16.