Santiago is a software developer specialising in machine learning, who has a passion for metagenomics and mexican folk music.
santiago sanchez fragoso, software developer – machine learning at embl-ebi. Credit: jeff dowling/embl-ebi
“we’re incredibly fortunate to have access to this much data and it’s vital for the work i do to train machine learning algorithms.”
machine learning is embedded in different areas across embl’s european bioinformatics institute (embl-ebi) including our data resources. Using this technology allows us to input data into our resources more efficiently than ever before. Santiago sanchez fragoso is a software developer at embl-ebi specialising in machine learning. Here, we find out about how he got into this field and his work to improve mgnify – our metagenomics data resource.
what is your professional background?
i did my bachelor’s degree in environmental biology and for my dissertation project, i developed a tool to identify proteins in arthropods – a type of insect – and plants. This was my first experience using pattern recognition and machine learning techniques. I then did a master’s degree which allowed me to develop these methods further. I did this in cuernavaca, mexico. This region is a hub for research, there are many life science institutes based there.
what does your role entail?
i’m a software developer specialising in machine learning. I develop tools and software for natural products discovery in the mgnify data resource. I also help with the development and rolling out of the new mgnify releases. Finally, i have my own research within the mgnify team. For this, i am testing the different tools i develop with various datasets to optimise them and gain new insights from these data. We also collaborate with colleagues at cambridge university to help us validate our findings experimentally.
what motivated you to work at embl-ebi?
i was using embl-ebi resources and tools back in mexico so working at the institute was an exciting opportunity. Embl-ebi data resources are widely used in mexico. For example, i was frequently using interpro, uniprot and ontology search to name a few. Metagenomics is also a very dynamic and fast growing area of bioinformatics so i saw this position in mgnify as a fantastic career opportunity.
working in bioinformatics meant i often took for granted the huge amount of data available in open access databases like those at embl-ebi. We’re incredibly fortunate to have access to this much data and it’s vital for the work i do to train machine learning algorithms. This has a huge impact on accelerating life science research.
what are some of the challenges in your job?
working with big data always has its challenges; there’s often a trade-off between how fast you want your work done versus the quality of the results. You find yourself frequently juggling this. There’s an enormous amount of data freely available and at your fingertips.
what is one thing we couldn’t find out about you from an online search?
i love mexican folk music. I have been playing the guitar since i was 14 years old. The type of music i play is called corridos, and this is very popular in northern mexico. I particularly enjoy playing and singing along; this is always a good party trick.
want to work with us?
check out our vacancies page or sign up for a job alert .
at embl-ebi you can use your energy and skills to make a valuable contribution to science and technology and be part of a new generation: digital science.
behind its outstanding technical infrastructure and exceptional life science expertise, embl-ebi is a medium-sized organisation with a small-company feel.
#j-18808-ljbffr