My work and research interests focus on machine learning for genomics
and metaresearch, including deep learning method development, automated
architecture search, and practical tools that use DL and LLMs to make
research workflows more reproducible, scalable, and efficient.
Outside research and software, I am interested in Baroque and classical
music, Expressionist art, and twentieth-century European literature.
I also enjoy hiking in the Alps, fly fishing, and spending time with
dogs.
Projects
I am the main developer of these tools, spanning secure research
infrastructure, sequencing operations, and scientific writing support.
bioRxiv preprint, 2026.
Presents BAQLaVa for high-resolution viral profiling from
metagenomic and metatranscriptomic data. The framework supports
scalable virome epidemiology and virus-host interaction analysis.
bioRxiv preprint, 2025.
Evaluates how well large language models support structured
microbial phenotype annotation. The study compares model behavior
across traits and uses confidence to prioritize reliable outputs.
Communications Biology, 2024.
Presents GenomeNet-Architect, a framework for automated neural
architecture optimization on genomic sequence tasks. The approach
searches model layouts and training hyperparameters for compact,
efficient genomic classifiers.
Cell Host & Microbe, 2023.
Studies strain-level microbiome responses to repeated antibiotic
perturbations in gnotobiotic mice. The paper highlights how
treatment pulses can reshape community dynamics and resistance.
Communications Biology, 2023.
Introduces Self-GenomeNet, a self-supervised approach tailored to
genomic sequences. The method uses reverse-complement structure to
improve learning in data-scarce genomic prediction tasks.
Cell Host & Microbe, 2021.
Maps natural CRISPR-Cas systems across human microbiome
metagenomes. The work links CRISPR spacers, cas genes, body sites,
taxa, and putative viral or mobile-element targets.