About

My name is Bailee Egan, and I am a bioinformatician by training with additional interests in data science and software development.

I received my M.S. in biology in 2022 at the University of Nebraska-Lincoln. Under Dr. Etsuko Moriyama, I studied corn rootworm chemoreceptors, learning many of the tools and techniques of bioinformatics. In my undergrad, I worked in the lab of Dr. Hope Hollocher at the University of Notre Dame, studying variation in the oral and gut microbiomes of long-tailed macaques.

I have a passion for coding. In my work, I mainly use Bash, Python, and R for processing data. It is also no surprise that data visualization is one of my favorite parts of my work. Outside of work, I've also constructed multiple games, apps, and a website for myself.

Skills

  • Languages: Python, R, Rust, Javascript (Jquery, React), CSS, HTML
  • Linux environments, HPC clusters, and job systems (Slurm, PBS)
  • Python and R packages: pandas, numpy, Plotly, Dash, tidyverse/ggplots, Shiny, Rmarkdown
  • Workflow, package management, package deployment tools: Snakemake, Nextflow, Conda, Docker, git

Research

Graduate research, August 2019 - July 2022

Transcriptome assembly and characterization of chemoreceptors for corn rootworms
  • Produced transcriptome assemblies from RNA-seq data using a variety of de novo and genome-guided assembly methods and examined the most optimal methods of assembly based on basic metrics and assembly evaluation tools
  • Identified novel chemoreceptors in transcriptome assemblies using BLAST search and performed phylogenetic analysis of chemoreceptor genes across insects gathered from literature and online databases
  • Analyzed chemoreceptor expression by performing differential gene-expression analysis using kallisto, DESeq2, and edgeR
  • Developed custom visualizations to display statistical results and complex datasets using R libraries

Undergraduate research, 2016 - 2019

Investigating associations between the prokaryotic and eukaryotic symbiotic communities in long-tailed macaques
  • Prepared 16S rRNA sequences from fecal and saliva samples and processed Illumina HiSeq reads using custom USEARCH pipeline to generate family-level read counts
  • Characterized microbiome community data using alpha and beta diversity methods and identified taxa strongly associated with certain microbiome features in R