Liliana FloreaAssociate Professor
McKusick-Nathans Institute of Genetic Medicine
Department of Medicine and Department of Computer Science
Faculty Manager, Computational Biology Consulting Core
1900 E. Monument, Welch 113
Johns Hopkins University School of Medicine
Baltimore, MD 21205
Ph: (443) 287-5624
E-mail: florea [at] jhu.edu
Lab | Core | CV | Publications
Our online Coursera course Command Line Tools for Genomic Data Science was formulated for biologists and computational scientists alike who wish to enter the 'big data' genomics science. The course covers topics starting from basic Unix commands, to biological data types and formats, to tools and practical workflows for alignment, sequence variation and transcriptomics. For some of our earlier teaching at GWU, please see the curriculum here.
In recent years, next generation sequencing has revolutionized biomedical sciences. However, the vast amounts of short reads it generates are difficult to analyze, demanding new and sophisticated bioinformatics methods. Our lab is developing algorithms and associated software tools to analyze next generation sequencing data to answer a variety of questions in biology and medicine.
High throughput sequencing has made it possible to sequence and assemble the genome of virtually any organism. Most model species were sequenced and annotated as part of large genome sequencing projects, undertaken by broad international consortia. More recently, next generation sequencing has tremendously accelerated the pace of producing new genomes, but the short reads lead to fragmented assemblies that are difficult to analyze. We are developing algorithms and tools to aid in genome assembly and assembly curation, as well as to compare and annotate genomes.
Alternative splicing and RNA editing are important post-transcriptional regulatory mechanisms that contribute to creating functional diversity. In human and other species, dysregulations in splicing and editing have been associated with diseases. We are developing methods to discover and catalogue alternative splicing and RNA editing variations in humans and animals, by analyzing the vast amounts of gene sequence data produced with conventional and next generation sequencing.
Genome analyses of bacterial and viral pathogens have provided unique insights into their virulence, host adaptation and evolution, and as determinants and markers of disease. In collaborative work, we have developed algorithm to identify sequence features in the genomes of viruses and bacteria, such as virus encoded non-coding RNAs and selection signatures, as well as tools to compare and visualize microbial genomes for comparative and functional analyses.