CCB » Software » How to Choose Your Metagenomics Classification Tool

Introduction

Authors: Jennifer Lu (JL), Florian P. Breitwieser (FB), Derrick E. Wood (DW), Li Song (LS), Daehwan Kim (DK), Ben Langmead (BL) Steven L. Salzberg (SLS)

From 2014-2018, the Center for Computational Biology has released 4 different metagenomics classification software packages: Kraken, KrakenUniq, Kraken 2, and Centrifuge. This page is dedicated to describing:
  1. The history of each software package
  2. The differences between each software package
  3. The best software package for users
  4. Additional software provided for post-processing/analyzing classification results.

Table of Contents

#1) Introduction
#2) Software Packages
#3) Links to Software Websites & Papers
#4) General Comparison Table
#5) How to Choose
#6) About the Authors

Page Updated: 2018/12/29 by Jennifer Lu
( jlu26 jhmi edu )

Software Packages: A Brief Description

Links to Software Websites & Papers

For the most comprehensive understanding of each software package, please refer to the individual websites and papers:

General Comparison

Kraken KrakenUniq Kraken 2 Centrifuge
First Release Date (yyyy/mm/dd) 2014/01/04 2018/05/30 2018/06/26 2016/10/04
Latest Release Date (yyyy/mm/dd) 2017/12/05 2018/09/10 2018/08/11 2018/06/10
Paper Date 2014/03/03 2018/11/18 TBA 2016/10/17
Original Authors DW/SLS FB/SLS DW/JL/BL DK/LS/FB
Currently Supported? No Yes, FB Yes, DW/JL Yes, LS
MemoryA 240.8 GB 240.8 GB 34.7 GB 25.2 GB
Database Build TimeA 16 hours 16 hours 4 hours 17 hours
Processing Time (per 10 Million reads)A 60 sec 55 sec 13 sec 70 sec
Abundance Estimation Bracken Bracken Bracken Built-in
Supported Databases Refseq
GRCh38
Refseq
GRCh38
microbial nt
Refseq
GRCh38
nt
16S Greengenes
16S Silva
16S RDP
nr
protein (translated search)
Refseq
GRCh38
nt
A Memory and Times measured for databases containing GRCh38 and Refseq bacterial/archaeal/viral sequences downloaded in Sept 2018. Database build speed measured using 32 threads on a 48 core machine with 512 GB memory. Processing speed measured using 16 threads during classification on the same machine. Memory and speed measured using each program's defaults (including default kmer size)

How to Choose

Kraken 1 is no longer supported:

While many continue to use this software, we encourage all Kraken users to upgrade to either KrakenUniq or Kraken 2.


KrakenUniq and Kraken 2 are uniquely useful depending on the project goal:
Kraken 2 v Centrifuge are distinctly different, but with different advantages:

About the Authors

Jennifer Lu (JL) is a PhD Candidate at Johns Hopkins University in Biomedical Engineering with a projected graduation date of 2020-2021. She is primarily advised by Steven Salzberg. She maintains the Bracken software and works alongside Derrick Wood and Ben Langmead to release and maintain Kraken 2. (Jennifer Lu's webpage )

Florian P Breitwieser (FB) is a post-doctoral researcher at Johns Hopkins University in Steven Salzberg's Lab. He is one of the original authors of Centrifuge and is the author of KrakenUniq and Pavian. (Florian Breitwieser's webpage)

Derrick E Wood (DW) received his PhD in 2014 from his work with Steven Salzberg on Kraken at the University of Maryland. For his post-doctoral work, Derrick worked with Ben Langmead in Johns Hopkins Computer Science to develop Kraken 2. He is currently working on publishing Kraken 2. (Derrick Wood's webpage)

Li Song (LS) received his PhD in 2018 working with Liliana Florea at Johns Hopkins University in the Computer Science Department. He is now a post-doctoral researcher at the Dana-Farber Cancer Institute in Shirley Liu’s lab. He is one of the original authors of Centrifuge and continues to maintain and update the software.

Daewhan Kim (DK) received his PhD at the University of Maryland and then completed his post-doctoral research with Steven Salzberg at Johns Hopkins University. He wrote Centrifuge alongisde Florian Breitwieser and Li Song. He now is an Assistant Professor at the University of Texas, Southwestern Medical Cneter. (Kim Lab webpage)

Ben Langmead (BL) is an Assistant Professor at Johns Hopkins University in the Department of Computer Science. He is the primary advisor to the Kraken 2 project. (Langmead Lab webpage)

Steven L Salzberg (SLS) is the Bloomberg Distinguished Professor of Biomedical Engineering, Computer Science, and Biostatistics at Johns Hopkins University. He is/was the primary advisor for the students and postdocs who developed Kraken 1, Centrifuge, KrakenUniq, Bracken, and Pavian. (Salzberg Lab webpage)