Skip to content

Introduction

Myloasm is a de novo metagenome assembler for long-read sequencing data. It takes sequencing reads and outputs polished contigs in a single command.

Myloasm works with:

  • Nanopore R10 simplex reads with > ~97% accuracy (basecalled in sup or hac mode)
  • PacBio HiFi reads

Installation and Usage: See the Install and Usage sections on the navigation side bar. Source code is available at the GitHub repo.

Why myloasm?

Results: Here are some preliminary results for myloasm until the preprint officially comes out.

Philosophy: Myloasm was designed to take advantage of modern long reads. Even the noisiest modern long reads (e.g., nanopore simplex R10) have become quite accurate—myloasm uses a new approach that enables high-resolution assembly from this data.

Strengths: myloasm can

  • often assemble similar (intraspecies) strains better than other nanopore assemblers
  • take advantage of very long reads better than de Bruijn graph approaches
  • obtain contiguous assemblies in even complex, highly heterogeneous metagenomes

Limitations: myloasm may

  • occasionally produce chimeric misassembled contigs due to its aggressiveness.
  • use more memory than other assemblers. Currently, a ~200 gigabase long-read human gut sample takes ~450 GB of RAM.

Issues, questions, and discussions

Citation

Jim Shaw and Heng Li. Forthcoming.