Sourmash is a command-line tool and Python library that calculates and compares MinHash signatures from sequence data. Sourmash "compare" and "gather" functionality enables comparison and characterization of signatures ...
I have been working on the assembly of big shotgun metagenomic data
from ARMO (Amazon Rain Forest Microbial Observatory) project. The
biggest challenge is the huge data size, 2TB in fastq and more than 6
billions reads after read trimming. One lucky thing ...
I spend so much of my time writing stuff down to cadge funding or
bruit about ideas, and much of that never really goes anywhere.
In the interests of slowing down any competitors by getting them
to take my old ideas seriously, here is an interesting set of
ideas that ...
I gave a talk last Wednesday at U. Michigan in the DCMB program where I included a slide
estimating how much DNA sequencing (in base pairs) was needed for good
de novo assembly of sequences from various biological environments or
problems. The slide was there to motivate the challenges of ...