Living in an Ivory Basement

Data Intensive Science, and Workflows

Published: Sun 11 December 2011
By C. Titus Brown

In python.

tags: bioinformatics python workflows metagenomics microsoftisstillevil

I'm writing this on my way back from Stockholm, where I attended a workshop on the 4th Paradigm. This is the idea (so named by Jim Gray, I gather?) that data-intensive science is a distinct paradigm from the first three paradigms of scientific investigation -- theory, experiment, and simulation. I was …
read more
There are comments.
Trying out 'cram'

Published: Sun 13 March 2011
By C. Titus Brown

In python.

tags: python testing pycon

I desperately need something to run and test things at the command line, both for course documentation (think "doctest" but with shell prompts) and for script testing (as part of scientific pipelines). At the 2011 testing-in-python BoF, Augie showed us cram, which is the mercurial project's internal test code ripped …
read more
There are comments.
My new data analysis pipeline code
Published: Fri 11 March 2011
By C. Titus Brown

In science.

tags: python bioinformatics science

First, I write a recipe file, 'metagenome.recipe', laying out my job description for, say, sequence trimming and assembly with Velvet:
```
fasta_file soil-data.fa

qc_filter min_length=50 remove_Ns=true

graph_filter min_length=400

velvet_assemble k=33 min_length=1000 scaffolding=True
```
Then I specify machine parameters, e.g. 'bigmem.conf':
```
[defaults]
n_threads …
```
read more
There are comments.
The sky is falling! The sky is falling!

Published: Thu 14 October 2010
By C. Titus Brown

In python.

tags: clowd bioinformatics python

I just parachuted in on (and heli'd out of?) the Beyond the Genome conference in Boston. I gave a very brief workshop on using EC2 for sequence analysis, which seemed well received. (Mind you, virtually everything possible went wrong, from lack of good network access to lack of attendee computers …
read more
There are comments.
A memory efficient way to remove low-abundance k-mers from large (metagenomic?) DNA data sets

Published: Wed 07 July 2010
By C. Titus Brown

In science.

tags: python bioinformatics biotools

I've spent the last few weeks working on a simple solution to a challenging problem in DNA sequence assembly, and I think we've got a nice simple theoretical solution with an actual implementation. I'd be interested in comments!

Introduction

Briefly, the algorithmic challenge is this:

We have a bunch of …
read more
There are comments.
Teaching scientists how to use computers - hub & spokes

Published: Mon 05 July 2010
By C. Titus Brown

In science.

tags: swc python bioinformatics

After my recent next-gen sequencing course, which was supposed to tie into the whole software carpentry (SWC) effort but didn't really succeed in doing so the first time through, I started thinking about the Right Way to tie in the SWC material. In particular, how do you both motivate scientists …
read more
There are comments.
Which functional programming language(s) should we teach?

Published: Thu 24 June 2010
By C. Titus Brown

In misc.

tags: python msu cse

Laurie Dillon just posted the SIGPLAN eduction board article on Why Undergraduates Should Learn the Principles of Programming Languages to our faculty mailing list at the MSU Computer Science department. One question that came up in the ensuing conversation was: what functional programming language(s) would/should we teach?

I …
read more
There are comments.

⇇ « Page 6 / 38 » ⇉

Introduction

social