[khmer] Duration of do-partition.py (very long !)

Alexis Groppi alexis.groppi at u-bordeaux2.fr
Tue Mar 19 02:41:45 PDT 2013


Hi Titus,

After digital normalization and filter-below-abund, upon your advice I 
performed do.partition.py on 2 sets of data (approx 2.5 millions of 
reads (75 nt)) :

/khmer-BETA/scripts/do-partition.py -k 20 -x 1e9 
/ag/khmer/Sample_174/174r1_prinseq_good_bFr8.fasta.keep.below.graphbase 
/ag/khmer/Sample_174/174r1_prinseq_good_bFr8.fasta.keep.below
and
/khmer-BETA/scripts/do-partition.py -k 20 -x 1e9 
/ag/khmer/Sample_174/174r2_prinseq_good_1lIQ.fasta.keep.below.graphbase 
/ag/khmer/Sample_174/174r2_prinseq_good_1lIQ.fasta.keep.below

For the first one I got a 
174r1_prinseq_good_bFr8.fasta.keep.below.graphbase.info with the 
information : 33 subsets total
Thereafter 33 files .pmap from 0.pmap to 32.pmap regurlarly were created 
and finally I got unique file 
174r1_prinseq_good_bFr8.fasta.keep.below.part (all the .pmap files were 
deleted)
This treatment lasted approx 56 hours.

For the second set (174r2), do-partition.py is started since 32 hours 
but I only got the 
174r2_prinseq_good_1lIQ.fasta.keep.below.graphbase.info with the 
information : 35 subsets total
And nothing more...

Is this duration "normal" ?
(The parameters for the threads are by default (4 threads))
33 subsets and only one file at the end ?
Should I stop do-partition.py on the second set and re run it with more 
threads ?

Thanks for your help

Alexis



-- 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.idyll.org/pipermail/khmer/attachments/20130319/a64a449e/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Signature_Mail_A_Groppi.png
Type: image/png
Size: 29033 bytes
Desc: not available
URL: <http://lists.idyll.org/pipermail/khmer/attachments/20130319/a64a449e/attachment-0002.png>


More information about the khmer mailing list