<html>

<head>

<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">

</head>

<body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;">

Ah yes, I have seen that as well (though in the kmer-counting pipeline, as Titus found)&#16;. &nbsp;I recall knocking threads down to 1 also helped.

<div><br>

</div>

<div>chris</div>

<div><br>

<div>

<div>On Apr 14, 2014, at 7:08 AM, Alexis GROPPI &lt;<a href="mailto:alexis.groppi@u-bordeaux.fr">alexis.groppi@u-bordeaux.fr</a>&gt; wrote:</div>

<br class="Apple-interchange-newline">

<blockquote type="cite">

<div text="#000000" bgcolor="#FFFFFF">Hi Titus,<br>

<br>

Good guess !<br>

On the same files but by omitting the --threads 12 option (default = 1) , the script worked perfectly !<br>

Thanks.<br>

<br>

Alexis<br>

<br>

PS : I wrote a comment on github for the follow up<br>

<br>

<br>

<div class="moz-cite-prefix">Le 13/04/2014 17:42, C. Titus Brown a écrit&nbsp;:<br>

</div>

<blockquote cite="mid:20140413154212.GE17799@idyll.org" type="cite">

<pre wrap="">On Thu, Apr 10, 2014 at 11:08:03AM &#43;0200, Alexis GROPPI wrote:

</pre>

<blockquote type="cite">

<pre wrap="">Hi,

Thanks for these helpful advices.

The first step (digital normalisation) ran perfectly.

I'm now trying to compute the abundance histogram.

I launched  :

abundance-dist-single.py -N 4 -x 48e9 --threads 12  

$PBS_O_WORKDIR/LCBL_1/LCBL_1.fasta.normalized  

$PBS_O_WORKDIR/LCBL_1/LCBL_1.fasta.normalized.histogram

But after 24 hours, the script is still running but without any results.

Is that normal ?

</pre>

</blockquote>

<pre wrap="">This is a bug that I ran into, too.  Not sure when it popped up, my guess is

between 0.8 and 1.0.  Thanks for the report -- see

<a class="moz-txt-link-freetext" href="https://github.com/ged-lab/khmer/issues/384">https://github.com/ged-lab/khmer/issues/384</a> to track.

best,

--titus

</pre>

<blockquote type="cite">

<pre wrap="">Le 04/04/2014 17:01, C. Titus Brown a ?crit :

</pre>

<blockquote type="cite">

<pre wrap="">On Apr 4, 2014, at 10:49 AM, Alexis Groppi <a class="moz-txt-link-rfc2396E" href="mailto:alexis.groppi@u-bordeaux.fr">&lt;alexis.groppi@u-bordeaux.fr&gt;</a> wrote:

</pre>

<blockquote type="cite">

<pre wrap="">Hi,

We want to analyse 454 metagenomics data (570 000 reads of ~700 nt per sample).

My questions are :

1/ Given that khmer is rather short-read/Illumina oriented, are we mistaken to try and apply it to our long 454 reads?

2/ Is there an actual benefit in feeding .fastq files to khmer (in our case separate .qual files at the moment, but that can be changed), or does it really only consider the sequence data? ie. are the fasta files sufficient ?

How do you define what data needs pre-normalization or what data can go straight to artifact removal? In the Iowa corn example, you do not start by a normalize/filter pass, how come?

3/ Thus, should the pipeline for our data be like :

DIGINORM (normalize-by-median --  filter-abund -- normalize-by-median) -- ARTIFACT REMOVAL (load-graph -- partition-graph, ...etc)

  or

is the step DIGINORM  useless in our case ?

Thanks for your help

And thanks for your great job on khmer 1.0 !

</pre>

</blockquote>

<pre wrap="">Hi Alexis,

thanks!

Good questions?

I would suggest doing only a single pass of digital normalization, since the impact of both errors and low coverage will be different with longer reads.  So, something like normalizing to a coverage of 5, in a single pass.  Do not do any trimming (filter-abund) as this will potentially discard a lot of your sequences; trimming cuts off the ends of sequences, and is best applied to high-coverage short reads.  Your main diagnostic tool here will be a k-mer abundance histogram after normalization: do you see that a bunch of real coverage has been lowered to 1 or 2, or are you primarily seeing normalization of coverage to an average of 5 with no increase in the number of sequences with a coverage of 1 or 2?

Partitioning should work just fine, but I would not do artifact removal (filter-stoptags, etc.).  If you get a big blob where everything hangs together, there are some things you can do with stoptags: briefly, run the knot finding stuff, but then feed the resulting stoptags into partition-graph with the -S parameter.  This will prevent partitioning across highly connected k-mers.

HTH!  Please feel free to ask more ;)

cheers,

--titus

_______________________________________________

khmer mailing list

<a class="moz-txt-link-abbreviated" href="mailto:khmer@lists.idyll.org">khmer@lists.idyll.org</a>

<a class="moz-txt-link-freetext" href="http://lists.idyll.org/listinfo/khmer">http://lists.idyll.org/listinfo/khmer</a>

</pre>

</blockquote>

<pre wrap="">-- 

CBiB - Universit? de Bordeaux <a class="moz-txt-link-rfc2396E" href="http://www.u-bordeaux.fr/">&lt;http://www.u-bordeaux.fr&gt;</a>

Dr Alexis Groppi <a class="moz-txt-link-rfc2396E" href="mailto:alexis.groppi@u-bordeaux2.fr">&lt;mailto:alexis.groppi@u-bordeaux2.fr&gt;</a>

Directeur adjoint du CBiB - Charg? de mission du CGFB

146, rue L?o Saignat - Case 68 - 33076 Bordeaux Cedex

T. &#43;33 5 57 57 12 18

P. &#43;33 6 35 95 04 87

<a class="moz-txt-link-abbreviated" href="http://www.cbib.u-bordeaux2.fr/">www.cbib.u-bordeaux2.fr</a> <a class="moz-txt-link-rfc2396E" href="http://www.cbib.u-bordeaux2.fr/">&lt;http://www.cbib.u-bordeaux2.fr&gt;</a>

</pre>

</blockquote>

<pre wrap=""></pre>

<blockquote type="cite">

<pre wrap="">_______________________________________________

khmer mailing list

<a class="moz-txt-link-abbreviated" href="mailto:khmer@lists.idyll.org">khmer@lists.idyll.org</a>

<a class="moz-txt-link-freetext" href="http://lists.idyll.org/listinfo/khmer">http://lists.idyll.org/listinfo/khmer</a>

</pre>

</blockquote>

<pre wrap="">

</pre>

</blockquote>

<br>

<div class="moz-signature">-- <br>

<a href="http://www.u-bordeaux.fr/" title="Université de Bordeaux"><span>&lt;signature_mail_cbib_nub.jpg&gt;</span></a>

<p style="font-family: Arial, sans-serif; font-size: 13px; color:

        #009DE0;line-height:17px">

<span style="font-weight: bold"><a href="mailto:alexis.groppi@u-bordeaux2.fr" style="color:#009DE0; text-decoration:none;">Dr Alexis

<span style="text-transform:uppercase;">Groppi</span></a></span><br>

Directeur adjoint du CBiB - Chargé de mission du CGFB<br>

146, rue Léo Saignat - Case 68 - 33076 Bordeaux Cedex<br>

T. &#43;33 5 57 57 12 18<br>

P. &#43;33 6 35 95 04 87<br>

<a href="http://www.cbib.u-bordeaux2.fr/" style="color:#009DE0;

          text-decoration:none;">www.cbib.u-bordeaux2.fr</a><br>

</p>

</div>

</div>

_______________________________________________<br>

khmer mailing list<br>

<a href="mailto:khmer@lists.idyll.org">khmer@lists.idyll.org</a><br>

http://lists.idyll.org/listinfo/khmer<br>

</blockquote>

</div>

<br>

</div>

</body>

</html>