<html>
  <head>
    <meta content="text/html; charset=ISO-8859-1"
      http-equiv="Content-Type">
  </head>
  <body text="#000000" bgcolor="#FFFFFF">
    Hi,<br>
    <br>
    <div class="moz-cite-prefix">Le 13/03/2013 14:12, Eric McDonald a
      &eacute;crit&nbsp;:<br>
    </div>
    <blockquote
cite="mid:CAGhFaV1RKgHuyLih2DY2Rw--j-RjZeh3WW0UxwDqcTUj7Eoy7w@mail.gmail.com"
      type="cite">
      <div dir="ltr">Hi Alexis,
        <div><br>
        </div>
        <div style="">First, let me say thank you for being patient and
          working with us in spite of all the problems you are
          encountering.</div>
      </div>
    </blockquote>
    <br>
    That's bioinformatician life ;)<br>
    <br>
    <blockquote
cite="mid:CAGhFaV1RKgHuyLih2DY2Rw--j-RjZeh3WW0UxwDqcTUj7Eoy7w@mail.gmail.com"
      type="cite">
      <div dir="ltr">
        <div style=""><br>
        </div>
        <div style="">With regards to the floating point exception, I
          see several opportunities for a division-by-zero condition in
          the threading utilities used by the script. These
          opportunities exist if an input file is empty. (The problem
          may be coming from another place, but this would be my first
          guess.) What does the following command say:</div>
        <div style=""><br>
        </div>
        <div style="">&nbsp; ls -lh&nbsp;<span
            style="font-family:monospace;font-size:10px">/scratch/ag/khmer/</span><a
            moz-do-not-send="true" href="http://174r1_table.kh/"
            target="_blank" style="font-family:monospace;font-size:10px">174r1_table.kh</a>&nbsp;<span
            style="font-family:monospace;font-size:10px">/mnt/var/home/ag/174r1_</span><span
            style="font-family:monospace;font-size:10px">prinseq_good_bFr8.fasta.keep</span></div>
      </div>
    </blockquote>
    <br>
    &nbsp;The result : (the files are not empty)<br>
    <tt>-rw-r--r-- 1 ag users 299M 12 mars&nbsp; 20:54
      /mnt/var/home/ag/174r1_prinseq_good_bFr8.fasta.keep</tt><tt><br>
    </tt><tt>-rw-r--r-- 1 ag users 141G 12 mars&nbsp; 21:05
      /scratch/ag/khmer/174r1_table.kh</tt><br>
    <br>
    <blockquote
cite="mid:CAGhFaV1RKgHuyLih2DY2Rw--j-RjZeh3WW0UxwDqcTUj7Eoy7w@mail.gmail.com"
      type="cite">
      <div dir="ltr"><br>
        <div style="">Also, since you appear to be using TORQUE as your
          resource manager/batch system, could you please attach the
          complete output and error files for the job? (These files
          should be of the form &lt;job_name&gt;.o2693 and
          &lt;job_name&gt;.e2693, where &lt;job_name&gt; is the name of
          your job. There may only be one or the other of these files,
          depending on site defaults and whether you specified "-j oe"
          or "-j eo" in your job submission.)<br>
        </div>
      </div>
    </blockquote>
    <br>
    I re run the job since I have deleted previous (2693) err/out files.<br>
    Here is the new file (merged with the option -j oe in the bash
    script) :<br>
    <br>
    <tt>#############################</tt><tt><br>
    </tt><tt>User: ag</tt><tt><br>
    </tt><tt>Date: Wed Mar 13 14:59:21 CET 2013</tt><tt><br>
    </tt><tt>Host: rainman.cbib.u-bordeaux2.fr</tt><tt><br>
    </tt><tt>Directory: /mnt/var/home/ag</tt><tt><br>
    </tt><tt>PBS_JOBID: 2695.rainman</tt><tt><br>
    </tt><tt>PBS_O_WORKDIR: /mnt/var/home/ag</tt><tt><br>
    </tt><tt>PBS_NODEFILE:&nbsp; rainman</tt><tt><br>
    </tt><tt>#############################</tt><tt><br>
    </tt><tt>#############################</tt><tt><br>
    </tt><tt>Debut filter-below-abund: Wed Mar 13 14:59:21 CET 2013</tt><tt><br>
    </tt><tt>starting threads</tt><tt><br>
    </tt><tt>starting writer</tt><tt><br>
    </tt><tt>loading...</tt><tt><br>
    </tt><tt>... filtering 0</tt><tt><br>
    </tt><tt>/var/lib/torque/mom_priv/jobs/2695.rainman.SC: line 49:
      54757 Floating point exception(core dumped)
      ./khmer-BETA/sandbox/fi</tt><tt><br>
    </tt><tt>lter-below-abund.py /scratch/ag/khmer/174r1_table.kh
      /mnt/var/home/ag/174r1_prinseq_good_bFr8.fasta.keep</tt><tt><br>
    </tt><tt><br>
    </tt><tt>real&nbsp;&nbsp;&nbsp; 3m54.873s</tt><tt><br>
    </tt><tt>user&nbsp;&nbsp;&nbsp; 0m0.085s</tt><tt><br>
    </tt><tt>sys&nbsp;&nbsp;&nbsp;&nbsp; 2m2.180s</tt><tt><br>
    </tt><tt>Date fin: Wed Mar 13 15:03:15 CET 2013</tt><tt><br>
    </tt><tt>Job finished</tt><br>
    <br>
    Thanks again for your help :)<br>
    <br>
    Alexis<br>
    <br>
    <blockquote
cite="mid:CAGhFaV1RKgHuyLih2DY2Rw--j-RjZeh3WW0UxwDqcTUj7Eoy7w@mail.gmail.com"
      type="cite">
      <div dir="ltr">
        <div style="">
        </div>
        <div style=""><br>
        </div>
        <div style="">Thanks,</div>
        <div style="">&nbsp; Eric</div>
        <div style=""><span style="font-family:monospace;font-size:10px"><br>
          </span></div>
      </div>
      <div class="gmail_extra"><br>
        <br>
        <div class="gmail_quote">On Wed, Mar 13, 2013 at 5:38 AM, Alexis
          Groppi <span dir="ltr">&lt;<a moz-do-not-send="true"
              href="mailto:alexis.groppi@u-bordeaux2.fr" target="_blank">alexis.groppi@u-bordeaux2.fr</a>&gt;</span>
          wrote:<br>
          <blockquote class="gmail_quote" style="margin:0 0 0
            .8ex;border-left:1px #ccc solid;padding-left:1ex">
            <div text="#000000" bgcolor="#FFFFFF"> Hi Eric,<br>
              <br>
              Thanks for your answer.<br>
              But unfortunately, after many attempts I'm getting this
              error :<tt><br>
                <br>
              </tt><tt>starting threads</tt><tt><br>
              </tt><tt>starting writer</tt><tt><br>
              </tt><tt>loading...</tt><tt><br>
              </tt><tt>... filtering 0</tt><tt><br>
              </tt><tt>/var/lib/torque/mom_priv/jobs/<a
                  moz-do-not-send="true" href="http://2693.rainman.SC"
                  target="_blank">2693.rainman.SC</a>: line 46: 63657
                Floating point exception(core dumped)
                ./khmer-BETA/sandbox/filter-below-abund.py
                /scratch/ag/khmer/<a moz-do-not-send="true"
                  href="http://174r1_table.kh" target="_blank">174r1_table.kh</a>
                /mnt/var/home/ag/174r1_prinseq_good_bFr8.fasta.keep</tt><tt><br>
              </tt><tt><br>
              </tt><tt>real&nbsp;&nbsp;&nbsp; 3m30.163s</tt><tt><br>
              </tt><tt>user&nbsp;&nbsp;&nbsp; 0m0.088s</tt><br>
              <br>
              Your opinion ?<br>
              <br>
              Thanks<br>
              <br>
              Alexis<br>
              <br>
              <br>
              <div>Le 13/03/2013 00:55, Eric McDonald a &eacute;crit&nbsp;:<br>
              </div>
              <div>
                <div class="h5">
                  <blockquote type="cite">
                    <div dir="ltr">Hi Alexis,
                      <div><br>
                      </div>
                      <div>One way to get the 'bleeding-edge' branch is
                        to clone it into a fresh directory; for example:</div>
                      <div>&nbsp; &nbsp;git clone <a moz-do-not-send="true"
                          href="http://github.com/ged-lab/khmer.git"
                          target="_blank">http://github.com/ged-lab/khmer.git</a>
                        -b bleeding-edge khmer-BETA</div>
                      <div><br>
                      </div>
                      <div>Assuming you already have a clone of the
                        'ged-lab/khmer' repo, then you should also be
                        able to do:</div>
                      <div>&nbsp; git fetch origin</div>
                      <div>&nbsp; git checkout bleeding-edge</div>
                      <div>Depending on how old your Git client is and
                        what its defaults are, you may have to do the
                        following instead:</div>
                      <div>&nbsp; git checkout --track -b bleeding-edge
                        origin/bleeding-edge</div>
                      <div><br>
                      </div>
                      <div>Hope this helps,</div>
                      <div>&nbsp; Eric</div>
                    </div>
                    <div class="gmail_extra"><br>
                      <br>
                      <div class="gmail_quote">On Tue, Mar 12, 2013 at
                        11:32 AM, Alexis Groppi <span dir="ltr">&lt;<a
                            moz-do-not-send="true"
                            href="mailto:alexis.groppi@u-bordeaux2.fr"
                            target="_blank">alexis.groppi@u-bordeaux2.fr</a>&gt;</span>
                        wrote:<br>
                        <blockquote class="gmail_quote" style="margin:0
                          0 0 .8ex;border-left:1px #ccc
                          solid;padding-left:1ex">
                          <div text="#000000" bgcolor="#FFFFFF"> <br>
                            <div>Le 12/03/2013 16:16, C. Titus Brown a
                              &eacute;crit&nbsp;:<br>
                            </div>
                            <div>
                              <blockquote type="cite">
                                <pre>On Tue, Mar 12, 2013 at 04:15:05PM +0100, Alexis Groppi wrote:
</pre>
                                <blockquote type="cite">
                                  <pre>Hi Titus,

Thanks for your answer
Actually it's my second attempt with filter-below-abund.
The first time, I thought the problem was coming from the location of my  
<a moz-do-not-send="true" href="http://table.kh" target="_blank">table.kh</a> file : in a storage element with poor level performance of I/O
I killed the job after 24h, moved the file in a best place and re run it
But with the same result : no completion after 24h

Any Idea ?

Thanks

Cheers From Bordeaux :)

Alexis

PS : The command line was the following :

./filter-below-abund.py <a moz-do-not-send="true" href="http://174r1_table.kh" target="_blank">174r1_table.kh</a> 174r1_prinseq_good_bFr8.fasta.keep

Is this correct ?
</pre>
                                </blockquote>
                                <pre>Yes, looks right... Can you try with the bleeding-edge branch, which now
incorporates a potential fix for this issue?</pre>
                              </blockquote>
                            </div>
                            From here : <a moz-do-not-send="true"
                              href="https://github.com/ged-lab/khmer/tree/bleeding-edge"
                              target="_blank">https://github.com/ged-lab/khmer/tree/bleeding-edge</a>
                            ?<br>
                            or <br>
                            here : <a moz-do-not-send="true"
                              href="https://github.com/ctb/khmer/tree/bleeding-edge"
                              target="_blank">https://github.com/ctb/khmer/tree/bleeding-edge</a>
                            ?<br>
                            <br>
                            Do I have to make a fresh install ? and How&nbsp;
                            ?<br>
                            Or just replace all the files and folders ?<br>
                            <br>
                            Thanks :)<br>
                            <br>
                            Alexis
                            <div>
                              <div><br>
                                <br>
                                <blockquote type="cite">
                                  <pre>thanks,
--titus

</pre>
                                  <blockquote type="cite">
                                    <pre>Le 12/03/2013 14:41, C. Titus Brown a ?crit :
</pre>
                                    <blockquote type="cite">
                                      <pre>On Tue, Mar 12, 2013 at 10:48:03AM +0100, Alexis Groppi wrote:
</pre>
                                      <blockquote type="cite">
                                        <pre>Metagenome assembly :
My data :
- original (quality filtered) data : 4463243 reads (75 nt) (Illumina)
1/ Single pass digital normalization with normalize-by-median (C=20)
==&gt; file .keep of 2560557 reads
2/ generated a hash table by load-into-counting on the .keep file
==&gt; file .kh of ~16Go (huge file ?!)
3/ filter-below-abund with C=100 from the two previous file (<a moz-do-not-send="true" href="http://table.kh" target="_blank">table.kh</a>
and reads.keep)
Still running after 24 hours  :(

Any advice to speed up this step ? ... and the others (partitionning ...) ?

I can have an access to a HPC : ~3000 cores.
</pre>
                                      </blockquote>
                                      <pre>Hi Alexis,

filter-below-abund and filter-abund have occasional bugs that prevent them
from completing.  I would kill and restart.  For that few reads it should
take no more than a few hours to do everything.

Most of what khmer does cannot easily be distributed across multiple chassis,
note.

best,
--titus
</pre>
                                    </blockquote>
                                    <pre>-- 
</pre>
                                  </blockquote>
                                </blockquote>
                                <br>
                              </div>
                            </div>
                            <span><font color="#888888">
                                <div>-- <br>
                                  <img
                                    src="cid:part12.04040604.04000609@u-bordeaux2.fr"
                                    border="0"></div>
                              </font></span></div>
                          <br>
_______________________________________________<br>
                          khmer mailing list<br>
                          <a moz-do-not-send="true"
                            href="mailto:khmer@lists.idyll.org"
                            target="_blank">khmer@lists.idyll.org</a><br>
                          <a moz-do-not-send="true"
                            href="http://lists.idyll.org/listinfo/khmer"
                            target="_blank">http://lists.idyll.org/listinfo/khmer</a><br>
                          <br>
                        </blockquote>
                      </div>
                      <br>
                      <br clear="all">
                      <div><br>
                      </div>
                      -- <br>
                      <div dir="ltr">
                        <div>Eric McDonald</div>
                        <div>HPC/Cloud Software Engineer</div>
                        <div>&nbsp; for the Institute for Cyber-Enabled
                          Research (iCER)</div>
                        <div>&nbsp; and the Laboratory for Genomics,
                          Evolution, and Development (GED)</div>
                        <div>Michigan State University</div>
                        <div>P: <a moz-do-not-send="true"
                            href="tel:517-355-8733" value="+15173558733"
                            target="_blank">517-355-8733</a></div>
                      </div>
                    </div>
                  </blockquote>
                  <br>
                </div>
              </div>
              <span class="HOEnZb"><font color="#888888">
                  <div>-- <br>
                    <img
                      src="cid:part16.01060504.07000405@u-bordeaux2.fr"
                      border="0"></div>
                </font></span></div>
          </blockquote>
        </div>
        <br>
        <br clear="all">
        <div><br>
        </div>
        -- <br>
        <div dir="ltr">
          <div>Eric McDonald</div>
          <div>HPC/Cloud Software Engineer</div>
          <div>&nbsp; for the Institute for Cyber-Enabled Research (iCER)</div>
          <div>&nbsp; and the Laboratory for Genomics, Evolution, and
            Development (GED)</div>
          <div>Michigan State University</div>
          <div>P: 517-355-8733</div>
        </div>
      </div>
    </blockquote>
    <br>
    <div class="moz-signature">-- <br>
      <img src="cid:part17.09020606.02060503@u-bordeaux2.fr" border="0"></div>
  </body>
</html>