<div dir="ltr">Alexis,<div><br></div><div>Sorry, I didn&#39;t mean to imply anything bad about David. As someone who has previously worked as a HPC systems administrator, I know that I have felt annoyed when users fill the wrong file system. So, if he was annoyed, then I understand.</div>
<div><br></div><div>I just realized that we didn&#39;t check something very basic.... What is your PYTHONPATH environment variable set to? What is the result, if you add:</div><div>  echo $PYTHONPATH</div><div>before the commands in the script?</div>
<div><br></div><div>Also, did you install &#39;khmer&#39; into your virtualenv (i.e., did you do &quot;python setup.py install&quot; at some point after you had done &quot;. /mnt/var/home/ag/env/bin/activate&quot;)? If so, then we have likely been using the wrong &#39;khmer&#39; modules this whole time... To verify this, what does the following command tell you:</div>
<div>  python -c &quot;import khmer.thread_utils as tu; print tu.__file__&quot;</div><div>We want to use the Python modules under your &#39;khmer-BETA/python&#39; directory and not the ones under your &#39;/mnt/var/home/ag/env/lib/python2.7/site-packages&#39; directory. I should have asked you to check this much earlier in the debugging process, especially since I was helping someone else with a similar issue.</div>
<div>  </div><div class="gmail_extra">Thank you,</div><div class="gmail_extra">  Eric<br><br><div class="gmail_quote">On Thu, Mar 14, 2013 at 7:28 AM, Alexis Groppi <span dir="ltr">&lt;<a href="mailto:alexis.groppi@u-bordeaux2.fr" target="_blank">alexis.groppi@u-bordeaux2.fr</a>&gt;</span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">
  
    
  
  <div text="#000000" bgcolor="#FFFFFF">
    Hi Eric,<br>
    <br>
    <br>
    <div>Le 14/03/2013 11:47, Eric McDonald a
      écrit :<br>
    </div>
    <blockquote type="cite">
      <div dir="ltr">Hi Alexis,
        <div><br>
        </div><div class="im">
        <div>The &#39;coredump&#39; file comes from a standard Unix
          feature - it is simply the image of the Python process as it
          was in memory at the time of crash. This is nothing that any
          &#39;khmer&#39; script produces explicitly. This is made by your
          operating system.</div>
        <div><br>
        </div>
        <div>You should be able to disable the core dumps if
          your systems engineer is getting upset about the space they
          are using. Please add the following before other commands in
          your job script:</div>
        <div>  ulimit -c 0</div>
      </div></div>
    </blockquote>
    Done, but for some mysterious reason, its not taken in account...<br>
    But David is a cool guy ;)<div class="im"><br>
    <blockquote type="cite">
      <div dir="ltr">
        <div><br>
        </div>
        <div>(Note: we could actually use the core dumps for
          debugging. I refrained from suggesting this to you yesterday,
          since describing the process can be somewhat complicated.)</div>
        <div><br>
        </div>
        <div>Anyway, thanks for rerunning with the diagnostics
          I suggested. The exit code is 136, which is what you get if a
          process experiences a floating-point exception. If the
          exception had occurred within &#39;filter-below-abund.py&#39; proper,
          the exit code would&#39;ve been 1 rather than 136. This is what I
          wanted to double-check.</div>
        <div><br>
        </div>
        <div>I suspect that something bad is happening within
          the Python interpreter. This may be due to some subtle bug
          involving Python&#39;s global interpreter lock and &#39;khmer&#39; not
          doing something proper with regards to that. I will attempt to
          analyze the problem in more detail later today.</div>
        <div><br>
        </div>
        <div>I know you must be getting tired of working on
          this problem</div>
      </div>
    </blockquote>
    <br></div>
    Not at all. Thanks for your work !<div class="im"><br>
    <br>
    <blockquote type="cite">
      <div dir="ltr">
        <div>, but if you want to try one more thing (for now),
          then it would be appreciated. Could you edit your copy of
          &#39;filter-below-abund.py&#39; and change:</div>
        <div>  WORKER_THREADS = 8</div>
        <div>to:</div>
        <div>  WORKER_THREADS = 1</div>
        <div>and see if that helps?</div>
      </div>
    </blockquote></div>
    Done also... but unfortunately same result :(<br>
    <br>
    <br>
    Alexis<div><div class="h5"><br>
    <br>
    <br>
    <blockquote type="cite">
      <div dir="ltr">
        <div><br>
        </div>
        <div>Thanks,</div>
        <div>  Eric</div>
        <div><br>
        </div>
        <div>P.S. If you want to reduce memory usage, you can
          decrease the Bloom filter size by adjusting the &quot;-x&quot; parameter
          that you use in the scripts to create your .kh files. Making
          this number smaller will reduce memory usage but will also
          increase the false positive rate, so be careful about tuning
          this too much.</div>
        <div><br>
        </div>
        <div class="gmail_extra"><br>
          <br>
          <div class="gmail_quote">On Thu, Mar 14, 2013 at 5:42 AM,
            Alexis Groppi <span dir="ltr">&lt;<a href="mailto:alexis.groppi@u-bordeaux2.fr" target="_blank">alexis.groppi@u-bordeaux2.fr</a>&gt;</span>
            wrote:<br>
            <blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">
              <div text="#000000" bgcolor="#FFFFFF"> Hi Eric,<br>
                <br>
                I&#39;ve tried all the suggestions you made<br>
                But same result (see attached e/o file)<br>
                <br>
                But with the help of David (the system engineer of the
                lab) I think we have found the bug : <br>
                 ==&gt; filter-below-abund.py fills a directory ( <i><span>/</span>var/spool/abrt/ccpp-<a href="tel:2013-03-14-10" value="+12013031410" target="_blank">2013-03-14-10</a>\:24\:13-26642.new<span>/</span></i>
                coredump/) until it reaches all the available space. 
                (see below)<br>
                ==&gt; Then it crashes<br>
                <br>
                Is there a way to modify this ?<br>
                <br>
                Thanks again<br>
                <br>
                Alexis<br>
                **************************************************<br>
                <tt>[root@rainman ~]# ll -h </tt><tt><i><span>/</span>var/spool/abrt/ccpp-<a href="tel:2013-03-14-10" value="+12013031410" target="_blank">2013-03-14-10</a>\:24\:13-26642.new<span>/</span></i></tt><tt>
                </tt><tt><br>
                </tt><tt>total 12G </tt><tt><br>
                </tt><tt>-rw-r----- 1 abrt users    4 14 mars  10:24
                  analyzer </tt><tt><br>
                </tt><tt>-rw-r----- 1 abrt users    6 14 mars  10:24
                  architecture </tt><tt><br>
                </tt><tt>-rw-r----- 1 abrt users  150 14 mars  10:24
                  cmdline </tt><tt><br>
                </tt><tt>-rw-r----- 1 abrt users  12G 14 mars  10:24
                  coredump </tt><tt><br>
                </tt><tt>-rw-r----- 1 abrt users 1,5K 14 mars  10:24
                  environ </tt><tt><br>
                </tt><tt>-rw-r----- 1 abrt users   31 14 mars  10:24
                  executable </tt><tt><br>
                </tt><tt>-rw-r----- 1 abrt users   27 14 mars  10:24
                  hostname </tt><tt><br>
                </tt><tt>-rw-r----- 1 abrt users   26 14 mars  10:24
                  kernel </tt><tt><br>
                </tt><tt>-rw-r----- 1 abrt users  13K 14 mars  10:24
                  maps </tt><tt><br>
                </tt><tt>-rw-r----- 1 abrt users   26 14 mars  10:24
                  os_release </tt><tt><br>
                </tt><tt>-rw-r----- 1 abrt users   71 14 mars  10:24
                  reason </tt><tt><br>
                </tt><tt>-rw-r----- 1 abrt users   10 14 mars  10:24
                  time </tt><tt><br>
                </tt><tt>-rw-r----- 1 abrt users    3 14 mars  10:24 uid
                </tt><tt><br>
                </tt><tt> </tt><tt><br>
                </tt><tt> </tt><tt><br>
                </tt><tt>[root@rainman ~]# ll -h </tt><tt><i><span>/</span>var/spool/abrt/ccpp-<a href="tel:2013-03-14-10" value="+12013031410" target="_blank">2013-03-14-10</a>\:24\:13-26642.new<span>/</span></i></tt><tt>
                </tt><tt><br>
                </tt><tt>total 18G </tt><tt><br>
                </tt><tt>-rw-r----- 1 abrt users    4 14 mars  10:24
                  analyzer </tt><tt><br>
                </tt><tt>-rw-r----- 1 abrt users    6 14 mars  10:24
                  architecture </tt><tt><br>
                </tt><tt>-rw-r----- 1 abrt users  150 14 mars  10:24
                  cmdline </tt><tt><br>
                </tt><tt>-rw-r----- 1 abrt users  18G 14 mars  10:25
                  coredump </tt><tt><br>
                </tt><tt>-rw-r----- 1 abrt users 1,5K 14 mars  10:24
                  environ </tt><tt><br>
                </tt><tt>-rw-r----- 1 abrt users   31 14 mars  10:24
                  executable </tt><tt><br>
                </tt><tt>-rw-r----- 1 abrt users   27 14 mars  10:24
                  hostname </tt><tt><br>
                </tt><tt>-rw-r----- 1 abrt users   26 14 mars  10:24
                  kernel </tt><tt><br>
                </tt><tt>-rw-r----- 1 abrt users  13K 14 mars  10:24
                  maps </tt><tt><br>
                </tt><tt>-rw-r----- 1 abrt users   26 14 mars  10:24
                  os_release </tt><tt><br>
                </tt><tt>-rw-r----- 1 abrt users   71 14 mars  10:24
                  reason </tt><tt><br>
                </tt><tt>-rw-r----- 1 abrt users   10 14 mars  10:24
                  time </tt><tt><br>
                </tt><tt>-rw-r----- 1 abrt users    3 14 mars  10:24 uid
                </tt><tt><br>
                </tt><br>
                <br>
                <div>Le 13/03/2013 22:58, Eric McDonald a écrit :<br>
                </div>
                <div>
                  <div>
                    <blockquote type="cite">
                      <div dir="ltr">Forwarding my earlier reply to the
                        list, since I didn&#39;t reply-to-all earlier.
                        <div><br>
                        </div>
                        <div>Also, Alexis, you may wish to change the
                          following in your job script:</div>
                        <div>  #PBS -l nodes=1:ppn=1</div>
                        <div>to</div>
                        <div>  #PBS -l nodes=1:ppn=8</div>
                        <div>assuming that you have 8-core nodes
                          available. &#39;filter-below-abund.py&#39; uses 8
                          threads by default; if a &#39;khmer&#39; job runs on
                          the same node as another job, it may try using
                          more CPU cores than it was allocated and that
                          could create problems with your systems
                          administrators. And, if a job&#39;s threads are
                          restricted to the requested number of cores,
                          then you will also not be getting optimal
                          performance by using more threads (8) than
                          available cores (1).<br>
                          <br>
                          <div class="gmail_quote">---------- Forwarded
                            message ----------<br>
                            From: <b class="gmail_sendername">Eric
                              McDonald</b> <span dir="ltr">&lt;<a href="mailto:emcd.msu@gmail.com" target="_blank">emcd.msu@gmail.com</a>&gt;</span><br>
                            Date: Wed, Mar 13, 2013 at 3:12 PM<br>
                            Subject: Re: [khmer] How to speed up the
                            filter-below-abund script ?<br>
                            To: <a href="mailto:alexis.groppi@u-bordeaux2.fr" target="_blank">alexis.groppi@u-bordeaux2.fr</a><br>
                            <br>
                            <br>
                            <div dir="ltr">Alexis,
                              <div><br>
                              </div>
                              <div>I just realized that the
                                floating-point exception is from inside
                                the Python interpreter itself. If the
                                floating-point exception had appeared
                                from within the &#39;filter-below-abund.py&#39;
                                script, then we shoul have seen a
                                traceback from the exception, ending
                                with:</div>
                              <div>  ZeroDivisionError: float division
                                by zero</div>
                              <div>Instead, we are seeing:</div>
                              <div>
                                <div>  <span style="font-family:monospace">line
                                    49: 54757 Floating point
                                    exception(core dumped)</span></div>
                              </div>
                              <div> from your job shell. (I should&#39;ve
                                noticed that earlier.)</div>
                              <div><br>
                                <div class="gmail_extra">Would you
                                  please add the following lines to your
                                  job script somewhere before you invoke
                                  &#39;filter-below-abund.py&#39;:</div>
                                <div class="gmail_extra">  python
                                  --version</div>
                                <div class="gmail_extra">   which python</div>
                                <div class="gmail_extra"><br>
                                </div>
                                <div class="gmail_extra">And would you
                                  please add the following line
                                  _immediately after_ you invoke
                                  &#39;filter-below-abund.py&#39;:</div>
                                <div class="gmail_extra">  echo &quot;Exit
                                  Code: $?&quot;</div>
                                <div class="gmail_extra"><br>
                                </div>
                                <div class="gmail_extra">Also, would you
                                  remove the &#39;time&#39; command from in
                                  front of your invocation of
                                  &#39;filter-below-abund.py&#39;?</div>
                                <div class="gmail_extra"><br>
                                </div>
                                <div class="gmail_extra"> And, one more
                                  action before trying again... please
                                  run:</div>
                                <div class="gmail_extra">  git pull</div>
                                <div class="gmail_extra">in your
                                  &#39;khmer-BETA&#39; directory. (I added
                                  another possible fix to the
                                  &#39;bleeding-edge&#39; branch. This command
                                  will pull that fix into your clone.)</div>
                                <div class="gmail_extra"><br>
                                </div>
                                <div class="gmail_extra">Thank you,</div>
                                <div class="gmail_extra">  Eric
                                  <div>
                                    <div><br>
                                      <br>
                                      <div class="gmail_quote">On Wed,
                                        Mar 13, 2013 at 10:13 AM, Alexis
                                        Groppi <span dir="ltr">&lt;<a href="mailto:alexis.groppi@u-bordeaux2.fr" target="_blank">alexis.groppi@u-bordeaux2.fr</a>&gt;</span>
                                        wrote:<br>
                                        <blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">
                                          <div text="#000000" bgcolor="#FFFFFF"> Hi,<br>
                                            <br>
                                            <div>Le 13/03/2013 14:12,
                                              Eric McDonald a écrit :<br>
                                            </div>
                                            <blockquote type="cite">
                                              <div dir="ltr">Hi Alexis,
                                                <div><br>
                                                </div>
                                                <div>
                                                  <div>First, let me say
                                                    thank you for being
                                                    patient and working
                                                    with us in spite of
                                                    all the problems you
                                                    are encountering.</div>
                                                </div>
                                              </div>
                                            </blockquote>
                                            <br>
                                            That&#39;s bioinformatician life
                                            ;)
                                            <div><br>
                                              <br>
                                              <blockquote type="cite">
                                                <div dir="ltr">
                                                  <div><br>
                                                  </div>
                                                  <div>With regards to
                                                    the floating point
                                                    exception, I see
                                                    several
                                                    opportunities for a
                                                    division-by-zero
                                                    condition in the
                                                    threading utilities
                                                    used by the script.
                                                    These opportunities
                                                    exist if an input
                                                    file is empty. (The
                                                    problem may be
                                                    coming from another
                                                    place, but this
                                                    would be my first
                                                    guess.) What does
                                                    the following
                                                    command say:</div>
                                                  <div><br>
                                                  </div>
                                                  <div>  ls -lh <span style="font-family:monospace;font-size:10px">/scratch/ag/khmer/</span><a href="http://174r1_table.kh/" style="font-family:monospace;font-size:10px" target="_blank">174r1_table.kh</a> <span style="font-family:monospace;font-size:10px">/mnt/var/home/ag/174r1_</span><span style="font-family:monospace;font-size:10px">prinseq_good_bFr8.fasta.keep</span></div>

                                                </div>
                                              </blockquote>
                                              <br>
                                            </div>
                                             The result : (the files are
                                            not empty)<br>
                                            <tt>-rw-r--r-- 1 ag users
                                              299M 12 mars  20:54
                                              /mnt/var/home/ag/174r1_prinseq_good_bFr8.fasta.keep</tt><tt><br>
                                            </tt><tt>-rw-r--r-- 1 ag
                                              users 141G 12 mars  21:05
                                              /scratch/ag/khmer/<a href="http://174r1_table.kh" target="_blank">174r1_table.kh</a></tt>
                                            <div><br>
                                              <br>
                                              <blockquote type="cite">
                                                <div dir="ltr"><br>
                                                  <div>Also, since you
                                                    appear to be using
                                                    TORQUE as your
                                                    resource
                                                    manager/batch
                                                    system, could you
                                                    please attach the
                                                    complete output and
                                                    error files for the
                                                    job? (These files
                                                    should be of the
                                                    form
                                                    &lt;job_name&gt;.o2693
                                                    and
                                                    &lt;job_name&gt;.e2693,
                                                    where
                                                    &lt;job_name&gt; is
                                                    the name of your
                                                    job. There may only
                                                    be one or the other
                                                    of these files,
                                                    depending on site
                                                    defaults and whether
                                                    you specified &quot;-j
                                                    oe&quot; or &quot;-j eo&quot; in
                                                    your job
                                                    submission.)<br>
                                                  </div>
                                                </div>
                                              </blockquote>
                                              <br>
                                            </div>
                                            I re run the job since I
                                            have deleted previous (2693)
                                            err/out files.<br>
                                            Here is the new file (merged
                                            with the option -j oe in the
                                            bash script) :<br>
                                            <br>
                                            <tt>#############################</tt><tt><br>
                                            </tt><tt>User: ag</tt><tt><br>
                                            </tt><tt>Date: Wed Mar 13
                                              14:59:21 CET 2013</tt><tt><br>
                                            </tt><tt>Host: <a href="http://rainman.cbib.u-bordeaux2.fr" target="_blank">rainman.cbib.u-bordeaux2.fr</a></tt><tt><br>
                                            </tt><tt>Directory:
                                              /mnt/var/home/ag</tt><tt><br>
                                            </tt><tt>PBS_JOBID:
                                              2695.rainman</tt><tt><br>
                                            </tt><tt>PBS_O_WORKDIR:
                                              /mnt/var/home/ag</tt><tt><br>
                                            </tt><tt>PBS_NODEFILE: 
                                              rainman</tt><tt><br>
                                            </tt><tt>#############################</tt><tt><br>
                                            </tt><tt>#############################</tt><tt><br>
                                            </tt><tt>Debut
                                              filter-below-abund: Wed
                                              Mar 13 14:59:21 CET 2013</tt>
                                            <div><tt><br>
                                              </tt><tt>starting threads</tt><tt><br>
                                              </tt><tt>starting writer</tt><tt><br>
                                              </tt><tt>loading...</tt><tt><br>
                                              </tt><tt>... filtering 0</tt><tt><br>
                                              </tt></div>
                                            <tt>/var/lib/torque/mom_priv/jobs/<a href="http://2695.rainman.SC" target="_blank">2695.rainman.SC</a>:
                                              line 49: 54757 Floating
                                              point exception(core
                                              dumped)
                                              ./khmer-BETA/sandbox/fi</tt><tt><br>
                                            </tt><tt>lter-below-abund.py
                                              /scratch/ag/khmer/<a href="http://174r1_table.kh" target="_blank">174r1_table.kh</a>
/mnt/var/home/ag/174r1_prinseq_good_bFr8.fasta.keep</tt><tt><br>
                                            </tt><tt><br>
                                            </tt><tt>real    3m54.873s</tt><tt><br>
                                            </tt><tt>user    0m0.085s</tt><tt><br>
                                            </tt><tt>sys     2m2.180s</tt><tt><br>
                                            </tt><tt>Date fin: Wed Mar
                                              13 15:03:15 CET 2013</tt><tt><br>
                                            </tt><tt>Job finished</tt><br>
                                            <br>
                                            Thanks again for your help
                                            :)<br>
                                            <br>
                                            Alexis
                                            <div>
                                              <div><br>
                                                <br>
                                                <blockquote type="cite">
                                                  <div dir="ltr">
                                                    <div> </div>
                                                    <div><br>
                                                    </div>
                                                    <div>Thanks,</div>
                                                    <div>  Eric</div>
                                                    <div><span style="font-family:monospace;font-size:10px"><br>
                                                      </span></div>
                                                  </div>
                                                  <div class="gmail_extra"><br>
                                                    <br>
                                                    <div class="gmail_quote">On
                                                      Wed, Mar 13, 2013
                                                      at 5:38 AM, Alexis
                                                      Groppi <span dir="ltr">&lt;<a href="mailto:alexis.groppi@u-bordeaux2.fr" target="_blank">alexis.groppi@u-bordeaux2.fr</a>&gt;</span>
                                                      wrote:<br>
                                                      <blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">
                                                        <div text="#000000" bgcolor="#FFFFFF"> Hi Eric,<br>
                                                          <br>
                                                          Thanks for
                                                          your answer.<br>
                                                          But
                                                          unfortunately,
                                                          after many
                                                          attempts I&#39;m
                                                          getting this
                                                          error :<tt><br>
                                                          <br>
                                                          </tt><tt>starting
                                                          threads</tt><tt><br>
                                                          </tt><tt>starting
                                                          writer</tt><tt><br>
                                                          </tt><tt>loading...</tt><tt><br>
                                                          </tt><tt>...
                                                          filtering 0</tt><tt><br>
                                                          </tt><tt>/var/lib/torque/mom_priv/jobs/<a href="http://2693.rainman.SC" target="_blank">2693.rainman.SC</a>:
                                                          line 46: 63657
                                                          Floating point
                                                          exception(core
                                                          dumped)
                                                          ./khmer-BETA/sandbox/filter-below-abund.py
/scratch/ag/khmer/<a href="http://174r1_table.kh" target="_blank">174r1_table.kh</a>
/mnt/var/home/ag/174r1_prinseq_good_bFr8.fasta.keep</tt><tt><br>
                                                          </tt><tt><br>
                                                          </tt><tt>real   
                                                          3m30.163s</tt><tt><br>
                                                          </tt><tt>user   
                                                          0m0.088s</tt><br>
                                                          <br>
                                                          Your opinion ?<br>
                                                          <br>
                                                          Thanks<br>
                                                          <br>
                                                          Alexis<br>
                                                          <br>
                                                          <br>
                                                          <div>Le
                                                          13/03/2013
                                                          00:55, Eric
                                                          McDonald a
                                                          écrit :<br>
                                                          </div>
                                                          <div>
                                                          <div>
                                                          <blockquote type="cite">
                                                          <div dir="ltr">Hi

                                                          Alexis,
                                                          <div><br>
                                                          </div>
                                                          <div>One way
                                                          to get the
                                                          &#39;bleeding-edge&#39;
                                                          branch is to
                                                          clone it into
                                                          a fresh
                                                          directory; for
                                                          example:</div>
                                                          <div>   git
                                                          clone <a href="http://github.com/ged-lab/khmer.git" target="_blank">http://github.com/ged-lab/khmer.git</a>
                                                          -b
                                                          bleeding-edge
                                                          khmer-BETA</div>
                                                          <div><br>
                                                          </div>
                                                          <div>Assuming
                                                          you already
                                                          have a clone
                                                          of the
                                                          &#39;ged-lab/khmer&#39;
                                                          repo, then you
                                                          should also be
                                                          able to do:</div>
                                                          <div>  git
                                                          fetch origin</div>
                                                          <div>  git
                                                          checkout
                                                          bleeding-edge</div>
                                                          <div>Depending
                                                          on how old
                                                          your Git
                                                          client is and
                                                          what its
                                                          defaults are,
                                                          you may have
                                                          to do the
                                                          following
                                                          instead:</div>
                                                          <div>  git
                                                          checkout
                                                          --track -b
                                                          bleeding-edge
origin/bleeding-edge</div>
                                                          <div><br>
                                                          </div>
                                                          <div>Hope this
                                                          helps,</div>
                                                          <div>  Eric</div>
                                                          </div>
                                                          <div class="gmail_extra"><br>
                                                          <br>
                                                          <div class="gmail_quote">On

                                                          Tue, Mar 12,
                                                          2013 at 11:32
                                                          AM, Alexis
                                                          Groppi <span dir="ltr">&lt;<a href="mailto:alexis.groppi@u-bordeaux2.fr" target="_blank">alexis.groppi@u-bordeaux2.fr</a>&gt;</span>
                                                          wrote:<br>
                                                          <blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">

                                                          <div text="#000000" bgcolor="#FFFFFF"> <br>
                                                          <div>Le
                                                          12/03/2013
                                                          16:16, C.
                                                          Titus Brown a
                                                          écrit :<br>
                                                          </div>
                                                          <div>
                                                          <blockquote type="cite">
                                                          <pre>On Tue, Mar 12, 2013 at 04:15:05PM +0100, Alexis Groppi wrote:
</pre>
                                                          <blockquote type="cite">
                                                          <pre>Hi Titus,

Thanks for your answer
Actually it&#39;s my second attempt with filter-below-abund.
The first time, I thought the problem was coming from the location of my  
<a href="http://table.kh" target="_blank">table.kh</a> file : in a storage element with poor level performance of I/O
I killed the job after 24h, moved the file in a best place and re run it
But with the same result : no completion after 24h

Any Idea ?

Thanks

Cheers From Bordeaux :)

Alexis

PS : The command line was the following :

./filter-below-abund.py <a href="http://174r1_table.kh" target="_blank">174r1_table.kh</a> 174r1_prinseq_good_bFr8.fasta.keep

Is this correct ?
</pre>
                                                          </blockquote>
                                                          <pre>Yes, looks right... Can you try with the bleeding-edge branch, which now
incorporates a potential fix for this issue?</pre>
                                                          </blockquote>
                                                          </div>
                                                          From here : <a href="https://github.com/ged-lab/khmer/tree/bleeding-edge" target="_blank">https://github.com/ged-lab/khmer/tree/bleeding-edge</a>
                                                          ?<br>
                                                          or <br>
                                                          here : <a href="https://github.com/ctb/khmer/tree/bleeding-edge" target="_blank">https://github.com/ctb/khmer/tree/bleeding-edge</a>
                                                          ?<br>
                                                          <br>
                                                          Do I have to
                                                          make a fresh
                                                          install ? and
                                                          How  ?<br>
                                                          Or just
                                                          replace all
                                                          the files and
                                                          folders ?<br>
                                                          <br>
                                                          Thanks :)<br>
                                                          <br>
                                                          Alexis
                                                          <div>
                                                          <div><br>
                                                          <br>
                                                          <blockquote type="cite">
                                                          <pre>thanks,
--titus

</pre>
                                                          <blockquote type="cite">
                                                          <pre>Le 12/03/2013 14:41, C. Titus Brown a ?crit :
</pre>
                                                          <blockquote type="cite">
                                                          <pre>On Tue, Mar 12, 2013 at 10:48:03AM +0100, Alexis Groppi wrote:
</pre>
                                                          <blockquote type="cite">
                                                          <pre>Metagenome assembly :
My data :
- original (quality filtered) data : 4463243 reads (75 nt) (Illumina)
1/ Single pass digital normalization with normalize-by-median (C=20)
==&gt; file .keep of 2560557 reads
2/ generated a hash table by load-into-counting on the .keep file
==&gt; file .kh of ~16Go (huge file ?!)
3/ filter-below-abund with C=100 from the two previous file (<a href="http://table.kh" target="_blank">table.kh</a>
and reads.keep)
Still running after 24 hours  :(

Any advice to speed up this step ? ... and the others (partitionning ...) ?

I can have an access to a HPC : ~3000 cores.
</pre>
                                                          </blockquote>
                                                          <pre>Hi Alexis,

filter-below-abund and filter-abund have occasional bugs that prevent them
from completing.  I would kill and restart.  For that few reads it should
take no more than a few hours to do everything.

Most of what khmer does cannot easily be distributed across multiple chassis,
note.

best,
--titus
</pre>
                                                          </blockquote>
                                                          <pre>-- 
</pre>
                                                          </blockquote>
                                                          </blockquote>
                                                          <br>
                                                          </div>
                                                          </div>
                                                          <span><font color="#888888">
                                                          <div>-- <br>
                                                          <img src="cid:part23.00000406.06040504@u-bordeaux2.fr" border="0"></div>
                                                          </font></span></div>
                                                          <br>
_______________________________________________<br>
                                                          khmer mailing
                                                          list<br>
                                                          <a href="mailto:khmer@lists.idyll.org" target="_blank">khmer@lists.idyll.org</a><br>
                                                          <a href="http://lists.idyll.org/listinfo/khmer" target="_blank">http://lists.idyll.org/listinfo/khmer</a><br>
                                                          <br>
                                                          </blockquote>
                                                          </div>
                                                          <br>
                                                          <br clear="all">
                                                          <div><br>
                                                          </div>
                                                          -- <br>
                                                          <div dir="ltr">
                                                          <div>Eric
                                                          McDonald</div>
                                                          <div>HPC/Cloud
                                                          Software
                                                          Engineer</div>
                                                          <div>  for the
                                                          Institute for
                                                          Cyber-Enabled
                                                          Research
                                                          (iCER)</div>
                                                          <div>  and the
                                                          Laboratory for
                                                          Genomics,
                                                          Evolution, and
                                                          Development
                                                          (GED)</div>
                                                          <div>Michigan
                                                          State
                                                          University</div>
                                                          <div>P: <a href="tel:517-355-8733" value="+15173558733" target="_blank">517-355-8733</a></div>
                                                          </div>
                                                          </div>
                                                          </blockquote>
                                                          <br>
                                                          </div>
                                                          </div>
                                                          <span><font color="#888888">
                                                          <div>-- <br>
                                                          <img src="cid:part27.09090006.02020605@u-bordeaux2.fr" border="0"></div>
                                                          </font></span></div>
                                                      </blockquote>
                                                    </div>
                                                    <br>
                                                    <br clear="all">
                                                    <div><br>
                                                    </div>
                                                    -- <br>
                                                    <div dir="ltr">
                                                      <div>Eric McDonald</div>
                                                      <div>HPC/Cloud
                                                        Software
                                                        Engineer</div>
                                                      <div>  for the
                                                        Institute for
                                                        Cyber-Enabled
                                                        Research (iCER)</div>
                                                      <div>  and the
                                                        Laboratory for
                                                        Genomics,
                                                        Evolution, and
                                                        Development
                                                        (GED)</div>
                                                      <div>Michigan
                                                        State University</div>
                                                      <div>P: <a href="tel:517-355-8733" value="+15173558733" target="_blank">517-355-8733</a></div>
                                                    </div>
                                                  </div>
                                                </blockquote>
                                                <br>
                                              </div>
                                            </div>
                                            <span><font color="#888888">
                                                <div>-- <br>
                                                  <img src="cid:part29.09010604.03010000@u-bordeaux2.fr" border="0"></div>
                                              </font></span></div>
                                        </blockquote>
                                      </div>
                                      <br>
                                      <br clear="all">
                                      <div><br>
                                      </div>
                                      -- <br>
                                      <div dir="ltr">
                                        <div>Eric McDonald</div>
                                        <div>HPC/Cloud Software Engineer</div>
                                        <div>  for the Institute for
                                          Cyber-Enabled Research (iCER)</div>
                                        <div>  and the Laboratory for
                                          Genomics, Evolution, and
                                          Development (GED)</div>
                                        <div>Michigan State University</div>
                                        <div>P: <a href="tel:517-355-8733" value="+15173558733" target="_blank">517-355-8733</a></div>
                                      </div>
                                    </div>
                                  </div>
                                </div>
                              </div>
                            </div>
                          </div>
                          <br>
                          <br clear="all">
                          <div><br>
                          </div>
                          -- <br>
                          <div dir="ltr">
                            <div>Eric McDonald</div>
                            <div>HPC/Cloud Software Engineer</div>
                            <div>  for the Institute for Cyber-Enabled
                              Research (iCER)</div>
                            <div>  and the Laboratory for Genomics,
                              Evolution, and Development (GED)</div>
                            <div>Michigan State University</div>
                            <div>P: <a href="tel:517-355-8733" value="+15173558733" target="_blank">517-355-8733</a></div>
                          </div>
                        </div>
                      </div>
                      <br>
                      <fieldset></fieldset>
                      <br>
                      <pre>_______________________________________________
khmer mailing list
<a href="mailto:khmer@lists.idyll.org" target="_blank">khmer@lists.idyll.org</a>
<a href="http://lists.idyll.org/listinfo/khmer" target="_blank">http://lists.idyll.org/listinfo/khmer</a>
</pre>
                    </blockquote>
                    <br>
                  </div>
                </div>
                <span><font color="#888888">
                    <div>-- <br>
                      <img src="cid:part34.09010103.00080709@u-bordeaux2.fr" border="0"></div>
                  </font></span></div>
            </blockquote>
          </div>
          <br>
          <br clear="all">
          <div><br>
          </div>
          -- <br>
          <div dir="ltr">
            <div>Eric McDonald</div>
            <div>HPC/Cloud Software Engineer</div>
            <div>  for the Institute for Cyber-Enabled Research (iCER)</div>
            <div>  and the Laboratory for Genomics, Evolution, and
              Development (GED)</div>
            <div>Michigan State University</div>
            <div>P: <a href="tel:517-355-8733" value="+15173558733" target="_blank">517-355-8733</a></div>
          </div>
        </div>
      </div>
    </blockquote>
    <br>
    </div></div><span class=""><font color="#888888"><div>-- <br>
      <img src="cid:part35.04020306.00030803@u-bordeaux2.fr" border="0"></div>
  </font></span></div>

</blockquote></div><br><br clear="all"><div><br></div>-- <br><div dir="ltr"><div>Eric McDonald</div><div>HPC/Cloud Software Engineer</div><div>  for the Institute for Cyber-Enabled Research (iCER)</div><div>  and the Laboratory for Genomics, Evolution, and Development (GED)</div>
<div>Michigan State University</div><div>P: 517-355-8733</div></div>
</div></div>