<html>
  <head>
    <meta content="text/html; charset=ISO-8859-1"
      http-equiv="Content-Type">
  </head>
  <body bgcolor="#FFFFFF" text="#000000">
    Yes the steady ballooning is quite obvious, espescially if I take
    some time staring at top command output etc. Thank you for your time
    I will then hope that someone will look at this stuff here. As a
    note might it be that my graafik.ht is corrupted somehow or
    something? It is even smaller by size than the 50m.ht which I was
    nicely able to partition, as additional information to anybody
    interested the data used was ~36M 250 bp reads. <br>
    Jens-Konrad<br>
    <div class="moz-cite-prefix">On 04/13/2013 05:35 AM, Eric McDonald
      wrote:<br>
    </div>
    <blockquote
cite="mid:CAGhFaV38cmW-mLXOQE5uffopXbvmK-=w8tauntMghLppO=1TXQ@mail.gmail.com"
      type="cite">
      <div dir="ltr">
        <div style="">Jens-Konrad,</div>
        <div><br>
        </div>
        Thanks for providing this information.
        <div>&nbsp;15: &nbsp;<span style="color:rgb(0,0,0);white-space:pre-wrap">resources_used.mem
            = 52379536kb</span></div>
        <div><font color="#000000"><span style="white-space:pre-wrap">
              30: </span></font><span
            style="color:rgb(0,0,0);white-space:pre-wrap">resources_used.mem
            = 90676068kb</span></div>
        <div><font color="#000000"><span style="white-space:pre-wrap">
              45: </span></font><span
            style="color:rgb(0,0,0);white-space:pre-wrap">resources_used.mem
            = 122543188kb</span></div>
        <div><font color="#000000"><span style="white-space:pre-wrap">Definitely
              some ballooning memory use there.<br>
            </span></font>
          <div><br>
          </div>
          <div style="">One more thing you may wish to examine from the
            command line is:</div>
          <div style="">&nbsp;&nbsp;qmgr -c "l s" | grep 'resources_'</div>
          <div style="">This will tell you about any default resources
            (such as physical memory) that your PBS server is assigning
            to new jobs. That said, I do believe that your jobs are
            exhausting available memory.</div>
        </div>
        <div style="">So, now the question is whether anything can be
          done about it. Unless someone with more experience with the
          partitioning code decides to speak up, I am going to have
          analyze your chosen parameters and the pieces of code in
          question to see if I can deduce anything. I might not be able
          to do this until Monday - I am too tired to do it tonight
          (here in US Eastern time) and have a busy weekend ahead of
          me.&nbsp;</div>
        <div style=""><br>
        </div>
        <div style="">I promise I will get back to you with some better
          answers if no one else decides to say anything. While you are
          waiting for a response and if you want to test your hypothesis
          about the number of threads correlating to increased memory
          use, then I would recommend using a smaller data set and
          seeing what kind of scaling in the memory use you see as you
          change the number of threads.</div>
        <div style=""><br>
        </div>
        <div style="">Have a good weekend,</div>
        <div style="">&nbsp; Eric</div>
        <div style=""><br>
        </div>
      </div>
      <div class="gmail_extra"><br>
        <br>
        <div class="gmail_quote">On Fri, Apr 12, 2013 at 7:30 AM,
          Jens-Konrad Preem <span dir="ltr">&lt;<a
              moz-do-not-send="true" href="mailto:jpreem@ut.ee"
              target="_blank">jpreem@ut.ee</a>&gt;</span> wrote:<br>
          <blockquote class="gmail_quote" style="margin:0 0 0
            .8ex;border-left:1px #ccc solid;padding-left:1ex">
            <div bgcolor="#FFFFFF" text="#000000">
              <div class="im">
                <div>On 04/11/2013 02:58 AM, Eric McDonald wrote:<br>
                </div>
              </div>
              <div>
                <div class="h5">
                  <blockquote type="cite">
                    <div dir="ltr">Forgot to reply to all, in case the
                      answer will help anyone else on the list....<br>
                      <br>
                      <div class="gmail_quote">---------- Forwarded
                        message ----------<br>
                        From: <b class="gmail_sendername">Eric McDonald</b>
                        <span dir="ltr">&lt;<a moz-do-not-send="true"
                            href="mailto:emcd.msu@gmail.com"
                            target="_blank">emcd.msu@gmail.com</a>&gt;</span><br>
                        Date: Wed, Apr 10, 2013 at 7:57 PM<br>
                        Subject: Re: [khmer] parition-graph memory
                        requirements<br>
                        To: Jens-Konrad Preem &lt;<a
                          moz-do-not-send="true"
                          href="mailto:jpreem@ut.ee" target="_blank">jpreem@ut.ee</a>&gt;<br>
                        <br>
                        <br>
                        <div dir="ltr">Hi,
                          <div><br>
                          </div>
                          <div> Sorry for the delayed reply.</div>
                          <div><br>
                          </div>
                          <div>Thanks for sharing your job scripts. I
                            notice that you are specifying the 'vmem'
                            resource. However, if PBS is also enforcing
                            a limit on the 'mem' resource (physical
                            memory), then you may be encountering that
                            limit. Do you know what default value is
                            assigned by your site's PBS server for the
                            'mem' resource?</div>
                          <div><br>
                          </div>
                          <div>Again, if you run:</div>
                          <div>&nbsp; qstat -f &lt;job_id&gt;</div>
                          <div>you should be able to determine both the
                            resources allocated for the job and how much
                            the job is actually using. Please let us
                            know the results of this command, if you
                            would like help interpreting them and
                            figuring out how to change your PBS resource
                            request, if necessary.</div>
                          <div><br>
                          </div>
                          <div>As a side note, smaller k-mer lengths
                            mean that more k-mers are being extracted
                            from each sequence. This means that the hash
                            tables are being more densely populated.
                            And, that means that you are more likely to
                            need larger hash tables to avoid a
                            significant false positive rate. But, I
                            think a better thing to say is that the
                            amount of memory used by the hash tables is
                            independent of k-mer size. So, changing
                            k-mer length does not affect memory usage
                            for many parts of khmer. (I would have to
                            look more closely to see how this affects
                            the partitioning code.)</div>
                          <div><br>
                          </div>
                          <div>Hope that helps,</div>
                          <div>&nbsp; Eric</div>
                          <div><br>
                          </div>
                        </div>
                        <div>
                          <div>
                            <div class="gmail_extra"><br>
                              <br>
                              <div class="gmail_quote">On Wed, Apr 10,
                                2013 at 4:23 AM, Jens-Konrad Preem <span
                                  dir="ltr">&lt;<a
                                    moz-do-not-send="true"
                                    href="mailto:jpreem@ut.ee"
                                    target="_blank">jpreem@ut.ee</a>&gt;</span>
                                wrote:<br>
                                <blockquote class="gmail_quote"
                                  style="margin:0 0 0
                                  .8ex;border-left:1px #ccc
                                  solid;padding-left:1ex">
                                  <div bgcolor="#FFFFFF" text="#000000">
                                    Hi,<br>
                                    <br>
                                    In an extreme act of foolishness I
                                    do seem to have lost my error logs.
                                    (I have been messing with the
                                    different&nbsp; scripts&nbsp; here a lot and
                                    so got rid of some of the outputs,&nbsp;
                                    in some ill thought out
                                    "housekeeping" event).<br>
                                    <br>
                                    I do attach here a bunch of PBS
                                    scripts that I used to get as far as
                                    I am. I did use a different script
                                    for most of the normalize and
                                    partition pipeline, so I'd have time
                                    to look at the outputs and get a
                                    sense of time taken for each. The
                                    scripts are in following order -
                                    supkhme(normalize),
                                    suprem(filter-below),
                                    supload(load-graph), and finally
                                    supart(partition-graph). (As can be
                                    seen I try to do the meta-genome
                                    analysis as per the guide.txt)<br>
                                    All the previous scripts completed
                                    without complaint, producing the 5.2
                                    Gb "graafik" graph.<br>
                                    <br>
                                    The partition graph had failed a few
                                    times after running an hour or so
                                    always with error messages
                                    concerning memory. Now the latest
                                    script there demands 240 Gb of
                                    memory which is maximum I can demand
                                    in the near future, and still failed
                                    with an error message concerning
                                    memory.<br>
                                    <br>
                                    I am right now working on
                                    reproducing the error, so I can then
                                    supply you with .logs and .error
                                    files, when no error occurs the
                                    better for me of course.<br>
                                    I decided to try different k-values
                                    this time as suggested by <a
                                      moz-do-not-send="true"
                                      href="https://khmer.readthedocs.org/en/latest/guide.html"
                                      target="_blank">https://khmer.readthedocs.org/en/latest/guide.html</a>
                                    (20 for normalization, and 32 for
                                    partitioning) those should make the
                                    graph file all the bigger - I used
                                    the smaller ones to avoid running
                                    out of memory but as it doesn't seem
                                    to help then what the heck. ;D.
                                    Right now I am at the load-graph
                                    stage with the new set. As it will
                                    complete in few hours I'll put the
                                    partition-graph on the run and then
                                    we will see if it dies within an
                                    hour. If so I'll post a new set of
                                    scripts and logs.<br>
                                    <br>
                                    Thank you for your time,<br>
                                    Jens-Konrad
                                    <div>
                                      <div><br>
                                        <br>
                                        <br>
                                        <br>
                                        <div>On 04/10/2013 04:18 AM,
                                          Eric McDonald wrote:<br>
                                        </div>
                                        <blockquote type="cite">
                                          <div dir="ltr">Hi Jens-Konrad,
                                            <div><br>
                                            </div>
                                            <div>Sorry for the delayed
                                              response. (I was on
                                              vacation yesterday and
                                              hoping that someone more
                                              familiar with the
                                              partitioning code would
                                              answer.)</div>
                                            <div><br>
                                            </div>
                                            <div>My understanding of the
                                              code is that decreasing
                                              the subset size will
                                              increase the number of
                                              partitions but will not
                                              change the overall graph
                                              coverage. Therefore, I
                                              would not expect it to
                                              lower memory requirements.
                                              (The overhead from
                                              additional partitions
                                              might raise them some, but
                                              I have not analyzed the
                                              code deeply enough to say
                                              one way or another about
                                              that.) As far as changing
                                              the number of threads
                                              goes, each thread does
                                              seem to maintain a local
                                              list of traversed k-mers
                                              (hidden in the C++
                                              implementation) but I do
                                              not yet know how much that
                                              would impact memory usage.
                                              Have you tried using a
                                              fewer number of threads?</div>
                                            <div><br>
                                            </div>
                                            <div>But, rather than
                                              guessing about causation,
                                              let's try to get some more
                                              diagnostic information.
                                              Does the script die
                                              immediately? (How long
                                              does the PBS job execute
                                              before failure?) Can you
                                              attach the output and
                                              error files for a job, and
                                              also the job script? What
                                              does</div>
                                            <div>&nbsp; qstat -f
                                              &lt;job_id&gt;</div>
                                            <div>where &lt;job_id&gt; is
                                              the ID of your running
                                              job, tell you about memory
                                              usage?</div>
                                            <div><br>
                                            </div>
                                            <div>Thanks,</div>
                                            <div>&nbsp; Eric</div>
                                            <div><br>
                                            </div>
                                            <div><br>
                                            </div>
                                          </div>
                                          <div class="gmail_extra"><br>
                                            <br>
                                            <div class="gmail_quote">On
                                              Mon, Apr 8, 2013 at 3:34
                                              AM, Jens-Konrad Preem <span
                                                dir="ltr">&lt;<a
                                                  moz-do-not-send="true"
href="mailto:jpreem@ut.ee" target="_blank">jpreem@ut.ee</a>&gt;</span>
                                              wrote:<br>
                                              <blockquote
                                                class="gmail_quote"
                                                style="margin:0 0 0
                                                .8ex;border-left:1px
                                                #ccc
                                                solid;padding-left:1ex">Hi,<br>
                                                I am having trouble with
                                                completing a
                                                partition-graph.py job.<br>
                                                No matter the
                                                configurations It seems
                                                to terminate with error
                                                messages hinting at low
                                                memory etc. *<br>
                                                Does LOWering the subset
                                                size reduce the memory
                                                use, what about LOWering
                                                the amount of parallel
                                                threads?<br>
                                                The <a
                                                  moz-do-not-send="true"
href="http://graafik.ht" target="_blank">graafik.ht</a> is 5.2G large, I
                                                had the script running
                                                as a PBS job with 240 GB
                                                RAM allocated. (That's
                                                as much as I can get it,
                                                maybe I'll have an
                                                opportunity in the next
                                                week to double it, but I
                                                wouldn't count on it).<br>
                                                Is it expected for the
                                                script to require so
                                                much RAM, or is there
                                                some bug or some misuse
                                                by my part. Would there
                                                be any configuration to
                                                get past this?<br>
                                                <br>
                                                Jens-Konrad Preem, MSc.,
                                                University of Tartu<br>
                                                <br>
                                                <br>
                                                <br>
                                                * the latest
                                                configuration after I
                                                thought on smaller
                                                subset size<br>
                                                ./khmer/scripts/partition-graph.py

                                                &nbsp;--threads 24
                                                --subset-size 1e4
                                                graafik<br>
                                                terminated with<br>
                                                cannot allocate memory
                                                for thread-local data:
                                                ABORT<br>
                                                <br>
                                                <br>
_______________________________________________<br>
                                                khmer mailing list<br>
                                                <a
                                                  moz-do-not-send="true"
href="mailto:khmer@lists.idyll.org" target="_blank">khmer@lists.idyll.org</a><br>
                                                <a
                                                  moz-do-not-send="true"
href="http://lists.idyll.org/listinfo/khmer" target="_blank">http://lists.idyll.org/listinfo/khmer</a><br>
                                              </blockquote>
                                            </div>
                                            <br>
                                            <br clear="all">
                                            <div><br>
                                            </div>
                                            -- <br>
                                            <div dir="ltr">
                                              <div>Eric McDonald</div>
                                              <div>HPC/Cloud Software
                                                Engineer</div>
                                              <div>&nbsp; for the Institute
                                                for Cyber-Enabled
                                                Research (iCER)</div>
                                              <div>&nbsp; and the Laboratory
                                                for Genomics, Evolution,
                                                and Development (GED)</div>
                                              <div>Michigan State
                                                University</div>
                                              <div>P: <a
                                                  moz-do-not-send="true"
href="tel:517-355-8733" value="+15173558733" target="_blank">517-355-8733</a></div>
                                            </div>
                                          </div>
                                        </blockquote>
                                        <br>
                                      </div>
                                    </div>
                                    <span><font color="#888888">
                                        <pre cols="72">-- 
Jens-Konrad Preem, MSc, University of Tartu</pre>
                                      </font></span></div>
                                  <br>
_______________________________________________<br>
                                  khmer mailing list<br>
                                  <a moz-do-not-send="true"
                                    href="mailto:khmer@lists.idyll.org"
                                    target="_blank">khmer@lists.idyll.org</a><br>
                                  <a moz-do-not-send="true"
                                    href="http://lists.idyll.org/listinfo/khmer"
                                    target="_blank">http://lists.idyll.org/listinfo/khmer</a><br>
                                  <br>
                                </blockquote>
                              </div>
                              <br>
                              <br clear="all">
                              <div><br>
                              </div>
                              -- <br>
                              <div dir="ltr">
                                <div>Eric McDonald</div>
                                <div>HPC/Cloud Software Engineer</div>
                                <div>&nbsp; for the Institute for
                                  Cyber-Enabled Research (iCER)</div>
                                <div>&nbsp; and the Laboratory for Genomics,
                                  Evolution, and Development (GED)</div>
                                <div>Michigan State University</div>
                                <div>P: <a moz-do-not-send="true"
                                    href="tel:517-355-8733"
                                    value="+15173558733" target="_blank">517-355-8733</a></div>
                              </div>
                            </div>
                          </div>
                        </div>
                      </div>
                      <br>
                      <br clear="all">
                      <div><br>
                      </div>
                      -- <br>
                      <div dir="ltr">
                        <div>Eric McDonald</div>
                        <div>HPC/Cloud Software Engineer</div>
                        <div>&nbsp; for the Institute for Cyber-Enabled
                          Research (iCER)</div>
                        <div>&nbsp; and the Laboratory for Genomics,
                          Evolution, and Development (GED)</div>
                        <div>Michigan State University</div>
                        <div>P: <a moz-do-not-send="true"
                            href="tel:517-355-8733" value="+15173558733"
                            target="_blank">517-355-8733</a></div>
                      </div>
                    </div>
                    <br>
                    <fieldset></fieldset>
                    <br>
                    <pre>_______________________________________________
khmer mailing list
<a moz-do-not-send="true" href="mailto:khmer@lists.idyll.org" target="_blank">khmer@lists.idyll.org</a>
<a moz-do-not-send="true" href="http://lists.idyll.org/listinfo/khmer" target="_blank">http://lists.idyll.org/listinfo/khmer</a>
</pre>
                  </blockquote>
                </div>
              </div>
              OK.<br>
              I post a failed run complete with PBS script, error log.,
              and qstat-f snapshots at different times.<br>
              I find it weird that I managed to complete the test run on
              iowa-corn50M which had a graph file even larger. Might the
              number of used threads pump up the memory? I used the
              sample commands from the web-page for corn. These used 4
              threads at max. <br>
              <span class="HOEnZb"><font color="#888888"> Jens-Konrad
                  Preem<br>
                </font></span></div>
            <br>
            _______________________________________________<br>
            khmer mailing list<br>
            <a moz-do-not-send="true"
              href="mailto:khmer@lists.idyll.org">khmer@lists.idyll.org</a><br>
            <a moz-do-not-send="true"
              href="http://lists.idyll.org/listinfo/khmer"
              target="_blank">http://lists.idyll.org/listinfo/khmer</a><br>
            <br>
          </blockquote>
        </div>
        <br>
        <br clear="all">
        <div><br>
        </div>
        -- <br>
        <div dir="ltr">
          <div>Eric McDonald</div>
          <div>HPC/Cloud Software Engineer</div>
          <div>&nbsp; for the Institute for Cyber-Enabled Research (iCER)</div>
          <div>&nbsp; and the Laboratory for Genomics, Evolution, and
            Development (GED)</div>
          <div>Michigan State University</div>
          <div>P: 517-355-8733</div>
        </div>
      </div>
    </blockquote>
    <br>
    <pre class="moz-signature" cols="72">-- 
Jens-Konrad Preem, MSc, University of Tartu</pre>
  </body>
</html>