[khmer] exceeding defined RAM limits?

C. Titus Brown ctb at msu.edu
Wed Dec 18 07:46:48 PST 2013


On Wed, Dec 18, 2013 at 03:43:22PM +0000, Oh, Julia (NIH/NHGRI) [F] wrote:
> [ohjs at helix khmer]$ git checkout master
> Branch master set up to track remote branch master from origin.
> Switched to a new branch 'master'
> [ohjs at helix khmer]$ make
> 
> ===> lots of stuff, ending with:
> 
> copying build/lib.linux-x86_64-2.6/khmer/_khmermodule.so -> khmer
> make[1]: Leaving directory `/home/ohjs/khmer/python'
> 
> [ohjs at helix khmer]$ git branch
>   bleeding-edge
> * master

OK, great! This is the latest development version; can you see if you can
reproduce the problem with it?  (Sadly, I expect you will, as we haven't
made many significant changes to normalize-by-median's machinery...)

best,
--titus

> On Dec 18, 2013, at 8:10 AM, C. Titus Brown <ctb at msu.edu> wrote:
> 
> > On Wed, Dec 18, 2013 at 03:07:57AM +0000, Oh, Julia (NIH/NHGRI) [F] wrote:
> >> Titus?thanks for the tip on variable coverage; will definitely try that out.
> > 
> > Great -- should significantly improve sensitivity to low coverage "stuff"!
> > 
> >> Michael?pretty sure I did a git clone. The last date in my directory is Sept 5th?but not sure if that would be pull date or your last modified date.
> > 
> > OK, and then one last check... did you check out the 'master' or 'legacy'
> > branch?  What does 'git branch' report?
> > 
> > To check out master, do:
> > 
> > git checkout master
> > make
> > 
> > cheers,
> > --titus
> > 
> >> On Dec 17, 2013, at 8:16 PM, Michael R. Crusoe <mcrusoe at msu.edu<mailto:mcrusoe at msu.edu>> wrote:
> >> 
> >> Hello Julia,
> >> 
> >> What version of khmer are you using?
> >> 
> >> That is, did you install via `pip` or a `git clone`?
> >> 
> >> 
> >> On Tue, Dec 17, 2013 at 5:14 PM, C. Titus Brown <ctb at msu.edu<mailto:ctb at msu.edu>> wrote:
> >> On Tue, Dec 17, 2013 at 04:36:34PM -0800, C. Titus Brown wrote:
> >>> On Tue, Dec 17, 2013 at 07:53:18PM +0000, Oh, Julia (NIH/NHGRI) [F] wrote:
> >>> Now, on to your real question :)
> >>> 
> >>>> $python2.7 /home/ohjs/khmer/scripts/normalize-by-median.py -C 5 -k 20 -N 4 -x 16e9 round2.unaligned.keep.abundfilt;
> >>>> 
> >>>> I thought I would be maxing out at 64 GB ram for the hash table (I?ve also used 32e9), but I get the following RAM usage report of
> >>>> 
> >>>> 4986693.biobos elapsed time:        23358 seconds
> >>>> 4986693.biobos walltime:         06:28:36 hh:mm:ss
> >>>> 4986693.biobos memory limit:       249.00 GB
> >>>> 4986693.biobos memory used:        249.76 GB
> >>>> 4986693.biobos cpupercent used:     98.00 %
> >>> 
> >>> What the heck!? That's not supposed to happen!
> >>> 
> >>> This is either a bug, or (most likely) is being caused by an overabundance of
> >>> high-abundance k-mers.  The latter is easy to fix -- I've filed a bug report to
> >>> fix the latter in the software overall [0] -- but would require you to modify
> >>> the script at the moment.  If you're up for that, put
> >>> 
> >>>      ht.set_use_bigcount(False)
> >>> 
> >>> at line 186 of normalize-by-median:
> >> 
> >> Darn it, that can't be the problem; I just wrote a test against this
> >> behavior and we actually did things right in the script and ignored
> >> high abundance k-mers.
> >> 
> >> So, this must be a bug of some sort.  Umm... Michael, any ideas?!
> >> 
> >> cheers,
> >> --titus
> >> --
> >> C. Titus Brown, ctb at msu.edu<mailto:ctb at msu.edu>
> >> 
> >> _______________________________________________
> >> khmer mailing list
> >> khmer at lists.idyll.org<mailto:khmer at lists.idyll.org>
> >> http://lists.idyll.org/listinfo/khmer
> >> 
> >> 
> >> 
> >> --
> >> Michael R. Crusoe: Software Engineer and Bioinformatician  mcrusoe at msu.edu<mailto:mcrusoe at msu.edu>
> >> @ the Genomics, Evolution, and Development lab; Michigan State University
> >> http://ged.msu.edu/     http://orcid.org/0000-0002-2961-9670    @biocrusoe<http://twitter.com/biocrusoe>
> >> 
> > 
> > -- 
> > C. Titus Brown, ctb at msu.edu
> 

-- 
C. Titus Brown, ctb at msu.edu




More information about the khmer mailing list