[khmer] khmer v1.2 has been released

Michael R. Crusoe mcrusoe at msu.edu
Mon Dec 8 09:32:13 PST 2014


Addendum to the known issues: There is a longstanding issue that we weren't
able to fix with this release.

Multithreaded reading will drop reads. This major issue has been present
for several khmer releases and was only found via a much larger test case
that we had been previously using. Credit to @camillescott. Workaround:
disable threading. The next release will fix this and the other FAST[AQ]
parsing issues. https://github.com/ged-lab/khmer/issues/681

Given the other fixes we had accumulated we decided it was more important
to get them out and to not rush the review of the new parser.

On Mon, Dec 8, 2014 at 11:46 AM, Michael R. Crusoe <mcrusoe at msu.edu> wrote:

> This is the v1.2 release of khmer: minor new features and bug fixes. The
> start of this release cycle coincided with the Mozilla Science Lab Global
> Sprint 2014. We honor and thank the 19 new contributors (including four
> Michigan State University undergraduates) who volunteered their time to
> contribute!
>
> Docs at: https://khmer.readthedocs.org/en/v1.2/
> New items of note:
>
> @mr-c <https://github.com/mr-c> and @ctb <https://github.com/ctb> are
> proud to announce khmer's code of conduct
> http://khmer.readthedocs.org/en/v1.2/dev/CODE_OF_CONDUCT.html #664
> <https://github.com/ged-lab/khmer/pull/664>
> All scripts list which files have been created during their execution #477
> <https://github.com/ged-lab/khmer/issues/477> @bocajnotnef
> <https://github.com/bocajnotnef>
> All scripts now only output status messages to STDERR instead of STDOUT
> #626 <https://github.com/ged-lab/khmer/pull/626> @b-wyss
> <https://github.com/b-wyss>
> docs/ a fairly major re-organization and brand new developer docs @ctb
> <https://github.com/ctb> @mr-c <https://github.com/mr-c>
> load-into-counting.py: --summary-info: machine readable summary in JSON
> or TSV format#649 <https://github.com/ged-lab/khmer/pull/649> @kdmurray91
> <https://github.com/kdmurray91>
> scripts/extract-partitions.py: added documentation for .dist columns #516
> <https://github.com/ged-lab/khmer/pull/516> @chuckpr
> <https://github.com/chuckpr>
> Makefile: a new target make install-dependencies is useful for developers
> #539 <https://github.com/ged-lab/khmer/pull/539> @mr-c
> <https://github.com/mr-c>
> Sandbox scripts have been cleaned up, or removed (see the
> sandbox/README.rst for details)#589
> <https://github.com/ged-lab/khmer/pull/589> @ctb <https://github.com/ctb>
> Notable bugs fixed/issues closed:
>
> do-partition.py's excessive spawning of threads fixed. #637
> <https://github.com/ged-lab/khmer/pull/637> @camillescott
> <https://github.com/camillescott>
> Fixed unique k-mer count reporting in load-graph, load-into-counting, and
> normalize-by-median.#562 <https://github.com/ged-lab/khmer/issues/562>
> @mr-c <https://github.com/mr-c>
> Clarified and test the requirement for a 64-bit operating system #529
> <https://github.com/ged-lab/khmer/issues/529> @Echelon9
> <https://github.com/Echelon9>
> Removed some of the broken multi-threading options #511
> <https://github.com/ged-lab/khmer/pull/511> @majoras-masque
> <https://github.com/majoras-masque>
> Fix table.get("wrong_length_string") gives core dump #585
> <https://github.com/ged-lab/khmer/pull/585> @Echelon9
> <https://github.com/Echelon9>
> filter-abund lists parameters that it doesn't use #524
> <https://github.com/ged-lab/khmer/issues/524> @jstapleton
> <https://github.com/jstapleton>
> Reduction of memory required to run the test suite #542
> <https://github.com/ged-lab/khmer/pull/542> @leogargu
> <https://github.com/leogargu>
> BibTeX included in CITATIONS #541
> <https://github.com/ged-lab/khmer/pull/541> @HLWiencko
> <https://github.com/HLWiencko>
> Additional fixes/features
>
> delete ScoringMatrix::assign as it is unused #502
> <https://github.com/ged-lab/khmer/issues/502> @RodPic
> <https://github.com/RodPic>
> Root all of our C++ exceptions to a common base exception #508
> <https://github.com/ged-lab/khmer/issues/508> @iglpdc
> <https://github.com/iglpdc>
> deleted KhmerError #503 <https://github.com/ged-lab/khmer/issues/503>
> @drlabratory <https://github.com/drlabratory>
> normalize-by-median reporting output after main loop exits, in case it
> hadn't been triggered #586 <https://github.com/ged-lab/khmer/pull/586>@ctb
> <https://github.com/ctb>
> Many issues discovered by cppcheck cleaned up #506
> <https://github.com/ged-lab/khmer/issues/506> @brtaylor92
> <https://github.com/brtaylor92>
> Developers have a new Makefile target to autofix formatting: make format
> #612 <https://github.com/ged-lab/khmer/pull/612> @brtaylor92
> <https://github.com/brtaylor92>
> normalize-by-median.py test coverage increased #361
> <https://github.com/ged-lab/khmer/issues/361> @SherineAwad
> <https://github.com/SherineAwad>
> Several unused functions were removed #599
> <https://github.com/ged-lab/khmer/issues/599> @brtaylor92
> <https://github.com/brtaylor92>
> Developer docs now link to the stdc++ docs as appropriate #629
> <https://github.com/ged-lab/khmer/pull/629> @mr-c
> <https://github.com/mr-c>
> Added tests for non-sequential access to input files #644
> <https://github.com/ged-lab/khmer/pull/644> @bocajnotnef
> <https://github.com/bocajnotnef>
> Removed khmer/theading_args.py #653
> <https://github.com/ged-lab/khmer/pull/653> @bocajnotnef
> <https://github.com/bocajnotnef>
> Improved test for maximum k value #658
> <https://github.com/ged-lab/khmer/issues/658> @pgarland
> <https://github.com/pgarland>
> ReadParser no longer crashes if n_threads = 0 #86
> <https://github.com/ged-lab/khmer/issues/86> @jiarong
> <https://github.com/jiarong>
> Known issues:
>
> All of these are pre-existing.
>
> Some users have reported that normalize-by-median.py will utilize more
> memory than it was configured for. This is being investigated in
> #266 <https://github.com/ged-lab/khmer/issues/266>
>
> Some FASTQ files confuse our parser when running with more than one thread.
> For example, while using load-into-counting.py. If you experience this then
> add "--threads=1" to your command line. This issue is being tracked in
> #249 <https://github.com/ged-lab/khmer/issues/249>
>
> If your k-mer table is truncated on write, an error may not be reported;
> this
> is being tracked in #443 <https://github.com/ged-lab/khmer/issues/443>.
> However, khmer will now (correctly) fail when trying to read a truncated
> file
> (See #333 <https://github.com/ged-lab/khmer/pull/333>).
>
> Paired-end reads from Casava 1.8 currently require renaming for use in
> normalize-by-median and abund-filter when used in paired mode. The
> integration of a fix for this is being tracked in #23
> <https://github.com/ged-lab/khmer/issues/23>
>
> Some scripts only output FASTA even if given a FASTQ file. This issue
> is being tracked in #46 <https://github.com/ged-lab/khmer/issues/46>
>
> A user reported that abundance-dist-single.py fails with small files and
> many
> threads. This issue is being tracked in #75
> <https://github.com/ged-lab/khmer/issues/75>
> Contributors
>
> @mr-c <https://github.com/mr-c>, @ctb <https://github.com/ctb>, *
> @bocajnotnef <https://github.com/bocajnotnef>, *@Echelon9
> <https://github.com/Echelon9>, *@jlippi <https://github.com/jlippi>, *
> @kdmurray91 <https://github.com/kdmurray91>, @qingpeng
> <https://github.com/qingpeng>, *@leogargu <https://github.com/leogargu>, *
> @jiarong <https://github.com/jiarong>, *@brtaylor92
> <https://github.com/brtaylor92>, *@iglpdc <https://github.com/iglpdc>,
> @camillescott <https://github.com/camillescott>, *@HLWiencko
> <https://github.com/HLWiencko>, *@cowguru2000
> <https://github.com/cowguru2000>, *@drlabratory
> <https://github.com/drlabratory>, *@jstapleton
> <https://github.com/jstapleton>, *@b-wyss <https://github.com/b-wyss>, *
> @jgluck <https://github.com/jgluck>, @fishjord
> <https://github.com/fishjord>, *@SherineAwad
> <https://github.com/SherineAwad>, *@pgarland <https://github.com/pgarland>,
> *@majoras-masque <https://github.com/majoras-masque>, @chuckpr
> <https://github.com/chuckpr>, *@RodPic <https://github.com/RodPic>,
> @luizirber <https://github.com/luizirber>, *@jrherr
> <https://github.com/jrherr>
>
> * Denotes new contributor
>
>
> --
> Michael R. Crusoe:  Programmer & Bioinformatician   mcrusoe at msu.edu
>  @ the Genomics, Evolution, and Development lab; Michigan State U
> https://impactstory.org/MichaelRCrusoe http://twitter.com/biocrusoe
>



-- 
Michael R. Crusoe:  Programmer & Bioinformatician   mcrusoe at msu.edu
 @ the Genomics, Evolution, and Development lab; Michigan State U
https://impactstory.org/MichaelRCrusoe http://twitter.com/biocrusoe
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.idyll.org/pipermail/khmer/attachments/20141208/e2905f76/attachment-0001.htm>


More information about the khmer mailing list