<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
</head>
<body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" class="">
<div class="">(cc’ing the list back for the reply)</div>
<div class=""><br class="">
</div>
<div class="">Not sure re: whether khmer deals with this though it might explain the issue. &nbsp;You could run a count in your file for any empty lines, something like &quot;zgrep&nbsp;-c&nbsp;'^$’ file_name.fastq.gz” (or pipe in the data via zcat if your system doesn’t have
 zgrep). &nbsp;Not sure if cutadapt does anything silly like leave spaces, so you may need to adjust the grep accordingly. &nbsp;</div>
<div class=""><br class="">
</div>
<div class="">Both the sequence and the quality would be empty, so the # of records with empty lines should be count / 2.</div>
<div class=""><br class="">
</div>
<div class="">chris</div>
<br class="">
<div>
<blockquote type="cite" class="">
<div class="">On Sep 16, 2015, at 9:08 PM, Will Shoemaker &lt;<a href="mailto:wrshoema@umail.iu.edu" class="">wrshoema@umail.iu.edu</a>&gt; wrote:</div>
<br class="Apple-interchange-newline">
<div class="">
<div dir="ltr" class="">Hi Chris,
<div class=""><br class="">
</div>
<div class="">I checked the cutadapt docs and all the versions keep empty fastq reads and they don't have an option to remove them. I don't know how to check for empty fsatq reads using bash commands (I don't know what an empty fastq read looks like), but I
 checked the number of reads in both the original and quality filtered fastq files and they have the same number of reads, so cutadapt is likely keeping empty reads.&nbsp;</div>
<div class=""><br class="">
</div>
<div class="">Is this still an issue for the newest version of khmer?&nbsp;</div>
<div class=""><br class="">
</div>
<div class="">Best,</div>
<div class="">Will</div>
</div>
<div class="gmail_extra"><br class="">
<div class="gmail_quote">On Wed, Sep 16, 2015 at 1:35 PM, Fields, Christopher J <span dir="ltr" class="">
&lt;<a href="mailto:cjfields@illinois.edu" target="_blank" class="">cjfields@illinois.edu</a>&gt;</span> wrote:<br class="">
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div style="word-wrap:break-word" class="">I may be mis-remembering this, but I recall cutadapt giving empty sequences with paired data before (maybe this has been fixed).&nbsp; Are any of the sequences empty?
<div class=""><br class="">
</div>
<div class="">chris</div>
<div class=""><br class="">
<div class="">
<blockquote type="cite" class="">
<div class="">
<div class="h5">
<div class="">On Sep 16, 2015, at 12:27 PM, Will Shoemaker &lt;<a href="mailto:wrshoema@umail.iu.edu" target="_blank" class="">wrshoema@umail.iu.edu</a>&gt; wrote:</div>
<br class="">
</div>
</div>
<div class="">
<div class="">
<div class="h5">
<div dir="ltr" class="">Hello,
<div class=""><br class="">
</div>
<div class="">I am unable to merge pairs of MiSeq reads using the khmer scrip interleave-reads.py in khmer version 1.3. The R1 and R2 files have had the first 10 bases trimmed off and have been quality filtered using cutadapt v1.9.&nbsp;</div>
<div class=""><br class="">
</div>
<div class="">Using the command zcat file_name.fastq.gz | echo $((`wc -l`/4)) on each set of reads, I found that the number of reads in R1 and R2 is the same.&nbsp;</div>
<div class=""><br class="">
</div>
<div class="">The command I'm running is:&nbsp;</div>
<div class="">interleave-reads.py -o output.fastq.gz R1.fastq.gz R2.fastq.gz &nbsp; (file names changed for readability)&nbsp;</div>
<div class=""><br class="">
</div>
<div class="">My OS is Linux 2.6.32-573.3.1.el6.x86_64 x86_64</div>
<div class=""><br class="">
</div>
<div class="">Attached is a txt file of the khmer output.</div>
<div class=""><br class="">
</div>
<div class="">Could this be an issue of cutadapt changing the file format? I am able to run assemblies on cutadapt processed reads.<br clear="all" class="">
<div class=""><br class="">
</div>
<div class=""><br class="">
</div>
<div class="">Best,</div>
<div class="">Will Shoemaker&nbsp;</div>
-- <br class="">
<div class="">
<div dir="ltr" class="">
<div class="">
<div dir="ltr" class="">
<div class="">
<div dir="ltr" class="">Will Shoemaker&nbsp;
<div class="">
<div dir="ltr" class=""><font face="arial, helvetica, sans-serif" color="#20124d" class="">Indiana University<br class="">
Graduate Student: Lennon Lab</font></div>
<div dir="ltr" class=""><font face="arial, helvetica, sans-serif" color="#20124d" style="font-size:small" class="">Evolution, Ecology, &amp; Behavior Program</font></div>
</div>
<div class=""><font face="arial, helvetica, sans-serif" color="#20124d" style="font-size:small" class="">Jordan Hall 238</font></div>
<div class=""><font face="arial, helvetica, sans-serif" color="#20124d" style="font-size:small" class=""><a href="mailto:wrshoema@umail.iu.edu" target="_blank" class="">wrshoema@umail.iu.edu</a><br class="">
</font></div>
<div class=""><a href="https://urldefense.proofpoint.com/v2/url?u=https-3A__twitter.com_shoemakah&amp;d=AwMFaQ&amp;c=8hUWFZcy2Z-Za5rBPlktOQ&amp;r=fbHa8Njtvh9VmSnzJxiEUTW9NWDwMMwQAzhgZDO41GQ&amp;m=2hmDL9wRzmlU_2g0L0tNoUVmhlNMH4HLOevw22IEZds&amp;s=83OIHhrZUaG5B6v_qoLzlJTxaI8_4EB_ZY-KTuz_aeg&amp;e=" target="_blank" class="">@shoemakah</a><br class="">
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
<span class="">&lt;khmer_error.txt&gt;</span>_______________________________________________<span class=""><br class="">
khmer mailing list<br class="">
<a href="mailto:khmer@lists.idyll.org" target="_blank" class="">khmer@lists.idyll.org</a><br class="">
<a href="https://urldefense.proofpoint.com/v2/url?u=http-3A__lists.idyll.org_listinfo_khmer&amp;d=AwMFaQ&amp;c=8hUWFZcy2Z-Za5rBPlktOQ&amp;r=fbHa8Njtvh9VmSnzJxiEUTW9NWDwMMwQAzhgZDO41GQ&amp;m=u_DlmqMPGHijsrZ9pAPXLoTqzk5dxyWDlo_2q7Ns38o&amp;s=lXG01P2mA6bN-cYiGtRXjf5Z4gmw35U9yDe-V7joBJo&amp;e=" target="_blank" class="">http://lists.idyll.org/listinfo/khmer</a><br class="">
</span></div>
</blockquote>
</div>
<br class="">
</div>
</div>
</blockquote>
</div>
<br class="">
<br clear="all" class="">
<div class=""><br class="">
</div>
-- <br class="">
<div class="gmail_signature">
<div dir="ltr" class="">
<div class="">
<div dir="ltr" class="">
<div class="">
<div dir="ltr" class="">Will Shoemaker&nbsp;
<div class="">
<div dir="ltr" class=""><font face="arial, helvetica, sans-serif" color="#20124d" class="">Indiana University<br class="">
Graduate Student: Lennon Lab</font></div>
<div dir="ltr" class=""><font face="arial, helvetica, sans-serif" color="#20124d" style="font-size:small" class="">Evolution, Ecology, &amp; Behavior Program</font></div>
</div>
<div class=""><font face="arial, helvetica, sans-serif" color="#20124d" style="font-size:small" class="">Jordan Hall 238</font></div>
<div class=""><font face="arial, helvetica, sans-serif" color="#20124d" style="font-size:small" class=""><a href="mailto:wrshoema@umail.iu.edu" target="_blank" class="">wrshoema@umail.iu.edu</a><br class="">
</font></div>
<div class=""><a href="https://urldefense.proofpoint.com/v2/url?u=https-3A__twitter.com_shoemakah&amp;d=AwMFaQ&amp;c=8hUWFZcy2Z-Za5rBPlktOQ&amp;r=fbHa8Njtvh9VmSnzJxiEUTW9NWDwMMwQAzhgZDO41GQ&amp;m=u_DlmqMPGHijsrZ9pAPXLoTqzk5dxyWDlo_2q7Ns38o&amp;s=bjJBicFWoouMwyO7lfcM1Mv465XnxIrOOVS-Qo0cUdk&amp;e=" target="_blank" class="">@shoemakah</a><br class="">
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</blockquote>
</div>
<br class="">
</body>
</html>