[data-carpentry-discuss] R/SQL integration lesson

Tracy Teal tkteal at datacarpentry.org
Mon Feb 2 10:29:54 PST 2015


> Ethan White <mailto:ethan at weecology.org>
> February 2, 2015 at 1:16 PM
> On Thu, Jan 29, 2015 at 1:29 PM, Ted Hart <edmund.m.hart at gmail.com 
> <mailto:edmund.m.hart at gmail.com>> wrote:
> > Lately I've been doing a lot of database work.  One thing I've found 
> is that
> > the provenance of your work flow with databases is much easier to 
> preserve
> > if it's written in R and you interact with the database via RJDBC rather
> > than a stand alone database environment.  When I do that I find I 
> have all
> > these adhoc queries that I never remember what order to execute them in.
> >
> > My question is, would this be a  worthwhile lesson for DC?  Or is it too
> > advanced? Or  would efforts be better spent working on the existing
> > curriculum?  Or would this make a reasonable addendum to the SQL lesson
> > (it's actually kind of trivial so it might not be worth it's own 
> lesson).
>
> I agree that this would be a great lesson, and I would be happy to do 
> a translation to Python.
We're interested and working on more advanced content, so I agree this 
would be great. And thanks Ethan for offering to do the translation. It 
should be done with the current biology dataset 
(https://github.com/datacarpentry/datacarpentry/tree/master/data/biology) to 
build on the existing lessons. Will that data work for the lesson you 
have in mind?

Thanks!
-Tracy
>
> Ethan
>
>
> _______________________________________________
> dc-discuss mailing list
> dc-discuss at lists.idyll.org
> http://lists.idyll.org/listinfo/dc-discuss
> François Michonneau <mailto:francois.michonneau at gmail.com>
> January 29, 2015 at 1:42 PM
> Hi Ted,
>
> I think it would be a really valuable addition to the SQL lesson. It
> would have to be taught after the R lesson, but I think it would
> really help tie together these two lessons and motivate more the SQL
> lesson. For people who are already working a little in R, this lesson
> would show them that databases can directly be integrated into their
> workflow.
>
> Cheers,
> -- Francois
>
>
>
> _______________________________________________
> dc-discuss mailing list
> dc-discuss at lists.idyll.org
> http://lists.idyll.org/listinfo/dc-discuss
> Ted Hart <mailto:edmund.m.hart at gmail.com>
> January 29, 2015 at 1:29 PM
> Lately I've been doing a lot of database work.  One thing I've found 
> is that the provenance of your work flow with databases is much easier 
> to preserve if it's written in R and you interact with the database 
> via RJDBC rather than a stand alone database environment.  When I do 
> that I find I have all these adhoc queries that I never remember what 
> order to execute them in.
>
> My question is, would this be a  worthwhile lesson for DC?  Or is it 
> too advanced? Or  would efforts be better spent working on the 
> existing curriculum?  Or would this make a reasonable addendum to the 
> SQL lesson (it's actually kind of trivial so it might not be worth 
> it's own lesson).
>
> I welcome some thoughts as I'm happy to take existing code and 
> formalize it into a lesson if the group things it's worthwhile.
>
> Best,
> Ted
>
> _______________________________________________
> dc-discuss mailing list
> dc-discuss at lists.idyll.org
> http://lists.idyll.org/listinfo/dc-discuss
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.idyll.org/pipermail/dc-discuss/attachments/20150202/9189fd2b/attachment-0001.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: compose-unknown-contact.jpg
Type: image/jpeg
Size: 770 bytes
Desc: not available
URL: <http://lists.idyll.org/pipermail/dc-discuss/attachments/20150202/9189fd2b/attachment-0001.jpg>


More information about the dc-discuss mailing list