Data Carpentry

From iDigBio
Jump to navigation Jump to search

This wiki supports the Data Carpentry Workshop to be held at the University of Florida at iDigBio. (Date to be announced). It is the first in a series of four biodiversity informatics workshops planned in the upcoming year (2014-2015).

Digitization Training Workshops Wiki Home

Planning Team

  • François Michonneau (FLMNH - iDigBio)
  • Katja Seltmann (TTD-TCN, AMNH)
  • Matt Collins (ACIS - iDigBio)
  • Dan Stoner (ACIS - iDigBio)
  • Deborah Paul (FSU - iDigBio)
  • Tracy K. Teal (BEACON)
  • Pam Soltis (FLMNH - iDigBio PI)
  • Derek Masaki (USGS)
  • Shari Ellis (iDigBio)
  • Kevin Love (iDigBio)
  • Mike Smorul (SESYNC)
  • and assistance from Nirav Merchant at iPlant.

Workshop Evaluation

  • link to pre-workshop survey
  • link here at end of workshop

Agenda

  • pre-workshop meeting (online)
    • Software installed?
    • Instructors available
    • Questions?
  • pre-workshop meeting / dinner day before
    • All welcome. Place/time TBA.

Course Overview - Day 1
8:30-9:00 Introductions / Overview / Why Data Carpentry? / How to organize data projects
9:00-10:00 Better use of spreadsheets (Tracy Teal)
10:00-10:30 Break
10:30-12:00 Better use of spreadsheets part II (Tracy Teal)
12:00-1:30 Lunch (with OpenRefine Demo - Deb Paul)
1:30-3:00 SQL Introduction (Matt Collins)
3:00-3:30 Break
3:30-5:00 SQL part II (Matt Collins)
5:00-5:30 Review / Wrap up for tomorrow


Course Overview - Day 2
8:30-10:00 Introduction to the shell (Tracy Teal)
10:00-10:30 Break
10:30-12:30 Introduction to R (François Michonneau)
12:30-1:30 Lunch (with OpenRefine Demo - Deb Paul)
1:30-3:00 Manipulating and plotting data in R (François Michonneau)
3:00-3:30 Break
3:30-4:30 Getting data in and out of R/Integrating R in your workflow (François Michonneau)
4:30-5:00 Scaling it up: DEMO using iPlant Discovery Environment (DE) (François Michonneau, ...)
5:00-5:30 Advanced Shell (Matt C.)
5:30-6:00 Review / Wrap up / Evaluations and Feedback
Thank you!

Link to Workshop Report

Logistics

Remote Participation

Remote participation will be provided via Adobe Connect: [room URL to be decided]

  • Remote can join those present for notes in (Google Doc) or (MoPad)?

Presentation Documents

  • links to any presentations (like power points) here.

Workshop Recordings

Day 1

  • [8:30am-10:00am]
  • [10:30am-12:00pm]
  • [1:00pm-3:00pm]
  • [3:30-6pm]

Day2

  • [8:30am-10:00am]
  • [10:30am-12:00pm]
  • [1:00pm-3:00pm]
  • [3:30-6pm]

Data Carpentry Resources and Links