2013 AOCR Hackathon Wiki: Difference between revisions
Jump to navigation
Jump to search
mNo edit summary |
mNo edit summary |
||
Line 1: | Line 1: | ||
= Welcome to the 2013 iDigBio AOCR Hackathon Wiki = | = Welcome to the 2013 iDigBio AOCR Hackathon Wiki = | ||
*Short URL to this hackathon wiki http://tinyurl.com/aocrhackathonwiki | *Short URL to this hackathon wiki http://tinyurl.com/aocrhackathonwiki | ||
*Those participating in the first iDigBio AOCR Hackathon need an iDigBio account. | *Those participating in the first iDigBio AOCR Hackathon need an iDigBio account. | ||
Line 6: | Line 5: | ||
== Links to Logistics, Communication, and Participant Information == | == Links to Logistics, Communication, and Participant Information == | ||
*[[2013 Hackathon Participants| Participant List]]<br> | |||
*[http://tinyurl.com/aocrHack Call for Participation]<br> | |||
*[http://tinyurl.com/idigbioAOCRHackathon Application Form]*<br> | |||
*[[2013 Hackathon Logistics| Travel, Food, Lodging, Connectivity Logistics]]<br> | |||
*2013 Hackathon Listserv, a mailing list for Hackathon Participants at aocr-hackathon-l@lists.ufl.edu | |||
=== Overview of the Challenge === | === Overview of the Challenge === |
Revision as of 00:36, 11 January 2013
Welcome to the 2013 iDigBio AOCR Hackathon Wiki
- Short URL to this hackathon wiki http://tinyurl.com/aocrhackathonwiki
- Those participating in the first iDigBio AOCR Hackathon need an iDigBio account.
- Note: This wiki page undergoing frequent updates and some participants have wiki edit permissions and will add to / update / edit these pages before, during and after the hackathon.
Links to Logistics, Communication, and Participant Information
- Participant List
- Call for Participation
- Application Form*
- Travel, Food, Lodging, Connectivity Logistics
- 2013 Hackathon Listserv, a mailing list for Hackathon Participants at aocr-hackathon-l@lists.ufl.edu
Overview of the Challenge
- 2013 iDigBio AOCR Hackathon Challenge
- overall description of the problem
- the specific challenge: parse OCR output to find values for these core data elements
- read about the_metrics to be used
"core" fields
link to explanations and examples of the 3 data sets
set 1: LBCC label images set 2: NYBG and BRIT label images set 3: CalBug ENT label images
link to page summarizing the rules we followed to transcribe the gold set (and others)
Known OCR, ML, NLP Issues and challenges
Human-in-the-loop: User Interface Wish List
*Thank you NESCent, Hilmar Lapp and the HIP working group for this model.