2013 AOCR Hackathon Wiki: Difference between revisions

From iDigBio
Jump to navigation Jump to search
mNo edit summary
Line 13: Line 13:
== Overview of the Challenge ==
== Overview of the Challenge ==
*2013 iDigBio AOCR [[Hackathon Challenge]]
*2013 iDigBio AOCR [[Hackathon Challenge]]
** overall description of [[Hackathon Challenge#the_challenge| The Challenge]]
**overall description of [[Hackathon Challenge#the_challenge| The Challenge]]
** [[Hackathon Challenge#the_specific_task| The Specific Task]]: parse OCR output to find values for these [[Hackathon Challenge#core_data_elements| core data elements]]
**[[Hackathon Challenge#the_specific_task| The Specific Task]]: parse OCR output to find values for these [[Hackathon Challenge#core_data_elements| core data elements]]
** read about [[Hackathon Challenge#the_metrics| the_metrics]] to be used
**[[Hackathon Challenge#metrics_and_evaluation| Metrics and Evaluation]] to be used
   [[2013 hackathon data elements| "core" fields]]   
   [[2013 hackathon data elements| "core" fields]]   



Revision as of 01:03, 11 January 2013

Welcome to the 2013 iDigBio AOCR Hackathon Wiki

  • Short URL to this hackathon wiki http://tinyurl.com/aocrhackathonwiki
  • Those participating in the first iDigBio AOCR Hackathon need an iDigBio account.
  • Note: This wiki page undergoing frequent updates and some participants have wiki edit permissions and will add to / update / edit these pages before, during and after the hackathon.

Links to Logistics, Communication, and Participant Information

Overview of the Challenge

  "core" fields   

link to explanations and examples of the 3 data sets

  set 1: LBCC label images
  set 2: NYBG and BRIT label images
  set 3: CalBug ENT label images

link to page summarizing the rules we followed to transcribe the gold set (and others)

Text Transcription Issues

Known OCR, ML, NLP Issues and challenges

Human-in-the-loop: User Interface Wish List

*Thank you NESCent, Hilmar Lapp and the HIP working group for this model.