Access to Digitization Tools and Methods: Difference between revisions
| mNo edit summary | mNo edit summary | ||
| Line 5: | Line 5: | ||
| This broad digitisation symposium included three sessions, covering different elements of digitisation. The key focus was to cover the developments that are occurring in digitisation but with a strong emphasis on the '''accessibility''' of tools and protocols ('''think open access, open source'''). | This broad digitisation symposium included three sessions, covering different elements of digitisation. The key focus was to cover the developments that are occurring in digitisation but with a strong emphasis on the '''accessibility''' of tools and protocols ('''think open access, open source'''). | ||
| [[File:Tdwg2014SymposiumLogo_0.png|right|400px]] | |||
| Examples of topics include tools for data/metadata capture and enrichment such as Optical Character Recognition (OCR), text mining, Natural Handwriting Recognition (NHR), Natural Language Processing (NLP), their availability and how they are being adopted and adapted. How are these tools being used currently, and how can we ensure that they are accessible to all? In addition, what are the tools in use for image capture and management, quality control and long-term preservation of images? What techniques are in use by many institutes, who are capturing images of their natural history collections and related objects like field notebooks, illustrations, labels, card catalogs, journals, and literature? | Examples of topics include tools for data/metadata capture and enrichment such as Optical Character Recognition (OCR), text mining, Natural Handwriting Recognition (NHR), Natural Language Processing (NLP), their availability and how they are being adopted and adapted. How are these tools being used currently, and how can we ensure that they are accessible to all? In addition, what are the tools in use for image capture and management, quality control and long-term preservation of images? What techniques are in use by many institutes, who are capturing images of their natural history collections and related objects like field notebooks, illustrations, labels, card catalogs, journals, and literature? | ||
| ==[[Digitization Resources|Digitization Resources Wiki Home]]== | ==[[Digitization Resources|Digitization Resources Wiki Home]]== | ||
Revision as of 01:06, 11 December 2014
This wiki supports the BIS(TDWG) 2014 Symposium: Access to Digitisation Tools and Methods, Jönköping, Sweden, October 27th, 2014.
This broad digitisation symposium included three sessions, covering different elements of digitisation. The key focus was to cover the developments that are occurring in digitisation but with a strong emphasis on the accessibility of tools and protocols (think open access, open source).
Examples of topics include tools for data/metadata capture and enrichment such as Optical Character Recognition (OCR), text mining, Natural Handwriting Recognition (NHR), Natural Language Processing (NLP), their availability and how they are being adopted and adapted. How are these tools being used currently, and how can we ensure that they are accessible to all? In addition, what are the tools in use for image capture and management, quality control and long-term preservation of images? What techniques are in use by many institutes, who are capturing images of their natural history collections and related objects like field notebooks, illustrations, labels, card catalogs, journals, and literature?
Digitization Resources Wiki Home
BIS(TDWG) 2014 Symposium: Access to Digitisation Tools and Methods, Agenda and Logistics
- Agenda in Google Doc
- TDWG 2014 Program
- Start time: 11 am on Monday October 27th at the 00 Elmia Congress Centre, Rydberg Hall, Jönköping, Sweden.
- TDWG 2014 Symposium: Access to Digitisation Tools and Methods - Calendar Announcement
- BIS(TDWG) 2014 Conference Website
- Twitter: @iDigBio @TDWG #tdwg2014 #tdwg #digitization and conveners: @emhaston @idbdeb @vsmithuk
Collaborative Notes Documents
- Google Doc for Group Notes
- Schedule is embedded in the Google Doc
 
Workshop Recordings
Monday, 27 October 2014
- 11:00 - 11:10 am Discovery and access to digitisation tools and methods. (recording) Elspeth M Haston, Robert Cubey
- 11:10 - 11:30 am The Open Drawer Project - Providing free access to high resolution images of entomological collection drawers. (recording) Alexander Kroupa, Falko Glöckler, Bernhard Schurian, Felix Maier, Stefan Schmidt, Gregor Hagedorn, Christoph Häuser
- 11:30 - 11:50 am StanDAP-Herb develops a standard process for extracting metadata from digitised herbarium specimens. (recording) Agnes Kirchhoff, Walter G. Berendsohn, Ulrich Bügel, Fernando Chaves, Cailin Guan, Markus Lindhorst, Dominik Röpert, Eduard Santamaria, Karl-Heinz Steinke, Hangyan Zheng
- 11:50 - 12:10 pm Moving beyond the box: automating the digitisation of insect collections. (recording) Pieter Holtzhausen, Stéfan van der Walt, Alice Heaton, Laurence Livermore, Vladimir Blagoderov, Ben Price, Lawrence Hudson, Vincent Smith (Jump to 14:00 minutes into this recording for this talk).
- 12:10 - 12:30 pm ZooSphere - Development of a software for automated spheric image capturing and interactive 3D visualization of biological collection objects. (recording) Martin Pluta, Falko Glöckler, Alexander Kroupa, Bernhard Schurian
- 2:00 - 2:20 pm Capturing Inventory level information about collections as a step in object to image to data workflows. (recording) Paul J Morris, James Hanken, David Lowery, Bertram Ludäscher, James A. Macklin, Robert A Morris, Tianhong Song, Patrick Sweeney
- 2:20 - 2:40 pm Data Discovery and Doer Happiness: Uses for Optical Character Recognition (OCR) Output. (recording) Deborah Paul, Andrea Matsunaga, Miao Chen, Jason Best, Sylvia Orli, William Ulate, Reed Beaman
- 2:40 - 3:00 pm Enriching the legacy literature with OCR corrections and text-mined semantic metadata. (recording mp4) Riza Batista-Navarro, Aminul Islam, William Ulate, Jennifer Hammock, Axel Soto, Sophia Ananiadou, Evangelos Milios
- 3:00 - 3:20 pm Managing Digitization Projects with Biospex. (recording) Greg Riccardi, Austin Mast, Elizabeth Ellwood, Robert Bruhn, Jeremy Spinks (Note that talk has discussion from last talk. This talk begins at the 1 min:40 sec mark. Follows with discussion.
- Optical character recognition (OCR) in linking entomological labels with field notebook data. Tero Mononen, Riitta Tegelberg, Janne Karppinen, Mira Sääskilahti, Hannu Saarenmaa, Tommi Koskinen, Jyrki Muona (not recorded)
- 4:20 - 4:40 pm What do you do when your Network Manager tells you there is no more space and they mean it?. (recording) Sharon Grant, Kate Webbink, Marc Lambruschi, Mike Yoshida
- 4:40 - 5:00 pm ENVIRONMENTS-EOL: identification of Environment Ontology terms in text and the annotation of the Encyclopedia of Life. (recording) Evangelos Pafilis, Sune Frankild, Lucia Fanini, Sarah Faulwetter, Christina Pavloudi, Julia Schnetzer, Aikaterini Vasileiadou, Umer Ijaz, Christos Arvanitidis, Robert Stevenson, Lars Juhl Jensen Talk begins at 3 minutes:30 seconds into the video. Sound capture only. See slides (below).
- 5:00 - 5:20 pm Case study of reuse of digitised content by creative industry in games: Europeana Creative. (recording) Jiri Frank
Presentation PowerPoints and PDFs
Monday, 27 October 2014
- Discovery and access to digitisation tools and methods. Elspeth M Haston, Robert Cubey
- The Open Drawer Project - Providing free access to high resolution images of entomological collection drawers. (pdf) Alexander Kroupa, Falko Glöckler, Bernhard Schurian, Felix Maier, Stefan Schmidt, Gregor Hagedorn, Christoph Häuser
- StanDAP-Herb develops a standard process for extracting metadata from digitised herbarium specimens. (pdf) Agnes Kirchhoff, Walter G. Berendsohn, Ulrich Bügel, Fernando Chaves, Cailin Guan, Markus Lindhorst, Dominik Röpert, Eduard Santamaria, Karl-Heinz Steinke, Hangyan Zheng
- Moving beyond the box: automating the digitisation of insect collections. (ppt) Pieter Holtzhausen, Stéfan van der Walt, Alice Heaton, Laurence Livermore, Vladimir Blagoderov, Ben Price, Lawrence Hudson, Vincent Smith
- ZooSphere - Development of a software for automated spheric image capturing and interactive 3D visualization of biological collection objects. (pdf) Martin Pluta, Falko Glöckler, Alexander Kroupa, Bernhard Schurian
- Capturing Inventory level information about collections as a step in object to image to data workflows. (pdf) Paul J Morris, James Hanken, David Lowery, Bertram Ludäscher, James A. Macklin, Robert A Morris, Tianhong Song, Patrick Sweeney
- Data Discovery and Doer Happiness: Uses for Optical Character Recognition (OCR) Output. (pptx) Deborah Paul, Andrea Matsunaga, Miao Chen, Jason Best, Sylvia Orli, William Ulate, Reed Beaman
- Enriching the legacy literature with OCR corrections and text-mined semantic metadata. (pptx) Riza Batista-Navarro, Aminul Islam, William Ulate, Jennifer Hammock, Axel Soto, Sophia Ananiadou, Evangelos Milios
- Managing Digitization Projects with Biospex. (pptx) Greg Riccardi, Austin Mast, Elizabeth Ellwood, Robert Bruhn, Jeremy Spinks
- Optical character recognition (OCR) in linking entomological labels with field notebook data. (pdf) Tero Mononen, Riitta Tegelberg, Janne Karppinen, Mira Sääskilahti, Hannu Saarenmaa, Tommi Koskinen, Jyrki Muona
- What do you do when your Network Manager tells you there is no more space and they mean it?. (pptx) Sharon Grant, Kate Webbink, Marc Lambruschi, Mike Yoshida
- ENVIRONMENTS-EOL: identification of Environment Ontology terms in text and the annotation of the Encyclopedia of Life. (pdf) Evangelos Pafilis, Sune Frankild, Lucia Fanini, Sarah Faulwetter, Christina Pavloudi, Julia Schnetzer, Aikaterini Vasileiadou, Umer Ijaz, Christos Arvanitidis, Robert Stevenson, Lars Juhl Jensen
- Case study of reuse of digitised content by creative industry in games: Europeana Creative. (pdf) Jiri Frank
