Difference between revisions of "User:GChriss/MediaDigitization"
m (suppress TOC) |
m (→High-Resolution Imaging: fallback to archive.org page) |
||
(11 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
__NOTOC__ |
__NOTOC__ |
||
− | An informal, introductory session led by [[ |
+ | An informal, introductory session led by [[User:GChriss]] on the following novel digization techniques: |
===hOCR Workflow Tools=== |
===hOCR Workflow Tools=== |
||
+ | The hOCR Workflow Tools project is a collection of tools to facilitate generation of text-searchable digital documents and is particularly useful in contexts where traditional OCR techniques would fare poorly (''e.g.'' handwritten notes). It's implemented ''via'' two [http://www.inkscape.org/en/ Inkscape] extensions: |
||
⚫ | |||
+ | :[https://gitorious.org/hocr-workflow/inkscape-hocr Inkscape Extension: Export Image Overlay Text as hOCR] |
||
+ | :[https://gitorious.org/hocr-workflow/inkscape-hocrpdf Inkscape Extension: Create Multi-Page PDF from hOCR HTML Directory] |
||
+ | |||
+ | Accurate text-searchable documents bring new life and layers of reader engagement to source materials. |
||
<br /> |
<br /> |
||
===The BookLiberator=== |
===The BookLiberator=== |
||
+ | The [http://bookliberator.org BookLiberator] is an innovative, lightweight book-scanner design that's feature-complete, including scanning speed, with [http://www.diybookscanner.org larger, traditional models]. This mini-topic covers a number of design changes from the original-project design and new media processing tools summarized in the following blog post: |
||
− | + | :[http://gchriss.tumblr.com/post/84946122863/bookliberator http://gchriss.tumblr.com/post/84946122863/bookliberator] |
|
+ | |||
+ | A BookLiberator lightning talk transcript is available: |
||
⚫ | |||
+ | |||
+ | One of the WiFi-active, 'anti-motion'-triggered Canon PowerShot cameras used in the design refresh will be on display. |
||
<br /> |
<br /> |
||
+ | |||
===High-Resolution Imaging=== |
===High-Resolution Imaging=== |
||
+ | Using image sensors with a high pixel density (defined as the number of sensor pixels divided by total sensor size) combined with [http://en.wikipedia.org/wiki/Angular_resolution high-resolving-power lenses] it's possible to image arbitrary surfaces in much higher detail than using document scanning or traditional macro photography techniques. For an example image created using this technique please see: |
||
− | + | :[https://web.archive.org/web/20140106062717/http://media.openvideo.pro/u/gchriss/m/docuzoom-microscale-1-dollar-bill/ https://web.archive.org/web/20140106062717/http://media.openvideo.pro/u/gchriss/m/docuzoom-microscale-1-dollar-bill/] |
|
+ | |||
+ | Beyond an introduction to the novel technique and how it can be applied in historical research contexts, a working "works/doesn't yet work/future work" status update will be presented with a particular focus on large-document automated scanning. An [http://www3.elphel.com/sites/default/files/Elphel_353_Brochure_2010_V05.pdf Elphel 353L] camera as well as a [https://www.olimex.com/wiki/A10-OLinuXino-LIME A10-OLinuXino-LIME] interfaced with an [http://www.ebay.com/itm/5-Mega-pixel-Camera-Module-OV5642-w-CS-mount-Lens-/281312875657 OV5642 image sensor] ''via'' GPIO pins will be on display. |
||
<br /> |
<br /> |
||
⚫ | |||
− | See [https://gitorious.org/openvideo_reference_build https://gitorious.org/openvideo_reference_build] |
||
⚫ | |||
+ | The ‘Open Video Reference Build’ is a set of tools designed to facilitate working with [http://openvideoconference.org/ open video] in multiple contexts such as software development, live-streaming, A/V conferencing, video editing, and machine recognition. It currently consists of three BASH scripts that create a series of well-defined software packages running in a libre, long-term-support operating system: [http://trisquel.info/ Trisquel]. |
||
+ | Video can be difficult to work with. The Open Video Reference Build is designed to reduce as much complexity as possible without sacrificing build precision or extensibility. See: [https://gitorious.org/openvideo_reference_build https://gitorious.org/openvideo_reference_build] |
||
− | Mini-descriptions for each of the above to follow shortly. The room will be subject to change but will be held during in the morning session. |
Latest revision as of 17:58, 30 April 2015
An informal, introductory session led by User:GChriss on the following novel digization techniques:
hOCR Workflow Tools
The hOCR Workflow Tools project is a collection of tools to facilitate generation of text-searchable digital documents and is particularly useful in contexts where traditional OCR techniques would fare poorly (e.g. handwritten notes). It's implemented via two Inkscape extensions:
- Inkscape Extension: Export Image Overlay Text as hOCR
- Inkscape Extension: Create Multi-Page PDF from hOCR HTML Directory
Accurate text-searchable documents bring new life and layers of reader engagement to source materials.
The BookLiberator
The BookLiberator is an innovative, lightweight book-scanner design that's feature-complete, including scanning speed, with larger, traditional models. This mini-topic covers a number of design changes from the original-project design and new media processing tools summarized in the following blog post:
A BookLiberator lightning talk transcript is available:
One of the WiFi-active, 'anti-motion'-triggered Canon PowerShot cameras used in the design refresh will be on display.
High-Resolution Imaging
Using image sensors with a high pixel density (defined as the number of sensor pixels divided by total sensor size) combined with high-resolving-power lenses it's possible to image arbitrary surfaces in much higher detail than using document scanning or traditional macro photography techniques. For an example image created using this technique please see:
Beyond an introduction to the novel technique and how it can be applied in historical research contexts, a working "works/doesn't yet work/future work" status update will be presented with a particular focus on large-document automated scanning. An Elphel 353L camera as well as a A10-OLinuXino-LIME interfaced with an OV5642 image sensor via GPIO pins will be on display.
Open Video Reference Build
The ‘Open Video Reference Build’ is a set of tools designed to facilitate working with open video in multiple contexts such as software development, live-streaming, A/V conferencing, video editing, and machine recognition. It currently consists of three BASH scripts that create a series of well-defined software packages running in a libre, long-term-support operating system: Trisquel.
Video can be difficult to work with. The Open Video Reference Build is designed to reduce as much complexity as possible without sacrificing build precision or extensibility. See: https://gitorious.org/openvideo_reference_build