Bug 56734 (hOCR) - Add support for hOCR format as input to Writer
Summary: Add support for hOCR format as input to Writer
Status: NEW
Alias: hOCR
Product: LibreOffice
Classification: Unclassified
Component: filters and storage (show other bugs)
Version:
(earliest affected)
3.6.3.2 release
Hardware: All All
: medium enhancement
Assignee: Not Assigned
URL:
Whiteboard: BSA
Keywords:
Depends on:
Blocks: Format-Filters
  Show dependency treegraph
 
Reported: 2012-11-04 10:06 UTC by Callegar
Modified: 2018-05-21 18:44 UTC (History)
2 users (show)

See Also:
Crash report or crash signature:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Callegar 2012-11-04 10:06:55 UTC
Namely, automatic conversion between the hocr html-derived format for the representation of ocr data to odt.

This would allow more strict integration with ocr tools like tesseract or cuneiform.
Comment 1 bfoman (inactive) 2012-11-04 12:51:05 UTC
Enhancement request.
Comment 2 Roman Eisele 2012-11-22 07:19:10 UTC
For first information about the hOCR format (with additional links), see e.g.
   http://en.wikipedia.org/wiki/hOCR

A valid enhancement request, therefore set Status to NEW.

The Component should be either “Writer” or “filters and storage”.