Bug 56734 (hOCR) - Add support for hOCR format as input to Writer
Summary: Add support for hOCR format as input to Writer
Status: NEW
Alias: hOCR
Product: LibreOffice
Classification: Unclassified
Component: filters and storage (show other bugs)
(earliest affected) release
Hardware: All All
: medium enhancement
Assignee: Not Assigned
Whiteboard: BSA
Depends on:
Blocks: Format-Filters
  Show dependency treegraph
Reported: 2012-11-04 10:06 UTC by Callegar
Modified: 2018-05-21 18:44 UTC (History)
2 users (show)

See Also:
Crash report or crash signature:


Note You need to log in before you can comment on or make changes to this bug.
Description Callegar 2012-11-04 10:06:55 UTC
Namely, automatic conversion between the hocr html-derived format for the representation of ocr data to odt.

This would allow more strict integration with ocr tools like tesseract or cuneiform.
Comment 1 bfoman (inactive) 2012-11-04 12:51:05 UTC
Enhancement request.
Comment 2 Roman Eisele 2012-11-22 07:19:10 UTC
For first information about the hOCR format (with additional links), see e.g.

A valid enhancement request, therefore set Status to NEW.

The Component should be either “Writer” or “filters and storage”.