Bug 138977 - Automatic OCR text overlay in Libreoffice draw. Horrendously annoying and difficult to turn off
Summary: Automatic OCR text overlay in Libreoffice draw. Horrendously annoying and dif...
Status: RESOLVED INSUFFICIENTDATA
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Draw (show other bugs)
Version:
(earliest affected)
6.4.5.2 release
Hardware: All All
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2020-12-16 17:25 UTC by mcconnachieac
Modified: 2021-10-15 03:54 UTC (History)
1 user (show)

See Also:
Crash report or crash signature:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description mcconnachieac 2020-12-16 17:25:37 UTC
Description:
I'm not sure this really classes as a 'bug' but more of a sort of interaction design abomination.

I recently scanned a 100 page or so document. It turned out draw is a pretty cool program I already had that could take some of those segments of the documents, save them as different files, and also take out some of the duplicates that happened from me scanning things twice by mistake. Worked great.

PROBLEM

EVERY SINGLE wordy page, had OCR text placed on top automatically and I had to manually delete, and every word had its own separate text box. There is no easy way to turn this off that I could find through searching the help, looking around the program or even on a duckduckgo search. When exporting as a PDF it had this unremovable overlay scrawled all over my important documents.

Someone of my developed high computer skills should  not have to struggle with this.  I hope this gets addressed asap as I hate bluders like this in great software and it reinforces the idea that open source software is clunky and unusable.

Steps to Reproduce:
1. Choose a pdf file that is a scan of a computer-produced document with no actual text - so it is essentially a pdf image.
2. open said pdf file in Draw
3. Gaze in horror

Actual Results:
Dozens and dozens of text boxes appear on top of the words that have to be manually selected and deleted one by one. Even a scan of my passport picture produces the letter T on top of my face??

Expected Results:
No text should appear scrawled on top of my scanned documents. At the very most it should ask me if I want to do such a thing.


Reproducible: Always


User Profile Reset: No



Additional Info:
Version: 6.4.5.2 (x64) windows 7
Comment 1 user546 2021-03-17 11:31:54 UTC
When I open a pdf in acrobat the text is fine, but when I open it in libreoffice,  it doesn't show the real text (it's a scanned book), I guess it only shows the OCR text (which did not make a great job). 

So after rotating stuff and export it export as pdf, it create a pdf with the OCR text (which is mostly unreadable).

I thought the text OCR was just a layer in front of the image. So I try to delete the OCR text, but there is nothing (visible) behind. 

It's not intuitive at all.


We need a box "remove OCR" (without having to convert it as an image, which damage the qaulity). And when we open a pdf it should we expect to open first the real pdf, a pdf converted by the OCR.  (Though I imagine it's not easy to do)
Comment 2 Timur 2021-03-17 12:17:46 UTC
This big is missing sample PDF,please attach and search for similar bugs.
Comment 3 QA Administrators 2021-09-14 05:20:10 UTC Comment hidden (obsolete)
Comment 4 QA Administrators 2021-10-15 03:54:40 UTC
Dear mcconnachieac,

Please read this message in its entirety before proceeding.

Your bug report is being closed as INSUFFICIENTDATA due to inactivity and
a lack of information which is needed in order to accurately
reproduce and confirm the problem. We encourage you to retest
your bug against the latest release. If the issue is still
present in the latest stable release, we need the following
information (please ignore any that you've already provided):

a) Provide details of your system including your operating
   system and the latest version of LibreOffice that you have
   confirmed the bug to be present

b) Provide easy to reproduce steps – the simpler the better

c) Provide any test case(s) which will help us confirm the problem

d) Provide screenshots of the problem if you think it might help

e) Read all comments and provide any requested information

Once all of this is done, please set the bug back to UNCONFIRMED
and we will attempt to reproduce the issue. Please do not:

a) respond via email 

b) update the version field in the bug or any of the other details
   on the top section of our bug tracker

Warm Regards,
QA Team

MassPing-NeedInfo-FollowUp