Bug 138810 - Libre Office Draw: OCR text jump to front of image
Summary: Libre Office Draw: OCR text jump to front of image
Status: RESOLVED DUPLICATE of bug 104770
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Draw (show other bugs)
Version:
(earliest affected)
7.0.1.2 release
Hardware: x86-64 (AMD64) Linux (All)
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2020-12-11 06:02 UTC by ajku500
Modified: 2022-03-09 18:11 UTC (History)
1 user (show)

See Also:
Crash report or crash signature:


Attachments
ocrmypdf-ocr.pdf (80.78 KB, application/pdf)
2022-03-09 17:03 UTC, ajku500
Details
ocrmypdf-ocr-export-libre-office.pdf (318.18 KB, application/pdf)
2022-03-09 17:04 UTC, ajku500
Details
pdf-gimp.pdf (146.35 KB, application/pdf)
2022-03-09 17:04 UTC, ajku500
Details
pdf-gimp-ocrmypdf.pdf (76.98 KB, application/pdf)
2022-03-09 17:05 UTC, ajku500
Details
pdf-gimp-ocrmypdf-export-libreoffice.pdf (303.33 KB, application/pdf)
2022-03-09 17:05 UTC, ajku500
Details
scanner-ocr.pdf (193.35 KB, application/pdf)
2022-03-09 17:06 UTC, ajku500
Details
scanner-ocr-export-libreoffice.pdf (329.19 KB, application/pdf)
2022-03-09 17:06 UTC, ajku500
Details

Note You need to log in before you can comment on or make changes to this bug.
Description ajku500 2020-12-11 06:02:39 UTC
Description:
I am using Libre Office Version: 7.0.1.2 Build ID: 00(Build:2) CPU threads: 2; OS: Linux 4.15; UC:\fakepath\Sample-raster-OCRmypdf-pdf.odfI render: default; VCL: gtk3 Locale: en-IN (en_IN); UI: en-US Ubuntu package version: 1:7.0.1_rc2-0ubuntu0.18.04.1 Calc: threaded

I noted that whenever I open a pdf file made of image + OCR in Libre Office Draw for editing the last most text/or text in a line/ or complete text on a page on every page is visible above the image. This cause lot of problem if I am working with 100s of pages. I have to select image on every page and move it to front. 

I further noted in my system I scanned image using HP deskjet 1050 scanner using Xsane.:
(Condition-1) If I open this image for editing in Gimp or okular and ocr resulting pdf. The resultant pdf show problem as described above.
(Condition-2) If I just OCR my scanned pdf file(without editing with any other siftware) and then open resulting file in Libre Office Draw it shows such issues.
I also downloaded .pdf with OCR files from various websites and found similar problem as described above.


Steps to Reproduce:
1. take a image or .pdf file open it with okular/ gimp and print it as .pdf file using option force rasterization.
2. ocr resulting .pdf file using any ocr software in my case ocrmypdf. 
3. open resulting .pdf file in Libre Office Draw. Any of the ocr text will be visible above the image. 
4. export .pdf from Libre Office Draw. the resulting .pdf will also have same issue. 

Actual Results:
The ocr text must be on back of image as it was in source pdf. But opened file and resulting .pdf both have any of the ocr text visible on front of the image. 

Expected Results:
The ocr text must be on back of image as it was in source pdf.


Reproducible: Always


User Profile Reset: No


OpenGL enabled: Yes

Additional Info:
Discussion about problem here: https://ask.libreoffice.org/en/question/281660/libre-office-draw-ocr-text-jump-to-front-of-image/
Comment 1 Xisco Faulí 2021-02-09 12:13:40 UTC
You can't confirm your own bugs. Moving it back to UNCONFIRMED until someone
else confirms it.
Comment 2 Natali Eroshenko 2021-08-13 09:54:27 UTC
don`t repro in Version: 7.1.5.2 (x64) / LibreOffice Community
Build ID: 85f04e9f809797b8199d13c421bd8a2b025d52b5
CPU threads: 6; OS: Windows 10.0 Build 19042; UI render: Skia/Raster; VCL: win
Locale: en-US (ru_RU); UI: en-US
Calc: threaded
Comment 3 Natali Eroshenko 2021-08-13 09:54:41 UTC
don`t repro in Version: 7.3.0.0.alpha0+ / LibreOffice Community
Build ID: 85f04e9f809797b8199d13c421bd8a2b025d52b5
CPU threads: 6; OS: Windows 10.0 Build 19042; UI render: Skia/Raster; VCL: win
Locale: en-US (ru_RU); UI: en-US
Calc: threaded
Comment 4 Timur 2021-08-13 12:43:45 UTC
You need to attach PDF from step2 and please screenshot after step 3. 
I tried with PDF you attached in Ask but couldn't see a problem.

This may be a duplicate of bug 104770.

Note: Lo 7.0.1.2 is old, many bugs were fixed with 7.0.6 and 7.1.5 and soon to be 7.1.6.
Comment 5 QA Administrators 2022-02-10 03:41:32 UTC Comment hidden (obsolete)
Comment 6 ajku500 2022-03-09 17:03:06 UTC
Created attachment 178754 [details]
ocrmypdf-ocr.pdf
Comment 7 ajku500 2022-03-09 17:04:04 UTC
Created attachment 178755 [details]
ocrmypdf-ocr-export-libre-office.pdf
Comment 8 ajku500 2022-03-09 17:04:55 UTC
Created attachment 178756 [details]
pdf-gimp.pdf
Comment 9 ajku500 2022-03-09 17:05:22 UTC
Created attachment 178757 [details]
pdf-gimp-ocrmypdf.pdf
Comment 10 ajku500 2022-03-09 17:05:47 UTC
Created attachment 178758 [details]
pdf-gimp-ocrmypdf-export-libreoffice.pdf
Comment 11 ajku500 2022-03-09 17:06:09 UTC
Created attachment 178759 [details]
scanner-ocr.pdf
Comment 12 ajku500 2022-03-09 17:06:33 UTC
Created attachment 178760 [details]
scanner-ocr-export-libreoffice.pdf
Comment 13 ajku500 2022-03-09 17:09:15 UTC
I am sorry for delay. Had some personal problems. Now I have updated the version of libreoffice. The detail is below:
Version: 7.3.1.3 / LibreOffice Community
Build ID: 30(Build:3)
CPU threads: 2; OS: Linux 4.15(Linux Mint 19.2 Tina); UI render: default; VCL: gtk3
Locale: en-IN (en_IN); UI: en-US
Ubuntu package version: 1:7.3.1~rc3-0ubuntu0.18.04.1~lo1
Calc: threaded
-----------------------------------------------------------
 
Here are the steps to Reproduce:

(1) Scan document/ create pdf using gimp and ocr using any method through scanner or ocrmypdf (Both use tesseract in my case). All the text in generated pdfs hidden behind the image. files/ screenshots: 
(2) Open document with LibreOffice Draw. Now the last text/ character at each page will expose (Or visible in front).
3) Now  export document open in LibreOffice Draw as pdf. In the exported pdf the the last text/ character at each page will expose (Or visible in front).

Expected result: the OCR text should remain in the background.

1.png:  LibreOffice Version (Link:https://imgur.com/a/PDGtz1G)
2.png:  Document OCR with ocrmypdf (ocrmypdf-ocr.pdf)(Link:https://imgur.com/dArgfaW)
3.png:  Document OCR with scanner (scanner-ocr.pdf)(Link:https://imgur.com/2zgyPmt)
4.png:  Document exported from gimp and OCR with ocrmypdf (pdf-gimp-ocrmypdf.pdf)(Link:https://imgur.com/EGuM1EV)
5.png:  Document ocrmypdf-ocr.pdf open in LibreOffice draw(Link:https://imgur.com/A3Ed0oU)
6.png:  Document scanner-ocr.pdf open in LibreOffice draw (Link:https://imgur.com/LDCSeGY)
7.png:  Document pdf-gimp-ocrmypdf.pdf open in LibreOffice draw (Link:https://imgur.com/sJKm3Qv)
8.png:  Document after ocrmypdf-ocr.pdf exported as pdf from LibreOffice draw (Link:https://imgur.com/mjOqyLF)
9.png:  Document after scanner-ocr.pdf exported as pdf from LibreOffice draw (Link: https://imgur.com/nByltvf)
10.png: Document after pdf-gimp-ocrmypdf.pdf exported as pdf from LibreOffice draw (Link: https://imgur.com/6qIhASj)

pdf files already attached in comments 6 to 12.
Comment 14 Timur 2022-03-09 18:11:43 UTC
I didn't test but let's assume duplicate.

*** This bug has been marked as a duplicate of bug 104770 ***