Description: When opening a PDF in LibreOffice, the Romanian special characters are not shown. Steps to Reproduce: 1. Open a PDF file with Romanian special characters (ă, î, ș, ț, Ă, Î, Ș, Ț) 2. The Romanian text should be written with a font which includes all the above characters (generic Windows fonts, such as the Arial family, Times New Roman etc. usually include these characters) 3. Actual Results: The characters ș, ț, Ș, Ț are not shown Expected Results: Ș, Ț, Ș, ț should be seen Reproducible: Always User Profile Reset: Yes OpenGL enabled: Yes Additional Info: I've tested this on several .pdf files and the result is always the same
Created attachment 147317 [details] Missing diacritics
Created attachment 147318 [details] Same text in the original PDF
Hello, Thank you for filing the bug. Please send us a sample document, as this makes it easier for us to verify the bug. I have set the bug's status to 'NEEDINFO', so please do change it back to 'UNCONFIRMED' once you have attached a document. (Please note that the attachment will be public, remove any sensitive information before attaching it.) How can I eliminate confidential data from a sample document? https://wiki.documentfoundation.org/QA/FAQ#How_can_I_eliminate_confidential_data_from_a_sample_document.3F Thank you
Created attachment 148029 [details] PDF with Romanian diacritics This is a PDF file where the Romanian diacritics ș, Ș, ț and Ț are not shown when opening in Draw
Created attachment 148030 [details] Another PDF with Romanian diacritics Same behavior, the ș, Ș, ț and Ț characters are missing
Confirm.Version: 6.3.0.0.alpha0+ Build ID: ef58bf56ad292656ad2de0a417eda72cc170f782 CPU threads: 4; OS: Linux 4.15; UI render: default; VCL: gtk3; and Version 4.1.0.0.alpha0+ (Build ID: efca6f15609322f62a35619619a6d5fe5c9bd5a)
I am romanian also and I confirm on Version: 6.3.0.0.alpha0+ Build ID: afbbdcc216a84b59fb263777659b044c4a7cf6f0 CPU threads: 4; OS: Linux 4.15; UI render: default; VCL: gtk3; TinderBox: Linux-rpm_deb-x86_64@86-TDF, Branch:master, Time: 2019-01-13_03:54:12 Locale: ro-RO (ro_RO.UTF-8); UI-Language: en-US Calc: threaded
Dear san_ionut, To make sure we're focusing on the bugs that affect our users today, LibreOffice QA is asking bug reporters and confirmers to retest open, confirmed bugs which have not been touched for over a year. There have been thousands of bug fixes and commits since anyone checked on this bug report. During that time, it's possible that the bug has been fixed, or the details of the problem have changed. We'd really appreciate your help in getting confirmation that the bug is still present. If you have time, please do the following: Test to see if the bug is still present with the latest version of LibreOffice from https://www.libreoffice.org/download/ If the bug is present, please leave a comment that includes the information from Help - About LibreOffice. If the bug is NOT present, please set the bug's Status field to RESOLVED-WORKSFORME and leave a comment that includes the information from Help - About LibreOffice. Please DO NOT Update the version field Reply via email (please reply directly on the bug tracker) Set the bug's Status field to RESOLVED - FIXED (this status has a particular meaning that is not appropriate in this case) If you want to do more to help you can test to see if your issue is a REGRESSION. To do so: 1. Download and install oldest version of LibreOffice (usually 3.3 unless your bug pertains to a feature added after 3.3) from https://downloadarchive.documentfoundation.org/libreoffice/old/ 2. Test your bug 3. Leave a comment with your results. 4a. If the bug was present with 3.3 - set version to 'inherited from OOo'; 4b. If the bug was not present in 3.3 - add 'regression' to keyword Feel free to come ask questions or to say hello in our QA chat: https://kiwiirc.com/nextclient/irc.freenode.net/#libreoffice-qa Thank you for helping us make LibreOffice even better for everyone! Warm Regards, QA Team MassPing-UntouchedBug
I downloaded again the attachment from comment 5 and open with LibreOffice. The same missing diacritics. Version: 7.2.0.0.alpha0+ Build ID: 5adc93a9a9426ef79054751be2904896f787a8a2 CPU threads: 4; OS: Linux 5.8; UI render: default; VCL: gtk3 Locale: ro-RO (ro_RO.UTF-8); UI: en-US Calc: threaded
On current master the fonts are correctly applied as Arial or Arial Bold. One issue I notice is that, when you open the pdf with a pdf reader (such as evince on linux) and make selection on the text, I only get "Derula i" rather than "Derulați".
I confirm what Kevin has notice: strange behaviour in PDF: diacritics are correct only visual, not when selecting a word. When I copy-paste text from PDF to any text editor there is NO diacritics. Should be not our bug?
Was the ț in Derulați exported as an image or any other object in the pdf? Is it ok if prepare a document using draw or writer, export to pdf and then reopen with Draw? As the char is shown when opening with a pdf viewer, of course it is Draw's bug. But it should be very hard to fix if we don't know where to find the char text on the page.
And the more strange thing is that, if I open the pdf with FireFox, the ț is shown and selectable... The xpdf script (<libreoffice path>/program/xpdfimport) processed the ț char as blank space. So, if it is libreoffice bug, then it must be something wrong with the xpdf (or even in the upstream poppler which is used by libreoffice to process the pdf)
I GUESS the problem may be somewhere in member function PDFOutDev::drawChar in sdext/source/pdfimport/xpdfwrapper/pdfioutdev_gpl.cxx i.e., need debug to see if the special char is correctly included but is then lost within this function.
Created attachment 174472 [details] odf with text Derulati
Created attachment 174473 [details] PDF generated from derulati.odt
I attached an odt file with text "Derulați" and also the exported PDF file generated form it. As you can see the PDF if it is opened with Draw is correctly open and the text and diactricits are well read by LibreOffice. So, I this the problem should be in the PDF file used by the reporter. Maybe is not well encoded in PDF.
I can reproduce this issue with any of the attached PDFs, any one else can reproduce? Version: 7.4.3.2 / LibreOffice Community Build ID: 1048a8393ae2eeec98dff31b5c133c5f1d08b890 CPU threads: 10; OS: Mac OS X 13.0.1; UI render: default; VCL: osx Locale: en-US (en_EG.UTF-8); UI: en-US Calc: threaded
I can NOT repro with Version: 7.5.0.0.alpha1+ (X86_64) / LibreOffice Community Build ID: ad085990b8073a122ac5222e5220f8f1d6826dcf CPU threads: 16; OS: Linux 5.15; UI render: default; VCL: gtk3 Locale: ro-RO (ro_RO.UTF-8); UI: en-US Calc: threaded Also I can NOT repro with Version: 7.4.0.2 / LibreOffice Community Build ID: 1512ce97d7ed39dce3121f7e15651fd8895f950e CPU threads: 16; OS: Linux 5.15; UI render: default; VCL: gtk3 Locale: ro-RO (ro_RO.UTF-8); UI: en-US Calc: threaded But I have to mention that I have ttf-mscorefonts-installer (Microsoft Fonts) installed on my Linux computer. (sudo add-apt-repository multiverse AND sudo apt update && sudo apt install ttf-mscorefonts-installer)
Sorry, I meant to say I can not reproduce. Closing, please reopen if you can still reproduce with up to date versions of LibreOffice.