Description: Recently, an old regression that prevented opening Arabic/Persian/Hebrew PDF files correctly in Draw is fixed by patches from Kevin Suo (thanks!). But, there are still problems that prevent one from being able to edit the PDF file. The main problem is that characters are shaped in Arabic script according to the position of the character. Thus, for example if you convert the text ضمیمه into PDF, you will get ﺿﻤﯿﻤﻪ which is ﺿ ﻤ ﯿ ﻤ ﻪ instead of ض م ی م ه. Steps to Reproduce: 1. Open attachment 129523 [details] 2. Try to edit the text from first line Actual Results: Characters are shaped ones, instead of normal Arabic/Persian characters Expected Results: Characters should be unshaped, so that the user can edit the text Reproducible: Always User Profile Reset: No Additional Info: Happens in the LO 7.5 dev master after the fix
Would probably need to happen before the sd Textbox objects are instantiated--so would be a final step in the PDF import filter. Once the draw textboxes are built it too late to rework the text stream.
For the purpose of this, I think applying NFKC normalization to the text should be enough: >>> unicodedata.normalize("NFKC", "ﺿ ﻤ ﯿ ﻤ ﻪ") 'ض م ی م ه' But NFKC normalization can change text meaning for other Unicode code points, so it should be applied to Arabic Presentation Forms characters exclusively.
I can’t reproduce this on master or 7.5 builds, the text is imported as regular Arabic characters not presentation forms. The file line in the PDF is copied as “ضمیمه شماره ” after import. Tested with: Version: 7.5.3.2 (X86_64) / LibreOffice Community Build ID: 9f56dff12ba03b9acd7730a5a481eea045e468f3 CPU threads: 6; OS: Mac OS X 13.4; UI render: default; VCL: osx Locale: en-EG (en_EG.UTF-8); UI: en-US Calc: threaded and: Version: 7.6.0.0.alpha1+ (X86_64) / LibreOffice Community Build ID: 244f9cf66bc36f229ccb5712bc8d80166b92266d CPU threads: 6; OS: Mac OS X 13.4; UI render: Skia/Metal; VCL: osx Locale: en-EG (en_EG.UTF-8); UI: en-US Calc: threaded
Dear Hossein, This bug has been in NEEDINFO status with no change for at least 6 months. Please provide the requested information as soon as possible and mark the bug as UNCONFIRMED. Due to regular bug tracker maintenance, if the bug is still in NEEDINFO status with no change in 30 days the QA team will close the bug as INSUFFICIENTDATA due to lack of needed information. For more information about our NEEDINFO policy please read the wiki located here: https://wiki.documentfoundation.org/QA/Bugzilla/Fields/Status/NEEDINFO If you have already provided the requested information, please mark the bug as UNCONFIRMED so that the QA team knows that the bug is ready to be confirmed. Thank you for helping us make LibreOffice even better for everyone! Warm Regards, QA Team MassPing-NeedInfo-Ping
Dear Hossein, Please read this message in its entirety before proceeding. Your bug report is being closed as INSUFFICIENTDATA due to inactivity and a lack of information which is needed in order to accurately reproduce and confirm the problem. We encourage you to retest your bug against the latest release. If the issue is still present in the latest stable release, we need the following information (please ignore any that you've already provided): a) Provide details of your system including your operating system and the latest version of LibreOffice that you have confirmed the bug to be present b) Provide easy to reproduce steps – the simpler the better c) Provide any test case(s) which will help us confirm the problem d) Provide screenshots of the problem if you think it might help e) Read all comments and provide any requested information Once all of this is done, please set the bug back to UNCONFIRMED and we will attempt to reproduce the issue. Please do not: a) respond via email b) update the version field in the bug or any of the other details on the top section of our bug tracker Warm Regards, QA Team MassPing-NeedInfo-FollowUp
(In reply to خالد حسني from comment #3) I also can't reproduce with: Version: 24.2.0.0.alpha1+ (X86_64) / LibreOffice Community Build ID: 516f800f84b533db0082b1f39c19d1af40ab29c8 CPU threads: 4; OS: Linux 6.5; UI render: default; VCL: gtk3 Locale: he-IL (en_IL); UI: en-US when opening in Draw. So, closing this as WORKSFORME; but - Khaled, you are more than welcome to reopen if this shows up again or if I've misunderstood something.
I still see multiple issues with the imported file in the latest LO 24.2 dev master: The smiley is reversed, (: becomes ): الله (ligature) becomes هللا In the first line one brackets is reversed, but not the other one. Parenthesizes in the text are reversed. Parenthesizes in the links are reversed. Version: 24.8.0.0.alpha0+ (X86_64) / LibreOffice Community Build ID: 5056da285da2f130d741add1f8432cd590116a96 CPU threads: 12; OS: Linux 6.2; UI render: default; VCL: gtk3 Locale: en-US (en_US.UTF-8); UI: en-US Calc: CL threaded I think this issue should remain open, until a fix is provided.