I try to open the following PPTX file under openSUSE – Version: 6.3.3.2.0+ http://www.cosmo-model.org/content/consortium/generalMeetings/general2019/wg7/Gebhardt_GM2019_parallelWG1-WG7.pptx Result: File format error found at SfxBaseModel::storeToStorage: 0x20d(row,col) I can open that file with PowerPoint 2013 under Windows.
Created attachment 156581 [details] gdb bt On pc Debian x86-64 with master sources updated today, I could reproduce this.
I reproduce this as well, even in latest LibreOffice 6.4.1.2 on Windows (32-bit and 64-bit). Let me know if my sample is needed, but Tobias' looks like same thing -- exact same "SfxBaseModel::storeToStorage: 0x20d(row,col)" message, and similar content in slides (it's the pages with calculus math symbols.)
Created attachment 159341 [details] Flight management course pptx version Issue reported via Twitter in DM. Flight management course pptx having the exact same problem.
Created attachment 159342 [details] Flight management course odp version converted from pptx to odp using Office 365 - March 2020 Issue reported via Twitter in DM. Flight management course pptx having the exact same problem.
Created attachment 159418 [details] Reduced example I tried to reduce the example which allows to reproduce the bug. It may help to investigate.
I noticed these logs on console: warn:legacy.osl:4632:5136:sax/source/expatwrap/saxwriter.cxx:399: lone 2nd Unicode surrogate warn:legacy.osl:4632:5136:sax/source/expatwrap/saxwriter.cxx:424: illegal Unicode character The problem seems related to a wrong use of surrogates (Unicode notion) at a moment. Indeed, surrogates are always by pairs (high and low surrogates). Most of the times, it's ok but sometimes you got one of them but not followed by the other part of the pair. For the moment, I don't know why.
Symbols and Unicode convert from the reduced slide: 𝑋 %uD835%uDC4B 𝑢 %uD835%uDC62 ≡ %u2261 𝜕 %uD835%uDF15
Created attachment 159420 [details] Minimal example So the pb is triggered with character "𝜕" (%uD835%uDF15) Why the surrogates mechanism doesn't work for it and it works for the others, I don't know yet.
Created attachment 159432 [details] bt Part of bt where partial d is analyzed On ICU, I found this line: source/data/unidata/confusables.txt:1115:1D715 ; 2202 ; MA #* ( 𝜕 → ∂ ) MATHEMATICAL ITALIC PARTIAL DIFFERENTIAL → PARTIAL DIFFERENTIAL #
Eike: I'm a bit a lost between starmath/i18npool and icu. Any thought why this character which uses surrogates (but other with also surrogates don't fail) is wrongly parsed?
I gave a try with: https://gerrit.libreoffice.org/c/core/+/91941 At least all the attached files open with it. However, I'm still confused why some surrogate pairs are well taken into account others not...
Locally, I got no problem to build but on Jenkins it fails :-(
Thanks to Stephan Bergmann, I could simplify the patch, waiting for Jenkins results.
Julien Nabet committed a patch related to this issue. It has been pushed to "master": https://git.libreoffice.org/core/commit/11b57129b53e1e2d71a5f969e2417226b4e2ddd9 tdf#129372: PPTX: error at SfxBaseModel::storeToStorage: 0x20d(row,col) It will be available in 7.0.0. The patch should be included in the daily builds available at https://dev-builds.libreoffice.org/daily/ in the next 24-48 hours. More information about daily builds can be found at: https://wiki.documentfoundation.org/Testing_Daily_Builds Affected users are encouraged to test the fix and report feedback.
Not sure if the patch is ok, I submitted it so it'll be on daily build in 24/48 hours and people may test it. Meanwhile, I cherry-picked it for 6.4 branch and let it for review (anyway, I can't submit myself since it must be validated by at least one person).
Also reproduced in Version: 5.4.0.0.alpha1+ Build ID: 9feb7f7039a3b59974cbf266922177e961a52dd1 CPU threads: 4; OS: Linux 4.19; UI render: default; VCL: gtk3; Locale: en-US (en_US.UTF-8); Calc: group
but not in Version: 5.2.0.0.alpha0+ Build ID: 3ca42d8d51174010d5e8a32b96e9b4c0b3730a53 Threads 4; Ver: 4.19; Render: default;
For the record, the document opens fine up to https://cgit.freedesktop.org/libreoffice/core/commit/?id=d81d104833f0ee9349ebcd0d79d2de84ba9a7262 author Michael Stahl <mstahl@redhat.com> 2016-02-12 18:22:51 +0100 committer Michael Stahl <mstahl@redhat.com> 2016-02-12 18:54:33 +0100 commit d81d104833f0ee9349ebcd0d79d2de84ba9a7262 (patch) tree 20069a32b56b52b9b8cdb4d37c5a0b22bfeb5c82 parent e2bfae9006e6adc4de17d0167dac6661b002f126 (diff) sfx2: related tdf#56270: loss of embedded objects imported from DOCX after this commit, LibreOffice prompts a Genera Error: General input/output error. LibreOffice started to crash after https://cgit.freedesktop.org/libreoffice/core/commit/?id=178f5306979ef55a5682191dcdafb9e926e57cde Bisected with bibisect-linux-64-5.2
Julien Nabet committed a patch related to this issue. It has been pushed to "libreoffice-6-4": https://git.libreoffice.org/core/commit/67af725a8623a509960a8463f7876fcd680565ad tdf#129372: PPTX: error at SfxBaseModel::storeToStorage: 0x20d(row,col) It will be available in 6.4.4. The patch should be included in the daily builds available at https://dev-builds.libreoffice.org/daily/ in the next 24-48 hours. More information about daily builds can be found at: https://wiki.documentfoundation.org/Testing_Daily_Builds Affected users are encouraged to test the fix and report feedback.
Backport on 6.3 branch waiting for Jenkins: https://gerrit.libreoffice.org/c/core/+/91902
Julien Nabet committed a patch related to this issue. It has been pushed to "libreoffice-6-3": https://git.libreoffice.org/core/commit/ab0078509c352ee5d7b8ae5334d49f7c14fc26a5 tdf#129372: PPTX: error at SfxBaseModel::storeToStorage: 0x20d(row,col) It will be available in 6.3.6. The patch should be included in the daily builds available at https://dev-builds.libreoffice.org/daily/ in the next 24-48 hours. More information about daily builds can be found at: https://wiki.documentfoundation.org/Testing_Daily_Builds Affected users are encouraged to test the fix and report feedback.
Xisco Fauli committed a patch related to this issue. It has been pushed to "master": https://git.libreoffice.org/core/commit/ed3b44ef622bc87da2425322521c293c2a46a1c5 tdf#129372: Add unittest It will be available in 7.0.0. The patch should be included in the daily builds available at https://dev-builds.libreoffice.org/daily/ in the next 24-48 hours. More information about daily builds can be found at: https://wiki.documentfoundation.org/Testing_Daily_Builds Affected users are encouraged to test the fix and report feedback.
Verified in Version: 7.0.0.0.alpha0+ Build ID: 35fc5ef0a759884b24ed8b83cd05702a0fab64cc CPU threads: 4; OS: Linux 4.19; UI render: default; VCL: gtk3; Locale: en-US (en_US.UTF-8); UI-Language: en-US Calc: threaded @Julien, thanks for fixing this issue!