Description: Whenever I try to convert an XML file to PDF using LibreOffice, the application crashes if the string "pwi" appears anywhere in the XML file. This happens even if the string is not part of the XML declaration but embedded somewhere else in the document. The issue occurs consistently with any valid XML file containing this specific string. I suspect there is an issue with how LibreOffice parses this combination of characters in XML documents. Removing "pwi" prevents the crash. Steps to Reproduce: 1.Create an XML File with the following content "<?xmlpwi" 2. Call the command "soffice --nodefault --nofirststartwizard --nolockcheck --nologo --norestore --convert-to pdf:writer_pdf_Export --outdir . <<CREATED_FILE>>.xml" 3. File is not created no error is thrown 4. Create an XML File with the following content "<?xmlpwa" 5. Call the command "soffice --nodefault --nofirststartwizard --nolockcheck --nologo --norestore --convert-to pdf:writer_pdf_Export --outdir . <<CREATED_FILE>>.xml" 6. File is created Actual Results: LibreOffice crashes every time when processing the XML file with the string "pwi". Expected Results: LibreOffice should process the XML file correctly and generate a PDF without crashing. Reproducible: Always User Profile Reset: Yes Additional Info: Removing or replacing the string "pwi" resolves the issue, but this is not a suitable workaround, as I cannot control the content of all XML files. This issue may be linked to how the LibreOffice XML parser handles certain character sequences.
Could you provide an example file of XML file so it'll be quicker to reproduce? Also, could you give a try to a recent LO version like 24.2.6 ?
Created attachment 196887 [details] text.xml
7.6.7 is just the earliest Version we know about that has this bug, we are already on 24.2.6 - and we get the same behaviour
(In reply to marcel.hoedl from comment #2) > Created attachment 196887 [details] That attachment is *not* an XML file at all. While it may be reasonable to expect LibreOffice to open it as plain text, still your report talks about *XML*, and so, it is also a valid behavior to just ignore invalid XML. Please provide a valid XML showing this problem.
If you replace the "pwi" with "pwa" libreoffice is converting the file. I know thats not valid XML, i just tried to give you a minimal example of the issue.
(In reply to marcel.hoedl from comment #5) GIGO principle doesn't require that different samples of garbage produce the same result. But well, if you decide that a request for a sample doesn't deserve attention, so be it.
Created attachment 196897 [details] test.xml I was anwsering the last comment from my smartphone. I did not want ot create a xml fiel on this device. Here it is.
https://gerrit.libreoffice.org/c/core/+/174607
Mike Kaganski committed a patch related to this issue. It has been pushed to "master": https://git.libreoffice.org/core/commit/8f25697591ecfd615a3142528ca13ee4d0d2c562 tdf#163295: XMLFilterDetect: make sure to only detect own types It will be available in 25.2.0. The patch should be included in the daily builds available at https://dev-builds.libreoffice.org/daily/ in the next 24-48 hours. More information about daily builds can be found at: https://wiki.documentfoundation.org/Testing_Daily_Builds Affected users are encouraged to test the fix and report feedback.
The lastest dev-version, worked for us. Do you already have an ETA?
(In reply to marcel.hoedl from comment #10) https://wiki.documentfoundation.org/ReleasePlan
Mike Kaganski committed a patch related to this issue. It has been pushed to "libreoffice-24-8": https://git.libreoffice.org/core/commit/9fcc9fe94f5073632c4fc50b153767acfa8f87ff tdf#163295: XMLFilterDetect: make sure to only detect own types It will be available in 24.8.3. The patch should be included in the daily builds available at https://dev-builds.libreoffice.org/daily/ in the next 24-48 hours. More information about daily builds can be found at: https://wiki.documentfoundation.org/Testing_Daily_Builds Affected users are encouraged to test the fix and report feedback.