Bug 131008 - Fileopen: Bullet symbol Soft Hyphen from MSO Subset: Latin1-Supplement not shown in .PPTX
Summary: Fileopen: Bullet symbol Soft Hyphen from MSO Subset: Latin1-Supplement not sh...
Status: NEW
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Impress (show other bugs)
Version:
(earliest affected)
4.2.0.4 release
Hardware: All All
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords: bibisected, bisected, filter:pptx, regression
Depends on:
Blocks: Hyphenation
  Show dependency treegraph
 
Reported: 2020-02-28 13:47 UTC by Walter Cheuk
Modified: 2023-10-04 10:44 UTC (History)
5 users (show)

See Also:
Crash report or crash signature:


Attachments
The .pptx file that have problem (7.22 MB, application/vnd.openxmlformats-officedocument.presentationml.presentation)
2020-02-28 13:49 UTC, Walter Cheuk
Details
The .pptx file opened in LO Impress (161.30 KB, image/png)
2020-02-28 13:50 UTC, Walter Cheuk
Details
The .pptx file opened in Powerpoint (180.47 KB, image/png)
2020-02-28 13:51 UTC, Walter Cheuk
Details
The .pptx just text (25.31 KB, application/vnd.openxmlformats-officedocument.presentationml.presentation)
2020-04-02 13:33 UTC, Timur
Details
Comparison LibreOffice 7.4 master and PP 2016 (154.21 KB, image/png)
2022-04-26 07:54 UTC, Xisco Faulí
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Walter Cheuk 2020-02-28 13:47:20 UTC
Description:
Some bullet point symbols in .PPTX file not shown in LO, while it's ok in Powerpoint

Steps to Reproduce:
1. Open the .pptx file
2. 
3.

Actual Results:
See some bullet point symbols are missing

Expected Results:
The bullet point symbols should be shown


Reproducible: Always


User Profile Reset: No



Additional Info:
Version: 6.4.1.2
Build ID: 4d224e95b98b138af42a64d84056446d09082932
CPU threads: 8; OS: Mac OS X 10.14.6; UI render: default; VCL: osx; 
Locale: zh-HK (zh_Hant.UTF-8); UI-Language: en-US
Calc: threaded
Comment 1 Walter Cheuk 2020-02-28 13:49:26 UTC
Created attachment 158255 [details]
The .pptx file that have problem
Comment 2 Walter Cheuk 2020-02-28 13:50:27 UTC
Created attachment 158256 [details]
The .pptx file opened in LO Impress
Comment 3 Walter Cheuk 2020-02-28 13:51:07 UTC
Created attachment 158257 [details]
The .pptx file opened in Powerpoint
Comment 4 eisa01 2020-02-29 13:37:57 UTC
Can confirm, also present on Windows

Bullet was shown on LO 3.3, so a regression

Version: 7.0.0.0.alpha0+
Build ID: 0cb4f304abf6f8dd6b40eb800788d2fe80581813
CPU threads: 4; OS: Mac OS X 10.14.6; UI render: default; VCL: osx; 
Locale: en-US (en_US.UTF-8); UI-Language: en-US
Calc: threaded
Comment 5 raal 2020-03-03 06:05:22 UTC
This seems to have begun at the below commit.
Adding Cc: to Khaled Hosny ; Could you possibly take a look at this one?
Thanks
 b8b0416620feecf3ede3305830a2b145c62a5bf9 is the first bad commit
commit b8b0416620feecf3ede3305830a2b145c62a5bf9
Author: Matthew Francis <mjay.francis@gmail.com>
Date:   Fri Sep 18 11:08:32 2015 +0800

    source-hash-bff8fa97e16f0f06fddc5545ea36c8bd2b18a580
    
    commit bff8fa97e16f0f06fddc5545ea36c8bd2b18a580
    Author:     Khaled Hosny <khaledhosny@eglug.org>
    AuthorDate: Mon May 6 11:08:29 2013 +0200
    Commit:     Khaled Hosny <khaledhosny@eglug.org>
    CommitDate: Mon May 6 17:22:31 2013 +0200
    
        Enable HarfBuzz by default
Comment 6 Timur 2020-04-02 12:17:45 UTC Comment hidden (obsolete)
Comment 7 Timur 2020-04-02 13:33:58 UTC
Created attachment 159268 [details]
The .pptx just text

PPTX opens in MSO 2016. I add it simplified, just text.

This bullet is shown:

<a:pPr marL="228600" indent="-228240">
	<a:lnSpc>
		<a:spcPct val="90000"/>
	</a:lnSpc>
	<a:spcBef>
		<a:spcPts val="1001"/>
	</a:spcBef>
	<a:buClr>
		<a:srgbClr val="000000"/>
	</a:buClr>
	<a:buFont typeface="Arial"/>
	<a:buChar char="•"/>
</a:pPr>

This dash bullet is not:

<a:pPr marL="685800" lvl="1" indent="-228240">
	<a:lnSpc>
		<a:spcPct val="90000"/>
	</a:lnSpc>
	<a:spcBef>
		<a:spcPts val="499"/>
	</a:spcBef>
	<a:buClr>
		<a:srgbClr val="000000"/>
	</a:buClr>
	<a:buFont typeface="Calibri"/>
	<a:buChar char="­"/>
</a:pPr>
Comment 8 Timur 2020-04-02 13:42:58 UTC Comment hidden (me-too)
Comment 9 Timur 2020-04-02 13:49:54 UTC
In <a:buChar char="­"/> I see dash - as char in Notepad, but it's not copied here. 
It's Custom bullet in MSO, char code 00AD from Subset: Latin1-Supplement, Soft Hyphen.
Hyphen-minus from Basic Latin is 002D.

Looks like author made a mistake which MSO allows but LO not.
Comment 10 Xisco Faulí 2022-04-26 07:54:08 UTC
Created attachment 179776 [details]
Comparison LibreOffice 7.4 master and PP 2016
Comment 11 Xisco Faulí 2022-04-26 07:54:24 UTC
Still reproducible in

Version: 7.4.0.0.alpha0+ / LibreOffice Community
Build ID: 67d8c245daf7c6ee33e908255f2bbd0ea8d08206
CPU threads: 8; OS: Linux 5.10; UI render: default; VCL: gtk3
Locale: es-ES (es_ES.UTF-8); UI: en-US
Calc: threaded
Comment 12 ⁨خالد حسني⁩ 2023-01-16 14:53:54 UTC
This is tricky to fix. We generally don’t want U+00A0 to be rendered unless the line was broken at it, but there is no line breaking involved here. We might want to make it visible when used as bullet (possibly under some MS comparability flag), but the decision needs to be taken inside GenericSalLayout::LayoutText(), but there we don’t know where the text came from to make such a decision.

May be the import filter should replace soft-hyphen with hyphen-minus, if MS Office always make it visible.