Bug 72308 - FORMATTING: Better way of .doc to .html conerversion with centering
Summary: FORMATTING: Better way of .doc to .html conerversion with centering
Status: NEW
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Writer (show other bugs)
Version:
(earliest affected)
unspecified
Hardware: Other All
: medium minor
Assignee: Not Assigned
URL:
Whiteboard: BSA
Keywords: filter:html
: 101203 (view as bug list)
Depends on:
Blocks: (X)HTML-Export
  Show dependency treegraph
 
Reported: 2013-12-04 11:59 UTC by grave_123
Modified: 2022-05-07 11:24 UTC (History)
3 users (show)

See Also:
Crash report or crash signature:


Attachments
The one of many files that I experience this frustration with in .doc format (989.00 KB, application/msword)
2013-12-04 11:59 UTC, grave_123
Details

Note You need to log in before you can comment on or make changes to this bug.
Description grave_123 2013-12-04 11:59:03 UTC
Created attachment 90222 [details]
The one of many files that I experience this frustration with in .doc format

Problem description: 

I've been experiencing this for a long time in LibreOffice Writer and frankly I'm tired of it. I help out a client with turning his documents that he wrote in Microsoft Word 2000/2003 into HTML so I can put them up on the Internet. Every single time I convert from a .doc file to .html file, the centering of titles in the document and the like become flush left. Well, after I looked at the page source I immediately started hacking at the problem and discovered that LibreOffice sets some goofy version of the align property and it looks like this:

ALIGN=CENTER

That way does nothing for centering things when converted. What I DID find that works however, is that when there's a <p class> tag

For example:

<P ALIGN=CENTER STYLE="widows: 0; orphans: 0"><B>FOOBAR</B></P>

Will not work. But THIS does center the text:

<P STYLE="widows: 0; orphans: 0; text-align: center"><B>FOOBAR</B></P>


There any way someone can get the Writer to do this instead of that goofy ALIGN=CENTER thing? Thanks :)


My current OS: Ubuntu 13.10
My version of LibreOffice: 4.1.2.3
Browser used: Mozilla Firefox 25.0.1


Steps to reproduce:
1. Open a .doc file in LibreOffice Writer and save it as .html using LibreOffice Writer
2. Open in Firefox
3. Anything meant to be centered is now flush left

Current behavior:

Centered text flushes left

Expected behavior:

Centered text
Operating System: All
Version: unspecified
Comment 1 retired 2013-12-29 12:53:59 UTC
Confirmed:4.2.0.1:OSX

* open attached example file
* save as ... html file
* open html file with LO

Text stays centered.

*But* open html-file with FF, WebKit or Chromium > Text is left.

Not sure if this is expected so > NeedAdvice

and NEW since the described behavior is confirmed.
Comment 2 grave_123 2013-12-29 14:16:48 UTC
(In reply to comment #1)
> Confirmed:4.2.0.1:OSX
> 
> * open attached example file
> * save as ... html file
> * open html file with LO
> 
> Text stays centered.
> 
> *But* open html-file with FF, WebKit or Chromium > Text is left.
> 
> Not sure if this is expected so > NeedAdvice
> 
> and NEW since the described behavior is confirmed.

No, the text is meant to be centered if the "ALIGN=CENTER" is present for that function.

In a nutshell: The desired outcome is to have the text centered not flush left. :)
Comment 3 Joel Madero 2015-05-02 15:41:51 UTC Comment hidden (obsolete)
Comment 4 Buovjaga 2015-06-20 14:04:42 UTC
Repro.

Win 7 Pro 64-bit Version: 5.1.0.0.alpha1+
Build ID: 3ecef8cedb215e49237a11607197edc91639bfcd
TinderBox: Win-x86@62-merge-TDF, Branch:MASTER, Time: 2015-06-19_23:16:58
Locale: fi-FI (fi_FI)
Comment 5 Buovjaga 2016-08-06 19:27:39 UTC
*** Bug 101203 has been marked as a duplicate of this bug. ***
Comment 6 QA Administrators 2017-09-01 11:16:17 UTC Comment hidden (obsolete)
Comment 7 Dennis Roczek 2018-04-03 22:52:36 UTC
still repro with

Version: 6.0.1.1 (x64)
Build-ID: 60bfb1526849283ce2491346ed2aa51c465abfe6
CPU-Threads: 4; BS: Windows 10.0; UI-Render: GL; 
Gebietsschema: de-DE (de_DE); Calc: CL
Comment 8 QA Administrators 2019-04-04 03:03:57 UTC Comment hidden (obsolete)
Comment 9 Dennis Roczek 2020-05-06 17:09:18 UTC
still repro with

Version: 6.4.2.2 (x64)
Build-ID: 4e471d8c02c9c90f512f7f9ead8875b57fcb1ec3
CPU-Threads: 4; BS: Windows 10.0 Build 18363; UI-Render: Standard; VCL: win; 
Gebietsschema: de-DE (de_DE); UI-Sprache: de-DE
Calc: CL
Comment 10 QA Administrators 2022-05-07 03:33:13 UTC Comment hidden (obsolete)
Comment 11 Michael Warner 2022-05-07 11:24:38 UTC
If I create a new document, and set some text centered, I get the HTML output:
<p align="center" style="line-height: 100%; margin-bottom: 0in">
Centered</p>

And when I view this in a web browser, it is centered.

When I open the attached document, and export to HTML, it has align="center" attribute on the <p> tag but is rendered as left-justified (as described in Comment 0). So, in that sense, this is confirmed. 

However, the reason it is justified from the attached document is that the export from the attached document also contains a <style> tag in the header: 
		p { color: #000000; text-align: justify; orphans: 2; widows: 2; direction: ltr; background: transparent }

The text-align:justify style on p is over-riding the align="center" attribute on the element. Looking in LO Writer, I can see that justified alignment is set on the "Text Body" paragraph style.

So, the bug here is that in LO, alignment set directly on text over-rides a paragraph style, but it seems that the opposite is true in HTML. The align="center" attribute is the HTML 4 way to do it, "text-align:center" style is the HTML 5 way. This goes back to the fact that we aren't really generating proper HTML 5 for export. See discussion in Bug 95861.