Created attachment 143783 [details]
Example DOC with 4 pages
Attached is DOC from bug 93977 (with faux TOC that's not relevant here).
It's 4 pages in MSO and was also 4 pages in LO up to 22.214.171.124.
Then in 4.3.1 it miraculously got an empty page after page 2 so it's 5 pages now up to 6.2+.
No issue now with same document saved in MSO as DOCX but problem was similar with 4.3. I see it fixed in 5.3.0.
It would be nice to both find a regression, if it's a regression, and where it was fixed for DOCX. I wouldn't surprised if JL did that, he had many fixes.
Anyway, we should always test both DOC and DOCX in cases like these.
Created attachment 143784 [details]
2nd example DOC with 1 page
Not sure if the same issue but here is 2nd example DOC with 1 page that I created in MSO from attachment 64325 [details] from regression bug 52208 that was fixed for DOCX but not for DOC.
Maybe bug 116872 (or more in general meta bug 118920)
bibisect-44max pinpoints the 5 pages starting with commit c5ed52b1cd6f22787c94bec035ceecf9e1da3271
Author: Lubo\u0161 Lu\u0148ák
CommitDate: Mon Jul 21 11:02:04 2014 +0200
ww8import create a pagedesc if continuous section changes margins (bnc#875383)
This is similar to what writerfilter does. MSWord can have one page with several
different margins, which are saved using continuous sections, which causes all
kinds of trouble, because either we treat them as Writer sections, which means
we lose some of the data, or we treat them as Writer page styles, which causes
spurious page breaks if in the wrong place. Either option has its problems, but
here it seems slightly better to go for keeping the data and hoping the page
break will be in a place where a break will be anyway.
*** This bug has been marked as a duplicate of bug 86468 ***
In this case (of continuous breaks) it might be good to have one format do it one way, and the other format do it another way. At least one of the two ways should work for whoever wants to share with non-ODT users.
This is an area where LO works completely differently from MS, so it is impossible to match (unless Writer is re-written).
confirmed that the .docx fix was from bug 64372. Doc and Docx/RTF are very different code in terms of importing, so rarely will lessons learned in one transfer over to the other. (Exporting shares a lot of code though.)
attachment 143783 [details] is 4 pages from LO 6.4 - OK
attachment 143784 [details] is still 2 pages - NOK