Bug 82393

Summary: FILEOPEN: LibreOffice Writer skips parts between "ü" character in plain text file
Product: LibreOffice Reporter: Jonas Müller <forseti>
Component: filters and storageAssignee: Not Assigned <libreoffice-bugs>
Status: RESOLVED DUPLICATE    
Severity: normal CC: forseti, momonasmon
Priority: medium    
Version: 4.3.0.4 release   
Hardware: x86-64 (AMD64)   
OS: Windows (All)   
Whiteboard:
Crash report or crash signature: Regression By:
Attachments: Example files and screenshots showing the file in Writer and Notepad++ for comparison

Description Jonas Müller 2014-08-09 18:16:55 UTC
Created attachment 104354 [details]
Example files and screenshots showing the file in Writer and Notepad++ for comparison

Follow these steps:

1. Open "opened_correctly.d" in LibreOffice writer. Writer will use the font "Liberation Mono" and it will open the file correctly. The same goes for dozens of other .d text files I tried.

2. Now open "opened_incorrectly" in LibreOffice writer. Writer will use "Courier" as font and in the end there are some parts missing. In LibreOffice it will say:

...
	AI_Output (self, other,"Grd_264_Gardist_GARDEAUFNAHME_Info_07_02"); //Geh doch mal zum Alten Lager und sprich mit Thorus darckgebracht hab, sollte ich mal mit Thorus reden. Vielleicht nimmt er mich in die Reihen der Gardisten auf."); 
...

3. Now for comparison open "opened_incorrectly" in Notepad.exe. Here it will open the file correctly:

...
	AI_Output (self, other,"Grd_264_Gardist_GARDEAUFNAHME_Info_07_02"); //Geh doch mal zum Alten Lager und sprich mit Thorus darüber!
	
	Log_CreateTopic (GE_BecomeGuard,LOG_NOTE);
	B_LogEntry (GE_BecomeGuard,"Wenn ich die Eier zurückgebracht hab, sollte ich mal mit Thorus reden. Vielleicht nimmt er mich in die Reihen der Gardisten auf.");
...



As you can see LibreOffice will skip the part after the ü character and then continue after the next ü character. However "opened_correctly" also contains ü characters.
Comment 1 Urmas 2014-08-09 22:30:33 UTC
Confirming; seems like a type autodetection error.
Comment 2 Maxim Monastirsky 2014-08-10 05:15:00 UTC
(Unfortunately) it works as expected. Bug 78731 comment 6 explains the reasons behind it. To ensure correct importing of a plain text files, they have to carry one of the following extensions: txt csv tsv tab.

*** This bug has been marked as a duplicate of bug 78731 ***