Bug 150173 - FILEOPEN FORMATTIING: CSV columns leaking into eachothers, messing everything up.
Summary: FILEOPEN FORMATTIING: CSV columns leaking into eachothers, messing everything...
Status: CLOSED INVALID
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Calc (show other bugs)
Version:
(earliest affected)
unspecified
Hardware: x86-64 (AMD64) Linux (All)
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2022-07-27 18:24 UTC by theemptyriver
Modified: 2022-07-27 20:03 UTC (History)
0 users

See Also:
Crash report or crash signature:


Attachments
A csv database containing data about a subreddits' posts. (70.62 KB, text/csv)
2022-07-27 18:24 UTC, theemptyriver
Details
screenshot showing pandas and calc side by side comparing row 5 (121.91 KB, image/png)
2022-07-27 18:27 UTC, theemptyriver
Details

Note You need to log in before you can comment on or make changes to this bug.
Description theemptyriver 2022-07-27 18:24:49 UTC
Created attachment 181453 [details]
A csv database containing data about a subreddits' posts.

Hello!
So I've recently downloaded this database, which, is imported successfully and error-free by Python's Pandas Framework.
Though, on LibreOffice Calc, Columns are read with errors.

For example, row 5 column author_flair_richtext should have a value of "[]" but instead has arabic text.
Comment 1 theemptyriver 2022-07-27 18:27:07 UTC
Created attachment 181454 [details]
screenshot showing pandas and calc side by side comparing row 5
Comment 2 Eike Rathke 2022-07-27 20:03:38 UTC
Unfortunately your screenshot hides the most important thing: the options used for import. Apparently you checked both Comma and Space as separators, and also Merge delimiters (at least with those options I get a similar wrong preview). Check only Comma and make sure Merge delimiters is unchecked, then the import is fine. With Character set Unicode (UTF-8).