When copying records from a Base table, In HTML format, the header has "charset=windows-1252" meta, but the data contents is in UTF-8. Please fix that error. When pasting this into Calc, The data are interpreted as Japanese encoding, instead of Windows-1252. As a consequence, the data cannot be fixed easily with codepage converter tools. Please insert the UTF-8 data from Base in the Windows-1252 encoding.
As I am unable to attach a test file, you can use the string "Образец текста" as a sample data with a new database to reproduce this.
Could you give some more detailed steps? I created a table with a LONGVARCHAR field and pasted Образец текста into it. How can I now copy in HTML format?
Just select the rows and copy them. In Calc, open the Paste button menu and choose 'HTML....'
(In reply to Urmas from comment #3) > Just select the rows and copy them. In Calc, open the Paste button menu and > choose 'HTML....' Ok, I managed to do it, but how do I confirm the encoding? I saved to .ods and opened content.xml, but there is no windows-1252 Win 7 Pro 64-bit Version: 5.2.0.0.alpha0+ Build ID: 259c1ed201f4277d74dfd600fed8c837cbf56abc CPU Threads: 4; OS Version: Windows 6.1; UI Render: default; TinderBox: Win-x86@39, Branch:master, Time: 2016-01-27_00:45:12 Locale: fi-FI (fi_FI)
Created attachment 122262 [details] Clipboard contents Here's what getting copied.
Ok, thanks, I used http://www.nirsoft.net/utils/inside_clipboard.html and confirmed. Win 7 Pro 64-bit Version: 5.2.0.0.alpha0+ Build ID: 259c1ed201f4277d74dfd600fed8c837cbf56abc CPU Threads: 4; OS Version: Windows 6.1; UI Render: default; TinderBox: Win-x86@39, Branch:master, Time: 2016-01-27_00:45:12 Locale: fi-FI (fi_FI)
I have a similar problem. I made a table with Base, input data in Japanese and tried to export the records to Calc following the instruction in the "Exporting data from Base" section in the page below. https://help.libreoffice.org/Common/Importing_and_Exporting_Data_in_Base If I paste the data on a new Calc sheet, I get accented Latin letters or symbols (Windows-1252 characters) instead of Japanese. Tried to paste as RTF (default) and HTML formats but both have the same encoding problem. Win 10 32-bit, LibreOffice Version: 5.1.2.2 Build ID: d3bf12ecb743fc0d20e0be0c58ca359301eb705f CPU Threads: 2; OS Version: Windows 6.2; UI Render: default; Locale: ja-JP (ja_JP)
Created attachment 124228 [details] Calc paste screenshot
Created attachment 124229 [details] Database file
** Please read this message in its entirety before responding ** To make sure we're focusing on the bugs that affect our users today, LibreOffice QA is asking bug reporters and confirmers to retest open, confirmed bugs which have not been touched for over a year. There have been thousands of bug fixes and commits since anyone checked on this bug report. During that time, it's possible that the bug has been fixed, or the details of the problem have changed. We'd really appreciate your help in getting confirmation that the bug is still present. If you have time, please do the following: Test to see if the bug is still present on a currently supported version of LibreOffice (5.2.7 or 5.3.3 https://www.libreoffice.org/download/ If the bug is present, please leave a comment that includes the version of LibreOffice and your operating system, and any changes you see in the bug behavior If the bug is NOT present, please set the bug's Status field to RESOLVED-WORKSFORME and leave a short comment that includes your version of LibreOffice and Operating System Please DO NOT Update the version field Reply via email (please reply directly on the bug tracker) Set the bug's Status field to RESOLVED - FIXED (this status has a particular meaning that is not appropriate in this case) If you want to do more to help you can test to see if your issue is a REGRESSION. To do so: 1. Download and install oldest version of LibreOffice (usually 3.3 unless your bug pertains to a feature added after 3.3) http://downloadarchive.documentfoundation.org/libreoffice/old/ 2. Test your bug 3. Leave a comment with your results. 4a. If the bug was present with 3.3 - set version to "inherited from OOo"; 4b. If the bug was not present in 3.3 - add "regression" to keyword Feel free to come ask questions or to say hello in our QA chat: http://webchat.freenode.net/?channels=libreoffice-qa Thank you for helping us make LibreOffice even better for everyone! Warm Regards, QA Team MassPing-UntouchedBug-20170522
The bug is still present. If I copy and paste data from Base table to Calc, I get wrongly encoded characters. I think this bug is inherited from OOo. I will test on the oldest version later. Win 10 64-bit Version: 5.3.3.2 (x64) Build ID: 3d9a8b4b4e538a85e0782bd6c2d430bafe583448 CPU Threads: 8; OS Version: Windows 6.19; UI Render: default; Layout Engine: new; Locale: ja-JP (ja_JP); Calc: group
Just wonder if it could be a dup of tdf#37859 Anyone could give a try with a daily build? (see http://dev-builds.libreoffice.org/daily/master/)
tdf#37859 looks the same bug. I will test with the dev build.
(In reply to Julien Nabet from comment #12) > Just wonder if it could be a dup of tdf#37859 > > Anyone could give a try with a daily build? (see > http://dev-builds.libreoffice.org/daily/master/) Yep, it seems to be as now the characters in HTML are encoded as entities instead of being messed up. Let's close as dupe. Version: 6.0.0.0.alpha0+ (x64) Build ID: 2404a17e157273430d40ceaa1ab1275e7b50ba6e CPU threads: 4; OS: Windows 6.19; UI render: default; TinderBox: Win-x86_64@42, Branch:master, Time: 2017-06-16_23:41:27 Locale: fi-FI (fi_FI); Calc: group *** This bug has been marked as a duplicate of bug 37859 ***