Description: see steps to reproduce Steps to Reproduce: 1. Open the attached file in Excel. You will see no Error in C1 cell. 2. Open the attached file in LibreOffice. The versions with fix for tdf#93951 would be better. 3. Click ok if you are asked whether you want to recalculate. Actual Results: You will see Error 501 in C1 cell. Expected Results: No Error in C1 cell Reproducible: Always User Profile Reset: No Additional Info: Version: 6.3.0.0.alpha0+ (x64) Build ID: 82fb7f9f90a3ba9d53b3412124e513aa78ddca10 CPU threads: 4; OS: Windows 10.0; UI render: default; VCL: win; TinderBox: Win-x86_64@42, Branch:master, Time: 2019-02-05_23:41:03 Locale: ja-JP (ja_JP); UI-Language: en-US Calc: threaded
Created attachment 149424 [details] file to be used in STR
already reproducible with AOO 4.1.5
Fwiw, it works if the sheet name is quoted as usual, i.e. ='Sheet1・2'.A1 Likely the U+30FB KATAKANA MIDDLE DOT punctuation character ends the "word" there.
Eike Rathke committed a patch related to this issue. It has been pushed to "master": https://git.libreoffice.org/core/+/a10df00bd4a2ae293b077701d32c720a63080ebf%5E%21 Resolves: tdf#123575 treat Katakana middle dots as midletter It will be available in 6.3.0. The patch should be included in the daily builds available at https://dev-builds.libreoffice.org/daily/ in the next 24-48 hours. More information about daily builds can be found at: https://wiki.documentfoundation.org/Testing_Daily_Builds Affected users are encouraged to test the fix and report feedback.
Pending review https://gerrit.libreoffice.org/68096 for 6-2
Thank you Eike. Um, to tell the truth, I wrote this bug report based on the following post https://ask.libreoffice.org/ja/question/183520/ and the reporter was also giving another examples like U+3010【 U+3011 】 and he was also thinking there could be more similar cases. I too optimistically thought that not recognizing all these letters is from one cause, and for simplicity, I just gave MIDDLE DOT, without investigating the code. Should I file a new bug report?
I don't see why we should accept characters like U+3010 LEFT BLACK LENTICULAR BRACKET U+3011 RIGHT BLACK LENTICULAR BRACKET as midletter characters, i.e. non-letter non-numeric characters that continue a word. Already the middle dots are a border case, where would that end if we'd introduce a whole bunch of exemptions for characters that are Unicode punctuation characters. Alternatively we could only accept every Unicode character (>=0xA0) in a sheet name but that doesn't quite fit how the lexical analyzer works as it doesn't know in advance that it's running a sheet name (and shouldn't). However, nothing for a quick fix.
Eike Rathke committed a patch related to this issue. It has been pushed to "libreoffice-6-2": https://git.libreoffice.org/core/+/24e17b53b88dee234f23cf3fad3c7bc75b2d9ab7%5E%21 Resolves: tdf#123575 treat Katakana middle dots as midletter It will be available in 6.2.2. The patch should be included in the daily builds available at https://dev-builds.libreoffice.org/daily/ in the next 24-48 hours. More information about daily builds can be found at: https://wiki.documentfoundation.org/Testing_Daily_Builds Affected users are encouraged to test the fix and report feedback.