Bug 133782 - [EDITING] Add option to stop absolutely all autoconversion of non-numeric characters into numerics
Summary: [EDITING] Add option to stop absolutely all autoconversion of non-numeric cha...
Status: NEW
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Calc (show other bugs)
Version:
(earliest affected)
6.4.1.2 release
Hardware: x86-64 (AMD64) Windows (All)
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks: AutoCorrect-Complete
  Show dependency treegraph
 
Reported: 2020-06-08 07:56 UTC by zzz
Modified: 2023-05-21 18:32 UTC (History)
4 users (show)

See Also:
Crash report or crash signature:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description zzz 2020-06-08 07:56:28 UTC
In current version, even if I turn of all auto-correction switches, when I type the "circle letter:〇" U+3007 into a cell, it is automatically converted to numeric 0 (zero).
In Japanese, the circle:"〇"(U+3007) and cross:"×"(Ux00D7) means "Yes"/"No", used in almost all questionnairs and tables, and should never been converted to numbers, especially zero. Excel does not do such conversion. I understand that some people might need such conversion, so adding an option to choose Calc's behaviour is the best solution.
Comment 1 V Stuart Foote 2020-06-08 14:22:23 UTC
I'll take your word it is undesirable for ja-JA usage
 
The conversion from U+3007 "〇" -> to U+0030 "0" does happens when I set the  language for the cell to Japanese and enter the glyph or <Alt>+X convert. But not with en-US.

While an existing "〇" value entry is retained with a prime ' auto prepended to retain it as text.

As the conversion to a number follows normal escape behavior, suggests this is internal to the calc module editshell(s), rather than an auto-correction localization, but I could not track it down.

@Eike?
Comment 2 zzz 2020-06-09 01:40:26 UTC
I forgot to mention that my Language Of | User interface setting is English (USA), and Locale settings is "Japanese".

Re:>V Stuart Foote 2020-06-08 14:22:23 UTC
   >I'll take your word it is undesirable for ja-JA usage
Correct, if that means the data locale. If you mean Language Of | User interface, then I don't want Calc to change any data handling behaviour by UI settings.
Sorry that I wasn't clear enough, and thanks for confirming.

  >when I set the  language for the cell to Japanese
Do you mean the Format Cells | Asian Text Font | Language ?

One modification to my request:
I observe a lot of Japanese language users mistakenly enter the so-called "wide" characters in cell formulae. Excel cells autoconverts these to ASCII, so many users keep this habit.
(a) "wide" numbers, "0123456789" (U+FF1x) in places of numbers. Excel cells autoconverts this to numbers (both from keyboard IME and from paste). LibreOffice Calc already does this.
(b)Similary, many users enter "wide" symbols and letters such as "=A2" to mean ASCII formula "=A2", "’A" to mean ASCII literal "'A" etc.
Excel response differs whether they are from keyboard into cell, or into edit bar, or pasted into cell! LibreOffice Calc does no conversion.
I personally don't do and don't like these (especially the random conversions in literals), but think some people may want LibreOffice Calc to behave similarly to Excel.
This means we need two or more independent option entries to toggle autoconvert:
(a)"0123456789": default=autoconvert
(b)"=A2": default=autoconvert
(c)"’A": default=autoconvert Excel compatible
(d)"〇": default=no convert

For a tentative solution only for me, just turn off the "〇"->0 autoconversion.
Comment 3 QA Administrators 2022-06-10 03:34:06 UTC Comment hidden (obsolete)
Comment 4 zzz 2022-06-10 08:02:47 UTC
Problem still remains in:
Version: 7.3.2.2 (x64) / LibreOffice Community
Build ID: 49f2b1bff42cfccbd8f788c8dc32c1c309559be0
CPU threads: 4; OS: Windows 10.0 Build 19044; UI render: Skia/Raster; VCL: win
Locale: ja-JP (ja_JP); UI: en-US
Calc: CL

A correction to my previous description about the letter:
  There are three identically-looking letters that often appear in the Japanese IME,
    ○ U+25CB WHITE CIRCLE
    ◯ U+25EF LARGE CIRCLE
    〇 U+3007 IDEOGRAPHIC NUMBER ZERO
  The Unicode name for U+3007 is "IDEOGRAPHIC NUMBER ZERO", one of the ideographic numerics such as
  一 U+4E00 CJK UNIFIED IDEOGRAPH-4E00
    means ideographic number one
  三 U+4E09 CJK UNIFIED IDEOGRAPH-4E09
    means ideographic number three
  二 U+4E8C CJK UNIFIED IDEOGRAPH-4E8C
    means ideographic number two
  四 U+56DB CJK UNIFIED IDEOGRAPH-56DB
    means ideographic number four
  Excel (by default) does not treat them as numbers.

My conclusion is the same as before:
  In data locale "Japanese" and when auto-correction switches are off,
  (priority 1) stop the U+3007 "〇" -> to U+0030 "0" conversion by default.
  (priority 2) add a switch to re-enable the conversion. I don't need it, but some other user might want it.