I have developed Marathi spell check for Libre office. For some words the right click option shows funny characters as shown in the image here... https://s3.amazonaws.com/loffice/spell_check.png The first 4 options are correct. I do not know about the rest. I guess this is happening for very short (1 or 2 character) words. I did not notice this before 5.1.0.3
Is the problem still in 5.1.1? Or you can also test with a master build: http://dev-builds.libreoffice.org/daily/master/Win-x86@42/current/
Created attachment 123752 [details] junk chars in spell check
(In reply to shantanu from comment #2) > Created attachment 123752 [details] > junk chars in spell check This is with 5.1.1?
5.1.1.3 is the version I have tested. The 5.2 version has only "HP" word added to the right click as shown in the second attachment.
Created attachment 123807 [details] new version has only HP on right click
Hello Shantanu, it's a duplicate of bug 97179. László, could you look at word HP in suggestions (see comment 4)? Thanks
(In reply to raal from comment #6) > Hello Shantanu, > it's a duplicate of bug 97179. > > László, could you look at word HP in suggestions (see comment 4)? Thanks So if this bug is a duplicate of bug 97179, which has been fixed, is this bug still valid?
Dear reporter, could you please try to reproduce it with the latest version of LibreOffice from https://www.libreoffice.org/download/libreoffice-fresh/ ? I have set the bug's status to 'NEEDINFO'. Please change it back to 'UNCONFIRMED' if the bug is still present in the latest version.
The word "HP" is still there. Tested on Ubuntu - Libre office version 5.3.4.2
Please attach a sample document, as this makes it easier for us to verify the bug. I have set the bug's status to 'NEEDINFO'. Please change it back to 'UNCONFIRMED' once the requested document is provided. (Please note that the attachment will be public, remove any sensitive information before attaching it. See https://wiki.documentfoundation.org/QA/FAQ#How_can_I_eliminate_confidential_data_from_a_sample_document.3F for help on how to do so.)
Dear Bug Submitter, This bug has been in NEEDINFO status with no change for at least 6 months. Please provide the requested information as soon as possible and mark the bug as UNCONFIRMED. Due to regular bug tracker maintenance, if the bug is still in NEEDINFO status with no change in 30 days the QA team will close the bug as INSUFFICIENTDATA due to lack of needed information. For more information about our NEEDINFO policy please read the wiki located here: https://wiki.documentfoundation.org/QA/Bugzilla/Fields/Status/NEEDINFO If you have already provided the requested information, please mark the bug as UNCONFIRMED so that the QA team knows that the bug is ready to be confirmed. Thank you for helping us make LibreOffice even better for everyone! Warm Regards, QA Team MassPing-NeedInfo-Ping-20180129
Dear Bug Submitter, Please read this message in its entirety before proceeding. Your bug report is being closed as INSUFFICIENTDATA due to inactivity and a lack of information which is needed in order to accurately reproduce and confirm the problem. We encourage you to retest your bug against the latest release. If the issue is still present in the latest stable release, we need the following information (please ignore any that you've already provided): a) Provide details of your system including your operating system and the latest version of LibreOffice that you have confirmed the bug to be present b) Provide easy to reproduce steps – the simpler the better c) Provide any test case(s) which will help us confirm the problem d) Provide screenshots of the problem if you think it might help e) Read all comments and provide any requested information Once all of this is done, please set the bug back to UNCONFIRMED and we will attempt to reproduce the issue. Please do not: a) respond via email b) update the version field in the bug or any of the other details on the top section of our bug tracker Warm Regards, QA Team MassPing-NeedInfo-20180302
The HP and other 2 letter options are still displayed on right click when I check for spelling of very short words. Screenshot attached. Tested on windows and libreoffice (6.2.4.2)
Created attachment 152608 [details] The words like "HP" are still seen on right click spell check options
Can you please upload a sample document with a few of such words?
(In reply to Aron Budea from comment #15) > Can you please upload a sample document with a few of such words? Setting to NEEDINFO
Created attachment 153263 [details] marathi word misspelled shows 2 letter english words as suggestion Google search for "libre office marathi spell check" and install the extension before checking the words in this document.
Created attachment 153264 [details] English spell check also showing junk / irrelevant suggestions type "mn" in English and you will get irrelevant spelling suggestions like AI, HP, Sun, UX, Xen on right click. Checked using Version: 6.2.5.2 (x64) CPU threads: 1; OS: Windows 10.0; UI render: default; VCL: win; Locale: en-US (en_US); UI-Language: en-US
(In reply to Shantanu from comment #18) > Created attachment 153264 [details] > English spell check also showing junk / irrelevant suggestions > > type "mn" in English and you will get irrelevant spelling suggestions like > AI, HP, Sun, UX, Xen on right click. > Checked using Version: 6.2.5.2 (x64) > CPU threads: 1; OS: Windows 10.0; UI render: default; VCL: win; > Locale: en-US (en_US); UI-Language: en-US Yeah, I do get those. Arch Linux 64-bit Version: 6.4.0.0.alpha0+ Build ID: 37fc9f51a8de11d40632e8cda17ccf1fa4b1f503 CPU threads: 8; OS: Linux 5.2; UI render: default; VCL: gtk3; Locale: fi-FI (fi_FI.UTF-8); UI-Language: en-US Calc: threaded Built on 6 August 2019
Dear Shantanu, To make sure we're focusing on the bugs that affect our users today, LibreOffice QA is asking bug reporters and confirmers to retest open, confirmed bugs which have not been touched for over a year. There have been thousands of bug fixes and commits since anyone checked on this bug report. During that time, it's possible that the bug has been fixed, or the details of the problem have changed. We'd really appreciate your help in getting confirmation that the bug is still present. If you have time, please do the following: Test to see if the bug is still present with the latest version of LibreOffice from https://www.libreoffice.org/download/ If the bug is present, please leave a comment that includes the information from Help - About LibreOffice. If the bug is NOT present, please set the bug's Status field to RESOLVED-WORKSFORME and leave a comment that includes the information from Help - About LibreOffice. Please DO NOT Update the version field Reply via email (please reply directly on the bug tracker) Set the bug's Status field to RESOLVED - FIXED (this status has a particular meaning that is not appropriate in this case) If you want to do more to help you can test to see if your issue is a REGRESSION. To do so: 1. Download and install oldest version of LibreOffice (usually 3.3 unless your bug pertains to a feature added after 3.3) from https://downloadarchive.documentfoundation.org/libreoffice/old/ 2. Test your bug 3. Leave a comment with your results. 4a. If the bug was present with 3.3 - set version to 'inherited from OOo'; 4b. If the bug was not present in 3.3 - add 'regression' to keyword Feel free to come ask questions or to say hello in our QA chat: https://web.libera.chat/?settings=#libreoffice-qa Thank you for helping us make LibreOffice even better for everyone! Warm Regards, QA Team MassPing-UntouchedBug
Reproduced. I type "mn" (without quotes) and get irrelevant suggestions like AI, HP, LG, UI, UX Please remove them from .dic file so that those words will not show up as seen in the attachment. Version: 24.2.1.2 (AARCH64) / LibreOffice Community Build ID: 420(Build:2) CPU threads: 2; OS: Linux 6.5; UI render: default; VCL: gtk3 Locale: en-US (C.UTF-8); UI: en-US Ubuntu package version: 4:24.2.1~rc2-0ubuntu0.22.04.1~lo1 Calc: threaded
Created attachment 194357 [details] remove 2 letter words from .dic file
The correct solution is to limit the number of suggestions shown. I believe this behaviour has been happening for years.
The group of these five words appear for every two-letter misspelled word in any language: AI, HP, LG, UI, UX For example, type "pn" "tk" or "ln" and right-click. Isn't this behavior strange? I do not think limiting the number of suggestions is going to help.
These are just suggestions - the computer has no brains. If none of the (visible) suggestions is helpful, then the human brain needs to decide how to deal with the supposed spelling violation. I see no bug here...
Asked for opinions in the dev chat and there was support for closing as notabug. A spell checker is not a word prediction system.
Hmmm... the enhancement I could think of is something similar to bug 97179 Comment 6's solution. Where you could: - DO NOT suggest "technical" dictionary words on 1-letter typos. I think that would help mitigate a lot of Shantanu's original issue. - - - This way, something like... CASE 1: attachment 153263 [details] 's 1-letter "typo" like: - य = U+092F = DEVANAGARI LETTER YA Right-Click currently gets: - 5 Marathi + 5 useless "technical words" at the bottom. After: - 5 "Marathi dictionary" suggestions CASE 2: A 1-letter "typo" like: - ȵ = U+0235 = LATIN SMALL LETTER N WITH CURL Right-Click currently gets this in English: - 15 1-letter suggestions + 5 "technical words" After: - 15 (or 20) 1-letter suggestions - - - > The group of these five words appear for every two-letter misspelled word in any language: > > AI, HP, LG, UI, UX If you go to: - Tools > Options - Languages and Locales > Writing Aids there are the "User-Defined Dictionaries". These exact words are a part of the dictionary: - technical [All] If you thought they were completely clogging up your Right-Click suggestions for your specific language—Marathi—then all you have to do is just: - UNCHECK the box for that dictionary. - - - And what is the "technical" dictionary? It just has a big list of: - (International) company names - Acer - AMD - Asus - ChatGPT - HP - IBM - Facebook - Microsoft - LibreOffice - Technical extensions/terms/formats - HDMI - HEVC - HTTP - FLAC - PNG - GPT - LAN - iOS - Greek/mathematical symbols - α - β - γ This ensures that *any* language talking about "DOCX" or "PNG" files, or an enormous company like "Microsoft" or "ChatGPT", won't get red squigglies. And because many of these "technical words" are 2, 3, or 4 characters long, they often appear in Right-Click on those 1, 2, or 3 character long "misspelled words"! - - - > If none of the (visible) suggestions is helpful, then the human brain needs to decide how to deal with the supposed spelling violation. > > I see no bug here... Yep, same. "Annoyance", yes. Bug, no. Uncheck the "technical [All]" dictionary in this specific case, and all those complaints disappear. But I think the enhancement I came up with at the top would be a big step in the right direction. And it's non-intrusive. And I don't think anybody would mind it. :) Sure, those 2- and 3-character cases would still exist (and those are trickier, see below). But that suggestion I made would completely eliminate the busted 1-character typo case! :) - - - > A spell checker is not a word prediction system. Yep. If you want more technical details, I summarized quite a bit back in: - /r/LibreOffice: "Is the auto-correction tool of many languages changing correct words into others?" - https://old.reddit.com/r/libreoffice/comments/135kn9j/is_the_autocorrection_tool_of_many_languages/jkc7dz2/ - Especially the "AutoCorrect Categories" and "Recommended Resources" and "More Spellchecking Resources". Especially see the info about "Edit Distance": - https://en.wikipedia.org/wiki/Edit_distance The Right-Click suggestions get "ranked", and mostly go by simple rules like: - "Is there just a missing accent?" - -> Add an accent - "Is there just a capital letter missing?" - -> Capitalize the word. - "Did you accidentally squish 2 correct words together?" - -> Add a SPACE between. - "Are 2 letters accidentally flipped?" - = Transposition - "How many characters are different?" - 1 character difference goes much higher up the suggestion list. - 2 character difference goes much lower on the suggestion list. So words that are much "closer" to the typo, get pushed up to very top of the list, like: - cafe vs. café - Same letters, but only missing 1 accent. - microsoft vs. Microsoft - Same letters, but only 1 capitalization. - libreoffice vs. LibreOffice - Same letters, but only 2 capitalization. And ones that are much "further"/worse, appear towards the bottom of the list, like: - misspellling -> dispelling - Accidentally 3 'l's. - Plus an "ss" -> "s". - Plus an 'm' -> 'd'. - = 3 full letters away!!! The user likely wanted "misspelling", which is the 1st/"very best" Right-Click choice... but the 4th ranked "dispelling" could still be possible. Like Comment #25 says, it's then up to the user to decide to take that advice (or ignore it).