Created attachment 68984 [details] screen shot of (kpv) Komi-language text operating as Latin I am working with several minority languages producing speller.zhfst binary files for voikko/2/mor-(LANGUAGE-CODE) directories. On my own MacBook Pro Snow Leopard, I can copy the required "speller.zhfst" and "voikko-fi_FI.pro" files to ~/.voikko/2/mor-la/. The LibreOffice recognizes "Latin" as having a spell checker _ABC_ in green before the language. So I open an Erzya-language document (myv), activate the entire text go to Tools/Language/For all Text/more... I select Latin, which has the two files I described above The same can be done with a Komi-language document (kpv) NB! IF, however, I make the directories ~/.voikko/2/mor-myv/ for Erzya ~/.voikko/2/mor-kpv/ for Komi (Zyrian) and copy the very same files I get a different result. LibreOffice/Preferences/Language Settings/Voikko/Finnish writing aids Vocabulary recognizes the presence of the "voikko-fi_FI.pro" files. But the Tools/Language/For all Text/more... does not show any evidence of spell checkers for either of the languages. I am hoping to release open-source beta spell checkers for Erzya, Komi-Zyrian and possibly Meadow Mari this year. Next year more Uralic minority languages are in the works. Please help me resolve this problem. Yours, Jack Rueter
Created attachment 69069 [details] Screen shot of (kpv) Komi-language text operating as Latin Thank you very much for your bug report! However, I do not know how to help here, sorry. I have adapted some files and converted the TIFF screenshot to a PNG image (which works better in some browsers) to help others to find and, hopefully, process this bug report.
@ Andras: Can you help here? Or can you please point Jack Rueter to someone else who can help with these problems? Thank you!
I think instead of "abusing" Latin for spellchecking these minority languages, we should add them to LibreOffice, so users can create documents in these languages. It would solve the spell checker recognition as well. I need the list of language names and ISO language codes.
Created attachment 69073 [details] attachment-18096-0.html Hi! Here is the mapping. struct Bcp47ToOOoMapping { const char * bcpTag; const char * oooLanguage; const char * oooRegion; }; Khanty = "kca" "kca" "RU" Komi-Zyrian = "kpv" "kpv" "RU" Livonian "liv" "liv" "LV" Moksha = "mdf" "mdf" "RU" Meadow Mari = "mhr" "mhr" "RU" Hill Mari = "mrj" "mrj" "RU" Erzya = "myv" "myv" "RU" Nganasan = "nio" "nio" "RU" Olonets = "olo" "olo" "RU" Veps = "vep" "vep" "RU" Võro = "vro" "vro" "EE" Nenets = "yrk" "yrk" "RU" The speller.zhfst files are constructed with open-source tools on the Giellatekno infrastructure. and the Voikko data are available from their server: https://victorio.uit.no/langtech/trunk/langs/myv/tools/spellcheckers/hfst/ voikko-fi_FI.pro<https://victorio.uit.no/langtech/trunk/langs/myv/tools/spellcheckers/hfst/voikko-fi_FI.pro> Presently the speller.zhfst file for Erzya will be available for down-load at divvun.no/static_files/myv.zhfst The Erzya myv.zhfst accompanies this message. It is beta status. On Thu, Oct 25, 2012 at 5:41 PM, <bugzilla-daemon@freedesktop.org> wrote: > Andras Timar <timar74@gmail.com> changed bug 56346<https://bugs.freedesktop.org/show_bug.cgi?id=56346> > What Removed Added Status UNCONFIRMED NEEDINFO Ever confirmed 1 > > *Comment # 3 <https://bugs.freedesktop.org/show_bug.cgi?id=56346#c3> on bug > 56346 <https://bugs.freedesktop.org/show_bug.cgi?id=56346> from Andras > Timar <timar74@gmail.com> * > > I think instead of "abusing" Latin for spellchecking these minority languages, > we should add them to LibreOffice, so users can create documents in these > languages. It would solve the spell checker recognition as well. I need the > list of language names and ISO language codes. > > ------------------------------ > You are receiving this mail because: > > - You reported the bug. > >
Created attachment 69074 [details] attachment-18096-1.dat
Created attachment 69075 [details] myv.zhfst
Andras Timar committed a patch related to this issue. It has been pushed to "master": http://cgit.freedesktop.org/libreoffice/core/commit/?id=112d9e66d4c81168e955178c5c35480cb6303bb2 fdo#56346 add a few more Uralic languages to languages dropdown The patch should be included in the daily builds available at http://dev-builds.libreoffice.org/daily/ in the next 24-48 hours. More information about daily builds can be found at: http://wiki.documentfoundation.org/Testing_Daily_Builds Affected users are encouraged to test the fix and report feedback.
Komi-Zyrian, Meadow Mari and Erzya have been there already, I added the rest.