Some locales of languages may be missing hyphenation info; but users still do want hyphenation, even for obscure locales. It makes sense to allow falling back on on more general locale, or on a popular specific locale which does have hyphenation data available. Example: en_IL - no hyphenation available en - would work (if this is defined at all) en_UK or en_US - good enough fallback options for en_IL A mechanism for enabling such fallbacks should be implemented and exposed to users.
In a way similar to bug 83561, which would implement the fallback mechanism needed for language variants. I am unsure how hyphenation rules work, and how possible it is to split that from the rest of the dictionary (i.e. using the dictionary for spellcheck but a fallback for hyphenation if it is missing). Marco and Sophie, any opinion on it?
(In reply to Stéphane Guillou (stragu) from comment #1) > In a way similar to bug 83561, which would implement the fallback mechanism > needed for language variants. > > I am unsure how hyphenation rules work, and how possible it is to split that > from the rest of the dictionary (i.e. using the dictionary for spellcheck > but a fallback for hyphenation if it is missing). Marco and Sophie, any > opinion on it? Heya, hyphenation is very hard. I have coded it into Proofing Tool GUI: https://proofingtoolgui.org See the subchapter in the user guide that explains how to use it: https://proofingtoolgui.org/proofingtoolgui_files/ProofingToolGUI_manual_V30.html#5.4.hyphenation It is very hard. Anyway, my two cents: for English languages, you can probably use the one in Gerrit as a fallback. The English languages that ship with my .oxt commit are either updated by me or by Kevin Atkinson, and none of us two change the hyphenation. Kevin simply converts wordlists into .dic automatically and I don't change such complex things, so definitely that until someone with great knowledge decides to improve the hyphenator, it won't be ever changed (it will always be this). For other languages, I am not sure. :-)
(In reply to Marco A.G.Pinto from comment #2) > Heya, hyphenation is very hard. Perhaps, but this bug is not about hyphenation. Just about choosing which already-implemented hyphenation scheme to use.
(In reply to Eyal Rozenberg from comment #3) > (In reply to Marco A.G.Pinto from comment #2) > > Heya, hyphenation is very hard. > > Perhaps, but this bug is not about hyphenation. Just about choosing which > already-implemented hyphenation scheme to use. scheme? Use the English hyphenator already there for English variants. Is this it?
I don't understand if you are asking for other thing. :-)
(In reply to Marco A.G.Pinto from comment #4) > Use the English hyphenator already there for English variants. > > Is this it? My example was for English, but suppose I want to hyphenate text in locale he_PL . That's probably not available, but it's more likely for us to have hyphenation capability for he, or for he_IL. I suggest that one of those be used as fallback. And the same for English or any other language.