Description: Libre office writer does not hyphenate the words correctly if zero-width-joiner is used anywhere in the word written in "Marathi" language. As per the validation script, the word "फक्तवित्तवान" can be broken at 3 places, but it is breaking at unexpected place in Writer as shown in the attached image. Steps to Reproduce: 1) Check config rule: # docker run --disable-content-trust -it shantanuo/hyphenate grep -iA1 'joiner' ../dicts/hyph_mr_IN.dic % Do not break either side of ZERO-WIDTH JOINER (U+200D) 22 % Break after ZERO-WIDTH NON JOINER (U+200C) 1 2) Run this docker command and check the output. Validation script has not broken the joiner but non-joiner is broken as expected in case of last 2 lines: # docker run -it shantanuo/hyphenate ./example ../dicts/hyph_mr_IN.dic /hi_sample.text वि=त्त वि=त्त वि=त्त उप=क्र=मा=वर=च्या=हीच उप=क्र=मा=वर=च्या=ही=बद्दल=चा वि=त्=त=वान वि=त्त=वान 3) Download and install the Marathi spell check extension for Libre office from... https://extensions.libreoffice.org/extensions/marathi-spellchecker/1.8 Start writer and type the word वित्तवान at the end of the line and notice the break between वित्-तवान Actual Results: Probably wrong: वित्=तवान Expected Results: वि=त्तवान OR वित्त=वान OR वित्तवा=न Reproducible: Always User Profile Reset: Yes Additional Info:
Created attachment 156862 [details] zero width joiner does not allow correct hyphenation Word is broken at क् and त् incorrectly
Let me put it another way, Note only the 4 lines from this page... https://gitlab.com/smc/hyphenation/blob/master/mr_IN/hyph_mr_IN.dic % Do not break either side of ZERO-WIDTH JOINER (U+200D) 22 % Break after ZERO-WIDTH NON JOINER (U+200C) 1 Is the rule mentioned above correctly set?
@Julien, any idea about this issue ?
I know nothing about hyphenation=> uncc myself.
Shantanu: isn't the problem outside of LibreOffice? Shouldn't you report to the extension maintainer?
I guess you are correct. The extension owner is the right person to resolve this. Closing.