Bug 64926 - Tibetan word breaking
Summary: Tibetan word breaking
Status: RESOLVED FIXED
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Linguistic (show other bugs)
Version:
(earliest affected)
unspecified
Hardware: Other All
: medium normal
Assignee: Not Assigned
URL:
Whiteboard: target:4.2.0 target:4.1.0.1
Keywords:
Depends on:
Blocks:
 
Reported: 2013-05-23 16:33 UTC by Elie Roux
Modified: 2013-06-14 09:52 UTC (History)
4 users (show)

See Also:
Crash report or crash signature:


Attachments
patch for core/plain/i18npool/source/breakiterator/breakiterator_unicode.cxx (808 bytes, text/plain)
2013-05-23 16:33 UTC, Elie Roux
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Elie Roux 2013-05-23 16:33:06 UTC
Created attachment 79713 [details]
patch for core/plain/i18npool/source/breakiterator/breakiterator_unicode.cxx

Dear all,

I'm currently trying to build tibetan support for hunspell. I'm realizing it won't work out of the box for LibreOffice. After some search, I'm providing a patch I cannot test (compilation time it way too long on my old laptop). Can you please comment and include it? It should make spellchecking possible for Tibetan (and Dzongkha).

Thank you,
-- 
Elie
Comment 1 Elie Roux 2013-05-25 08:28:47 UTC
I realize I did not specify the reason word breaking wouldn't work with Tibetan without ICU: in Tibetan, syllables (what we could call words) are separated by tshegs, character ་, not by spaces. Dzongkha is exactly the same.
Comment 2 Michael Meeks 2013-05-25 16:58:19 UTC
Eike / Andras - another nice patch from Elie - can you review ? :-)
Comment 3 Andras Timar 2013-05-29 08:01:32 UTC
 https://gerrit.libreoffice.org/4082
Comment 4 Elie Roux 2013-05-29 11:35:33 UTC
Thank you very much! I have to say I didn't know this tool, it seems very good, I'll use it next time.
Comment 5 Commit Notification 2013-05-31 10:06:14 UTC
Elie Roux committed a patch related to this issue.
It has been pushed to "master":

http://cgit.freedesktop.org/libreoffice/core/commit/?id=878b09a1a936c2d5bf4461e0123fafd377d948e2

fdo#64926 use icu's breakiterator for Tibetan and Dzongkha



The patch should be included in the daily builds available at
http://dev-builds.libreoffice.org/daily/ in the next 24-48 hours. More
information about daily builds can be found at:
http://wiki.documentfoundation.org/Testing_Daily_Builds
Affected users are encouraged to test the fix and report feedback.
Comment 6 Elie Roux 2013-05-31 12:34:40 UTC
Thank you very much, I'll try the nightbuild tomorrow.
Comment 7 Michael Meeks 2013-06-05 10:34:26 UTC
Assuming this is fixed - and marking it thus :-) Sounds like a nice feature to add here: https://wiki.documentfoundation.org/ReleaseNotes/4.2

Thanks !
Comment 8 Commit Notification 2013-06-14 09:52:01 UTC
Elie Roux committed a patch related to this issue.
It has been pushed to "libreoffice-4-1":

http://cgit.freedesktop.org/libreoffice/core/commit/?id=9a47b6a3727ff6153b129058a220e93e0dbbbb60&h=libreoffice-4-1

fdo#64926 use icu's breakiterator for Tibetan and Dzongkha


It will be available in LibreOffice 4.1.

The patch should be included in the daily builds available at
http://dev-builds.libreoffice.org/daily/ in the next 24-48 hours. More
information about daily builds can be found at:
http://wiki.documentfoundation.org/Testing_Daily_Builds
Affected users are encouraged to test the fix and report feedback.