Bug 49849 - : Word wrapping in Hebrew handled incorrectly: lines break mid-word
Summary: : Word wrapping in Hebrew handled incorrectly: lines break mid-word
Status: RESOLVED FIXED
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: LibreOffice (show other bugs)
Version:
(earliest affected)
3.5.2 release
Hardware: Other All
: medium normal
Assignee: Caolán McNamara
URL:
Whiteboard: BSA target:3.6.0
Keywords:
Depends on:
Blocks:
 
Reported: 2012-05-12 12:17 UTC by ndvpublic+bugs
Modified: 2012-05-28 23:50 UTC (History)
4 users (show)

See Also:
Crash report or crash signature:


Attachments
Same word in Hebrew written over and over again. Last occurrence breaks mid-word. (67.98 KB, image/png)
2012-05-12 12:17 UTC, ndvpublic+bugs
Details
Document demonstrating the issue of Hebrew lines improperly broken (8.81 KB, application/vnd.oasis.opendocument.text)
2012-05-13 08:22 UTC, ndvpublic+bugs
Details

Note You need to log in before you can comment on or make changes to this bug.
Description ndvpublic+bugs 2012-05-12 12:17:01 UTC
Created attachment 61519 [details]
Same word in Hebrew  written over and over again. Last occurrence breaks mid-word.

Problem description: 

Steps to reproduce:
1. .... Create a new Writer document.
2. .... Write a long enough sentence in Hebrew so that it spans more than one line.

Current behavior:
The line will break mid-word.

Expected behavior:
Line should break either before or after a word.

Platform (if different from the browser): 
Arch Linux, 64-bit.
Comment 1 ndvpublic+bugs 2012-05-12 12:21:26 UTC
Possibly related: issue began after upgrading to LO 3.5.2, which is compiled against the new ICU 49.
Comment 2 Caolán McNamara 2012-05-13 08:15:22 UTC
can you paste in the same word here
Comment 3 ndvpublic+bugs 2012-05-13 08:21:02 UTC
Yeah, the word is
מילים

I'll attach the document too.
Comment 4 ndvpublic+bugs 2012-05-13 08:22:57 UTC
Created attachment 61557 [details]
Document demonstrating the issue of Hebrew lines improperly broken
Comment 5 Caolán McNamara 2012-05-13 09:30:14 UTC
confirmed
Comment 6 Caolán McNamara 2012-05-13 09:35:00 UTC
http://unicode.org/versions/Unicode6.1.0/

"Two new Line_Break property values have been added. The first is for Hebrew letters: lb=HL. It is used in the definition of a new rule, LB21a, in UAX #14, for handling line breaking for Hebrew characters next to hyphens. The second, lb=CJ, allows for better customization of Japanese line breaking. Implementations of Unicode line breaking may need to be updated to correctly handle these additional line break property values."

might be it
Comment 7 Not Assigned 2012-05-13 14:47:46 UTC
Caolan McNamara committed a patch related to this issue.
It has been pushed to "master":

http://cgit.freedesktop.org/libreoffice/core/commit/?id=20c24114143d6d38774b56a142fd4ae05094308e

Resolves: fdo#49849 implement Unicode 6.1 hebrew line breaking rules
Comment 8 Caolán McNamara 2012-05-13 14:58:24 UTC
in the "internal-icu" case, icu 49 is only in use on master, not 3-5. So if the problem appears in a 3.5 or earlier version, it means a distro has built against a system icu 49 so if your distro has this problem point them to the above commit, and bug #49885 if you want to help out :-)