Bug 160256 - Support marking text as having an undetermined language (ISO 639-2 "und" code)
Summary: Support marking text as having an undetermined language (ISO 639-2 "und" code)
Status: VERIFIED FIXED
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: LibreOffice (show other bugs)
Version:
(earliest affected)
Inherited From OOo
Hardware: All All
: medium enhancement
Assignee: Eike Rathke
URL:
Whiteboard: target:24.8.0 inReleaseNotes:24.8
Keywords: needsDevEval
Depends on:
Blocks: Languages
  Show dependency treegraph
 
Reported: 2024-03-18 09:26 UTC by Eyal Rozenberg
Modified: 2024-04-19 14:01 UTC (History)
5 users (show)

See Also:
Crash report or crash signature:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Eyal Rozenberg 2024-03-18 09:26:09 UTC
Both before and after the resolution of bug 151290, it is useful to be able to choose "undetermined" language for a stretch of text. There is an ISO 639-2 code for this: "und"

https://en.wikipedia.org/wiki/List_of_ISO_639-2_codes

and it should be available in the list(s) of languages in the UI.
Comment 1 Stéphane Guillou (stragu) 2024-03-18 12:13:31 UTC
Fair enough, I can see how this could be useful in-app as well as for interoperability with specialised tools in which a distinction between "no language" and "undetermined" matters, e.g. transcription tools, applications in linguistics, script writing, databases and datasets in which a record's language is yet to be determined...
It could be listed at the top of the list as [Undetermined], next to [None].
Comment 2 V Stuart Foote 2024-03-18 16:02:04 UTC
@Eike, Regina can't imagine this hadn't come up previously. Any recollections...?
Comment 3 Eike Rathke 2024-04-03 13:30:52 UTC
Just type
und
into the character format language box and hit Enter.
Will be displayed as "Unknown language {und}" after that.
Comment 4 Eyal Rozenberg 2024-04-03 20:14:39 UTC
(In reply to Eike Rathke from comment #3)

Interesting... that's better than what I knew was implemented right now.

Regardless of that point,

* This suggestion would only be relevant before bug 151290 (after which, language is no longer an aspect of formatting).
* There are 3 language boxes: Western, Complex, Asian. Why should the user do this in the Western language selection box? And - should be any of them? all of them?
* There's no list item to choose this with the mouse
* There's no completion for it
* ...and it's easy to mistake with Udmurt, which does have completion
Comment 5 Eike Rathke 2024-04-11 17:03:46 UTC
I'll add "[Multiple]" {mul} and "[Undetermined]" {und} to the top of the list.
Comment 6 Commit Notification 2024-04-12 10:07:30 UTC
Eike Rathke committed a patch related to this issue.
It has been pushed to "master":

https://git.libreoffice.org/core/commit/d4375cf6e6086c3b9cb169cbb40781e8b1124b4d

Resolves: tdf#160256 Add "[Multiple]" and "[Undetermined]" to language list

It will be available in 24.8.0.

The patch should be included in the daily builds available at
https://dev-builds.libreoffice.org/daily/ in the next 24-48 hours. More
information about daily builds can be found at:
https://wiki.documentfoundation.org/Testing_Daily_Builds

Affected users are encouraged to test the fix and report feedback.
Comment 7 Stéphane Guillou (stragu) 2024-04-19 14:01:00 UTC
Verified in:

Version: 24.8.0.0.alpha0+ (X86_64) / LibreOffice Community
Build ID: bdf3b5ce49b0e4ee1b4525d344cfb037ef473059
CPU threads: 8; OS: Linux 6.5; UI render: default; VCL: gtk3
Locale: en-AU (en_AU.UTF-8); UI: en-US
Calc: CL threaded

In release notes: https://wiki.documentfoundation.org/index.php?title=ReleaseNotes%2F24.8&type=revision&diff=748339&oldid=748277

Thanks Eike!