Bug Hunting Session
Bug 78371 - UNO: Add "Clear Direct Character Formatting" and "Clear Direct Paragraph Formatting" functions
Summary: UNO: Add "Clear Direct Character Formatting" and "Clear Direct Paragraph Form...
Status: NEW
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Writer (show other bugs)
Version:
(earliest affected)
Inherited From OOo
Hardware: All All
: medium enhancement
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks: UNO-Command-New Writer-Menus Clear-Formatting
  Show dependency treegraph
 
Reported: 2014-05-07 06:26 UTC by Milos Sramek
Modified: 2019-08-15 13:50 UTC (History)
7 users (show)

See Also:
Crash report or crash signature:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Milos Sramek 2014-05-07 06:26:48 UTC
Hi,
I work on translation of the LibreOffice guides to Slovak. I use for that the OmegaT translation tool, because it can read and write odt files.

I observe a problem, which can be perhaps solved by adding a new function "Clear Direct Character Formatting" along to the existing "Clear Direct Formatting".

The problem:
OmegaT (and other similar CAT tolls) breaks text in sentences, which are then translated. It honours formating, which means that the translator sees the formating tags (in OmegaT they are abbreviated e.g., to <f1> text </f1). 

If there is too much of direct formating, the text becomes untranslatable by CAT tools, because tags appear everywhere and also inside words. This is exactly the case of the LO guides. The remedy for this is to use styles and then to remove the direct formating. 

The problem is that "Clear Direct Formatting" removes all direct formating - in the case of LO guides it breaks headers, lists etc. The reason may be that the guides were not written "the right way". However, even authors of the LO guides do not use LO "the right way".

I think that introduction of the "Clear Direct Character Formatting" (cleaning up stuff only inside paragraphs withou influencing the rest) would solve this problem. It will make translation of odt files by CAT tools possible. 

I currently use a workaround for that: I unzip the odt file and clean the content.xml file by my own python script. It does not work perfect but still solves the problem. The proposed solution would make possible to translate odt files for all.

Thanks for considering this proposal
--
Milos
Comment 1 Owen Genat (retired) 2014-07-18 14:27:16 UTC
Seems like a reasonable request, especially as there is a clearly indicated use case. Status set to NEW.
Comment 2 Timur 2016-02-10 20:29:41 UTC
What you ask for exists in Apply Style dropdown as "Clear formatting".
But, it's not so obvious, especially that it's different from Format - Clear Direct Formatting, so I'd also like to have it added to Format menu.
Comment 3 Octavio Alvarez 2016-05-04 19:47:47 UTC
(In reply to Milos Sramek from comment #0)
> The problem:
> OmegaT (and other similar CAT tolls) breaks text in sentences, which are
> then translated. It honours formating, which means that the translator sees
> the formating tags (in OmegaT they are abbreviated e.g., to <f1> text </f1). 

Do you have a sample file and/or screenshots? It would help document and understand your use case better.

> The problem is that "Clear Direct Formatting" removes all direct formating -
> in the case of LO guides it breaks headers, lists etc. The reason may be
> that the guides were not written "the right way". However, even authors of
> the LO guides do not use LO "the right way".

I tried to reproduce it: I created a new document with sample text and applied lists (both, directly and by list styles) and a header. I did "Select All" and tested both "Clear Direct Formatting" and "Clear Formatting" from the "Apply Styles" dropdown [*]. The result was: lists and headers were kept appropriately. So I could not reproduce in LibO 5.1.3.1 on Debian Sid.

Maybe the "not using the right way" is getting in between? Can you point us to a sample problematic guide?

[*] "Clear Direct Formatting" does what it says: it clears just direct formatting (direct paragraph, character, etc.). OTOH, "Clear Formatting" from the dropdown will clear all direct formatting *and* set character styles to Default and paragraph styles to Text Body.

> I think that introduction of the "Clear Direct Character Formatting"
> (cleaning up stuff only inside paragraphs withou influencing the rest) would
> solve this problem. It will make translation of odt files by CAT tools
> possible.

I am also of the idea that "Clear Direct [__styletype__] Formatting" should exist somewhere.
Comment 4 Yousuf Philips (jay) (retired) 2017-04-23 23:45:26 UTC
Definitely useful UNO commands that could one day go into a group/split button for users to be more precise with what they want to do.