Excel has a feature where you can remove duplicates with a single click. This would be very helpful to have in LibreOffice as well.
Like this: http://milospjanic.blogspot.com/2011/10/how-to-remove-duplicates-in-libreoffice.html ?
Yes, that is what I was looking for. But I still think it should be an optional button/dialog available on the main toolbar without having to go through all that. Like this: https://thinkandbegin.files.wordpress.com/2012/05/remove-dup-2.png
*** Bug 92990 has been marked as a duplicate of this bug. ***
Below also another program that easily performs the procedure. WPS Spreadsheets http://pt-br.tinypic.com/r/2h663gj/8
Additional detail: in Excel, the function *removes* duplicates in-place, while filter in LO only allows hiding in-place, or removing by copying to a different location. So actually, LO lacks the functionality.
(In reply to Joel Madero from comment #1) > Like this: > http://milospjanic.blogspot.com/2011/10/how-to-remove-duplicates-in- > libreoffice.html ? IMHO, in Excel it made more clear for users. Different dialogue with very simple options. And in result in Excel we have only rows without any duplicates and without copying of result to another range...
*** Bug 109519 has been marked as a duplicate of this bug. ***
I would also like to request that "Remove Duplicates" is added to Calc. In Excel, removing duplicates from a column is a one click action. In Calc I have to go through multiple steps to obtain this. It's a waste of time if it's encessary to do it in multiple spreadsheets, it's a waste of time to make a macro and other spreadsheets offer this by default. There is an extension that adds this function to Calc so I propose that this is added to the default installation, that a .uno command is created for this button to be integrated in the Notebookbars as well. https://extensions.libreoffice.org/extensions/remove-duplicates
Created attachment 145904 [details] Remove Duplicates button in Data Ribbon in Excel 2016
Well, that extension apparently does not work in 6.1
Dear Markus and Eike, Would it be possible to use the code on the extension to provide this function by default on Calc? It is a really useful data manipulation option that Calc should have by default like most Office suites have.
*** Bug 73712 has been marked as a duplicate of this bug. ***
Well, Muhammet Kara after checking the extension confirmed that it is written in Basic, meaning that it is a macro. If someone would translate it to C++, it would be awesome.
*** Bug 124758 has been marked as a duplicate of this bug. ***
(In reply to Pedro from comment #13) > Well, Muhammet Kara after checking the extension confirmed that it is > written in Basic, meaning that it is a macro. > > If someone would translate it to C++, it would be awesome. Now there is faster extension https://extensions.libreoffice.org/extensions/remove-duplicates-fast for it
Couldn't this extension be added by default to LibreOffice?
(In reply to Roman Kuznetsov from comment #15) > (In reply to Pedro from comment #13) > > Well, Muhammet Kara after checking the extension confirmed that it is > > written in Basic, meaning that it is a macro. > > > > If someone would translate it to C++, it would be awesome. > > Now there is faster extension > https://extensions.libreoffice.org/extensions/remove-duplicates-fast for it Couldn't your extension be added by default to LibreOffice? Also, please consider adding support for it to create its own tab in the Tabbed Bar UI.
I don't know much about LibreOffice software and people are using it and I am the one who knows start working on it. The https://www.assignmenthelper.com.au/do-my-assignment/ share some of the tips and features which will help me more than for sure.
Since we have a working solution with the standard filter dialog it should be easy to add a new UNO command and run the filter procedure with a predefined setting. Code pointer sc/source/ui/view/tabvwshc.cxx #308 sc/source/ui/dbgui/sfiltdlg.cxx Would understand this as a medium to interesting difficulty.
Just to mention: this is not just filtering. It also deletes the cells with duplicate values. But I guess that is one minor thing to add.
pay someone to do assignments Swithering over unaccomplished and half-and-half assignments and the time is running on the fast track so you feel you can not catch it? Every time you pick up the topic you tangle up with the content in thinking how to start and finish it up? If these are your concerns, then pay someone to do assignments is the handpicked solution students and experts prefer. You can ask us for top-quality assignment and at My Assignment Helper Online, we are ready 24x7 to provide you the best assignment service. https://myassignmenthelperonline.com/pay-someone-to-do-my-assignment.html
Being the outstanding website development company in delhi It is renowned for providing beautiful websites and allows you to enjoy the seamless experience from website design and development to website optimization for various platforms for all your website design and development needs. We work both for higher rankings as well as for conversions. We are Top 5 Digital marketing company in delhi that ensures your business visibility on top of the search engines. At Wondermouse Technologies- website development company, our team has a wealth of experience in making sure it is easy to use and professional. https://www.wondermouse.in/web-development-company-in-delhi/ https://www.wondermouse.in/top-digital-marketing-company-in-delhi/ https://www.wondermouse.in/best-website-development-company-gurgaon/
https://www.univention.com/blog-en/2020/11/onlyoffice-docs-brand-new-name-and-big-updates-for-spreadsheets/ OnlyOffice, which is an open source (GPL3) office suite added the RemoveDuplicates function. Maybe we could use their function.
Here's the code pointer in OnlyOffice: https://github.com/ONLYOFFICE/sdkjs/blob/6df8cd8e34e4591dc5d35675cbb067498d40d9c7/cell/view/WorksheetView.js#L20104 https://github.com/ONLYOFFICE/sdkjs/blob/6df8cd8e34e4591dc5d35675cbb067498d40d9c7/cell/view/WorksheetView.js#L20176
(In reply to Pedro from comment #23) > https://www.univention.com/blog-en/2020/11/onlyoffice-docs-brand-new-name- > and-big-updates-for-spreadsheets/ > > OnlyOffice, which is an open source (GPL3) office suite added the > RemoveDuplicates function. > Maybe we could use their function. OnlyOffice doesn't use LibreOffice, so I don't see how we could use their function.
Maybe look at it and make something similar in LibO codebase. Thanks for the explanation on the Telegram channel.
Hello! Is there any news about function?
Having a group of super-expedient and deft academic experts truly helps in delivering express online assignment help in UK on time. You will make certain to get the most awesome aspect help with assignments within the timeframe that you set at the time of placing the request with us. <a href="https://www.greatassignmenthelp.com/uk/">assignment help</a>
Nice Bug tracking tool like team foundation. If you want to get more information related to your programming and tracking the programming, contact Australia Best Tutor
Created attachment 177433 [details] Only Office also acquired the Remove Duplicates functionality Only Office also got the Remove Duplicates functionality by default.
Created attachment 177434 [details] Remove Duplicates in Google Sheets This feature is also present in Google Sheets.
*** Bug 144744 has been marked as a duplicate of this bug. ***
Note that any feature request having a working extension with compatible license is already an easy hack. Just use its source code as the template that provides the required logic - it uses UNO commands, which may be easily converted to a C++ code, and assign a new UNO command to that new function. (Given that Remove Duplicates Fast is based on Remove Duplicates (https://github.com/ACTom/lo-extension-removeduplicates) which is GPLv3, which doesn't allow to use its code in LO directly because we need an MPL-compatible license, interested parties may ask the author for changed license in a github issue.)
What MPL specific license do you recommend?
Asked the dev in a new issue.
The developer already changed the license to MPL 2.0. Hopefully this allows someone to pick this up. :D
Important, also the values of the duplicate selected cells should be deleted and not the entire rows. Moving the remaining data up to leave the blank spaces below.
Created attachment 184664 [details] Why remove the whole row? The filter hides the entire 8th row, including the letter f (view image attached). The correct behavior would be to remove values 1 and 4 from row 8. Do not filter by hiding row 8 but move values 2 and 1 from row 9 up one cell and leave B9 and C9 empty. I know this was a standard filter, but the button that they put specifically to remove duplicates should not be a filter, but actually remove the duplicate values within the selection and not the entire row.
Created attachment 185548 [details] WPS Worksheets Manage Duplicates menu WPS Worksheets is vastly superior in managing duplicates to any other office suite. It not only allows for removal, but also highlighting and also fetching unique values or highlight them.
Created attachment 185549 [details] Dialog It allows selecting duplicates in selected range, in two ranges within the sheet, in different sheets in the worksheet and in different worksheets!
Created attachment 185550 [details] Highlight values
Created attachment 185551 [details] Fetch unique values This is a feature already available in MSO since at least 2007 and it's a feature that's been actively worked on in other Office suites as well, to a point where it is very well designed. All in all, this is a feature that is sorely lacking in Calc for several years now and with the evolution of this feature visible in OnlyOffice and WPS Worksheets (available in Linux as well), and with the RemoveDuplicates extension having compatible license with LibO it's hard to understand why this has been overlooked for so long now.
(In reply to Pedro from comment #42) > with the RemoveDuplicates > extension having compatible license with LibO it's hard to understand why > this has been overlooked for so long now. 0. both extensions https://extensions.libreoffice.org/en/extensions/show/remove-duplicates https://extensions.libreoffice.org/en/extensions/show/remove-duplicates-fast are licensed GPL (whatever version) and thus are *not* compatible with LibreOffice licensing. 1. even if they were, that tells nothing about the source code whether it would fit into LO core code (or even be in C++ that it could). 2. if those extensions fulfil the requirements, then why not use them.
If we realize bug 149933 searching for duplicates could be one option too.
(In reply to Eike Rathke from comment #43) > (In reply to Pedro from comment #42) > > with the RemoveDuplicates > > extension having compatible license with LibO it's hard to understand why > > this has been overlooked for so long now. > 0. both extensions > https://extensions.libreoffice.org/en/extensions/show/remove-duplicates > https://extensions.libreoffice.org/en/extensions/show/remove-duplicates-fast > are licensed GPL (whatever version) and thus are *not* compatible with > LibreOffice licensing. > > 1. even if they were, that tells nothing about the source code whether it > would fit into LO core code (or even be in C++ that it could). > > 2. if those extensions fulfil the requirements, then why not use them. Eike Rathke, the developer changed the license on his github repo to MPL 2.0.
Created attachment 185837 [details] License changed to MPL2.0
The Fast extension was an improvement done by Mike Kaganski and Kompilainnen I believe. They did not change the license on their extension yet.
Am definitely missing this in Calc right now.
(In reply to Pedro from comment #47) > The Fast extension was an improvement done by Mike Kaganski and Kompilainnen > I believe. They did not change the license on their extension yet. Since our extension was based on the previous one, our license was necessarily the same. Since the old extension's license is now MPL 2.0, I am glad to re-license my work under MPL 2.0. Roman's turn.
I'm assuming this bug is about adding the command. If we also want to simplify/alter the filtering dialog - that should be a separate bug. (If I'm wrong - please change the title, clarify the bug's scope in a comment, and refer to the comment in the title)
(In reply to Eyal Rozenberg from comment #50) The original request was to implement a feature *like Excel's "remove duplicates"*. Filters are orthogonal to that, they never remove any duplicates, only hide or do a partial copy.
Your work is truly remarkable! Thank you for providing us with this wonderful website and all your hard work.
Should the function run based on values or formulas? In other words is =1+1 the same as =2?
(In reply to Heiko Tietze from comment #53) > Should the function run based on values or formulas? In other words is =1+1 > the same as =2? Values: =2 or =1+1 and ="b" or =char(98) are the same. Only first cell must remain.
(In reply to Heiko Tietze from comment #53) > Should the function run based on values or formulas? In other words is =1+1 > the same as =2? FYI Excel does consider =2 and =1+1 as duplicates. It seems Excel only considers the cell value that is being shown, regardless of the formula. TBH I find it a bit intrusive. But I believe many users will want this feature to behave similarly to what Excel does.
The objective of Remove Duplicates is to remove duplicates of values, not formulas or calculations. There's a reason this is in the Data tab of excel and not in Formulas. Initially, keeping the scope focused on having a Remove Duplicates that simply removes duplicates of values is the most important. If Sahil Gautam is motivated to keep working on this afterwards then maybe this can be expanded upon in the future much like WPS Office did in their Worksheets module (their Excel equivalent). WPS Office has the implementation with more functionalities of Remove Duplicates.
(In reply to Pedro from comment #56) > The objective of Remove Duplicates is to remove duplicates of values, not > formulas or calculations. Do you argue that =1+2 is not the same as =2+1? Or =1+3 != =2+2. Or ="b" != =char(98). And I'm against a dialog here to fine-tune the operation. Makes the workflow heavy.
(In reply to Heiko Tietze from comment #57) > (In reply to Pedro from comment #56) > > The objective of Remove Duplicates is to remove duplicates of values, not > > formulas or calculations. > > Do you argue that =1+2 is not the same as =2+1? Or =1+3 != =2+2. Or ="b" != > =char(98). > > And I'm against a dialog here to fine-tune the operation. Makes the workflow > heavy. Being able to put a dialog where you can mark if you want to look at formulas instead of values: I think that in case of looking at formulas and not values, the formulas that are repeated changing only the relative references, should be considered the same. I think that =A3*2 in row 5 should be the same as =A4*2 in row 6. Before checking if they are equal, the relative references part should be removed from the formulas (without $ before) for checking.
(In reply to Heiko Tietze from comment #57) > And I'm against a dialog here to fine-tune the operation. Makes the workflow > heavy. Oh :-D LOL. You simply can't have this without a dialog. At all. The "duplicate" concept is *SO COMPLEX*, that you simply can't make all agree on your definition of it. See text import dialog for a similar complexity. Or sort. People might want to remove duplicates based on some subset of columns (but remove all the cells in the area). They might want formulas to make the difference. They might want to treat equality of numbers with epsilon, or use "text as shown". They might want to work by rows or by columns. They might want to shift up or right.
(In reply to Mike Kaganski from comment #59) > The "duplicate" concept is *SO COMPLEX*... This is exactly what I mean. You cannot implement a swiss-army knife for every scenario. If the one inbuilt function is not sufficient in some _rare_ use cases, those need to be accomplished by alternative methods. But the primary workflow should be supported as easy as possible- ie. one click to remove duplicates, as the function label says.
(In reply to Heiko Tietze from comment #60) > This is exactly what I mean. You cannot implement a swiss-army knife for > every scenario. Yes you can. And you need to. Just because we are the office suite, and not a tool for one single task. See how we *do* try to implement it in case of filtering, or text import, or file format support. See how other office suite implements it. Excel provides a dialog. Google Sheets provides a dialog. People here expect a dialog. It is simply unavoidable.
> This is exactly what I mean. You cannot implement a swiss-army knife for every scenario. If the one inbuilt function is not sufficient in some _rare_ use cases, those need to be accomplished by alternative methods. But the primary workflow should be supported as easy as possible- ie. one click to remove duplicates, as the function label says. Heiko did you even try to use this function in Excel or any other office suite before commenting? A dialog is REQUIRED if not even for the case that you need to have a "this data has headers" check mark. The button in Excel opens two sequential dialogs. Even the Calc extension requires a dialog. If you don't feel this is essential because it doesn't fit your needs please don't try to cripple something that is crucial and sorely missing from Calc. The Excel function simplifies things by referring to Remove Duplicate Values. This does not include formulas since the formula is not a value but provides you with one. Take into consideration that in Excel if you expand duplicate removal to multiple columns, you also need to select the columns that count for duplicates. If you select multiple columns, it will only remove rows when both selected columns have duplicates.
(In reply to Pedro from comment #62) > did you even try to use this function... Am I disallowed to comment if not? (No need to answer, totally off-topic; ultimately it means "implement exactly as I need")
So We concluded on "dialog needed".
(In reply to Pedro from comment #62) > A dialog is REQUIRED if not even for the case that you need to have a "this > data has headers" check mark. The button in Excel opens two sequential > dialogs. Indeed Excel opens a dialog before removing duplicates, but the dialog does not offer many options... f.i. there's nothing about how to handle cells with formulas. I understand the need for a dialog here... but there should also be a way to simply click a button and remove duplicates without a dialog disrupting the workflow, similarly to what we have with "Sort" (which has a dialog) and "Sort ascending" (no dialog needed).
(In reply to Rafael Lima from comment #65) > but there should also be a way to > simply click a button and remove duplicates without a dialog disrupting the > workflow, similarly to what we have with "Sort" (which has a dialog) and > "Sort ascending" (no dialog needed). No. This *might* turn out to be useful - when you have implemented the dialog, and then start seeing requests to "just make my Click the toolbar - then press Enter difficult sequence easier, because pressing Enter is do much tiresome" requests. It may be justified by the user demand - but not made pro-actively: multiplying UNO commands without the sizable demand is exactly the bloat that should be avoided, not the dialogs that user ask for.
(In reply to Mike Kaganski from comment #66) > multiplying UNO commands without the sizable demand is > exactly the bloat that should be avoided, not the dialogs that user ask for. AFAIK it would be possible to have both (dialog / non-dialog) functionalities with a single UNO command, depending on the arguments passed to it. Another real use-case of removing duplicates without the dialog would be for writing macros, where all parameters of the UNO command would be provided by the macro and no dialog would be necessary. In summary... I'm in favor of having a dialog here, but it would also be cool to have the ability to run this UNO command without showing the dialog as well.
(In reply to Rafael Lima from comment #67) > AFAIK it would be possible to have both (dialog / non-dialog) > functionalities with a single UNO command, depending on the arguments passed > to it. This is exactly why I asked for an optional argument for the UNO command, when reviewed the proposed patch.
*** Bug 159980 has been marked as a duplicate of this bug. ***
(In reply to Pedro from comment #39) > Created attachment 185548 [details] > WPS Worksheets Manage Duplicates menu > > WPS Worksheets is vastly superior in managing duplicates to any other office > suite. > It not only allows for removal, but also highlighting and also fetching > unique values or highlight them. great
great
Sahil Gautam committed a patch related to this issue. It has been pushed to "master": https://git.libreoffice.org/core/commit/29fd68bb682006ccaa5aaed516c064b5b6368463 tdf#85976 [RFE] Add a "Remove Duplicate Records" command It will be available in 25.2.0. The patch should be included in the daily builds available at https://dev-builds.libreoffice.org/daily/ in the next 24-48 hours. More information about daily builds can be found at: https://wiki.documentfoundation.org/Testing_Daily_Builds Affected users are encouraged to test the fix and report feedback.
I see these problems: (1) On first call, "by column" was selected although that is the less common case. (2) The meaning of "Orientation" is not really clear. Suggestion: Compare: rows columns (3) The purpose of "Items" is unclear. (4) Help page does not exist. (5) If a database range is selected, the dialog does not consider the property "Contains column labels" of the database range. (6) It is not usual to use "Okay", but other dialogs have it named "OK". (7) Command has no extended tip.
The help page patch needs code review https://gerrit.libreoffice.org/c/help/+/173142 I asked Oliver & Ilmari for merging it but they said it can only be done after the feature patch is released.
Sahil Gautam committed a patch related to this issue. It has been pushed to "master": https://git.libreoffice.org/help/commit/0d61990c85ed2135a6064b6caf1e989e820cd65c tdf#85976 Help page for HandleDuplicateRecords dialog
The Help page was approved to provide a landing page for the dialog and UNO command. A follow up is needed to fix typos and improve style.
@Regina, I fixed most of them in https://gerrit.libreoffice.org/c/core/+/174201 I didn't understand the 5th point about database range. I tested it with plain rows and columns in calc. Is it supposed to do more than that? Can you please provide some steps to reproduce the issue...
(In reply to Sahil Gautam from comment #77) > @Regina, I fixed most of them in > https://gerrit.libreoffice.org/c/core/+/174201 > I didn't understand the 5th point about database range. I tested it with > plain rows and columns in calc. Is it supposed to do more than that? Can you > please provide some steps to reproduce the issue... https://help.libreoffice.org/latest/en-US/text/scalc/guide/database_define.html Regina: are label ranges relevant as well? https://help.libreoffice.org/latest/en-US/text/scalc/01/04070400.html
Created attachment 196782 [details] Database ranges examples to (5): A database range is an <table:database-range> element in file format. It has the attribute 'table:contains-header' with values 'true' (default) and 'false', and the attribute 'table:orientation' with values 'row' (default) and 'column'. The attributes are only written to file, if the value is not default. This results in 4 combinations. The attachment has for each one a database range. Each database range is on a sheet of its own. Database range and sheet are named according the desired combination. The 'table:orientation="column"' case is not fully implemented in LibreOffice, but only for sorting. It does not exist in the 'table'-feature of Excel. You should control, whether the settings are imported correctly and adjust them if necessary, before you make tests with the new "remove duplicate" feature. To see the settings set the cursor into the data range. Then go to menu Data > Sort. The name of the database range and the sheet says you, which setting is intended. You can see and alter the 'label'-settings too in menu Data > Define Range. Select a range and open the "Options" part. Depending on the range, the option 'Contains column labels' should be checked or not. To select a database range use menu Data > Select Range. From point of a user I expect, that when I select a database range, the new "show/remove" duplicate records dialog has pre-selected the orientation and label settings according to the settings in the database range. to (3): With renaming items -> records the purpose is still not clear. Perhaps you explain here in the bug report the purpose in detail. Then we can help to find a short description.
(In reply to Buovjaga from comment #78) > > Regina: are label ranges relevant as well? > > https://help.libreoffice.org/latest/en-US/text/scalc/01/04070400.html Yes, if the selection is not a database range but has cells which belong to a label range, then the dialog should open with the corresponding settings pre-selected. As user I would expect this, but I don't know how effortful it is to implement. And a further remark: (8) A rectangular cell range can have rows/columns, which are directly hidden by the user, or they can be hidden by a filter, or they can be hidden in a collapsed group. Are such hidden rows/columns effected by "remove duplicates"? Are such hidden rows/columns evaluated for the criterion "duplicate"?
Created attachment 196787 [details] duplicate records dialog evolution
(8) Yes, the dialog also considers the hidden rows/columns for duplicates comparison. (3) The UI was designed by Heiko (please refer to the "duplicate records dialog evolution" attachment). It grouped the relevant controls/widgets into sections like "Actions:", "Items:" (now "Records:" in the latest patch (still not final)). The design to me looked like "Howard Roark's designs from *The Fountainhead*" :), I just couldn't dare to change the masterpiece;
(In reply to Regina Henschel from comment #79) > From point of a user I expect, that when I select a database range, the new > "show/remove" duplicate records dialog has pre-selected the orientation and > label settings according to the settings in the database range. Code pointer for Sahil... check ScTabPageSortFields::Init() https://opengrok.libreoffice.org/xref/core/sc/source/ui/dbgui/tpsort.cxx?r=57c7269f#116 Which is where the dialog checks the database range properties and set them in the Sort dialog.
In regard to (3) and the help text for it: The current help has: <paragraph role="paragraph" id="par_id61725963172527"><emph>Items:</emph> shows the headers for the selected records. If "data includes headers" checkbox is checked, then it contains the headers of the records, else it's either the row number or the column name depending on the orientation. The user can select/unselect the records to be compared. In the column header, it contains a checkbox to toggle state for all the records in the treeview.</paragraph> From my tests, I would say that the sentence "The user can select/unselect the records to be compared." is wrong. Content problem: In the language of databases, the user does not select/unselect "records" but "fields". Example: Animal | Location | Year Deer | West | 2023 Deer | West | 2021 Deer | East | 2021 Deer | East | 2023 If "all" is selected, all four records are different and thus retained. If only "Animal" and "Location" is selected, then the field "Year" is not used in the comparisons. The comparisons actually use the shortened records: Deer | West Dear | West Deer | East Deer | East Thus these records are retained: (assuming first in top-down direction) Deer | West | 2023 Deer | East | 2021 I have no solution for (3) because the term "field" is related to databases and very technical. But "record" is surely wrong. Perhaps talk to Heiko and Olivier? Style problem: The help addresses the user. Therefore "The user can select/unselect..." is an unsuitable wording. Common are wordings with imperative, e.g. "Select foo", "Choose bar", or descriptive "Selects ...", "Shows ...", "Displays", or wordings with "You ...", or constructs of the form "To get result foo, do bar."
In regard to (8): If I use a filter (Auto, Standard and Advanced) on the range and that hides some rows, and when I then call the "Duplicate Records..." dialog, I get the error message "No Data found to operate on." This does not happen, when I hide some rows manually or have some rows in a collapsed group. Is it intended, that the feature does not work for a cell area with active filter? If yes, the error message and the help should tell this.
If "By Row" is selected, the text for Headers is "Data contains row headers". That is misleading. If "By Row" is selected it compares and deletes rows. Thus the text should be "Range contains column labels". The text should be the same as in the Sort dialog and in the Options in the "Define Database Range" dialog. And the other way round, if "By Column" is selected, it should be "Range contains row labels".
(In reply to Regina Henschel from comment #85) > [...] I get the error message "No Data found to operate on." > > This does not happen, when I hide some rows manually or have some rows in a > collapsed group. > > Is it intended, that the feature does not work for a cell area with active > filter? If yes, the error message and the help should tell this. The error message only shows up if the the active cell is not on the data (or very next to it). I tried it with autofilter, and couldn't reproduce it. I had 4 rows with 1231 abca (2 columns), and then I hid the second row ( |2|b|), and called duplicate records when the cell focus was on |3|c(here)|, the dialog appeared as expected.
Steps to reproduce the problem: 1. Open attachment 196782 [details] "Database ranges examples" and go to sheet "VertLabel". 2. Menu Data > Select Range. In that "Select Database Range" dialog select "VertWithLabel". OK. Now the range is selected and the cell cursor is on A1. 3. Menu Data > More Filters... > Standard Filter. Select Field name "Region", Condition = and Value "East". OK. Now the range shows only records with Region "East". Cell cursor is still on A1. 4. Menu Data > Duplicate Records... => Error message.
Sahil Gautam committed a patch related to this issue. It has been pushed to "master": https://git.libreoffice.org/core/commit/7a1636a24f8a4c856348bb6781aef4a494227691 tdf#85976 change labels as suggested in comment 73 on the ticket It will be available in 25.2.0. The patch should be included in the daily builds available at https://dev-builds.libreoffice.org/daily/ in the next 24-48 hours. More information about daily builds can be found at: https://wiki.documentfoundation.org/Testing_Daily_Builds Affected users are encouraged to test the fix and report feedback.
If possible add this action to the Data tab of the Tabbed UI. Thank you for your great work Sahil!
The "Records" element group (the "All" and the list with checkboxes) is completely unclear from the user point of view. If I didn't know what it means, I will never guess that it's "what to compare to decide if two records (which actually contain all columns, not only checked) are duplicates or not". The help (still mentioning old "Items" term) doesn't help to understand that, too. It talks what it shows, but that's actually unimportant; the important is that the checkboxes define what would be "primary key" in SQL.
Some variation of "Compare by:" or "Fields to compare:" would be much better label.
Patch https://gerrit.libreoffice.org/c/core/+/175704 changes "Compare: " to "Compare by: " and "Records: " to "Rows: " (if compare by columns is selected), and "Columns: " (if compare by rows is selected). Suggestions for better labels are always welcome!
patch https://gerrit.libreoffice.org/c/core/+/175764 adds "Handle Duplicate Records" button to the notebookbar. The label "Handle Duplicate Records" is very long, takes so much space on the notebookbar. I have "Handle Duplicates" in my mind, need more suggestions...
(In reply to Sahil Gautam from comment #94) > patch https://gerrit.libreoffice.org/c/core/+/175764 adds "Handle Duplicate > Records" button to the notebookbar. The label "Handle Duplicate Records" is > very long, takes so much space on the notebookbar. I have "Handle > Duplicates" in my mind, need more suggestions... Just "Duplicates"? Besides, bug 163117
I would keep the name as close as possible to other Office suites. This is only a "Remove Duplicates" action ? Then name it Remove Duplicates as that is the convention in other office suites and users will be familiarized to look for that.name of this action in : Google Sheets - Remove Duplicates Excel - Remove Duplicates Only Office - Remove Duplicates WPS Spreadsheets (more features besides removing duplicates) - Manage duplicates Therefore call it "Remove Duplicates" or "Manage Duplicates". Just "Duplicates" doesn't say an action to take.
(In reply to Pedro from comment #96) > I would keep the name as close as possible to other Office suites. > This is only a "Remove Duplicates" action ? It also selects duplicates (other than removing them) depending on which option is selected in the "Action" section of the dialog.
(In reply to Sahil Gautam from comment #97) > (In reply to Pedro from comment #96) > > I would keep the name as close as possible to other Office suites. > > This is only a "Remove Duplicates" action ? > > It also selects duplicates (other than removing them) depending on which > option is selected in the "Action" section of the dialog. I would name it "Manage Duplicates" like WPS Spreadsheets then.
Sahil Gautam committed a patch related to this issue. It has been pushed to "master": https://git.libreoffice.org/core/commit/22949c16e65fccd40d6313c6b6c0d7906f72a999 tdf#85976 Change label from "Handle Duplicate Records" to "Duplicates" It will be available in 25.2.0. The patch should be included in the daily builds available at https://dev-builds.libreoffice.org/daily/ in the next 24-48 hours. More information about daily builds can be found at: https://wiki.documentfoundation.org/Testing_Daily_Builds Affected users are encouraged to test the fix and report feedback.
Sahil Gautam committed a patch related to this issue. It has been pushed to "master": https://git.libreoffice.org/core/commit/965287a9edb982b4f1857e7a57a73f0bdfd7e330 tdf#85976 Make labels more intuitive in "Duplicate Records Dialog" It will be available in 25.2.0. The patch should be included in the daily builds available at https://dev-builds.libreoffice.org/daily/ in the next 24-48 hours. More information about daily builds can be found at: https://wiki.documentfoundation.org/Testing_Daily_Builds Affected users are encouraged to test the fix and report feedback.
Please add to https://wiki.documentfoundation.org/ReleaseNotes/25.2.
(In reply to Mike Kaganski from comment #92) > Some variation of "Compare by:" or "Fields to compare:" would be much better > label. @Mike I think you suggested these labels as a replacement for "Records: " and not the first label "Compare: " []rows []columns? I changed "Compare: " to "Compare by: ", and (while reading the updated labels for updating help page) I felt something was wrong.
Sahil Gautam committed a patch related to this issue. It has been pushed to "master": https://git.libreoffice.org/core/commit/608e1452c51efa4f9bbcea8ed9a538ff974eed28 tdf#85976 Add 'Handle Duplicate Records' button to the notebookbar It will be available in 25.2.0. The patch should be included in the daily builds available at https://dev-builds.libreoffice.org/daily/ in the next 24-48 hours. More information about daily builds can be found at: https://wiki.documentfoundation.org/Testing_Daily_Builds Affected users are encouraged to test the fix and report feedback.