Bug 74665 - Ability to generate a histogram from non-pre-binned data
Summary: Ability to generate a histogram from non-pre-binned data
Status: NEW
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Calc (show other bugs)
Version:
(earliest affected)
Inherited From OOo
Hardware: All All
: medium enhancement
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks: Data-Statistics
  Show dependency treegraph
 
Reported: 2014-02-07 12:16 UTC by Tomaz Vajngerl
Modified: 2024-02-21 00:16 UTC (History)
11 users (show)

See Also:
Crash report or crash signature:


Attachments
screenshot of results in Gnumeric (data and plot) (82.88 KB, image/png)
2024-02-21 00:16 UTC, Stéphane Guillou (stragu)
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Tomaz Vajngerl 2014-02-07 12:16:20 UTC
Add missing "Histogram" function from Data Analysis Toolkit
Comment 1 Stanislav Horacek 2015-02-24 07:21:28 UTC
*** Bug 82716 has been marked as a duplicate of this bug. ***
Comment 2 Jean-Baptiste Faure 2015-02-24 21:18:49 UTC
changed source and target. Make this bug the duplicate of bug 82716.

Best regards. JBF

*** This bug has been marked as a duplicate of bug 82716 ***
Comment 3 Tomaz Vajngerl 2015-02-24 23:49:23 UTC
''Bug 82716 isn't directly related to this one. That bug is about easier creating a histogram chart so you don't have to put extra histogram related data into the sheet - a dedicated histogram chart type. This one is about calculating the histogram from data and putting the results back into the sheet. 

I'll put this one back to status NEW.
Comment 4 Gerry 2015-08-09 09:22:04 UTC
For competitive comparison, MS Office 2016 is going to implement a Histogram chart type: https://blogs.office.com/2015/07/02/introducing-new-and-modern-chart-types-now-available-in-office-2016-preview/
Comment 5 Gerry 2016-02-13 11:02:04 UTC
Isn't bug 81530 (which is linked to this page as "See also") actually a duplicate? 

Both bugs refer to the chart type HISTOGRAM, while this bug refers to its entry in the Data Analysis Toolkit and the other bug in the Insert -> Chart types.
Comment 6 Tomaz Vajngerl 2016-02-13 20:08:42 UTC
No.. that are 2 exclusive features. You can make a histogram function for the data analysis toolkit without adding a histogram chart type and the other way around.
Comment 7 Xisco Faulí 2017-07-13 12:23:58 UTC
Setting Assignee back to default. Please assign it back to yourself if you're
still working on this issue
Comment 8 Heiko Tietze 2017-08-02 14:06:15 UTC
(In reply to Tomaz Vajngerl from comment #6)
> No.. that are 2 exclusive features. You can make a histogram function for
> the data analysis toolkit without adding a histogram chart type and the
> other way around.

Tomaz, what do you expect from a 'histogram' function that is not available with frequency()? https://help.libreoffice.org/Calc/Array_Functions#FREQUENCY
The common understanding of a histogram is the graphical representation of what frequency calculates.
Comment 9 Tomaz Vajngerl 2017-08-02 15:50:14 UTC
This is not about adding a new HISTOGRAM function.

This is all about quickly preparing data to create data for a histogram which will use existing formulas (like FREQUENCY). Bin calculation is the key here, which will be created automatically depending on user input (number of bins) and other options. See how Analysis Toolkit works in Excel..
Comment 10 Yousuf Philips (jay) (retired) 2017-08-03 11:30:32 UTC
Feature is available in Excel and Gnumeric, so there isnt any reason not to also have it in Calc.
Comment 11 Heiko Tietze 2017-08-03 12:44:50 UTC
The design team talked about the request in the meeting. The requested Data > Statistics wizard is generic, so no mockup is needed. First, the bins have to be calculated equidistantly between min and max based on the user input on how many, followed by frequencies() and the histogram generation.
Comment 12 Eyal Rozenberg 2023-03-10 19:34:27 UTC
I'm not quite sure I understand what this bug is about.

Does LO have a "data analysis toolkit"? Is this asking for some special wizard for many statistical operations which also involves histograms?

I'd like like to just be able to generate histograms (= two-column bin, frequency tables) from unbinned data, numerical or categorical. Is this the right bug for that or should I open another one?
Comment 13 Tomaz Vajngerl 2023-03-11 01:44:35 UTC
(In reply to Eyal Rozenberg from comment #12)
> I'm not quite sure I understand what this bug is about.
> 
> Does LO have a "data analysis toolkit"? Is this asking for some special
> wizard for many statistical operations which also involves histograms?

Yes, we have most functions found in "data analysis toolkit" in Data -> Statistics. And yes - just like Histogram in the "data analysis toolkit". 

> I'd like like to just be able to generate histograms (= two-column bin,
> frequency tables) from unbinned data, numerical or categorical. Is this the
> right bug for that or should I open another one?

Sounds like the correct place.
Comment 14 Tomaz Vajngerl 2023-03-11 01:45:33 UTC
BTW.. looking at this - this could be an easy hack with difficulty "interesting"...
Comment 15 Xisco Faulí 2023-03-13 11:38:35 UTC
(In reply to Tomaz Vajngerl from comment #14)
> BTW.. looking at this - this could be an easy hack with difficulty
> "interesting"...

Adding Hossein
Comment 16 Stéphane Guillou (stragu) 2024-02-21 00:16:36 UTC
Created attachment 192673 [details]
screenshot of results in Gnumeric (data and plot)

This is an example output of Gnumeric 1.12.51's Statistics > Descriptive Statistics > Frequency Tables > Histogram dialog.
It can generate a chart as well as the underlying binned data, so essentially this report + bug 81530 + bug 82716 all into one dedicated dialog.

The dialog has many options in the following tabs:
- Input (range can be grouped, and use labels)
- Cutoffs (can come from a range too)
- Bins (how boundaries are split between bins)
- Graphs and Options (chart or not chart, what kind of chart, percentages or fractions, cumulative or not...)
- Output (location and formatting)

Quite powerful.