Bug 166665 - Error in the statistic file provided in the documentation
Summary: Error in the statistic file provided in the documentation
Status: RESOLVED FIXED
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Documentation (show other bugs)
Version:
(earliest affected)
25.2.3.2 release
Hardware: All All
: medium normal
Assignee: Julien Nabet
URL:
Whiteboard: target:25.8.0
Keywords:
Depends on:
Blocks: HelpGaps-NewFeatures Help-Changes-Features
  Show dependency treegraph
 
Reported: 2025-05-20 11:58 UTC by J22Gim
Modified: 2025-05-25 09:10 UTC (History)
2 users (show)

See Also:
Crash report or crash signature:


Attachments
corrected file (17.04 KB, application/vnd.oasis.opendocument.spreadsheet)
2025-05-20 14:35 UTC, J22Gim
Details

Note You need to log in before you can comment on or make changes to this bug.
Description J22Gim 2025-05-20 11:58:18 UTC
Description:
The dataset and the analysis to demonstrate how to perform ANOVA in Calc are contained in the following link:
Link 1: https://help.libreoffice.org/latest/en-US/text/scalc/01/statistics_anova.html 

On that webpage, a file is offered, which is located at the following link:
Link 2: https://help.libreoffice.org/latest/media/files/scalc/statistics_data.ods

The problem is that both the dataset and the analysis in the provided file are different from those shown on the website. 
The file is wrong, the website is right.

What is wrong with the file?
The ANOVA tab has a dataset (B3:D15). For some reason, the last number of the 'Maths' column is not a number but a text. You can confirm this by seeing how many numbers are counted in G6:G9. The last number (B15) is formatted as text. And because of that, the whole ANOVA analysis is wrong. (Also, for some reason, there is a space in that column as well (B8), although this is not relevant for the analysis at this stage).

The dataset is correctly shown on the website (link 1). If you edit the statistics_data.ods file and change cell B15 to a number, then the 'ANOVA - Single Factor' output (F3:L14) is updated and shows correct results, which are the same as on the website.

Steps to Reproduce:
It's all written in the description.

Actual Results:
.

Expected Results:
.


Reproducible: Always


User Profile Reset: Yes

Additional Info:
.
Comment 1 J22Gim 2025-05-20 14:35:55 UTC
Created attachment 200888 [details]
corrected file

The attached file has the corrected dataset. I only modified one single cell (formatted $ANOVA.B15 as number), didn't touch anything else.
Comment 2 Julien Nabet 2025-05-20 19:59:44 UTC
On pc Debian x86-64 with master sources updated today, I confirm this.

There are one file per locale to change.
These have been added with: ef6f345505552524570d22387543bf45ffa4f8f4
"Add sample file for Calc Data Statistics"
(see https://gerrit.libreoffice.org/c/help/+/164319).

If all these files must be changed manually, it would be good to know if the space in B8 should be removed too.

Olivier: I don't know if there's a quick way to make the changes in all these files.
If there's no quick way perhaps we should seize the opportunity to convert them in fods so if they need another change, a quick replace may be possible.
Comment 3 J22Gim 2025-05-21 12:46:56 UTC
(In reply to Julien Nabet from comment #2)
> 
> If all these files must be changed manually, it would be good to know if the
> space in B8 should be removed too.
> 

Well, when one is teaching how to use a statistical/data science method, you'd use a clean, simple dataset. Maybe it's ok that it is unbalanced (i.e., some values are missing) if you intend to show that the test is robust against this. But the space may be problematic for some purposes, so I would definitely remove it if possible.

Ideally, the table should be the simplest possible: only numbers.
Complication ok-ish: missing values (i.e. empty cells).
Please remove: spaces or any kind of text.

Thanks for looking into this, and for this great software!
Comment 4 Julien Nabet 2025-05-24 09:28:08 UTC
I've just a sent a first patch here for en-GB, en-US and en-ZA UI:
https://gerrit.libreoffice.org/c/help/+/185734

Waiting for Olivier's feedback to know if it's ok and so if I can keep on.
Comment 5 Commit Notification 2025-05-24 13:16:55 UTC
Julien Nabet committed a patch related to this issue.
It has been pushed to "master":

https://git.libreoffice.org/help/commit/d42cdacbd8c27fd4ff5be331031b25724bf418bf

Related tdf#166665: change in en-GB, en-US and en-ZA files
Comment 6 Commit Notification 2025-05-24 13:22:58 UTC
Olivier Hallot committed a patch related to this issue.
It has been pushed to "master":

https://git.libreoffice.org/help/commit/f946fe66f4a28a46f4530ed45ed719fa8a09ab87

tdf#166665 Correct statistics example file
Comment 7 Commit Notification 2025-05-25 08:07:46 UTC
Julien Nabet committed a patch related to this issue.
It has been pushed to "master":

https://git.libreoffice.org/help/commit/5ed646461fd12fd8770e2812015ce4074e595c37

Related tdf#166665: am/statistics_data.ods until ./ca-valencia/statistics_data.ods
Comment 8 Commit Notification 2025-05-25 08:16:49 UTC
Julien Nabet committed a patch related to this issue.
It has been pushed to "master":

https://git.libreoffice.org/help/commit/6fdaaf839e51d33b781dd31b877b583ebf30d2ea

Related tdf#166665: from cs/statistics_data.ods to soffice eu/statistics_data.ods
Comment 9 Commit Notification 2025-05-25 09:09:56 UTC
Julien Nabet committed a patch related to this issue.
It has been pushed to "master":

https://git.libreoffice.org/help/commit/3b9dbcec0f18073dd622b6f759ec154695abc51e

Related tdf#166665:  from fi/statistics_data.ods until the end
Comment 10 Julien Nabet 2025-05-25 09:10:53 UTC
After these patchs, I suppose we need to wait for synchro with wiki help.