Created attachment 190068 [details] pdf for testing purposes libreoffice --headless --infilter="calc_pdf_addstream_import" --convert-to xlsx:"Calc MS Excel 2007 XML" Untitledspreadsheet-Sheet1.pdf Error: source file could not be loaded
I think that conversion, it's not available.
So I don't have any instruments in command line to do this, even with double or triple conversion?
I think so. https://ask.libreoffice.org/t/does-libreoffice-have-a-convert-from-pdf-to-csv-or-ods-function/38134
But the filter "calc_pdf_addstream_import" seems to exist, even though it is not exposed in the GUI. In sdext/source/pdfimport/config/pdf_import_filter.xcu, it has the flags "NOTINFILEDIALOG NOTINCHOOSER" I only see it used here: https://opengrok.libreoffice.org/xref/core/sdext/source/pdfimport/filterdet.cxx?r=9dd0af94#357
For Calc I didn't find another import filter for pdf in documentation. But anyway I tried different combinations and nothing helped me to convert pdf to xlsx
As was mentioned by Mike Kaganski on IRC, the filter is specifically for hybrid PDFs, as shown in line https://opengrok.libreoffice.org/xref/core/sdext/source/pdfimport/config/pdf_import_filter.xcu?r=0f613adb#197 I think the error message should be better than "file not found". Something like "input file is not a hybrid PDF" would be a whole lot better.
Created attachment 190099 [details] hybrid PDF with embedded ODS You can test with this PDF as input to see that the filter does work when the right format is supplied.