Invalid cell coordinate 1 during IOFactory::load #2501

petruchek · 2022-01-14T20:36:20Z

This is:

- [X] a bug report
- [ ] a feature request
- [ ] **not** a usage question (ask them on https://stackoverflow.com/questions/tagged/phpspreadsheet or https://gitter.im/PHPOffice/PhpSpreadsheet)

What is the expected behavior?

$spreadsheet object successfully created

What is the current behavior?

Error: PhpOffice\PhpSpreadsheet\Exception: Invalid cell coordinate 1

What are the steps to reproduce?

Please provide a Minimal, Complete, and Verifiable example of code that exhibits the issue without relying on an external Excel file or a web server:

<?php

require __DIR__ . '/vendor/autoload.php';

$spreadsheet = \PhpOffice\PhpSpreadsheet\IOFactory::load("./example.xlsx");

Which versions of PhpSpreadsheet and PHP are affected?

Latest (1.21.0) is still affected, tested on PHP 7.4.

File to reproduce

I'm attaching the file - I don't know how it was generated, but I can open it with Excel 2013 without any warnings. When I re-save the file without applying any changes, the problem disappears (but the file is obviously not the same one - even file size differs). It looks like the original file some weird structure that PhpSpreadsheet is unable to handle correctly.
example.xlsx

The text was updated successfully, but these errors were encountered:

oleibman · 2022-01-15T15:16:46Z

Thank you for supplying your example file. It has a merge range stored as 1:1, i.e. the entire first row is merged as cell A1. PhpSpreadsheet is not expecting this. When you resave the file with Excel, the merge range is now stored as A1:XFD1, and PhpSpreadsheet does understand that.

This is a relatively easy oversight to fix. But, in doing so, it appears that the logic for setting up the merge range within PhpSpreadsheet is very costly in terms of memory and speed, a problem which isn't really exposed until you deal with large ranges like that. So, the fix, which I am working on, will be a little more complicated than merely fixing the reported problem.

Fix PHPOffice#2501. Merge range can be supplied as entire rows or columns, e.g. `1:1` or `A:C`. PhpSpreadsheet is expecting a row and a column to be specified for both parts of the range, and fails when the unexpected format shows up. The code to clear cells within the merge range is very inefficient in terms of both memory and time, especially when the range is large (e.g. for an entire row or column). More efficient code is substituted. It is possible that we can get even more efficient by deleting the cleared cells rather than setting them to null. However, that needs more research, and there is no reason to delay this fix while I am researching. When Xlsx Writer encounters a null cell, it writes it to the output file. For cell merges (especially involving whole rows or columns), this results in a lot of useless output. It is changed to skip the output of null cells when (a) the cell style matches its row's style, or (b) the row style is not specified and the cell style matches its column's style.

* Xlsx Reader Merge Range For Entire Column(s) or Row(s) Fix #2501. Merge range can be supplied as entire rows or columns, e.g. `1:1` or `A:C`. PhpSpreadsheet is expecting a row and a column to be specified for both parts of the range, and fails when the unexpected format shows up. The code to clear cells within the merge range is very inefficient in terms of both memory and time, especially when the range is large (e.g. for an entire row or column). More efficient code is substituted. It is possible that we can get even more efficient by deleting the cleared cells rather than setting them to null. However, that needs more research, and there is no reason to delay this fix while I am researching. When Xlsx Writer encounters a null cell, it writes it to the output file. For cell merges (especially involving whole rows or columns), this results in a lot of useless output. It is changed to skip the output of null cells when (a) the cell style matches its row's style, or (b) the row style is not specified and the cell style matches its column's style. * Scrutinizer See if these changes appease it. * Improved CellIterators Finally figured out how to improve efficiency here, meaning that there is no longer a reason to change Writer/Xlsx, so restore that. * No Change for CellIterator I had thought a change was needed for CellIterator, but it isn't.

oleibman mentioned this issue Jan 17, 2022

Xlsx Reader Merge Range For Entire Column(s) or Row(s) #2504

Merged

5 tasks

oleibman closed this as completed in #2504 Jan 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Invalid cell coordinate 1 during IOFactory::load #2501

Invalid cell coordinate 1 during IOFactory::load #2501

petruchek commented Jan 14, 2022

oleibman commented Jan 15, 2022

Invalid cell coordinate 1 during IOFactory::load #2501

Invalid cell coordinate 1 during IOFactory::load #2501

Comments

petruchek commented Jan 14, 2022

What is the expected behavior?

What is the current behavior?

What are the steps to reproduce?

Which versions of PhpSpreadsheet and PHP are affected?

File to reproduce

oleibman commented Jan 15, 2022