Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

XML streaming to handle large Excel files #3

Closed
felipenoris opened this issue Apr 19, 2018 · 5 comments
Closed

XML streaming to handle large Excel files #3

felipenoris opened this issue Apr 19, 2018 · 5 comments
Assignees

Comments

@felipenoris
Copy link
Owner

felipenoris commented Apr 19, 2018

One drawback of the current implementation for this package is that the whole XML file from a Excel package are parsed in the moment the file is read at XML.read("filename.xlsx") worksheet is parsed when accessing any information of the worksheet.

One idea is to stream XML for worksheet XML files.

I found that EzXML.jl has support for streaming XML files.

@stugorf
Copy link

stugorf commented Feb 28, 2019

@felipenoris I want to read an Excel file from an AWS S3 bucket without downloading it. It looks like this feature of ExcelFiles should work. This is how I normally read a file from a stream. Can you clarify how to read a stream using the XML Streaming feature:

file_content = S3.get_object(aws, get_config);
load(file_content, "Sheet1") |> DataFrame

Thank you.

@felipenoris
Copy link
Owner Author

felipenoris commented Feb 28, 2019

Hi! For now it is not possible to read an excel file from stream. We would need #26 to be implemented.

@stugorf
Copy link

stugorf commented Feb 28, 2019

Thank you. Do you know of any packages that can read from stream currently?

@felipenoris
Copy link
Owner Author

I think there isn't. But I'll try to work on #26 though.

@stugorf
Copy link

stugorf commented Mar 1, 2019 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants