Defered data for download button #5053

Vinno97 · 2022-07-28T12:38:04Z

Problem

The download button currently expects its data to be available when declaring the button. If data needs to be read from disk (or worse: compiled multiple disk sources), this can make the app needlessly slow.
In my app, the data downloading is not a common use case, but the packing of the data for downloading is relatively expensive. Caching helps, but only when the data doesn't change.

Solution

I propose a method to only load and preprocess (archive, pickle, etc) when the download is actually requested.

I propose to also allow a function as a data type that gets called as soon as the download button is pressed. This callback then returns the actual data.

def get_data():
    data = some_heavy_data_loading()
    return data

st.download_button("Download Data", get_data, file_name="my-data.dat")

Possible additions:

Currently a download button accepts str, bytes, TextIO, BinaryIO, or io.RawIOBase. With deferred loading, it would also be possible to accept a file pointer and stream the data to the user. This might bring huge speed and memory benefits when downloading large files.

Technically this streaming would also be possible without deferred loading, but then you're keeping unnecessary files open.

Community voting on feature requests enables the Streamlit team to understand which features are most important to our users.

If you'd like the Streamlit team to prioritize this feature request, please use the 👍 (thumbs up emoji) reaction in response to the initial post.

The text was updated successfully, but these errors were encountered:

LukasMasuch · 2022-07-28T14:47:01Z

@Vinno97 Thanks for the suggestion. This would be indeed a nice addition to the download button, especially when dealing with large files. I will forward this feature request to our product team.

tomgallagher · 2022-07-29T05:36:47Z

In the meantime, I'm using this as a way of ensuring that page flow is not interrupted by large file prep

def customDownloadButton(df):
    if st.button('Prepare downloads'):
        #prep data for downloading
        csv = convert_df(df)
        json_lines = convert_json(df)
        parquet = convert_parquet(df)
        tab1, tab2, tab3 = st.tabs(["Convert to CSV", "Convert to JSON", "Convert to Parquet"])
        with tab1:
            st.download_button('Download', csv, file_name='data.csv')
        with tab2:
            st.download_button('Download', json_lines, file_name='data.json')
        with tab3:
            st.download_button('Download', parquet, file_name='data.parquet')

jrieke · 2022-07-30T00:08:27Z

Yes agree! Back when we implemented download button, I know that we also thought about allowing users to pass a function. Not sure if we cut that just to reduce scope or if there were any reasons against doing that. Will revisit!

xR86 · 2022-08-29T14:55:59Z

I also had this issue, but it appears that it does approximately what you proposed, @Vinno97 ?
The docs mention that you could have a callback for this.

Not sure if I'm missing some nuance with blocking when downloading large files, but I've already used this for data to be generated on click, regardless if it's data files or octet streams to be saved as files (eg: zip).

Lifted from the docs:

@st.cache
 def convert_df(df):
     # IMPORTANT: Cache the conversion to prevent computation on every rerun
     return df.to_csv().encode('utf-8')

csv = convert_df(my_large_df)

st.download_button(
     label="Download data as CSV",
     data=csv,
     file_name='large_df.csv',
     mime='text/csv',
 )

@jrieke Was this functionality added in the meantime and not linked to this issue ?

jrieke · 2022-09-23T22:59:36Z

Nope we didn't implement this yet. We don't have a timeline yet but I'm 99 % sure we want to do this at some point.

amirhessam88 · 2022-12-26T02:44:38Z

Any progress on this ? Do we have an ETA when this bug is gonna be fixed?

wolfgang-koch · 2023-01-09T11:49:14Z

I would appreciate if this gets resolved. I already tried to address this issue on the forum a couple months ago: https://discuss.streamlit.io/t/create-download-file-upon-clicking-a-button/32613
My idea was to solve this using some JS, but it's messy and causes some slight shifting down of the page content.

In my opinion, st.download_button should only fill memory with the file's content upon acutally clicking the button instead of on every script re-run.

jzluo · 2023-01-19T03:11:29Z

I'd also like to voice appreciation this feature. I finally tracked down my app's occasional hanging to this issue. In the meantime, gating the download button behind a "prepare data for download" button like @tomgallagher's example above is a clumsy but okay workaround.

HStep20 · 2023-01-28T16:13:33Z

This would be a great feature. I know its highly requested, but when working with APIs, the lack of this feature makes it a miserable experience. It has to hit the API each time the page is reloaded to prep the download, meaning lots of requests within a quota are used up. Its even worse if you have multiple tabs on a page, each of which download a different dataset for the user - It means x api calls per page load, per tab, each time the script is rerun.

Ive mitigated it by using a nested button like tom suggested, to 'get' data, then show the download button to download it, but a proper way to combine both into one UX Action would be amazing.

masonearles · 2023-03-14T03:41:51Z

+1

ElenaBossolini · 2023-08-12T19:32:53Z

Same problem here. In my case I need to generate a excel file from multiple large pandas dataframes (one dataframe per sheet). I write the data as BytesIO.
The experience is that going from a pandas dataframes to a BytesIO buffer takes about 0.003s, but on the streamlit app, the user is left hanging for multiple seconds. Something between 5s and 10s.

SabraHealthCare · 2023-10-28T12:31:35Z

def get_data():
    data = some_heavy_data_loading()
    return data

st.download_button("Download Data", get_data, file_name="my-data.dat")

def get_data():
st.write("test")
data = some_heavy_data_loading()
return data

I added 'st.write("test")' in get_data, and found that "test"was printed before download_button. it means the get_data() still runs even download button is un-clicked.

andrewpimm · 2023-10-30T10:16:26Z

def get_data():
    data = some_heavy_data_loading()
    return data

st.download_button("Download Data", get_data, file_name="my-data.dat")
def get_data(): st.write("test") data = some_heavy_data_loading() return data

I added 'st.write("test")' in get_data, and found that "test"was printed before download_button. it means the get_data() still runs even download button is un-clicked.

Unless there has been an update that hasn't been announced here, I'm not sure that a function can be called from st.download_button in this way.

jsulopzs · 2023-12-13T09:25:25Z

+1 to this feature, it'd be great for developers to create custom calculators that provide business value and a rich UX.

CharlesFr · 2023-12-17T20:33:04Z

any updates on this feature?

ViniciusgCaetano · 2024-01-06T15:03:52Z

+1

zbjdonald · 2024-01-11T08:13:58Z

any updates on this feature?

LarsHill · 2024-01-13T14:18:12Z

I came across this issue as well. Besides large data payloads being created on every run, it is annoying that there is no way to create the data only after the download button is clicked.
In my case the raw data to be downloaded is created and stored as session state "after" the position of the download button in the code. Now when I click the download button the previously created data state is downloaded but not the current state.

Here is an example:

create_data = st.button("Create data")

if "data" not in st.session_state:
    st.session_state.data = None

st.download_button(
    label="Export",
    data=st.session_state.data,
    file_name=file_name,
)

if create_data:
    # logic to create data here
    st.session_state.data = create_data_logic()

Now, first I click on the "creata data" button and afterwards I click the "download" button but only None is downloaded.
Only on an app rerun the session state available to the download button is updated and the correct data is downloaded.

If the data creation process could happen in a callback after the download button is clicked, there would be no issue...

Currently this workaround does the job for me, but I feel this should be natively possible in strewamlit without js hacks...

anki-code · 2024-02-08T16:03:36Z

Personally I want to say that Streamlit is very unpleasant for new users and I need to google every step and I continuously facing with issues with use cases. And yes, I want to +1 this bug too because when I want to download the data I want to click on the button, wait processing and get the data.

sfc-gh-pkommini · 2024-02-27T01:51:17Z

Hi Team,

We currently have the same issue and makes st.download_button unusable in production. Is there a workaround till the callback function is added? Also is there an ETA for the data callback being added?

goyodiaz · 2024-02-27T17:12:26Z

@sfc-gh-pkommini The only workaround I ever found is using two buttons as posted above.

BenGravell · 2024-03-10T12:50:24Z

+1 on this issue.

A super basic use-case is offering users a download of PNG images. This is a typical desire of a user if you want "archival quality" and are willing to eat the storage size - forcing people into JPEG all the time is not nice. PNG being mostly uncompressed means the filesize / data payload is going to be higher. Even moderately large PNG of dims 3072 x 4096 ends up being ~26 MB, which is totally feasible for generating in-memory and offering for one-off downloads. The ask is just to defer the costly serialization operations until the user actually clicks the download button, rather than having to do it every time just to display a download button. The workaround is too fiddly and requires too much ad-hoc state management to really be called a solution IMO.

iandesj · 2024-04-03T17:40:09Z

My team encountered this bug when apps are deployed in replicas to something like Kubernetes.

Vinno97 added type:enhancement Requests for feature enhancements or new features status:needs-triage Has not been triaged by the Streamlit team labels Jul 28, 2022

LukasMasuch added feature:st.download_button and removed status:needs-triage Has not been triaged by the Streamlit team labels Jul 28, 2022

kajarenc mentioned this issue Nov 8, 2022

[WIP] Add two-step process for download data, should fix issue #5586 #5639

Closed

9 tasks

carolinedlu added status:in-progress We're on it! added-voting-callout and removed added-voting-callout labels Nov 10, 2022

MathCatsAnd mentioned this issue Jan 8, 2023

st.download_button: Create downloadable data only when button is clicked. #5899

Closed

kmcgrady mentioned this issue Jan 18, 2024

Convert and download markdown text as a pdf report #7894

Closed

4 tasks

chrieke mentioned this issue Mar 15, 2024

Export image chrieke/prettymapp#12

Open

jrieke added status:likely Will probably implement but no timeline yet and removed status:in-progress We're on it! labels Apr 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Defered data for download button #5053

Defered data for download button #5053

Vinno97 commented Jul 28, 2022 •

edited by carolinedlu

LukasMasuch commented Jul 28, 2022

tomgallagher commented Jul 29, 2022

jrieke commented Jul 30, 2022

xR86 commented Aug 29, 2022

jrieke commented Sep 23, 2022

amirhessam88 commented Dec 26, 2022

wolfgang-koch commented Jan 9, 2023

jzluo commented Jan 19, 2023

HStep20 commented Jan 28, 2023

masonearles commented Mar 14, 2023

ElenaBossolini commented Aug 12, 2023

SabraHealthCare commented Oct 28, 2023

andrewpimm commented Oct 30, 2023 •

edited

jsulopzs commented Dec 13, 2023

CharlesFr commented Dec 17, 2023

ViniciusgCaetano commented Jan 6, 2024

zbjdonald commented Jan 11, 2024

LarsHill commented Jan 13, 2024

anki-code commented Feb 8, 2024 •

edited

sfc-gh-pkommini commented Feb 27, 2024 •

edited

goyodiaz commented Feb 27, 2024

BenGravell commented Mar 10, 2024

iandesj commented Apr 3, 2024

Defered data for download button #5053

Defered data for download button #5053

Comments

Vinno97 commented Jul 28, 2022 • edited by carolinedlu

Problem

Solution

LukasMasuch commented Jul 28, 2022

tomgallagher commented Jul 29, 2022

jrieke commented Jul 30, 2022

xR86 commented Aug 29, 2022

jrieke commented Sep 23, 2022

amirhessam88 commented Dec 26, 2022

wolfgang-koch commented Jan 9, 2023

jzluo commented Jan 19, 2023

HStep20 commented Jan 28, 2023

masonearles commented Mar 14, 2023

ElenaBossolini commented Aug 12, 2023

SabraHealthCare commented Oct 28, 2023

andrewpimm commented Oct 30, 2023 • edited

jsulopzs commented Dec 13, 2023

CharlesFr commented Dec 17, 2023

ViniciusgCaetano commented Jan 6, 2024

zbjdonald commented Jan 11, 2024

LarsHill commented Jan 13, 2024

anki-code commented Feb 8, 2024 • edited

sfc-gh-pkommini commented Feb 27, 2024 • edited

goyodiaz commented Feb 27, 2024

BenGravell commented Mar 10, 2024

iandesj commented Apr 3, 2024

Vinno97 commented Jul 28, 2022 •

edited by carolinedlu

andrewpimm commented Oct 30, 2023 •

edited

anki-code commented Feb 8, 2024 •

edited

sfc-gh-pkommini commented Feb 27, 2024 •

edited