Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Copying data from grid takes a very long time for moderately sized data extracts #25284

Open
tatemian opened this issue Feb 1, 2024 · 5 comments

Comments

@tatemian
Copy link

tatemian commented Feb 1, 2024

Type: Performance Issue

Pretty straightforward. I work with large query returns, often in excess of 100,000 rows. Copying data from the data grid takes a very long time (well in excess of an hour, most often) to copy data to the clipboard, even for small data returns. There is just a small popup in the bottom right that says "Copying..." and there's no way to cancel the action other than running the query again or closing it. Of course, neither help the time it takes copy the data grid.

I'm rather new to ADS, but I don't remember it being like this when I first started using it. I'm currently having to save the data grid to a temp .csv file and copy the data from another program like Excel. Just a slight inconvenience, but it adds up with countless queries throughout the day.

Updated to 1.47.1 today and that did not solve the issue.

Any help is greatly appreciated.

Azure Data Studio version: azuredatastudio 1.47.1 (b6f7beb, 2024-01-10T02:15:02.628Z)
OS version: Windows_NT x64 10.0.19045
Restricted Mode: No
Preview Features: Enabled
Modes:

System Info
Item Value
CPUs Intel(R) Core(TM) i7-8665U CPU @ 1.90GHz (8 x 2112)
GPU Status 2d_canvas: enabled
canvas_oop_rasterization: enabled_on
direct_rendering_display_compositor: disabled_off_ok
gpu_compositing: enabled
multiple_raster_threads: enabled_on
opengl: enabled_on
rasterization: enabled
raw_draw: disabled_off_ok
video_decode: enabled
video_encode: enabled
vulkan: disabled_off
webgl: enabled
webgl2: enabled
webgpu: enabled
Load (avg) undefined
Memory (System) 7.82GB (0.71GB free)
Process Argv
Screen Reader no
VM 0%
Workspace Info
;
Extensions: none
@bwyatt-mcr
Copy link

Yep. My issue #25282 is related to this, but my data sets are far smaller and it still struggles to do these basic tasks.

@kisantia
Copy link
Contributor

kisantia commented Feb 5, 2024

There was a recent change made to speed this up: #25152. Can you try insiders and see if the performance has improved?

@kisantia kisantia added Area - Query Editor Enhancement Request for new features or functionality labels Feb 5, 2024
@kisantia kisantia added this to the February 2024 Release milestone Feb 5, 2024
@tatemian
Copy link
Author

tatemian commented Feb 12, 2024

There was a recent change made to speed this up: #25152. Can you try insiders and see if the performance has improved?

I tried using v.1.48.0-insider. No notable improvement to grid copy speed. Referenced improvement simulated 50,000 rows of 4 columns resulting in ~3 second copy. I recreated that scenario in insiders and attempted grid copy of the data, which took 5 min, 12 seconds.

I did some more testing with my current setup (v.1.48.0-insiders 2024.02.09)

(rows x columns)
25,000 x 4 ... ~02m01s
25,000 x 8 ... ~02m22s
50,000 x 4 ... ~05m12s
50,000 x 8 ... ~10m29s
100,000 x 4 ... ~28m14s
100,000 x 8 ... ~36m31s

Hopefully this give a better idea of what I'm working with. I tried a 150,000 x 30 grid copy (more what I'm used to working with) and after almost two hours of waiting, I cancelled the copy request.

@cheenamalhotra
Copy link
Member

cheenamalhotra commented Feb 15, 2024

Can you give an example of what datatypes you're copying here? Sample data would be great too!
That would help us extrapolate data and reproduce the issue..

@tatemian
Copy link
Author

Can you give an example of what datatypes you're copying here? Sample data would be great too! That would help us extrapolate data and reproduce the issue..

Unfortunately, I cannot supply sample data due to the nature of my work, but here are the data types for the fields I used in the above test:

  1. [int, not null]

  2. [datetime, null]

  3. [char(1), null]

  4. [char(14), null]

  5. [varchar(64), null]

  6. [varchar(64), null]

  7. [varchar(64), null]

  8. [char(25), null]

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: Needs triage
Development

No branches or pull requests

5 participants