Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing selection statistics for gen2 #964

Closed
thelazydogsback opened this issue Dec 5, 2018 · 9 comments
Closed

Missing selection statistics for gen2 #964

thelazydogsback opened this issue Dec 5, 2018 · 9 comments
Assignees
Labels
💡 feature request New feature or request ✅ merged A fix for this issue has been merged
Milestone

Comments

@thelazydogsback
Copy link

Storage Explorer Version: 1.6.0
Platform/OS Version: win10 x64

It would be great to have the following when gen2 stats are supported:

  • Total number of files
  • Total number of directories
  • Max directory depth
  • Total size in bytes
  • Min/Avg/Max # of files per directory
  • Min/Avg/Max bytes per file
@MRayermannMSFT MRayermannMSFT added the 💡 feature request New feature or request label Dec 6, 2018
@MRayermannMSFT MRayermannMSFT added this to the 1.14.0 milestone Aug 5, 2019
@MRayermannMSFT MRayermannMSFT added this to Committed in Storage Explorer via automation Aug 19, 2019
@samuda2019
Copy link

This is a basic feature missing in ADLS Gen2. It is very important to have this enabled. Please let us know the ETA on when it would be enabled.

@thelazydogsback
Copy link
Author

To underscore why this is needed so badly especially here: When using Spark/Databricks it's pretty easy to get your partitioning wrong, leading to horrible performance. For example, one can easily find oneself with a 100,000 directories in 4 levels, and only one or two files in each directory, rather than let's say 1000 directories at two levels with a few hundred files per directory. If the number of directories is within an order of magnitude of the # of files, it's can be a red flag.

Even better than just a static view, would be to emit events that contained these status over time.

@samuda2019
Copy link

@thelazydogsback I couldn't agree more.

@MRayermannMSFT MRayermannMSFT modified the milestones: 1.14.0, 1.15.0 May 27, 2020
@MRayermannMSFT MRayermannMSFT modified the milestones: 1.15.0, 1.16.0 Jun 15, 2020
@Robert-Kostecki
Copy link

Robert-Kostecki commented Sep 1, 2020

Hi, this "feature request" (or bug depending how you read it) has been opened for nearly 21 months now, is there any plan for these to be added anytime soon please?

Alternatively are you planning to fix "Azure Storage Explorer Folder Statistics missing in ADLS Gen2"https://github.com/microsoft/AzureStorageExplorer/issues/1349 which was closed as a duplicate?

Can we have an update on the timeline please?

@JasonYeMSFT JasonYeMSFT modified the milestones: 1.16.0, 1.17.0 Sep 3, 2020
@dazzag24
Copy link

This is a key missing feature.

@samuda2019
Copy link

Any updates on this feature.

@MikeWedderburn-Clarke
Copy link

+1

1 similar comment
@mortenf
Copy link

mortenf commented Oct 26, 2020

+1

@MRayermannMSFT MRayermannMSFT self-assigned this Oct 28, 2020
@MRayermannMSFT MRayermannMSFT moved this from Committed to In Progress in Storage Explorer Oct 29, 2020
@MRayermannMSFT
Copy link
Member

Selection and folder statistics for Gen2 are going to be included in 1.17. The feature has parity with the existing normal blob experience. That is: counts of number of and total bytes of active blobs.

Storage Explorer automation moved this from In Progress to Done Dec 8, 2020
@MRayermannMSFT MRayermannMSFT added the ✅ merged A fix for this issue has been merged label Dec 8, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
💡 feature request New feature or request ✅ merged A fix for this issue has been merged
Projects
Development

No branches or pull requests

8 participants