Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We鈥檒l occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Elasticsearch storage usage to df.info() #321

Merged
merged 1 commit into from Nov 16, 2020

Conversation

V1NAY8
Copy link
Contributor

@V1NAY8 V1NAY8 commented Nov 7, 2020

  • Added the Elasticsearch Storage Usage to df.info()

Example:

>>> ed_df = ed.DataFrame('localhost', 'flights')
>>> ed_df.info()
<class 'eland.dataframe.DataFrame'>
Index: 13059 entries, 0 to 13058
Data columns (total 27 columns):
 #   Column              Non-Null Count  Dtype         
---  ------              --------------  -----         
 0   AvgTicketPrice      13059 non-null  float64       
 1   Cancelled           13059 non-null  bool          
 2   Carrier             13059 non-null  object        
 3   Dest                13059 non-null  object        
 4   DestAirportID       13059 non-null  object        
 5   DestCityName        13059 non-null  object        
 6   DestCountry         13059 non-null  object        
 7   DestLocation        13059 non-null  object        
 8   DestRegion          13059 non-null  object        
 9   DestWeather         13059 non-null  object        
 10  DistanceKilometers  13059 non-null  float64       
 11  DistanceMiles       13059 non-null  float64       
 12  FlightDelay         13059 non-null  bool          
 13  FlightDelayMin      13059 non-null  int64         
 14  FlightDelayType     13059 non-null  object        
 15  FlightNum           13059 non-null  object        
 16  FlightTimeHour      13059 non-null  float64       
 17  FlightTimeMin       13059 non-null  float64       
 18  Origin              13059 non-null  object        
 19  OriginAirportID     13059 non-null  object        
 20  OriginCityName      13059 non-null  object        
 21  OriginCountry       13059 non-null  object        
 22  OriginLocation      13059 non-null  object        
 23  OriginRegion        13059 non-null  object        
 24  OriginWeather       13059 non-null  object        
 25  dayOfWeek           13059 non-null  int64         
 26  timestamp           13059 non-null  datetime64[ns]
dtypes: bool(2), datetime64[ns](1), float64(5), int64(2), object(17)
memory usage: 64.000 bytes
Elasticsearch storage usage: 5.043 MB

@sethmlarson Please review 馃槂

@elasticmachine
Copy link

Since this is a community submitted pull request, a Jenkins build has not been kicked off automatically. Can an Elastic organization member please verify the contents of this patch and then kick off a build manually?

Copy link
Contributor

@sethmlarson sethmlarson left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@sethmlarson
Copy link
Contributor

jenkins test this please

@sethmlarson
Copy link
Contributor

I don't think the failure is related to your changes, going to merge!

@sethmlarson sethmlarson merged commit 56f6ba6 into elastic:master Nov 16, 2020
@V1NAY8
Copy link
Contributor Author

V1NAY8 commented Nov 16, 2020

Thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants