###**Vaex- Reading And Processing Huge Datasets in seconds**
What is Vaex?

Vaex is a high performance Python library for lazy Out-of-Core DataFrames (similar to Pandas), to visualize and explore big tabular datasets. It calculates statistics such as mean, sum, count, standard deviation etc, on an N-dimensional grid for more than a billion (10^9) samples/rows per second. Visualization is done using histograms, density plots and 3d volume rendering, allowing interactive exploration of big data. Vaex uses memory mapping, zero memory copy policy and lazy computations for best performance (no memory wasted).

In [10]:
#!pip install vaex

Collecting vaex
  Downloading vaex-3.0.0-py3-none-any.whl (3.9 kB)
Collecting vaex-arrow<0.6,>=0.5.0
  Downloading vaex_arrow-0.5.1-py3-none-any.whl (7.3 kB)
Collecting pyarrow>=0.15
  Downloading pyarrow-2.0.0-cp36-cp36m-manylinux2014_x86_64.whl (17.7 MB)
[K     |████████████████████████████████| 17.7 MB 140 kB/s 
[?25hCollecting vaex-astro<0.8,>=0.7.0
  Downloading vaex_astro-0.7.0-py3-none-any.whl (10 kB)
Collecting vaex-core<3,>=2.0.0
  Downloading vaex_core-2.0.3-cp36-cp36m-manylinux2010_x86_64.whl (2.4 MB)
[K     |████████████████████████████████| 2.4 MB 66.3 MB/s 
Collecting vaex-hdf5<0.7,>=0.6.0
  Downloading vaex_hdf5-0.6.0-py3-none-any.whl (11 kB)
Collecting s3fs<0.3
  Downloading s3fs-0.2.2.tar.gz (47 kB)
[K     |████████████████████████████████| 47 kB 4.1 MB/s 
[?25hCollecting boto3>=1.9.91
  Downloading boto3-1.16.37-py2.py3-none-any.whl (130 kB)
[K     |████████████████████████████████| 130 kB 82.8 MB/s 
[?25hCollecting botocore>=1.12.91
  Downloading botocore-1.19

In [2]:
import vaex
import pandas as pd
import numpy as np
n_rows = 1000000
n_cols = 500
df = pd.DataFrame(np.random.randint(0, 100, size=(n_rows, n_cols)), columns=['col%d' % i for i in range(n_cols)])
df.head()

Unnamed: 0,col0,col1,col2,col3,col4,col5,col6,col7,col8,col9,col10,col11,col12,col13,col14,col15,col16,col17,col18,col19,col20,col21,col22,col23,col24,col25,col26,col27,col28,col29,col30,col31,col32,col33,col34,col35,col36,col37,col38,col39,...,col460,col461,col462,col463,col464,col465,col466,col467,col468,col469,col470,col471,col472,col473,col474,col475,col476,col477,col478,col479,col480,col481,col482,col483,col484,col485,col486,col487,col488,col489,col490,col491,col492,col493,col494,col495,col496,col497,col498,col499
0,43,97,6,79,23,35,89,46,93,26,36,36,85,38,14,72,62,27,43,50,29,4,0,31,26,43,87,77,33,32,90,80,21,35,99,89,34,6,45,13,...,7,93,71,57,62,81,12,57,90,34,89,97,86,89,9,80,56,53,1,35,84,28,71,55,2,90,59,60,17,37,15,11,58,4,86,85,87,39,31,5
1,7,63,32,3,44,11,57,23,77,46,2,0,53,46,81,62,20,21,67,63,41,22,7,49,55,44,62,86,74,64,65,28,37,36,0,42,80,76,99,98,...,0,62,15,40,43,35,92,98,3,39,46,81,79,32,59,62,74,23,16,67,74,77,7,11,19,24,59,88,85,29,76,24,54,80,34,97,38,29,11,60
2,23,63,66,29,27,88,61,83,1,96,8,47,72,51,93,7,80,36,64,92,90,25,21,88,53,18,51,1,22,85,17,29,83,80,3,24,1,54,25,90,...,2,33,22,45,13,5,28,13,87,56,28,90,61,0,91,17,51,23,99,78,31,93,61,21,3,35,14,79,98,89,17,18,29,3,4,94,61,9,74,73
3,81,50,11,2,99,64,6,61,35,25,86,21,98,21,55,15,88,82,19,51,73,0,68,42,6,98,73,45,99,32,69,41,15,49,52,16,59,64,96,31,...,60,23,46,76,76,66,60,46,75,75,27,4,66,22,0,93,45,31,60,83,16,28,46,6,64,79,67,6,91,21,59,28,11,98,5,22,67,56,55,74
4,24,53,72,20,1,62,28,43,21,76,65,75,0,41,3,49,81,24,74,81,60,50,46,53,98,55,45,84,19,64,68,25,61,94,26,63,83,81,38,71,...,23,45,97,56,89,57,5,4,48,14,66,34,12,15,12,64,70,33,88,6,92,61,65,19,61,30,88,14,10,64,46,59,3,49,35,3,33,78,73,34


In [3]:
df.info(memory_usage='deep')

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 1000000 entries, 0 to 999999
Columns: 500 entries, col0 to col499
dtypes: int64(500)
memory usage: 3.7 GB


####**Creating Csv files**

In [4]:
file_path = 'final_data.csv'
df.to_csv(file_path, index=False)

####**Create Hdf5 files**

In [5]:
vaex_df = vaex.from_csv(file_path, convert=True, chunk_size=5_000_000)

In [6]:
type(vaex_df)

vaex.hdf5.dataset.Hdf5MemoryMapped

####**Read Hdf5 files using Vaex librar**

In [7]:
vaex_df = vaex.open('final_data.csv.hdf5')

In [8]:
type(vaex_df)

vaex.hdf5.dataset.Hdf5MemoryMapped

In [9]:
vaex_df.head()

#,col0,col1,col2,col3,col4,col5,col6,col7,col8,col9,col10,col11,col12,col13,col14,col15,col16,col17,col18,col19,col20,col21,col22,col23,col24,col25,col26,col27,col28,col29,col30,col31,col32,col33,col34,col35,col36,col37,col38,col39,col40,col41,col42,col43,col44,col45,col46,col47,col48,col49,col50,col51,col52,col53,col54,col55,col56,col57,col58,col59,col60,col61,col62,col63,col64,col65,col66,col67,col68,col69,col70,col71,col72,col73,col74,col75,col76,col77,col78,col79,col80,col81,col82,col83,col84,col85,col86,col87,col88,col89,col90,col91,col92,col93,col94,col95,col96,col97,col98,col99,col100,col101,col102,col103,col104,col105,col106,col107,col108,col109,col110,col111,col112,col113,col114,col115,col116,col117,col118,col119,col120,col121,col122,col123,col124,col125,col126,col127,col128,col129,col130,col131,col132,col133,col134,col135,col136,col137,col138,col139,col140,col141,col142,col143,col144,col145,col146,col147,col148,col149,col150,col151,col152,col153,col154,col155,col156,col157,col158,col159,col160,col161,col162,col163,col164,col165,col166,col167,col168,col169,col170,col171,col172,col173,col174,col175,col176,col177,col178,col179,col180,col181,col182,col183,col184,col185,col186,col187,col188,col189,col190,col191,col192,col193,col194,col195,col196,col197,col198,col199,col200,col201,col202,col203,col204,col205,col206,col207,col208,col209,col210,col211,col212,col213,col214,col215,col216,col217,col218,col219,col220,col221,col222,col223,col224,col225,col226,col227,col228,col229,col230,col231,col232,col233,col234,col235,col236,col237,col238,col239,col240,col241,col242,col243,col244,col245,col246,col247,col248,col249,col250,col251,col252,col253,col254,col255,col256,col257,col258,col259,col260,col261,col262,col263,col264,col265,col266,col267,col268,col269,col270,col271,col272,col273,col274,col275,col276,col277,col278,col279,col280,col281,col282,col283,col284,col285,col286,col287,col288,col289,col290,col291,col292,col293,col294,col295,col296,col297,col298,col299,col300,col301,col302,col303,col304,col305,col306,col307,col308,col309,col310,col311,col312,col313,col314,col315,col316,col317,col318,col319,col320,col321,col322,col323,col324,col325,col326,col327,col328,col329,col330,col331,col332,col333,col334,col335,col336,col337,col338,col339,col340,col341,col342,col343,col344,col345,col346,col347,col348,col349,col350,col351,col352,col353,col354,col355,col356,col357,col358,col359,col360,col361,col362,col363,col364,col365,col366,col367,col368,col369,col370,col371,col372,col373,col374,col375,col376,col377,col378,col379,col380,col381,col382,col383,col384,col385,col386,col387,col388,col389,col390,col391,col392,col393,col394,col395,col396,col397,col398,col399,col400,col401,col402,col403,col404,col405,col406,col407,col408,col409,col410,col411,col412,col413,col414,col415,col416,col417,col418,col419,col420,col421,col422,col423,col424,col425,col426,col427,col428,col429,col430,col431,col432,col433,col434,col435,col436,col437,col438,col439,col440,col441,col442,col443,col444,col445,col446,col447,col448,col449,col450,col451,col452,col453,col454,col455,col456,col457,col458,col459,col460,col461,col462,col463,col464,col465,col466,col467,col468,col469,col470,col471,col472,col473,col474,col475,col476,col477,col478,col479,col480,col481,col482,col483,col484,col485,col486,col487,col488,col489,col490,col491,col492,col493,col494,col495,col496,col497,col498,col499
0,16,97,12,80,50,75,83,2,33,71,36,76,56,71,8,32,31,82,27,95,34,9,88,3,92,16,87,29,51,44,78,6,96,6,50,84,3,40,65,71,33,93,84,31,51,50,70,26,93,33,89,55,0,52,91,97,51,89,6,41,62,48,24,51,83,12,91,17,51,41,98,56,5,27,79,8,29,98,44,81,64,5,11,49,5,38,90,23,62,69,22,63,26,7,69,58,99,62,18,67,55,53,17,78,20,86,72,52,0,70,25,89,85,60,52,56,19,29,42,26,40,57,11,10,41,41,87,24,88,79,65,50,84,99,93,34,34,25,78,2,94,89,57,13,51,34,15,87,86,46,17,45,75,65,65,0,88,18,83,77,16,86,34,21,84,11,62,48,28,87,75,97,80,73,74,68,27,96,75,93,68,90,64,93,73,88,38,95,76,13,20,9,86,2,30,30,21,73,43,59,99,18,6,32,67,9,50,94,1,48,87,82,81,37,91,87,69,19,60,79,14,51,91,17,54,49,81,20,46,2,7,49,69,71,73,45,44,5,41,31,13,90,26,57,45,36,82,91,70,9,47,4,73,21,6,78,73,33,15,5,44,62,21,98,23,28,26,55,94,73,29,83,67,16,35,19,58,17,21,67,93,42,92,29,18,66,17,79,55,94,31,88,59,67,57,10,79,32,85,67,47,55,32,97,53,13,17,93,47,26,43,66,31,54,34,51,55,17,23,14,27,89,93,51,61,39,84,80,34,1,17,8,53,12,12,54,35,38,48,28,4,33,52,83,81,63,47,78,43,59,34,85,10,98,50,72,71,59,48,24,23,38,14,62,94,58,47,26,72,97,5,69,64,18,81,58,10,14,0,59,66,12,21,64,24,16,32,70,89,65,9,82,90,55,76,96,8,32,12,32,76,48,81,8,12,70,18,69,39,37,91,1,68,29,28,31,3,70,52,49,78,30,3,34,56,16,30,38,75,8,82,17,34,13,83,39,77,50,10,42,44,37,41,23,56,33,8,90,91,88,28,11,67,65,95,34,26,35,82,35,97,51,47,34,43,10,79,28,88,16,9,97,48,45,99,10,80,29,61,11,94,38,4,49,26,61,71,13,6,67,14,27,24,58,83,40,31,58,82,26
1,26,64,93,1,90,44,7,28,64,49,30,47,13,11,86,28,34,33,35,32,48,47,20,8,36,45,16,56,40,57,71,42,58,65,86,5,6,88,8,88,54,70,89,7,11,85,63,69,76,72,40,37,34,26,43,95,9,87,30,16,84,67,26,17,25,62,14,87,68,81,12,58,17,83,6,60,41,7,77,23,71,74,76,7,0,56,32,29,31,25,24,6,84,85,16,96,9,99,35,61,17,26,88,62,8,21,60,84,97,3,60,16,81,6,53,69,93,77,70,20,53,4,44,36,48,7,33,95,99,1,79,84,49,91,24,81,61,26,32,41,3,86,90,59,55,27,6,40,15,41,59,95,54,6,85,81,67,26,70,28,33,72,58,85,22,80,0,91,66,32,79,98,24,57,74,71,98,2,81,87,61,53,83,48,9,92,59,49,24,7,97,90,5,94,37,63,26,3,50,90,14,27,49,38,48,37,78,83,2,82,90,88,89,4,88,49,17,28,85,11,20,81,79,79,2,62,6,41,63,81,75,82,78,79,32,54,13,58,8,37,54,96,0,27,67,16,99,10,89,2,48,83,73,0,8,8,90,64,82,33,76,12,80,10,26,5,41,17,79,96,2,62,88,13,38,63,44,58,30,32,74,31,64,75,20,67,5,85,54,63,96,75,6,95,4,79,6,97,41,98,45,87,8,98,77,80,23,32,78,90,43,64,86,21,67,0,20,15,73,79,37,48,55,23,78,76,12,27,76,56,84,43,63,56,1,15,78,76,99,63,65,5,56,24,43,73,74,24,33,33,12,25,78,20,66,55,13,41,90,96,19,61,77,29,22,74,4,2,59,31,20,48,60,59,91,70,56,55,80,29,22,32,57,16,85,52,68,38,65,19,69,36,62,13,16,63,75,95,5,51,45,96,27,88,75,55,93,57,27,40,49,15,64,50,24,94,59,76,86,94,80,75,81,50,80,24,11,13,11,37,46,80,61,20,68,91,44,73,1,37,87,73,72,17,83,20,76,75,32,1,25,11,88,67,0,21,85,82,46,8,57,31,98,1,60,78,49,89,75,82,8,27,7,3,28,55,12,91,4,48,59,2,8,87,6,56,2,93,53,89,25,43,87,77,26,80,95,85,54,92
2,48,24,95,27,75,64,7,68,6,32,84,50,53,77,98,82,40,29,49,40,75,92,42,34,65,34,67,3,23,50,45,97,90,88,88,42,54,56,83,83,34,0,65,73,26,3,9,82,81,85,57,56,42,33,2,17,29,77,31,94,50,96,11,29,60,9,50,71,69,4,51,14,95,10,57,78,23,24,64,8,41,35,17,73,79,73,47,96,99,84,28,79,88,69,63,57,48,47,38,42,6,73,94,0,82,3,71,7,35,37,45,77,16,43,3,57,61,57,36,52,27,52,39,70,99,91,76,44,64,15,56,77,82,67,68,3,26,79,78,38,67,61,6,23,86,10,14,52,6,96,94,20,13,14,64,25,3,18,78,85,10,60,92,89,74,36,55,44,87,90,98,8,77,62,47,23,0,93,44,62,12,28,56,99,80,47,4,64,76,85,69,84,98,24,92,62,23,20,47,77,75,49,53,88,79,79,70,33,92,4,22,55,50,62,60,16,80,6,2,59,58,52,91,92,54,66,82,43,56,81,20,21,60,75,28,53,77,55,63,67,39,23,54,78,85,30,76,63,86,20,39,3,32,2,62,98,83,0,34,81,7,26,72,27,54,28,79,92,51,24,45,76,25,75,8,14,72,48,55,11,35,73,23,82,98,77,42,77,76,93,23,89,10,2,9,93,23,21,72,74,99,69,14,6,3,43,38,91,77,48,62,34,22,1,35,47,72,88,53,62,45,2,92,31,54,40,10,98,3,47,57,94,76,48,49,19,63,7,82,26,95,27,89,91,25,1,48,59,0,20,15,47,7,58,84,76,10,39,83,36,77,76,11,38,44,66,59,10,63,67,82,57,25,38,50,55,40,96,4,66,69,56,62,30,42,67,14,88,99,70,45,84,45,24,21,54,93,54,19,95,78,69,96,8,96,62,55,76,38,9,51,37,46,81,40,80,68,54,97,16,44,7,94,62,19,88,19,71,68,28,56,58,68,32,90,74,26,87,11,18,18,35,53,85,41,89,38,83,52,41,98,25,23,71,71,14,90,37,7,59,83,75,6,2,92,8,80,61,60,21,15,19,80,91,35,81,81,67,34,32,29,44,44,12,55,20,48,72,37,39,66,91,90,87,80,72,15,50,51,96
3,74,19,73,64,18,38,42,30,42,93,74,97,79,53,63,50,43,22,58,88,13,31,66,89,38,51,54,22,87,54,82,86,54,5,43,36,55,40,76,28,44,3,14,85,59,68,64,99,99,8,80,69,55,88,69,77,98,88,81,79,50,95,81,46,27,69,34,26,79,5,90,39,23,26,89,92,60,88,21,7,62,42,79,99,38,87,50,97,39,29,43,3,49,94,1,74,74,1,11,68,10,17,39,41,46,81,2,59,49,21,83,80,64,26,98,6,99,10,62,48,69,14,25,95,51,0,23,34,41,0,62,14,62,24,61,65,48,12,68,0,68,10,84,35,72,44,64,27,79,75,77,36,46,99,5,92,98,36,3,85,97,65,63,13,67,47,31,6,47,55,99,37,37,81,36,56,42,35,4,27,24,9,72,6,43,39,14,90,28,12,75,62,10,90,18,6,84,59,27,53,54,71,21,35,47,44,11,87,97,49,0,86,65,72,30,3,84,66,78,23,65,51,98,86,23,33,80,55,80,15,28,68,6,41,93,96,79,36,59,3,57,24,65,68,22,75,3,76,11,95,49,5,59,27,93,99,60,58,18,35,67,26,83,32,49,47,27,5,49,27,86,54,14,98,0,81,11,43,70,43,96,23,94,62,54,90,76,11,88,68,2,0,7,14,35,26,83,43,53,84,82,65,9,26,45,2,46,73,15,87,98,1,82,83,85,23,62,34,79,95,45,66,90,57,54,36,99,22,24,23,74,42,15,94,64,9,58,31,99,21,0,26,4,88,9,52,69,44,52,63,35,15,47,28,42,78,85,87,89,70,47,14,43,21,35,83,51,40,51,55,31,81,94,78,43,57,99,56,7,34,56,56,9,13,65,46,43,64,96,62,38,10,99,12,37,69,37,97,2,98,94,41,82,99,9,82,90,93,41,65,40,70,57,58,41,27,70,12,37,58,99,33,13,24,37,81,95,68,65,15,82,55,39,15,13,21,52,94,64,26,44,34,89,86,23,0,26,55,35,90,42,77,57,76,26,67,73,31,52,14,10,5,40,71,57,95,54,27,35,92,63,0,54,49,30,66,7,81,5,68,18,92,77,35,20,90,0,81,18,20,96,4,20,79,49,26,63,47,65,62
4,7,7,48,56,20,66,64,53,73,85,83,34,15,76,28,18,7,17,89,57,14,75,1,96,9,97,75,29,84,37,22,30,10,25,83,30,32,43,40,47,40,82,54,57,70,60,92,45,72,89,35,80,36,51,41,48,66,97,28,73,34,30,24,46,51,45,17,10,33,22,55,98,20,28,43,3,62,16,90,86,37,2,97,17,50,49,85,30,45,55,33,82,13,48,71,49,67,33,44,20,50,70,44,86,22,87,73,53,16,55,43,67,6,65,66,5,10,9,16,9,70,54,69,92,62,23,46,5,33,56,16,18,19,19,54,65,5,51,29,65,73,48,54,8,31,39,33,20,17,88,45,85,35,24,43,99,45,58,57,3,15,80,38,64,64,34,64,40,32,39,72,81,56,11,26,52,63,47,8,60,77,53,72,29,21,8,19,31,22,17,81,22,98,98,12,93,93,17,28,45,21,28,85,0,16,61,76,31,27,31,46,98,76,44,80,84,66,72,6,50,87,23,80,82,24,34,46,87,42,94,6,16,28,77,19,17,79,65,22,73,25,54,13,89,70,44,44,89,87,70,10,54,31,37,81,12,20,49,15,47,54,65,37,83,97,49,26,37,20,24,46,64,42,61,76,52,11,91,77,84,43,45,1,40,79,22,43,27,10,9,80,86,43,73,72,67,62,85,47,44,41,24,19,99,72,36,56,58,1,92,74,28,85,96,90,84,57,20,43,54,65,40,5,2,73,0,93,63,24,64,9,91,12,19,84,23,47,76,61,33,91,19,56,87,17,32,66,8,44,66,20,51,89,66,44,63,83,75,7,12,83,63,67,12,37,89,91,12,90,39,38,59,69,39,6,20,2,20,58,38,12,78,61,34,74,47,5,84,23,99,4,63,31,54,41,57,55,79,41,17,26,37,69,86,49,47,5,27,55,60,99,77,6,3,75,76,88,74,29,25,75,81,23,44,64,64,94,46,33,97,42,81,4,81,38,54,32,58,93,68,3,62,91,63,12,4,21,8,47,84,1,31,63,74,60,52,5,39,88,6,15,20,59,22,47,94,83,2,88,72,70,25,1,16,2,26,20,3,59,2,51,51,95,14,80,47,18,62,24,33,83,70,58,67,72,38,68,87,75,21
5,50,36,85,47,73,64,89,86,38,35,82,0,92,42,22,95,63,67,16,26,42,10,59,85,44,42,28,89,44,6,74,68,42,42,89,21,58,84,59,92,25,14,59,86,39,94,67,11,71,97,51,65,98,92,17,44,35,19,50,65,57,17,20,6,97,83,25,91,0,94,58,49,60,22,86,9,86,19,40,46,16,69,6,43,34,1,74,75,10,7,4,84,25,89,35,78,22,57,82,10,52,53,13,92,67,28,84,91,59,25,72,0,45,64,86,80,96,87,6,42,39,9,45,55,6,89,12,92,75,97,32,7,47,91,50,2,78,23,47,78,91,17,40,59,1,80,18,80,8,45,41,69,25,62,67,41,18,9,68,50,6,72,11,95,73,97,12,1,4,11,96,33,68,84,68,53,19,20,81,89,93,60,91,44,73,45,69,72,57,99,52,75,54,36,90,62,35,27,97,52,3,58,16,75,92,50,71,74,20,70,28,70,29,90,16,32,22,99,4,82,25,69,89,93,30,80,76,10,64,35,62,33,49,66,1,36,55,52,67,70,67,14,24,34,11,32,35,93,53,38,15,80,62,65,31,13,89,78,58,21,87,81,39,39,81,84,40,54,75,19,34,79,28,1,97,27,83,47,68,40,40,5,85,31,22,65,10,59,87,44,77,67,76,73,39,12,79,95,17,40,38,74,74,20,52,98,30,57,17,55,88,46,64,55,63,55,4,16,60,55,68,39,37,72,56,11,87,97,82,28,61,78,21,35,71,0,22,53,39,10,18,69,9,20,56,31,32,7,23,65,4,9,87,16,32,11,76,33,49,95,71,96,21,64,69,37,39,24,82,67,59,30,45,33,81,90,90,58,27,7,52,3,48,27,71,40,83,56,58,12,90,99,11,41,44,51,80,91,57,75,39,12,73,30,81,39,8,97,45,78,67,92,93,49,45,79,60,47,72,11,74,76,99,32,92,6,11,25,64,80,44,42,68,71,97,11,61,67,78,40,25,16,1,17,4,36,60,18,8,95,77,41,46,32,87,63,15,70,34,32,0,31,79,92,65,60,51,89,63,3,76,79,67,28,19,65,16,4,17,35,98,19,97,33,39,8,17,5,2,30,95,98,29,19,47,71,23,48,92,45
6,33,59,1,48,37,52,40,98,89,55,80,48,56,72,79,86,25,69,29,10,78,22,42,98,82,34,56,75,42,44,30,10,25,19,46,46,84,64,22,36,37,97,99,18,18,3,75,39,68,80,17,81,88,32,43,8,21,73,73,89,97,7,66,5,76,21,13,90,21,33,91,36,55,72,70,75,61,96,93,57,88,2,20,22,65,34,23,21,83,25,72,72,25,95,34,58,76,72,86,9,89,13,14,32,28,13,49,64,44,31,7,2,14,89,49,7,41,77,2,87,43,29,37,84,18,8,34,62,35,62,56,51,96,71,2,72,52,39,2,97,60,20,70,18,79,24,69,25,72,83,24,70,81,9,22,41,61,5,47,71,72,42,30,12,4,90,19,24,74,1,43,6,53,65,81,60,48,22,85,6,0,1,72,55,74,72,91,23,47,43,85,85,24,9,9,59,52,21,47,91,50,46,53,53,61,46,71,92,69,21,31,89,52,37,17,46,7,2,37,20,7,44,56,29,16,63,61,45,40,71,73,12,3,50,64,69,12,34,7,59,42,9,57,47,20,22,64,68,91,91,92,88,98,70,6,18,33,46,46,51,29,88,73,40,94,5,69,18,4,34,63,41,67,2,78,34,35,75,41,38,1,50,91,93,76,54,35,88,4,63,14,21,51,56,65,95,3,36,17,92,81,94,98,48,15,1,12,23,29,75,52,58,76,3,89,15,29,78,31,15,60,1,62,22,36,62,36,16,23,42,35,78,21,6,51,64,5,51,75,13,71,13,78,37,4,19,10,53,11,80,4,89,70,89,3,19,87,32,88,51,78,34,83,2,62,78,37,12,33,58,57,5,16,32,42,12,40,33,99,54,57,52,46,38,96,95,25,85,55,80,48,93,87,19,60,38,30,58,4,9,1,1,33,54,14,86,30,41,17,71,78,23,55,58,59,11,72,21,16,49,59,87,36,81,60,82,62,41,72,81,95,70,49,13,14,48,82,64,85,13,80,90,17,68,16,27,12,65,57,11,69,87,97,78,77,26,25,22,11,13,96,34,74,19,83,23,45,60,36,32,78,97,68,98,48,43,66,82,20,76,0,31,70,49,67,58,15,54,99,58,83,1,36,17,95,90,36,8,92,80
7,10,1,89,4,1,95,61,6,35,80,0,70,54,67,21,42,64,7,44,40,72,62,0,77,8,56,72,19,12,35,28,52,70,64,17,82,2,11,96,27,0,59,18,8,11,59,53,77,95,47,48,72,7,56,34,20,86,26,60,88,57,91,94,22,32,90,47,64,75,71,45,56,40,48,63,15,79,44,25,47,54,0,21,52,0,72,36,97,10,81,68,73,55,80,54,38,20,85,25,0,88,54,98,14,51,3,14,81,14,73,51,68,95,12,68,47,84,33,13,62,61,92,6,64,23,60,65,9,94,73,22,63,12,94,55,4,4,96,94,39,57,48,99,83,89,87,58,92,63,15,79,55,10,3,73,16,82,11,86,11,83,80,28,27,71,31,75,9,51,56,7,73,85,13,26,82,36,29,46,52,89,74,18,60,3,69,28,28,26,16,45,83,59,48,97,66,72,37,10,83,58,22,7,85,32,57,81,40,96,28,5,65,96,37,91,67,56,14,2,10,42,70,64,95,78,41,42,53,59,74,39,77,92,88,83,40,4,10,43,0,51,5,88,41,1,45,63,38,93,94,86,96,92,9,97,40,53,80,5,91,64,26,61,65,51,76,14,20,11,47,33,49,51,43,13,97,29,83,46,66,65,72,96,40,89,8,48,17,55,65,77,37,0,94,37,61,18,85,26,19,61,8,62,9,63,22,90,96,0,39,16,12,21,85,46,1,3,52,28,84,86,53,42,85,18,95,72,23,51,59,97,12,41,92,72,64,85,57,27,8,19,55,14,30,17,98,4,10,7,84,81,21,75,52,44,36,53,48,12,8,6,75,65,3,3,76,86,23,94,57,94,19,39,4,77,84,80,70,22,92,63,85,90,44,17,5,39,21,74,42,27,63,63,1,33,81,58,34,72,48,16,62,62,17,58,19,65,50,82,11,19,73,52,35,0,38,54,87,25,65,4,46,29,36,85,44,59,45,12,33,36,21,89,59,51,57,41,64,31,39,4,95,51,50,20,27,86,78,61,2,69,96,19,97,98,42,13,52,97,69,95,77,93,61,25,48,34,66,25,52,53,48,55,99,37,98,77,58,60,10,8,30,13,22,0,92,77,43,41,59,33,73,55,7,63,76,97,97,86,7
8,55,90,66,51,24,53,73,61,0,47,24,26,15,6,76,9,59,84,53,52,72,51,85,5,98,98,75,66,46,85,94,11,59,69,1,34,94,38,98,21,42,53,33,96,20,37,84,8,30,56,27,37,73,17,91,89,4,48,0,1,95,40,24,34,3,42,61,32,11,79,39,73,56,38,84,72,78,5,82,67,16,7,55,40,25,37,47,90,93,34,54,93,83,31,99,20,30,2,36,46,88,48,58,53,63,67,56,97,16,58,81,27,8,96,2,23,3,49,38,27,51,76,45,18,18,72,49,64,92,59,14,84,32,24,61,8,74,95,61,46,78,26,11,51,58,21,11,17,62,55,54,0,55,50,36,92,8,83,8,66,35,80,28,10,27,25,10,59,90,17,57,22,70,95,25,7,38,54,89,88,88,53,43,70,63,20,23,35,68,25,26,59,68,96,0,85,23,14,74,80,44,18,87,44,77,44,98,15,41,71,3,29,4,16,18,96,79,57,3,46,47,36,79,73,45,86,38,47,54,18,14,42,3,27,52,89,49,85,97,52,37,31,85,7,15,20,31,19,71,62,89,6,11,46,92,41,69,35,39,92,14,33,12,31,96,30,86,67,72,65,36,77,40,41,35,58,52,23,54,12,73,10,33,7,4,37,21,73,48,28,58,17,44,99,63,95,75,87,42,78,15,12,46,6,0,24,62,26,99,73,85,87,22,17,45,46,17,11,62,28,22,53,43,67,5,40,61,97,80,9,27,71,34,40,7,25,67,34,49,20,68,31,48,95,97,16,11,12,98,12,53,74,63,66,44,63,21,61,7,28,93,8,70,16,67,3,79,4,83,37,40,81,73,18,1,49,87,85,61,11,19,98,21,39,81,84,49,67,4,99,48,23,63,5,68,16,75,18,4,2,90,1,79,64,40,78,33,27,58,16,33,91,43,53,87,27,60,47,11,37,12,82,69,17,39,25,27,4,56,30,69,38,83,60,6,15,78,20,78,57,79,32,85,36,91,66,54,15,21,2,58,33,39,89,18,82,56,99,37,65,83,82,76,97,55,58,91,47,78,82,84,95,38,72,58,95,74,70,2,0,90,68,18,87,33,10,51,84,84,66,13,12,73,91,85,98,38,97,16,98
9,91,71,20,52,7,30,1,35,94,49,12,61,60,32,0,52,48,41,11,62,93,14,72,32,38,0,73,72,84,97,0,76,95,44,29,23,25,63,28,31,97,50,8,96,74,64,26,80,31,51,92,89,89,0,79,77,32,0,5,8,57,97,0,15,47,64,48,62,27,26,66,8,62,25,4,86,91,78,65,21,45,96,57,9,4,33,31,0,30,5,1,8,93,66,85,51,45,13,62,4,54,86,42,16,88,12,34,76,3,63,92,47,46,73,92,84,0,43,23,42,57,69,53,72,40,77,86,55,79,4,97,90,30,85,69,91,56,15,36,80,92,66,16,88,4,97,96,4,88,26,46,55,16,83,15,11,43,15,53,0,17,69,3,72,1,24,95,86,80,50,24,36,93,95,38,16,10,51,43,5,84,78,17,23,54,16,27,0,5,93,40,23,37,7,8,35,73,35,41,13,94,14,83,44,18,87,60,48,43,93,26,78,91,55,4,8,60,7,26,27,76,79,99,86,29,51,31,81,95,39,12,26,83,18,2,58,23,57,32,27,83,61,21,60,98,45,88,55,78,52,11,6,58,81,46,55,74,35,19,1,86,18,96,88,34,80,45,51,90,6,14,13,18,33,91,40,4,41,55,87,21,38,58,64,46,33,39,43,9,0,26,94,39,31,55,95,35,29,29,29,23,87,99,62,71,44,39,39,81,93,2,58,7,45,46,69,64,45,34,54,35,67,26,53,40,67,68,83,12,14,14,13,34,71,23,14,27,18,83,42,37,47,66,42,37,29,6,75,97,56,9,11,74,33,28,87,8,29,61,31,11,62,55,88,75,70,45,59,79,45,63,17,5,74,10,65,49,92,98,83,26,76,51,1,5,26,19,30,40,85,10,6,72,76,14,77,26,71,58,61,27,83,90,50,99,47,92,69,12,36,48,59,51,47,48,19,88,31,48,27,27,64,98,71,37,84,78,67,1,57,77,80,2,90,41,67,23,76,76,86,68,52,97,93,31,83,80,62,82,12,68,22,55,58,1,5,34,96,79,88,35,96,84,89,90,26,65,50,41,26,0,82,57,12,28,61,84,36,43,15,57,33,67,59,79,41,68,32,33,38,89,5,14,84,48,85,14,32,72,18


####**Expression system**
Don't waste memory or time with feature engineering, we (lazily) transform your data when needed.

In [10]:
%%time
vaex_df['multiplication_col13']=vaex_df.col1*vaex_df.col3

CPU times: user 293 µs, sys: 13 µs, total: 306 µs
Wall time: 311 µs


In [11]:
vaex_df['multiplication_col13']

Expression = multiplication_col13
Length: 1,000,000 dtype: int64 (column)
---------------------------------------
     0  7760
     1    64
     2   648
     3  1216
     4   392
    ...     
999995  1547
999996  5148
999997  4692
999998  2730
999999  4482

####**Out-of-core DataFrame**
Filtering and evaluating expressions will not waste memory by making copies; the data is kept untouched on disk, and will be streamed only when needed. Delay the time before you need a cluster.

In [12]:
vaex_df[vaex_df.col2>70]

#,col0,col1,col2,col3,col4,col5,col6,col7,col8,col9,col10,col11,col12,col13,col14,col15,col16,col17,col18,col19,col20,col21,col22,col23,col24,col25,col26,col27,col28,col29,col30,col31,col32,col33,col34,col35,col36,col37,col38,col39,col40,col41,col42,col43,col44,col45,col46,col47,col48,col49,col50,col51,col52,col53,col54,col55,col56,col57,col58,col59,col60,col61,col62,col63,col64,col65,col66,col67,col68,col69,col70,col71,col72,col73,col74,col75,col76,col77,col78,col79,col80,col81,col82,col83,col84,col85,col86,col87,col88,col89,col90,col91,col92,col93,col94,col95,col96,col97,col98,col99,col100,col101,col102,col103,col104,col105,col106,col107,col108,col109,col110,col111,col112,col113,col114,col115,col116,col117,col118,col119,col120,col121,col122,col123,col124,col125,col126,col127,col128,col129,col130,col131,col132,col133,col134,col135,col136,col137,col138,col139,col140,col141,col142,col143,col144,col145,col146,col147,col148,col149,col150,col151,col152,col153,col154,col155,col156,col157,col158,col159,col160,col161,col162,col163,col164,col165,col166,col167,col168,col169,col170,col171,col172,col173,col174,col175,col176,col177,col178,col179,col180,col181,col182,col183,col184,col185,col186,col187,col188,col189,col190,col191,col192,col193,col194,col195,col196,col197,col198,col199,col200,col201,col202,col203,col204,col205,col206,col207,col208,col209,col210,col211,col212,col213,col214,col215,col216,col217,col218,col219,col220,col221,col222,col223,col224,col225,col226,col227,col228,col229,col230,col231,col232,col233,col234,col235,col236,col237,col238,col239,col240,col241,col242,col243,col244,col245,col246,col247,col248,col249,col250,col251,col252,col253,col254,col255,col256,col257,col258,col259,col260,col261,col262,col263,col264,col265,col266,col267,col268,col269,col270,col271,col272,col273,col274,col275,col276,col277,col278,col279,col280,col281,col282,col283,col284,col285,col286,col287,col288,col289,col290,col291,col292,col293,col294,col295,col296,col297,col298,col299,col300,col301,col302,col303,col304,col305,col306,col307,col308,col309,col310,col311,col312,col313,col314,col315,col316,col317,col318,col319,col320,col321,col322,col323,col324,col325,col326,col327,col328,col329,col330,col331,col332,col333,col334,col335,col336,col337,col338,col339,col340,col341,col342,col343,col344,col345,col346,col347,col348,col349,col350,col351,col352,col353,col354,col355,col356,col357,col358,col359,col360,col361,col362,col363,col364,col365,col366,col367,col368,col369,col370,col371,col372,col373,col374,col375,col376,col377,col378,col379,col380,col381,col382,col383,col384,col385,col386,col387,col388,col389,col390,col391,col392,col393,col394,col395,col396,col397,col398,col399,col400,col401,col402,col403,col404,col405,col406,col407,col408,col409,col410,col411,col412,col413,col414,col415,col416,col417,col418,col419,col420,col421,col422,col423,col424,col425,col426,col427,col428,col429,col430,col431,col432,col433,col434,col435,col436,col437,col438,col439,col440,col441,col442,col443,col444,col445,col446,col447,col448,col449,col450,col451,col452,col453,col454,col455,col456,col457,col458,col459,col460,col461,col462,col463,col464,col465,col466,col467,col468,col469,col470,col471,col472,col473,col474,col475,col476,col477,col478,col479,col480,col481,col482,col483,col484,col485,col486,col487,col488,col489,col490,col491,col492,col493,col494,col495,col496,col497,col498,col499,multiplication_col13
0,26,64,93,1,90,44,7,28,64,49,30,47,13,11,86,28,34,33,35,32,48,47,20,8,36,45,16,56,40,57,71,42,58,65,86,5,6,88,8,88,54,70,89,7,11,85,63,69,76,72,40,37,34,26,43,95,9,87,30,16,84,67,26,17,25,62,14,87,68,81,12,58,17,83,6,60,41,7,77,23,71,74,76,7,0,56,32,29,31,25,24,6,84,85,16,96,9,99,35,61,17,26,88,62,8,21,60,84,97,3,60,16,81,6,53,69,93,77,70,20,53,4,44,36,48,7,33,95,99,1,79,84,49,91,24,81,61,26,32,41,3,86,90,59,55,27,6,40,15,41,59,95,54,6,85,81,67,26,70,28,33,72,58,85,22,80,0,91,66,32,79,98,24,57,74,71,98,2,81,87,61,53,83,48,9,92,59,49,24,7,97,90,5,94,37,63,26,3,50,90,14,27,49,38,48,37,78,83,2,82,90,88,89,4,88,49,17,28,85,11,20,81,79,79,2,62,6,41,63,81,75,82,78,79,32,54,13,58,8,37,54,96,0,27,67,16,99,10,89,2,48,83,73,0,8,8,90,64,82,33,76,12,80,10,26,5,41,17,79,96,2,62,88,13,38,63,44,58,30,32,74,31,64,75,20,67,5,85,54,63,96,75,6,95,4,79,6,97,41,98,45,87,8,98,77,80,23,32,78,90,43,64,86,21,67,0,20,15,73,79,37,48,55,23,78,76,12,27,76,56,84,43,63,56,1,15,78,76,99,63,65,5,56,24,43,73,74,24,33,33,12,25,78,20,66,55,13,41,90,96,19,61,77,29,22,74,4,2,59,31,20,48,60,59,91,70,56,55,80,29,22,32,57,16,85,52,68,38,65,19,69,36,62,13,16,63,75,95,5,51,45,96,27,88,75,55,93,57,27,40,49,15,64,50,24,94,59,76,86,94,80,75,81,50,80,24,11,13,11,37,46,80,61,20,68,91,44,73,1,37,87,73,72,17,83,20,76,75,32,1,25,11,88,67,0,21,85,82,46,8,57,31,98,1,60,78,49,89,75,82,8,27,7,3,28,55,12,91,4,48,59,2,8,87,6,56,2,93,53,89,25,43,87,77,26,80,95,85,54,92,64
1,48,24,95,27,75,64,7,68,6,32,84,50,53,77,98,82,40,29,49,40,75,92,42,34,65,34,67,3,23,50,45,97,90,88,88,42,54,56,83,83,34,0,65,73,26,3,9,82,81,85,57,56,42,33,2,17,29,77,31,94,50,96,11,29,60,9,50,71,69,4,51,14,95,10,57,78,23,24,64,8,41,35,17,73,79,73,47,96,99,84,28,79,88,69,63,57,48,47,38,42,6,73,94,0,82,3,71,7,35,37,45,77,16,43,3,57,61,57,36,52,27,52,39,70,99,91,76,44,64,15,56,77,82,67,68,3,26,79,78,38,67,61,6,23,86,10,14,52,6,96,94,20,13,14,64,25,3,18,78,85,10,60,92,89,74,36,55,44,87,90,98,8,77,62,47,23,0,93,44,62,12,28,56,99,80,47,4,64,76,85,69,84,98,24,92,62,23,20,47,77,75,49,53,88,79,79,70,33,92,4,22,55,50,62,60,16,80,6,2,59,58,52,91,92,54,66,82,43,56,81,20,21,60,75,28,53,77,55,63,67,39,23,54,78,85,30,76,63,86,20,39,3,32,2,62,98,83,0,34,81,7,26,72,27,54,28,79,92,51,24,45,76,25,75,8,14,72,48,55,11,35,73,23,82,98,77,42,77,76,93,23,89,10,2,9,93,23,21,72,74,99,69,14,6,3,43,38,91,77,48,62,34,22,1,35,47,72,88,53,62,45,2,92,31,54,40,10,98,3,47,57,94,76,48,49,19,63,7,82,26,95,27,89,91,25,1,48,59,0,20,15,47,7,58,84,76,10,39,83,36,77,76,11,38,44,66,59,10,63,67,82,57,25,38,50,55,40,96,4,66,69,56,62,30,42,67,14,88,99,70,45,84,45,24,21,54,93,54,19,95,78,69,96,8,96,62,55,76,38,9,51,37,46,81,40,80,68,54,97,16,44,7,94,62,19,88,19,71,68,28,56,58,68,32,90,74,26,87,11,18,18,35,53,85,41,89,38,83,52,41,98,25,23,71,71,14,90,37,7,59,83,75,6,2,92,8,80,61,60,21,15,19,80,91,35,81,81,67,34,32,29,44,44,12,55,20,48,72,37,39,66,91,90,87,80,72,15,50,51,96,648
2,74,19,73,64,18,38,42,30,42,93,74,97,79,53,63,50,43,22,58,88,13,31,66,89,38,51,54,22,87,54,82,86,54,5,43,36,55,40,76,28,44,3,14,85,59,68,64,99,99,8,80,69,55,88,69,77,98,88,81,79,50,95,81,46,27,69,34,26,79,5,90,39,23,26,89,92,60,88,21,7,62,42,79,99,38,87,50,97,39,29,43,3,49,94,1,74,74,1,11,68,10,17,39,41,46,81,2,59,49,21,83,80,64,26,98,6,99,10,62,48,69,14,25,95,51,0,23,34,41,0,62,14,62,24,61,65,48,12,68,0,68,10,84,35,72,44,64,27,79,75,77,36,46,99,5,92,98,36,3,85,97,65,63,13,67,47,31,6,47,55,99,37,37,81,36,56,42,35,4,27,24,9,72,6,43,39,14,90,28,12,75,62,10,90,18,6,84,59,27,53,54,71,21,35,47,44,11,87,97,49,0,86,65,72,30,3,84,66,78,23,65,51,98,86,23,33,80,55,80,15,28,68,6,41,93,96,79,36,59,3,57,24,65,68,22,75,3,76,11,95,49,5,59,27,93,99,60,58,18,35,67,26,83,32,49,47,27,5,49,27,86,54,14,98,0,81,11,43,70,43,96,23,94,62,54,90,76,11,88,68,2,0,7,14,35,26,83,43,53,84,82,65,9,26,45,2,46,73,15,87,98,1,82,83,85,23,62,34,79,95,45,66,90,57,54,36,99,22,24,23,74,42,15,94,64,9,58,31,99,21,0,26,4,88,9,52,69,44,52,63,35,15,47,28,42,78,85,87,89,70,47,14,43,21,35,83,51,40,51,55,31,81,94,78,43,57,99,56,7,34,56,56,9,13,65,46,43,64,96,62,38,10,99,12,37,69,37,97,2,98,94,41,82,99,9,82,90,93,41,65,40,70,57,58,41,27,70,12,37,58,99,33,13,24,37,81,95,68,65,15,82,55,39,15,13,21,52,94,64,26,44,34,89,86,23,0,26,55,35,90,42,77,57,76,26,67,73,31,52,14,10,5,40,71,57,95,54,27,35,92,63,0,54,49,30,66,7,81,5,68,18,92,77,35,20,90,0,81,18,20,96,4,20,79,49,26,63,47,65,62,1216
3,50,36,85,47,73,64,89,86,38,35,82,0,92,42,22,95,63,67,16,26,42,10,59,85,44,42,28,89,44,6,74,68,42,42,89,21,58,84,59,92,25,14,59,86,39,94,67,11,71,97,51,65,98,92,17,44,35,19,50,65,57,17,20,6,97,83,25,91,0,94,58,49,60,22,86,9,86,19,40,46,16,69,6,43,34,1,74,75,10,7,4,84,25,89,35,78,22,57,82,10,52,53,13,92,67,28,84,91,59,25,72,0,45,64,86,80,96,87,6,42,39,9,45,55,6,89,12,92,75,97,32,7,47,91,50,2,78,23,47,78,91,17,40,59,1,80,18,80,8,45,41,69,25,62,67,41,18,9,68,50,6,72,11,95,73,97,12,1,4,11,96,33,68,84,68,53,19,20,81,89,93,60,91,44,73,45,69,72,57,99,52,75,54,36,90,62,35,27,97,52,3,58,16,75,92,50,71,74,20,70,28,70,29,90,16,32,22,99,4,82,25,69,89,93,30,80,76,10,64,35,62,33,49,66,1,36,55,52,67,70,67,14,24,34,11,32,35,93,53,38,15,80,62,65,31,13,89,78,58,21,87,81,39,39,81,84,40,54,75,19,34,79,28,1,97,27,83,47,68,40,40,5,85,31,22,65,10,59,87,44,77,67,76,73,39,12,79,95,17,40,38,74,74,20,52,98,30,57,17,55,88,46,64,55,63,55,4,16,60,55,68,39,37,72,56,11,87,97,82,28,61,78,21,35,71,0,22,53,39,10,18,69,9,20,56,31,32,7,23,65,4,9,87,16,32,11,76,33,49,95,71,96,21,64,69,37,39,24,82,67,59,30,45,33,81,90,90,58,27,7,52,3,48,27,71,40,83,56,58,12,90,99,11,41,44,51,80,91,57,75,39,12,73,30,81,39,8,97,45,78,67,92,93,49,45,79,60,47,72,11,74,76,99,32,92,6,11,25,64,80,44,42,68,71,97,11,61,67,78,40,25,16,1,17,4,36,60,18,8,95,77,41,46,32,87,63,15,70,34,32,0,31,79,92,65,60,51,89,63,3,76,79,67,28,19,65,16,4,17,35,98,19,97,33,39,8,17,5,2,30,95,98,29,19,47,71,23,48,92,45,1692
4,10,1,89,4,1,95,61,6,35,80,0,70,54,67,21,42,64,7,44,40,72,62,0,77,8,56,72,19,12,35,28,52,70,64,17,82,2,11,96,27,0,59,18,8,11,59,53,77,95,47,48,72,7,56,34,20,86,26,60,88,57,91,94,22,32,90,47,64,75,71,45,56,40,48,63,15,79,44,25,47,54,0,21,52,0,72,36,97,10,81,68,73,55,80,54,38,20,85,25,0,88,54,98,14,51,3,14,81,14,73,51,68,95,12,68,47,84,33,13,62,61,92,6,64,23,60,65,9,94,73,22,63,12,94,55,4,4,96,94,39,57,48,99,83,89,87,58,92,63,15,79,55,10,3,73,16,82,11,86,11,83,80,28,27,71,31,75,9,51,56,7,73,85,13,26,82,36,29,46,52,89,74,18,60,3,69,28,28,26,16,45,83,59,48,97,66,72,37,10,83,58,22,7,85,32,57,81,40,96,28,5,65,96,37,91,67,56,14,2,10,42,70,64,95,78,41,42,53,59,74,39,77,92,88,83,40,4,10,43,0,51,5,88,41,1,45,63,38,93,94,86,96,92,9,97,40,53,80,5,91,64,26,61,65,51,76,14,20,11,47,33,49,51,43,13,97,29,83,46,66,65,72,96,40,89,8,48,17,55,65,77,37,0,94,37,61,18,85,26,19,61,8,62,9,63,22,90,96,0,39,16,12,21,85,46,1,3,52,28,84,86,53,42,85,18,95,72,23,51,59,97,12,41,92,72,64,85,57,27,8,19,55,14,30,17,98,4,10,7,84,81,21,75,52,44,36,53,48,12,8,6,75,65,3,3,76,86,23,94,57,94,19,39,4,77,84,80,70,22,92,63,85,90,44,17,5,39,21,74,42,27,63,63,1,33,81,58,34,72,48,16,62,62,17,58,19,65,50,82,11,19,73,52,35,0,38,54,87,25,65,4,46,29,36,85,44,59,45,12,33,36,21,89,59,51,57,41,64,31,39,4,95,51,50,20,27,86,78,61,2,69,96,19,97,98,42,13,52,97,69,95,77,93,61,25,48,34,66,25,52,53,48,55,99,37,98,77,58,60,10,8,30,13,22,0,92,77,43,41,59,33,73,55,7,63,76,97,97,86,7,4
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
290466,53,58,72,89,26,9,72,97,19,15,50,63,50,31,84,69,84,29,72,39,37,71,77,44,96,93,69,84,71,95,80,51,37,41,36,64,64,40,45,83,75,27,70,33,6,44,42,37,78,52,0,89,34,8,58,75,98,76,83,95,65,45,72,12,16,29,63,35,21,88,48,43,53,30,24,86,65,38,47,48,93,53,89,77,77,84,77,68,71,42,72,73,80,32,8,70,38,95,87,20,36,66,0,56,45,38,89,42,57,17,51,87,56,61,39,17,38,13,60,60,31,44,52,64,21,91,34,69,16,31,40,27,45,32,6,4,59,30,95,69,25,96,36,8,14,92,75,81,14,66,13,85,28,5,9,46,26,71,20,52,23,36,32,80,82,66,30,51,69,27,92,55,88,85,39,46,1,96,13,61,46,18,78,26,92,55,44,51,45,63,10,4,59,28,87,36,16,23,28,70,16,33,18,53,17,26,52,52,41,79,37,4,3,27,62,85,88,94,5,23,90,23,40,41,27,87,41,76,54,42,0,93,78,68,47,20,8,31,21,95,99,43,32,76,10,91,50,43,98,46,17,71,74,52,41,85,46,34,12,41,46,50,6,52,92,81,20,9,89,4,19,95,74,21,54,2,18,32,36,64,48,74,89,48,44,79,6,48,83,37,96,52,67,9,94,80,71,82,13,80,79,7,78,81,19,90,48,95,40,86,90,50,38,5,80,45,33,0,53,68,7,51,62,99,39,3,19,9,30,52,33,72,42,73,2,15,90,30,74,32,61,10,93,60,41,60,53,86,75,41,90,48,64,46,27,62,6,63,71,6,18,60,36,45,86,46,5,54,93,23,96,41,69,99,50,17,68,48,33,54,37,56,86,37,23,31,85,52,38,14,84,82,48,55,58,61,33,64,82,73,60,99,52,81,22,79,74,34,88,92,53,15,26,82,97,82,30,89,87,92,16,78,3,12,18,38,92,99,53,75,0,15,9,46,23,95,39,16,33,63,76,47,79,90,54,61,81,87,43,22,30,62,51,44,26,98,75,5,51,83,20,78,91,18,91,60,8,71,98,80,74,68,12,55,91,91,37,16,74,2,93,14,16,87,18,33,96,71,56,11,48,18,94,81,67,33,15,0,92,89,5162
290467,94,7,90,92,17,79,78,49,10,88,25,11,83,2,89,86,64,84,7,12,90,59,42,51,70,51,73,78,70,28,29,98,86,53,96,71,35,59,15,19,72,0,39,49,67,91,49,10,9,58,46,86,30,96,12,8,76,3,74,86,54,76,17,54,44,22,54,69,10,24,12,87,64,51,1,55,34,72,60,54,69,36,38,3,48,73,48,41,84,30,35,75,67,83,76,28,38,11,78,59,70,95,33,0,3,35,42,12,60,23,64,29,20,74,8,50,15,49,99,82,79,12,79,5,76,74,78,23,36,67,18,56,62,15,25,40,9,99,32,2,30,78,16,5,67,34,23,7,82,35,64,19,60,41,12,13,81,27,48,21,53,40,64,60,74,25,94,43,41,25,58,55,5,87,77,5,46,5,37,52,82,63,66,73,36,92,3,67,63,58,54,99,91,70,87,37,27,63,25,78,33,71,97,73,79,90,46,86,28,77,63,18,0,13,40,5,91,37,2,4,33,7,74,52,15,91,54,3,35,44,33,10,44,28,0,0,28,77,24,42,23,32,3,65,16,56,11,27,62,4,96,81,35,29,12,69,59,46,63,62,73,98,50,48,74,44,65,77,46,99,13,62,28,53,80,15,58,44,7,36,60,1,41,34,45,62,72,92,32,77,3,36,28,98,52,64,72,75,45,15,3,24,9,11,66,34,31,92,17,75,67,66,87,66,91,76,70,40,4,15,98,42,77,92,60,86,67,25,87,4,50,21,3,94,94,98,12,17,49,52,29,48,57,79,50,50,45,41,37,61,24,68,22,90,90,21,51,70,46,98,51,92,27,48,50,10,34,94,9,43,10,14,95,6,67,68,63,82,53,21,25,96,90,29,25,64,82,54,8,70,10,87,22,78,64,70,80,24,62,90,56,65,57,96,41,17,27,87,56,65,88,46,92,12,73,17,27,94,16,10,7,95,8,2,3,9,64,1,40,43,15,43,34,20,79,46,89,25,42,69,99,20,97,16,78,33,48,42,77,98,70,28,88,56,77,9,63,50,94,36,65,39,62,38,78,72,11,23,26,47,60,12,43,43,72,19,80,17,85,33,94,79,3,67,29,45,69,87,63,34,14,83,8,62,84,63,28,87,33,44,644
290468,16,10,83,16,36,0,70,98,44,0,49,22,9,48,23,78,45,59,74,50,53,54,71,57,12,28,99,12,8,41,52,25,88,82,36,45,80,80,95,36,80,94,85,75,76,41,54,74,34,72,40,59,76,10,2,38,27,22,12,51,70,77,9,59,91,37,58,66,80,49,28,41,99,28,4,22,65,12,29,23,16,90,64,84,48,83,5,36,7,2,31,79,57,16,39,44,64,11,58,90,94,29,35,27,8,7,42,83,27,32,82,37,95,45,52,89,56,32,29,64,87,59,68,37,70,47,0,30,54,88,47,63,82,99,75,75,37,81,88,31,33,55,3,71,7,68,35,9,35,72,22,27,2,6,27,53,24,57,36,54,67,74,66,81,77,45,73,95,81,39,36,22,0,44,63,35,28,68,13,15,99,5,63,85,24,93,24,14,28,42,50,45,69,12,31,97,99,53,50,33,91,54,48,53,64,82,35,55,28,0,39,15,61,51,70,81,33,52,35,47,76,63,7,92,42,6,29,33,41,30,45,10,82,20,18,65,77,28,86,98,96,88,77,90,74,2,0,67,4,47,92,13,90,45,90,5,71,66,90,9,69,14,86,70,52,58,75,96,82,94,94,0,42,53,1,6,77,14,0,33,13,56,50,36,23,8,46,19,53,71,32,43,92,72,78,38,74,45,7,47,58,77,1,95,10,93,37,24,56,72,68,82,84,30,75,31,3,65,7,71,8,16,76,33,91,48,19,74,55,99,62,61,76,8,93,71,95,7,80,0,41,97,54,33,29,4,95,70,32,66,94,94,32,14,69,46,9,83,3,20,41,24,87,93,33,88,8,3,58,42,19,19,77,98,16,73,1,23,42,84,2,85,8,36,22,56,96,97,7,5,96,5,5,51,6,73,12,87,58,72,40,82,27,80,71,74,36,31,55,17,38,55,91,73,37,60,6,89,42,84,77,31,92,23,16,98,16,62,43,31,25,67,38,4,77,31,73,46,60,14,78,4,25,42,28,81,41,80,29,18,59,53,16,88,14,99,61,45,15,80,50,24,77,78,91,73,22,3,76,23,95,72,15,73,26,87,97,74,85,94,86,11,47,87,59,44,27,40,30,63,85,61,86,65,49,22,68,19,65,26,160
290469,56,66,79,78,49,34,34,70,15,74,0,33,7,75,20,21,81,73,44,80,11,46,8,64,71,94,48,68,76,97,44,26,12,80,5,73,39,8,1,79,89,31,93,59,90,77,25,28,58,22,98,50,3,45,54,74,53,40,17,15,10,29,5,28,63,77,75,3,89,16,11,22,30,12,61,72,90,93,70,56,71,43,6,29,56,57,95,38,39,55,29,23,15,74,28,55,51,96,55,62,35,34,20,28,13,24,18,82,85,97,80,17,60,91,55,78,97,97,46,92,68,41,78,40,33,49,30,33,49,80,14,47,72,56,41,27,9,26,93,31,65,93,22,31,95,81,19,37,56,78,88,10,20,38,57,79,1,28,34,80,16,47,11,6,90,44,15,91,6,95,77,48,73,81,63,29,88,39,26,61,99,47,31,29,26,9,96,0,8,22,15,94,46,22,76,77,74,32,71,49,24,73,52,66,43,29,45,5,15,73,8,3,55,42,54,4,3,70,87,54,16,23,2,18,39,19,65,49,21,86,76,65,12,79,55,25,61,68,98,15,56,86,35,34,54,76,17,13,50,99,93,99,5,23,62,91,30,60,62,20,42,44,13,60,43,78,76,74,28,29,45,56,62,57,34,29,60,88,48,34,44,56,14,43,99,99,39,8,93,30,16,98,91,42,75,55,92,96,28,29,19,75,61,91,11,86,55,74,48,16,11,15,33,40,3,90,88,52,78,41,92,99,19,17,6,54,62,51,12,74,5,82,77,64,8,3,95,59,11,51,13,7,82,15,86,62,23,76,64,57,37,45,66,98,57,32,77,30,70,11,50,78,37,39,87,24,42,57,80,78,48,95,1,36,89,87,0,24,15,4,51,36,19,87,28,26,48,27,62,11,5,11,64,60,26,43,27,36,44,62,12,40,70,86,52,16,22,83,98,18,5,17,60,80,16,96,16,1,23,80,27,72,52,99,56,44,21,53,46,32,9,95,55,99,97,64,80,63,17,96,46,84,3,34,61,97,21,97,92,48,27,44,89,95,71,20,4,21,37,3,15,32,20,14,41,53,67,94,75,88,31,88,40,55,78,85,64,25,69,26,82,21,31,22,29,13,18,87,5,35,93,18,9,68,13,51,89,45,63,83,5148


In [13]:
dff=vaex_df[vaex_df.col2>70]  ##Here Filtering will not make a memory copy

In [14]:
### All the agorithms work out of core, the limit is the size of your harddriver
dff.col2.minmax(progress='widget')

HBox(children=(FloatProgress(value=0.0, max=1.0), Label(value='In progress...')))

array([71, 99])

####**Fast groupby / aggregations**
Vaex implements parallelized, highly performant groupby operations, especially when using categories (>1 billion/second).

In [15]:
%%time
vaex_df_group=vaex_df.groupby(vaex_df.col1,agg=vaex.agg.mean(vaex_df.col4))
vaex_df_group

CPU times: user 32.1 ms, sys: 8.68 ms, total: 40.8 ms
Wall time: 322 ms


#,col1,col4_mean
0,97,49.744904017415394
1,64,48.945076892350706
2,24,49.818863772754554
3,19,49.80906441563662
4,7,49.820070600100856
...,...,...
95,20,49.930369044066445
96,46,49.525142457262824
97,55,49.417780477926286
98,98,49.51452282157676


In [16]:
%%time
vaex_df.groupby(vaex_df.col1,agg='count')

CPU times: user 15.8 ms, sys: 0 ns, total: 15.8 ms
Wall time: 17.4 ms


#,col1,count
0,97,10106
1,64,10014
2,24,9998
3,19,9951
4,7,9915
...,...,...
95,20,10053
96,46,10003
97,55,9876
98,98,9881
