Skip to content

boufleur/pydeltalake

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

42 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PyDeltaLake

Native Python Delta Lake implementation, based on Delta Protocol.

Installation

pip install pydeltalake

Usage

Resolve partitions for current version of the DeltaTable:

>>> from pydeltalake.lib import DeltaLake
>>> dl = DeltaLake('test/data/delta-0.2.0')
>>> dl.files()
{'part-00000-cb6b150b-30b8-4662-ad28-ff32ddab96d2-c000.snappy.parquet', 'part-00000-7c2deba3-1994-4fb8-bc07-d46c948aa415-c000.snappy.parquet', 'part-00001-c373a5bd-85f0-4758-815e-7eb62007a15c-c000.snappy.parquet'}

Time travel:

>>> from deltalake import DeltaTable
>>> from datetime import datetime
>>> timestamp = 1564524298
>>> time_travel = datetime.fromtimestamp(timestamp) # datetime.datetime(2019, 7, 31, 0, 4, 58)
>>> dl = DeltaLake('test/data/delta-0.2.0', time_travel=time_travel)
>>> dl.files()
{'part-00001-4327c977-2734-4477-9507-7ccf67924649-c000.snappy.parquet', 'part-00000-b44fcdb0-8b06-4f3a-8606-f8311a96f6dc-c000.snappy.parquet', 'part-00001-c373a5bd-85f0-4758-815e-7eb62007a15c-c000.snappy.parquet', 'part-00001-185eca06-e017-4dea-ae49-fc48b973e37e-c000.snappy.parquet', 'part-00000-7c2deba3-1994-4fb8-bc07-d46c948aa415-c000.snappy.parquet', 'part-00000-512e1537-8aaa-4193-b8b4-bef3de0de409-c000.snappy.parquet'}

About

A Python library to interface with Delta Lake

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages