Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Arctic datastore as Zipline datasource #660

Open
femtotrader opened this issue Jul 25, 2015 · 9 comments
Open

Arctic datastore as Zipline datasource #660

femtotrader opened this issue Jul 25, 2015 · 9 comments

Comments

@femtotrader
Copy link

Hello,

this datastore may also be worth to be add to zipline
https://github.com/manahl/arctic
at least it should be discussed here

Kind regards

See for arctic side man-group/arctic#19

@femtotrader femtotrader changed the title Arctic Datastore Arctic datastore as Zipline datasource Jul 25, 2015
@twiecki
Copy link
Contributor

twiecki commented Jul 25, 2015

I wonder if perhaps this should be linked to odo instead where it will be available to zipline if #659 gets implemented.

@femtotrader
Copy link
Author

to my understanding that's quite different because with Blaze (and odo) you won't have a Numpy arrays like in this datastore but you are yieding values using a generator but I also understand your pov.

@twiecki
Copy link
Contributor

twiecki commented Jul 25, 2015

I think odo handles streaming data sources too. @mrocklin?

@llllllllll
Copy link
Contributor

odo supports streaming data sources with chunks

@femtotrader
Copy link
Author

see blaze/blaze#1185

@llllllllll
Copy link
Contributor

Sorry, what am I supposed to be seeing in that issue?

@mrocklin
Copy link

Joe is correct that the right approach here is chunks. The brief chunks documentation is here: http://odo.pydata.org/en/latest/type-modifiers.html#chunks

@femtotrader
Copy link
Author

@llllllllll you are supposed to see your own post

for i, chunk in enumerate(odo(dat, chunks(pd.DataFrame), chunksize=10)):
    print("chunk %d" % (i + 1))
    for j, row in chunk.iterrows():
        print(row)

and you are right Blaze wasn't used in this simple example but I wonder if it will be the case if you want to align data.

@llllllllll
Copy link
Contributor

Oh, were you just showing the iterable? I wasn't sure if you were responding to my comment there

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants