Read data from dataframe #69

manycoding · 2019-04-17T14:46:30Z

Support at least dataframe - which will allow to read the data locally from whatever source (csv, json, be it remote or local)

Currently the library relies on having _key to report items by it. So the implementation could look like:

Figure out a simple api (fastai - datablock? like

items = Items.from_csv (items.from_job)
schema = Schema.get_schema(schema)
items.report_all(schema)

# And to keep it granular enough so it can be used in Spidermon
arche.rules.duplicates.find_by(items.df, ["name", "title"])

Add _key column. Maybe it's easier to make _key as index if it's present and report index
_type. So far _type nobody really needed it since we can use filters.

The text was updated successfully, but these errors were encountered:

manycoding added the Type: Feature New feature or request label Apr 17, 2019

manycoding added this to the 0.4.0dev milestone Apr 17, 2019

manycoding mentioned this issue Apr 29, 2019

Save items data in df #75

Merged

manycoding added a commit that referenced this issue Apr 30, 2019

Accept data from dataframe, implements #69

8d5deab

manycoding modified the milestones: 0.4.0, 0.3.3 May 3, 2019

manycoding closed this as completed May 3, 2019

manycoding mentioned this issue May 10, 2019

Numpy #85

Merged

manycoding mentioned this issue Jul 1, 2019

High level API redesign #123

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Read data from dataframe #69

Read data from dataframe #69

manycoding commented Apr 17, 2019 •

edited

Loading

Read data from dataframe #69

Read data from dataframe #69

Comments

manycoding commented Apr 17, 2019 • edited Loading

manycoding commented Apr 17, 2019 •

edited

Loading