Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add emb module #249

Merged
merged 7 commits into from
Jun 27, 2022
Merged

Add emb module #249

merged 7 commits into from
Jun 27, 2022

Conversation

seyuboglu
Copy link
Collaborator

No description provided.

@seyuboglu seyuboglu merged commit 220cc56 into dev Jun 27, 2022
seyuboglu added a commit that referenced this pull request Jul 22, 2022
* delete nn

* Add support for loading train and test set in cifar10" (#193)

* Fix issue where tensor columns can't be indexed with pandas series (#195)

* Update cifar10 to support test set too (#196)

* Fix bacckwards compat issue with base_dir and gcs_image_column (#197)

* Support backwards compatibility with nn (#198)

* Bump version (#199)

* Update contributing to support new dev main structure (#203)

* Add args, kwargs to ColumnIOMixin._read_data (#204)

Co-authored-by: Jesse Vig <45317205+jessevig@users.noreply.github.com>

* Fix from_huggingface and add tests (#205)

closes #201

* allow_pickle=true when loading numpy block (#206)

* Add downloader to ImageColumn (#207)

* Remove default addition of index (#208)

* Remove default addition of index

* Fix provenance tests

* Add DEW contrib to registry (#209)

* Catch ConnectionResetError (#210)

* Add inaturalist to contrib (#211)

* Add inaturalist to contrib

* Add annotations to intarualist

* Fix issue where arraycolumns can't be saved with jsonlines (#214)

* Update the docs and add user guide. (#215)

* Add contrib for enron (#217)

* Fix PIL attribute error on list column representation (#218)

* mmap path bug fix (#219)

* Downgrade pytorch dependency bound (#220)

* Fix issue with subclassing datapanel _state_keys (#224)

* Use multiple slices instead of pa.Table.take in ArrowBlock (#226)

* Fix issue where boolean list can't index (#227)

* Add support for AudioColumn (#222)

* Add waterbirds (#228)

* Add use guide to indexing and stubs for remaining sections (#225)

* Docs/build fix (#230)

* Bump version (#231)

* Audioset DataPanel (#229)

* Add the audioset dataset

* Add AudioColumn to audioset datapanel

* Fix issue where old datapanels didn't have formatter state (#233)

* Make audioset datapanels relational (#235)

* Add coco, mir, and pascal (#239)

* Make write only write columns in datapanel (#240)

* Enforce contiguous index in pandas columns (#244)

* Fix issue where ray pickle fails on lazy loader (#245)

* Add support for groupby operation

* Reorganize the implementation of datasets (#246)

* Add support for persistent configuration (#247)

* Implement sort for data panel and columns (#237)

* Add emb module (#249)

* clusterby stuff

* Add clusterby

* clusterby stuff

* Add clusterby

* Add embed op (#248)

* Autoformat

Co-authored-by: Sam Randall <1billionmore@gmail.com>

* Reorganize ops code (#250)

* Update CI to include 3.9 and 3.10 and to drop 3.7

* Add sample (#251)

* Update ci.yml

* Add several HAPI datasets  (#252)

* Update styling of docs (#253)

* Bump version (#254)

* Remove fastbpe

Co-authored-by: Karan Goel <kgoel93@gmail.com>
Co-authored-by: Karan Goel <kgoel@cs.stanford.edu>
Co-authored-by: Jesse Vig <45317205+jessevig@users.noreply.github.com>
Co-authored-by: Khaled Saab <36782882+khaledsaab@users.noreply.github.com>
Co-authored-by: Priya2698 <52657555+Priya2698@users.noreply.github.com>
Co-authored-by: sam-randall <38796503+sam-randall@users.noreply.github.com>
Co-authored-by: Hannah Kim <61199762+hannahkim24@users.noreply.github.com>
Co-authored-by: Sam Randall <1billionmore@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants