-
Notifications
You must be signed in to change notification settings - Fork 114
Issues: IBM/data-prep-kit
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Feature] Single Pypi package for python, ray and spark and single pypi package for all transforms
enhancement
New feature or request
#673
opened Oct 5, 2024 by
touma-I
2 tasks done
Add reference in notebooks to specific version of data-prep-toolkit
#669
opened Oct 4, 2024 by
touma-I
[Bug] Semantic sort for repos doesn't seem to do as described
bug
Something isn't working
#668
opened Oct 4, 2024 by
MisterKloudy
1 of 2 tasks
[Bug] pdf2parquet ray version erroring out when downloading models for the very first time
bug
Something isn't working
#667
opened Oct 4, 2024 by
sujee
1 of 2 tasks
[Bug] PII refactor test image and test src fails
bug
Something isn't working
#665
opened Oct 3, 2024 by
revit13
1 of 2 tasks
[Bug] unable to install release 0.2.1 on windows
bug
Something isn't working
#644
opened Sep 30, 2024 by
sujee
1 of 2 tasks
[Feature] Extend document chunker transform to support fixed-size token window chunker with overlap
enhancement
New feature or request
#641
opened Sep 29, 2024 by
juancappi
2 tasks done
[Feature] Relax release rules so MICRO version (0.0.X) can be increased for transforms inpdendent of library/runtime
enhancement
New feature or request
#639
opened Sep 29, 2024 by
touma-I
2 tasks done
[Feature] Update RAG examples for release 0.2.1
enhancement
New feature or request
#636
opened Sep 27, 2024 by
sujee
2 tasks done
Create the Ray version of the html2parquet transform
current-priority
enhancement
New feature or request
#635
opened Sep 27, 2024 by
shahrokhDaijavad
1 of 2 tasks
[Feature] Extend DPK capabilities to cover the full life cycle for data acquisition and data processing
enhancement
New feature or request
#633
opened Sep 27, 2024 by
touma-I
2 tasks done
[Feature] separate examples into another repo
enhancement
New feature or request
#628
opened Sep 25, 2024 by
sujee
1 of 2 tasks
[Bug] Fail to build transform spark image when using a release tag x.y.z (without any .devN suffix)
bug
Something isn't working
#625
opened Sep 25, 2024 by
touma-I
1 of 2 tasks
Support for Python 3.12
current-priority
enhancement
New feature or request
#618
opened Sep 24, 2024 by
shahrokhDaijavad
1 of 2 tasks
[Feature] Capability to specify the paths where multiple output tables will be saved
enhancement
New feature or request
#609
opened Sep 21, 2024 by
cmadam
2 tasks done
[Feature] Capability to distribute during initialization to a large binary object (e.g.a table) to all the transform instances
enhancement
New feature or request
#608
opened Sep 21, 2024 by
cmadam
2 tasks done
[Bug] header_cleanser intermittently failing ci/cd when building python venv
bug
Something isn't working
#607
opened Sep 20, 2024 by
daw3rd
1 of 2 tasks
[Feature] Base spark image build is very slow and impacting ci/cd
enhancement
New feature or request
#606
opened Sep 20, 2024 by
daw3rd
1 of 2 tasks
[Bug] pdf2parquet must calculate hash and size on the file
bug
Something isn't working
#605
opened Sep 20, 2024 by
sujee
1 of 2 tasks
[Feature] Enable pure python transforms in new spark runtime.
enhancement
New feature or request
#586
opened Sep 12, 2024 by
daw3rd
1 of 17 tasks
[Bug] Testing Rag notebook with latest release of pdf2Parquet, eDedup and DocID
bug
Something isn't working
#583
opened Sep 10, 2024 by
touma-I
1 of 2 tasks
[Bug] issues running ray transformations on Google colab
bug
Something isn't working
#582
opened Sep 10, 2024 by
sujee
1 of 2 tasks
[Feature] Need better documentation of fuzzy dedupe
enhancement
New feature or request
#578
opened Sep 6, 2024 by
sujee
2 tasks done
[Feature] need an example of using doc_quality plugin with installed pypi packages
enhancement
New feature or request
#575
opened Sep 6, 2024 by
sujee
1 of 2 tasks
[Bug] Intermittent doc_id test-src failures in ci/cd.
bug
Something isn't working
#574
opened Sep 5, 2024 by
daw3rd
2 tasks done
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.