big-data
Repositories 1,389
Good first issues
For hindi text period '.' punctuation not splitting text
help wanted (easy) feat / tokenizer#3625 opened 13 days ago by gauravgr
Convert command line on .iob Conll-2003 file not working
help wanted (easy) bug#3620 opened 15 days ago by AlexisPister
💫 Adding models for new languages master thread
help wanted (easy) enhancement#3056 opened 5 months ago by ines
Good first issues
Complete RowExpressionFormatter to print pretty plans
beginner-task planner#12714 opened 12 days ago by highker
Add sanity checker to check no expression left after optimization
beginner-task planner#12713 opened 12 days ago by highker
Backport "Create Block directly if the Data stream in ORC file is null"
beginner-task backport#12413 opened 2 months ago by wenleix
Good first issues
Easy introspection of detached parts.
easy task feature#5164 opened 4 days ago by alexey-milovidov
Add parseDateTimeBestEffortUS function.
easy task feature#5157 opened 4 days ago by alexey-milovidov
Add data checksums to system.parts_columns table.
easy task feature#5151 opened 5 days ago by alexey-milovidov
Good first issues
Cython mishandles PEP-3135 __class__ cell in methods
good first issue P: blocker#2912 opened about 1 month ago by MisterKeefe
Cythonize/Cython compilation fails to import/recognize `.pxd` from external modules that only have `__init__.so`
good first issue Build System#2886 opened 2 months ago by JonasT
Good first issues
Add an integration guideline with AWS EMR
area-docs priority-high#8758 opened 23 days ago by apc999
Good first issues
Predict on a single object with iloc
good first issue help wanted#785 opened 24 days ago by annaveronika
CatBoostRanker class in Python
good first issue help wanted#766 opened 30 days ago by annaveronika
Predict on a single object
good first issue help wanted#763 opened about 1 month ago by annaveronika
Good first issues
#3665 opened 15 days ago by kanzure
Code layout docs
docs#3647 opened 23 days ago by ysimonson
#3648 opened 23 days ago by ysimonson
Good first issues
Support feeding from Spark
good first issue enhancement#9158 opened 11 days ago by lesters
Add onear grammar value to userInput()
good first issue enhancement#7294 opened 7 months ago by dkurzaj
Write Datadog integration for Vespa metrics
good first issue enhancement#5322 opened about 1 year ago by frodelu
Good first issues
Operations with fields of type char fail in the rest api
beginner good first issue bug#2152 opened 3 days ago by m316257
Move OperationDetail and OperationField into own classes
beginner good first issue p:low#1793 opened 12 months ago by m55624