Release date: April 29, 2021
- Added/updated/fixed regressions for MS MARCO doc ranking, TREC 2019 DL, and TREC 2020 DL.
- Added regressions for TREC 2020 background linking.
- Added support for C4 Corpus.
- Added ability to index and search pre-tokenized documents.
- Improved end-to-end test harness.
- Cleaned up LTR code (renamed features), improved documentation.
- Implemented
getDocumentTokens
inIndexReaderUtils
- Removed code related to knowledge graphs.
- Refactored
TopicReader
, improved building ofTOPIC_FILE_TO_TYPE
mapping. - Fixed bug in parsing of multi-line TREC topics (impact on regressions for Disks 1 & 2).
- Fixed bug where
DocumentCollection
was not following symlinks properly.
Sorted by number of commits:
- Jimmy Lin (lintool)
- Stephanie Hu (stephaniewhoo)
- Arthur Chen (ArthurChen189)
- Chris Kamphuis (Chriskamphuis)
- Ronak Pradeep (ronakice)
- Sailesh Nankani (saileshnankani)
- Stephen Green (eelstretching)
- Calvin Wang (printfCalvin)
- Shane Ding (shaneding)
All contributors with more than one commit, sorted by number of commits, according to GitHub:
- Jimmy Lin (lintool)
- Peilin Yang (Peilin-Yang)
- Ryan Clancy (r-clancy)
- Ahmet Arslan (iorixxx)
- Edwin Zhang (edwinzhng)
- Rodrigo Nogueira (rodrigonogueira4)
- Emily Wang (emmileaf)
- Royal Sequiera (rosequ)
- Chris Kamphuis (Chriskamphuis)
- Victor Yang (Victor0118)
- Boris Lin (borislin)
- Tommaso Teofili (tteofili)
- Nikhil Gupta (nikhilro)
- Stephanie Hu (stephaniewhoo)
- Yuhao Xie (Kytabyte)
- Shane Ding (shaneding)
- Xueguang Ma (MXueguang)
- Kuang Lu (lukuang)
- Adam Yang (adamyy)
- Luchen Tan (LuchenTan)
- Xinyu Mavis Liu (x389liu)
- Ronak Pradeep (ronakice)
- Salman Mohammed (salman1993)
- Zhiying Jiang (bazingagin)
- Johnson Han (x65han)
- Hang Cui (HangCui0510)
- Yuqi Liu (yuki617)
- Dayang Shi (dyshi)
- Michael Tu (tuzhucheng)
- Peng Shi (Impavidity)
- Zeynep Akkalyoncu Yilmaz (zeynepakkalyoncu)
- Xin Qian (xeniaqian94)
- Joel Mackenzie (JMMackenzie)
- Estella Liu (estella98)
- Pepijn Boers (PepijnBoers)
- Justin Borromeo (justinborromeo)
- Kelvin Jiang (kelvin-jiang)
- Yuxin (Vicky) Zhu (yxzhu16)
- Lizzy Zhang (LizzyZhang-tutu)
- Adam Roegiest (aroegies)
- Weihua Li (w329li)
- Yue Zhang (nsndimt)
- Julie Tibshirani (jtibshirani)
- Toke Eskildsen (tokee)
- Zhaohao Zeng (matthew-z)
- Xing Niu (xingniu)
- Alex Dou (YimingDou)
- Adrien Grand (jpountz)
- Mengfei Liu (meng-f)
- Mina Farid (minafarid)
- Adrien Pouyet (Ricocotam)
- Edward Lu (edwardhdlu)
- Gaurav Baruah (gauravbaruah)
- Mustafa Abualsaud (ammsa)
- Jiarui Zhang (jrzhang12)
- Stephen Green (eelstretching)
- Maik Fröbe (mam10eks)