[Feature]: Support Double, Float16 and BF16 vectors #22837

xiaofan-luan · 2023-03-19T01:30:55Z

Is there an existing issue for this?

I have searched the existing issues

Is your feature request related to a problem? Please describe.

There are many different vector types based on models.
So far what we received most is double, float16, BF16, double and BF16 is on top priority.
Anyone interested on it please help

Describe the solution you'd like.

No response

Describe an alternate solution.

No response

Anything else? (Additional Context)

No response

jon-chuang · 2023-04-01T16:04:47Z

/assign

jon-chuang · 2023-04-01T16:07:53Z

@xiaofan-luan could I ask if it is wrong to convert the embedding to a float32, which I think has better numerical performance on most CPU unless hardware support exists?

Or, is the purpose of this issue to support storage of such formats assuming that the compute nodes have the correct compute (e.g. GPU or the right Xeon chipset) to handle operations in those datatypes?

If so, do we need to implement fallback by e.g. emulation or casting when the appropriate compute support is missing? Pytorch handles by autocasting.

jon-chuang · 2023-04-01T16:10:37Z

Duplicate: #22132

jon-chuang · 2023-04-01T16:46:15Z

btw, bfloat16 does not exist on faiss: https://github.com/facebookresearch/faiss/wiki/How-to-make-Faiss-run-faster, and I believe not in Annoy or HNSWLib either

But it supports float16 and we can compile it back in: #2828

jiaoew1991 · 2023-04-02T00:48:49Z

Welcome @jon-chuang , You can implement float16 first, we can discuss about bf16 later. 😄

jiaoew1991 · 2023-07-18T01:15:59Z

/unassign @jon-chuang

jiaoew1991 · 2023-07-18T01:18:35Z

We can break down the steps into the following:

segcore supports brute-force search for float16.
Go distributed layer supports float16.
Knowhere supports float16 type indexing.
SDKs support float16.

…e as BinaryVector (milvus-io#33760) Issue: milvus-io#22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>

issue #22837 pr: #33750 Signed-off-by: chasingegg <chao.gao@zilliz.com>

issue:#22837 related:#33653 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>

…e as BinaryVector (#33760) (#33788) pr: #33760 Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>

…pe (milvus-io#33625) issue: milvus-io#22837 pr: milvus-io#33624 Signed-off-by: xianliang.li <xianliang.li@zilliz.com>

) issue: #22837 related: #33880 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>

issue: #22837 pr: #33868 - opensource autoindex support - metric type check for different data types - autoindex data type for search param Signed-off-by: chasingegg <chao.gao@zilliz.com>

issue: #22837 pr: #33868 Use primitive type instead of proto enum type for queryHook to recognize Signed-off-by: chasingegg <chao.gao@zilliz.com>

issue: #22837 contain #33625 #33867 #33911 which already merged to 2.4 branch Signed-off-by: chasingegg <chao.gao@zilliz.com> Co-authored-by: foxspy <xianliang.li@zilliz.com>

issue: #22837 - fix byte size wrong for binary vectors - fix the expect/actual error msg Signed-off-by: chasingegg <chao.gao@zilliz.com>

issue: #22837 related: #33878 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>

…move some cgo call (#34102) issue: #22837 related pr: #34104 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>

…in pkg (#34104) issue: #22837 related pr: #34102 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>

…-io#33377) related milvus-io#22837 Signed-off-by: chasingegg <chao.gao@zilliz.com>

issue:milvus-io#22837 related milvus-io#33575 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>

…e as BinaryVector (milvus-io#33760) Issue: milvus-io#22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>

issue:milvus-io#22837 related:milvus-io#33653 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>

issue: milvus-io#22837 contain milvus-io#33625 milvus-io#33867 milvus-io#33911 which already merged to 2.4 branch Signed-off-by: chasingegg <chao.gao@zilliz.com> Co-authored-by: foxspy <xianliang.li@zilliz.com>

issue: milvus-io#22837 - fix byte size wrong for binary vectors - fix the expect/actual error msg Signed-off-by: chasingegg <chao.gao@zilliz.com>

issue: milvus-io#22837 related: milvus-io#33878 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>

…move some cgo call (milvus-io#34102) issue: milvus-io#22837 related pr: milvus-io#34104 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>

xiaofan-luan added the kind/feature Issues related to feature request from users label Mar 19, 2023

xiaofan-luan added this to the 2.0-Backlog milestone Mar 19, 2023

jiaoew1991 added the good first issue Good for newcomers label Mar 19, 2023

sre-ci-robot assigned jon-chuang Apr 1, 2023

Writer-X mentioned this issue Jul 17, 2023

Add float16 vector for milvus #25647

Closed

Writer-X mentioned this issue Jul 17, 2023

Add float16 vector #25657

Closed

Writer-X mentioned this issue Jul 17, 2023

Add float16 vector #25673

Closed

sre-ci-robot unassigned jon-chuang Jul 18, 2023

Writer-X mentioned this issue Jul 21, 2023

Add float16 vector #25816

Closed

Writer-X mentioned this issue Jul 24, 2023

Add float16 vector #25852

Merged

cydrain added a commit to cydrain/milvus that referenced this issue Jun 12, 2024

enhance: Handle Float16Vector/BFloat16Vector numpy bulk insert as sam…

e5669f1

…e as BinaryVector (milvus-io#33760) Issue: milvus-io#22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>

cydrain mentioned this issue Jun 12, 2024

enhance: Handle Float16Vector/BFloat16Vector numpy bulk insert as same as BinaryVector (#33760) #33788

Merged

sre-ci-robot pushed a commit that referenced this issue Jun 12, 2024

fix: [2.4] fix binary vector data size (#33751)

7ef2892

issue #22837 pr: #33750 Signed-off-by: chasingegg <chao.gao@zilliz.com>

sre-ci-robot pushed a commit that referenced this issue Jun 12, 2024

enhance: proxy check hnsw with sparse is legal (#33697)

be3559e

issue:#22837 related:#33653 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>

sre-ci-robot pushed a commit that referenced this issue Jun 13, 2024

enhance: Handle Float16Vector/BFloat16Vector numpy bulk insert as sam…

ebd0af1

…e as BinaryVector (#33760) (#33788) pr: #33760 Issue: #22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>

chasingegg mentioned this issue Jun 14, 2024

enhance: [2.4] autoindex for multi data type #33867

Merged

chasingegg pushed a commit to chasingegg/milvus that referenced this issue Jun 14, 2024

enhance: [cherry-pick] add autoindex mapping for binary/sparse dataty…

ae89971

…pe (milvus-io#33625) issue: milvus-io#22837 pr: milvus-io#33624 Signed-off-by: xianliang.li <xianliang.li@zilliz.com>

chasingegg mentioned this issue Jun 14, 2024

enhance: autoindex for multi data type #33868

Merged

This was referenced Jun 14, 2024

enhance: [cherry-pick]check index with data type in knowhere api #33878

Merged

enhance: check index with data type #33880

Merged

sre-ci-robot pushed a commit that referenced this issue Jun 14, 2024

enhance: [cherry-pick]check index with data type in knowhere api (#33878

5623696

) issue: #22837 related: #33880 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>

chasingegg mentioned this issue Jun 17, 2024

enhance: Use primitive type for vectorType #33911

Merged

sre-ci-robot pushed a commit that referenced this issue Jun 17, 2024

enhance: Use primitive type for vectorType (#33911)

08c096c

issue: #22837 pr: #33868 Use primitive type instead of proto enum type for queryHook to recognize Signed-off-by: chasingegg <chao.gao@zilliz.com>

sre-ci-robot pushed a commit that referenced this issue Jun 18, 2024

fix: fix binary vector data size (#33750)

0d20303

issue: #22837 - fix byte size wrong for binary vectors - fix the expect/actual error msg Signed-off-by: chasingegg <chao.gao@zilliz.com>

sre-ci-robot pushed a commit that referenced this issue Jun 19, 2024

enhance: check index with data type (#33880)

298e50b

issue: #22837 related: #33878 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>

This was referenced Jun 24, 2024

enhance: remove CheckVecIndexWithDataTypeExist function in pkg and remove some cgo call #34102

Merged

enhance: [cherry-pick]remove CheckVecIndexWithDataTypeExist function in pkg #34104

Merged

enhance: update new mmap config parmeters version #34143

Merged

jaime0815 pushed a commit that referenced this issue Jun 26, 2024

enhance: remove CheckVecIndexWithDataTypeExist function in pkg and re…

51ebe95

…move some cgo call (#34102) issue: #22837 related pr: #34104 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>

jaime0815 pushed a commit that referenced this issue Jun 26, 2024

enhance: [cherry-pick]remove CheckVecIndexWithDataTypeExist function …

0bd93de

…in pkg (#34104) issue: #22837 related pr: #34102 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>

yellow-shine pushed a commit to yellow-shine/milvus that referenced this issue Jul 2, 2024

fix: correct get vector data size for bf16/fp16/binary vector (milvus…

58def0c

…-io#33377) related milvus-io#22837 Signed-off-by: chasingegg <chao.gao@zilliz.com>

yellow-shine pushed a commit to yellow-shine/milvus that referenced this issue Jul 2, 2024

enhance: disk index support binary vector (milvus-io#33631)

677eafb

issue:milvus-io#22837 related milvus-io#33575 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>

yellow-shine pushed a commit to yellow-shine/milvus that referenced this issue Jul 2, 2024

enhance: Handle Float16Vector/BFloat16Vector numpy bulk insert as sam…

6113a8a

…e as BinaryVector (milvus-io#33760) Issue: milvus-io#22837 Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>

yellow-shine pushed a commit to yellow-shine/milvus that referenced this issue Jul 2, 2024

enhance: proxy check hnsw with sparse is legal (milvus-io#33697)

7047aa7

issue:milvus-io#22837 related:milvus-io#33653 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>

yellow-shine pushed a commit to yellow-shine/milvus that referenced this issue Jul 2, 2024

enhance: check index with data type (milvus-io#33880)

671a1c2

issue: milvus-io#22837 related: milvus-io#33878 Signed-off-by: cqy123456 <qianya.cheng@zilliz.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: Support Double, Float16 and BF16 vectors #22837

[Feature]: Support Double, Float16 and BF16 vectors #22837

xiaofan-luan commented Mar 19, 2023

jon-chuang commented Apr 1, 2023

jon-chuang commented Apr 1, 2023 •

edited

Loading

jon-chuang commented Apr 1, 2023

jon-chuang commented Apr 1, 2023 •

edited

Loading

jiaoew1991 commented Apr 2, 2023

jiaoew1991 commented Jul 18, 2023

jiaoew1991 commented Jul 18, 2023

[Feature]: Support Double, Float16 and BF16 vectors #22837

[Feature]: Support Double, Float16 and BF16 vectors #22837

Comments

xiaofan-luan commented Mar 19, 2023

Is there an existing issue for this?

Is your feature request related to a problem? Please describe.

Describe the solution you'd like.

Describe an alternate solution.

Anything else? (Additional Context)

jon-chuang commented Apr 1, 2023

jon-chuang commented Apr 1, 2023 • edited Loading

jon-chuang commented Apr 1, 2023

jon-chuang commented Apr 1, 2023 • edited Loading

jiaoew1991 commented Apr 2, 2023

jiaoew1991 commented Jul 18, 2023

jiaoew1991 commented Jul 18, 2023

jon-chuang commented Apr 1, 2023 •

edited

Loading

jon-chuang commented Apr 1, 2023 •

edited

Loading