-
Notifications
You must be signed in to change notification settings - Fork 3.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] support function hll_from_base64 #31320
Comments
Could you give us more info? Like, how this function behaves in another OLAP engine or mysql or others. |
我们现在的场景是: 在导入doris之前,为了减少数据量,我们使用spark或者flink按照不同维度进行了聚合, 而我们的指标又需要用到近似计算去计算uv值, 目前这个场景我们只能直接将明细数据导入doris, 这个对 doris的压力就非常大。 |
doris也是支持udf的,有试过吗? |
我看官网文档写的支持类型里没有二进制类型, 应该是不支持吧? |
I can't found this function in any other OLAP engine. I need convert a base64 string(from spark/flink agg result) into a hll binary, and we found doris support function bitmap_from_base64 that can convert a base64 string to bitmap binary, it looks a bit similar to what we are looking for. |
I have a similar issue where we want to import data from Apache Druid, which stores HLL- and Thetasketch colums as b64 encoded strings. |
OK, I will do it, please assign this to me. |
@morningman Can you assign this issue to him? |
Moreover, you can use our hll UDFs including to_hll in hive/spark to load data into doris after this PR: #33896 |
when I use [hll_from_base64 in streamload, I got an error: reason: [ANALYSIS_ERROR]TStatus: errCode = 2, detailMessage = HLL column must use hll_hash function, like online_cnt=hll_hash(xxx) or online_cnt=hll_empty() |
Search before asking
Description
doris have the bitmap function bitmap_from_base64, can hll function support function hll_from_base64 like bitmap?
Use case
No response
Related issues
No response
Are you willing to submit PR?
Code of Conduct
The text was updated successfully, but these errors were encountered: