Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[问题]如果Source是数据库,像MSSQL、MySql,怎么去处理数据的读取 #2

Closed
Cloud33 opened this issue Dec 9, 2020 · 6 comments

Comments

@Cloud33
Copy link

Cloud33 commented Dec 9, 2020

如果Source是数据库,像MSSQL、MySql,怎么去处理数据的读取,让CSharpFlink持续的去处理业务数据,并能保证业务数据不被重复处理呢?

常用的场景:
1.计算总销售金额;
2.计算报表数据

是否能提供一下思路或CSharpFlink最佳实践

谢谢。

@wxzz
Copy link
Owner

wxzz commented Dec 9, 2020 via email

@Cloud33
Copy link
Author

Cloud33 commented Dec 9, 2020

明白,有一个难点在于,我怎么去订阅数据?因为不能去重复处理数据,所以我要知道我处理在哪个位置,然后从这个位置开始拉取数据继续处理。

我看flink方案,好像是把数据同步到Kafka中,然后flink订阅Kafka进行处理。

有没有更好的方式?

其实核心问题就是,怎么流式的去处理数据,不依赖过多的中间件

@wxzz
Copy link
Owner

wxzz commented Dec 9, 2020 via email

@Cloud33
Copy link
Author

Cloud33 commented Dec 11, 2020

没有看到相关代码

@wxzz
Copy link
Owner

wxzz commented Dec 14, 2020

没有看到相关代码

重复数据的处理在WindowsTask的AddMetaData中。

@Cloud33
Copy link
Author

Cloud33 commented Dec 22, 2020

ok

@Cloud33 Cloud33 closed this as completed Dec 22, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants