Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

数据链路需要整理。 #2

Open
hanzug opened this issue Apr 12, 2024 · 0 comments
Open

数据链路需要整理。 #2

hanzug opened this issue Apr 12, 2024 · 0 comments
Labels
invalid This doesn't seem right

Comments

@hanzug
Copy link
Owner

hanzug commented Apr 12, 2024

关于kafka的数据流向有问题,原设想 网页原数据爬取下来后先发送到kafka(这里似乎可以使用流处理框架进行处理,待研究)mapreduce 和 mysql异步读取kafka中的csv格式的数据,进行索引构建和落库。

但是因为爬虫还没写,这部分数据流的逻辑优点混乱,需要梳理一下。

@hanzug hanzug added the invalid This doesn't seem right label Apr 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
invalid This doesn't seem right
Projects
None yet
Development

No branches or pull requests

1 participant