Twitter Streaming Data

Files

Start by launching the server from stream.ipynb. The port by default is 4040 but you can change it. Also you can change the Bearer_token if you want to. Run it and make sure you're server is listening to the port you choose (it shows a message).
Launch a server Elasticsearch (in a docker container or not). We assume that you'll use the port 9200, but you can choose another one.
Put the file process.ipynb in an environment where spark is installed, we used a docker container, but you can use what you want. Change the ports in the top according to the ports you chose previously (STREAM_PORT, ELASTIC_PORT). No need to change them if you used the default ones.
In the same place, change the host to localhost if you're not running the code in a docker container. if you're, keep it at host.docker.internal
You can now run the code of process.ipynb.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
process.ipynb		process.ipynb
spark.pdf		spark.pdf
stream.ipynb		stream.ipynb