kafka-streams-movies-aggregator

transform normalized movies data from multiple kafka topics(send by cdc postgres) into denormalized movies data, then send it to movies-output and consumed by es connector. Then the data stored/updarted in elasticsearch index.

Prequisite

download elasticseaerch-sink-connector version 14.0.6 in https://www.confluent.io/hub/confluentinc/kafka-connect-elasticsearch copy and paste zip file into docker/kafka-connect directory
download zip file in https://drive.google.com/drive/folders/1zQD_gCFQ8yK2V-7K46a2gqxhh2gu3XDJ?usp=sharing , copy and paste all json files in root project directory
download & install apache maven https://maven.apache.org/download.cgi

run the application in docker:

    1. ./mvnw  package -DskipTests
    2. docker compose up -d

      wait for all container running & kafka-connect loaded all plugin 
    3. bash create-topics-2.sh
    4. bash kafka-connect-2.sh
    5. bash connect-health-2.sh
    6. docker-compose -f docker-compose-app.yml up -d, wait until all container up & running (5-10 minutes, due to building multistage image movie-search), building & creating container movie-service,movie-streams, and movie-search
    7. python3 insertMovieToMovieService.py, wait for 10-15 minutes, wait until "done" message printed. inserting movies data  with releases between 2020-2023 to postgresql from movie-service
    8. docker-compose -f docker-compose-stream.yml up -d, wait about 5-6 minutes for the data stream to process
    9. import postman collection file in config/movie-test-stream.postman_collection.json
   10. test the query for movies with releases between 2020-2023(ex. Oppenheimer, Dune,etc. Not all movies available)  using the movie-elasticsearch(port 8080) folder in the postman collection (dont search by query release year due to a timestamp stream conversion failure)

run the application locally:

    1. ./mvnw  package -DskipTests
    2. docker compose up -d
  
      wait for all container running & kafka-connect loaded all plugin 
    3. bash create-topics-2.sh
    4. bash kafka-connect-2.sh
    5. bash connect-health-2.sh
    6. run movie-service application
    7. import postman collection file in config/movie-test-stream.postman_collection.json
    8. run movie-streams application
    9.  go to localhost:9001 to see message in each topic
    10. test query with elasticserach index "movieswiki" in localhost:9200
    11. python3 insertMovieToMovieService.py, wait for 12 minutes. inserting movies data to postgresql
    13. test the query for movies with releases between 2020-2023(ex. Oppenheimer, Dune ,etc. Not all movies available)  using the movie-elasticsearch(port 8080) folder in the postman collection (dont search by query release year due to a timestamp stream conversion failure)

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.idea		.idea
.mvn/wrapper		.mvn/wrapper
config		config
docker/kafka-connect		docker/kafka-connect
movie-elasticsearch		movie-elasticsearch
movie-module		movie-module
movie-service		movie-service
movie-streams		movie-streams
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
check-isi-topic-2.sh		check-isi-topic-2.sh
check-isi-topics.sh		check-isi-topics.sh
connect-health-2.sh		connect-health-2.sh
connect-health.sh		connect-health.sh
create-topics-2.sh		create-topics-2.sh
create-topics.sh		create-topics.sh
delete-connector.sh		delete-connector.sh
docker-compose-app.yml		docker-compose-app.yml
docker-compose-stream.yml		docker-compose-stream.yml
docker-compose.yml		docker-compose.yml
insertMovieToMovieService.py		insertMovieToMovieService.py
kafka-connect-2.sh		kafka-connect-2.sh
kafka-connect.sh		kafka-connect.sh
movies.json		movies.json
msg.txt		msg.txt
mvnw		mvnw
mvnw.cmd		mvnw.cmd
pom.xml		pom.xml
reset-topics.sh		reset-topics.sh
tes-produce.txt		tes-produce.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

kafka-streams-movies-aggregator

Prequisite

run the application in docker:

run the application locally:

Architecture

About

Releases

Packages

Languages

License

lintang-b-s/kafka-streams-movies-aggregator

Folders and files

Latest commit

History

Repository files navigation

kafka-streams-movies-aggregator

Prequisite

run the application in docker:

run the application locally:

Architecture

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages