GitHub

How to using corona project ingest data from S3 to Elasticsearch.

Goal:

A csv row data file is stored in S3, using a batch ingest data into Elasticsearch, and then visualize by Kibana tool.

Step 1. Create new AWS EC2

Step 2. Install OpenCOBOL, git, cron on EC2

Step 3. Get source from git

cd /home/ec2-user/
git clone https://github.com/pkm0306pkm/corona.git 
cobc ./cobol/CZZ1230.cbl

Step 4. Create new AWS Elasticsearch service

Guideline: https://docs.aws.amazon.com/elasticsearch-service/latest/developerguide/es-gsg.html
Setting "corona-index" mapping:

curl -X PUT -H "Content-Type: application/json" /corona-index/ -d '
{
   "mappings":{
      "properties":{
         "Country":{"type":"text","fields":{"keyword":{"type":"keyword","ignore_above":256}}}, 
         "Province":{"type":"text","fields":{"keyword":{"type":"keyword","ignore_above":256}}}, 
         "Date":{ "type":"date"}, 
         "LastUpdate":{ "type":"date"},
         "Confirmed":{ "type":"double"}, 
         "Deaths":{ "type":"double"}, 
         "Recovered":{ "type":"double"}
      }
   }
}'

Step 5. Create new bucket in AWS S3

Step 6. Edit environment variable in batch.sh file

export S3_DATA_PATH="s3://fpt-corona-data"
export EC2DATA_PATH="/home/ec2-user/corona/data"
export FDDOMAIN="https://search-corona-5sbkgdsencyyorxubck7wdg6s4.ap-northeast-1.es.amazonaws.com"
export COB_LIBRARY_PATH="/home/ec2-user/corona/cobol"

Step 7. Setting cron:

Perform a "batch" schedule every 5 minutes

*/5 * * * * /home/ec2-user/corona/cronjob/batch.sh >> /home/ec2-user/corona/cronjob/batch.log 2>&1

Refer: https://crontab-generator.org/

Step 8. Upload s3_datarow.csv to S3

Cron will execute schedule (JOB: batch.sh) in every 5 minutes

Step 9. Confirm log result at batch.log file.

AWS CLI: copy file s3_datarow.csv to EC2
COBOL: convert to Elasticsearch json format
REST API: index json data to Elasticsearch service

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
cobol		cobol
cronjob		cronjob
data		data
images		images
COBOLハッカソン2020.pdf		COBOLハッカソン2020.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

How to using corona project ingest data from S3 to Elasticsearch.

Goal:

Step 1. Create new AWS EC2

Step 2. Install OpenCOBOL, git, cron on EC2

Step 3. Get source from git

Step 4. Create new AWS Elasticsearch service

Step 5. Create new bucket in AWS S3

Step 6. Edit environment variable in batch.sh file

Step 7. Setting cron:

Step 8. Upload s3_datarow.csv to S3

Step 9. Confirm log result at batch.log file.

Step 10. Usign Kibana visualize data.

About

Releases

Packages

Languages

pkm0306pkm/corona

Folders and files

Latest commit

History

Repository files navigation

How to using corona project ingest data from S3 to Elasticsearch.

Goal:

Step 1. Create new AWS EC2

Step 2. Install OpenCOBOL, git, cron on EC2

Step 3. Get source from git

Step 4. Create new AWS Elasticsearch service

Step 5. Create new bucket in AWS S3

Step 6. Edit environment variable in batch.sh file

Step 7. Setting cron:

Step 8. Upload s3_datarow.csv to S3

Step 9. Confirm log result at batch.log file.

Step 10. Usign Kibana visualize data.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages