Skip to content

"Apache Iceberg Connector for AWS Glue를 이용하여 데이터레이크 CRUD 하기" 포스팅 내용 실습 프로젝트

Notifications You must be signed in to change notification settings

ksmin23/aws-glue-iceberg-blog

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

aws-glue-iceberg-blog

MySQL 테이블

  • Full-Load
CREATE TABLE human_resources.employee_details (emp_no BIGINT(20) AUTO_INCREMENT, name varchar(30), department varchar(30), city varchar(50), salary int, m_time DATETIME DEFAULT CURRENT_TIMESTAMP, PRIMARY KEY(emp_no) ) ENGINE=InnoDB AUTO_INCREMENT=0;

INSERT INTO human_resources.employee_details (emp_no, name, department, city, salary) VALUES (1, 'Adam', 'IT', 'SFO', 50000);
INSERT INTO human_resources.employee_details (emp_no, name, department, city, salary) VALUES (2, 'Susan', 'Sales', 'NY', 60000);
INSERT INTO human_resources.employee_details (emp_no, name, department, city, salary) VALUES (3, 'Jeff', 'Finance', 'Tokyo', 55000);
INSERT INTO human_resources.employee_details (emp_no, name, department, city, salary) VALUES (4, 'Bill', 'Manufacturing', 'New Delhi', 70000);
INSERT INTO human_resources.employee_details (emp_no, name, department, city, salary) VALUES (5, 'Joe', 'IT', 'Chicago', 45000);
INSERT INTO human_resources.employee_details (emp_no, name, department, city, salary) VALUES (6, 'Steve', 'Finance', 'NY', 60000);
  • CDC-Load
UPDATE human_resources.employee_details SET city='New Delhi' WHERE emp_no = 2;
UPDATE human_resources.employee_details SET salary=70000 WHERE emp_no = 5;
INSERT INTO human_resources.employee_details (emp_no, name, department, city, salary) VALUES (8, 'John', 'Sales', 'SFO', 90000);
INSERT INTO human_resources.employee_details (emp_no, name, department, city, salary) VALUES (9, 'Eli', 'Purchasing', 'Chicago', 90000);
DELETE FROM human_resources.employee_details WHERE emp_no = 3;

Sample data

  • full-load: 처음 Glue Job을 이용해서 Apache Iceberg 테이블을 생성할 때 사용하는 데이터
  • cdc-load: RDS에서 Insert/Update/Delete를 수행한 데이터로 full-load로 만든 Apache Iceberg 테이블을 업데이트 하는데 사용하는 데이터

About

"Apache Iceberg Connector for AWS Glue를 이용하여 데이터레이크 CRUD 하기" 포스팅 내용 실습 프로젝트

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages