Flight Data Analysis Designed and implemented a distributed data processing workflow using Hadoop and Apache Oozie to analyze over 20 years of U.S. flight data (~100M+ records). Developed and orchestrated multiple MapReduce jobs to identify top-performing airlines, analyze airport taxi times, and determine common flight cancellation reasons. Deployed the solution on AWS EC2 instances, performed scalability testing across varying VM counts, and benchmarked performance over incremental time spans. Demonstrated strong skills in big data processing, cluster configuration, and workflow automation.
-
Notifications
You must be signed in to change notification settings - Fork 0
JimSab068/Flight_Data
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
About
Flight Data Analysis
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published