Big data analytics
for delivery companies is always very necessary to uncover customer trends and patterns. Trends help these companies in optimizing their services to ensure a smooth flow of operations between riders as well as forecast future demand of various products and services.
This project aimed to analyze the menus of various restaurants that use uber-eats courier
as their supply company. As this is big data, the use of OLAP-DATABASES
comes in handy to perfrom heavy analytics using SQL
. These databases offer:
Scalability
: Can easily manage massive amounts of data efficiently.Low latency
: Optimized for complex analytical queries due to their latency structure and optimized data structures and indexing.Batch processing
: Good forlarge-scale
data processing tasks efficiently.
Uber-Eats
customers also have the right to investigate the prices of various items on the menu to find the cheapest and most expensive restaurants to save on expenditure. The data was sourced from kaggle
and can be accessed by using the following link
- Investigate the
number of restaurants
that useUBER-EATS
. - Investigate the
top restaurants with the largest number of items.
- Investigate the
price range of items
for the restaurant with the highest number of items. - Investigate the
average price of items
across various restaurants - Investigate the
most expensive restaurant.
Snacks items
analysis. Analyze the price of snack items across various restaurants.
- Investigate the correlation between location i.e
latitude
&longitude
andmenu-prices
. - Outsource data for
customer orders
and analyzeconsumer purchasing trends
.
All the requirements to run this project are listed in the file requirements-file together with the library versions.
- Download the
zip-file
fromkaggle
- Make a project directory using the
mkdir
command on theterminal
and extract the filesin the zip file. - The files are in zip formart having a size of
896 mb
. - Follow along through the parquet convert file which uses
apache pyarrow
. - Load the parquet file into an
OLAP database
namelyDuckDB
in this case for quick analytical queries.