# Top 100 Companies in the US by Revenue in 2023

**Name**: Ahmed Kadar
**Afflication**: GISC 4317, University of Texas at Dallas
**Date**: August 15, 2025
**GitHub Repository Link**: https://github.com/ahmedkadar64/Final-Project

### Abstract
<pre> This project is a report analyzing data given for the top 100 companies in the US by revenue in the year 2023. To search through and organize this data, I use the 'database' module I created for the midterm, altered to work for this new dataset ("companies.csv") </pre>

### Table of Contents (You may customize the structure to match your narrative flow.)
1. Introduction
2. Data
3. Results
4. Conclusion
5. References (if applicable)

## 1. Introduction
### 1.1 Background/context
The dataset provided contains information on the top 100 companies in the US by revenue in 2023. This data is crucial for understanding the economic landscape of the country, identifying key players in various industries, and analyzing trends in corporate performance. The dataset includes company names, revenues, and other relevant details that can be used for further analysis.

### 1.2 Purpose of the analysis
The purpose of this analysis is to explore the dataset of the top 100 companies in the US by revenue in 2023, identify key trends, and provide insights into the performance of these companies. By analyzing this data, we can gain a better understanding of the economic environment, the industries that are thriving, and the factors contributing to the success of these companies such as headquarters location, industry, and number of employees. This analysis can also help in making informed decisions for business strategies, investments, and policy-making.

## 2. Data (Required)
### 2.1 Data Description
* Source(s) of data: Fortune 500 and Forbes
* Type of data (quantitative, qualitative, time series, spatial, etc.):
  - Quantitative: Revenue, number of employees
  - Qualitative: Company names, industries, headquarters locations
* Variables overview
  - `Rank': Rank of the company in the list
  - `Name': Name of the company
  - 'Industry': Industry in which the company operates
  - `Revenue': Revenue of the company in millions of USD
  - `Revenue Growth': Percentage growth in revenue compared to the previous year
  - `Employees': Number of employees in the company
  - `Headquarters': Location of the company's headquarters
* Data collection methods
* The data was collected from Fortune 500 and Forbes websites, which provide annual lists of the largest companies in the US based on revenue. The dataset was then cleaned and organized for analysis. I grabbed the dataset from Kaggle, which is a popular platform for data science and machine learning projects. The dataset is available in the References section below.

### 2.2. Data exploration or descriptive stats
For an overview of the data, I will use the 'database' module I created for the midterm project. This module allows me to search through and organize the data effectively. Below is the code that demonstrates how to use this module to explore the dataset.

In [2]:
import sys
sys.path.append(r"C:\Users\HP\PycharmProjects\Final-Project")
import database as db
import csv
companies = r"C:\Users\HP\PycharmProjects\Final-Project\companies.csv"

{'Rank': '1', 'Name': 'Walmart', 'Industry': 'Retail', 'Revenue (USD millions)': '611,289', 'Revenue growth': '6.70%', 'Employees': '2,100,000', 'Headquarters': 'Bentonville, Arkansas'}
{'Rank': '2', 'Name': 'Amazon', 'Industry': 'Retail and Cloud Computing', 'Revenue (USD millions)': '513,983', 'Revenue growth': '9.40%', 'Employees': '1,540,000', 'Headquarters': 'Seattle, Washington'}
{'Rank': '3', 'Name': 'Exxon Mobil', 'Industry': 'Petroleum industry', 'Revenue (USD millions)': '413,680', 'Revenue growth': '44.80%', 'Employees': '62,000', 'Headquarters': 'Spring, Texas'}
{'Rank': '4', 'Name': 'Apple', 'Industry': 'Electronics industry', 'Revenue (USD millions)': '394,328', 'Revenue growth': '7.80%', 'Employees': '164,000', 'Headquarters': 'Cupertino, California'}
{'Rank': '5', 'Name': 'UnitedHealth Group', 'Industry': 'Healthcare', 'Revenue (USD millions)': '324,162', 'Revenue growth': '12.70%', 'Employees': '400,000', 'Headquarters': 'Minnetonka, Minnesota'}
{'Rank': '6', 'Name': '

Import the database module and the CSV file containing the company data. The `database` module contains functions to search and organize the data effectively.

In [3]:
db.show_all(companies)

{'Rank': '1', 'Name': 'Walmart', 'Industry': 'Retail', 'Revenue (USD millions)': '611,289', 'Revenue growth': '6.70%', 'Employees': '2,100,000', 'Headquarters': 'Bentonville, Arkansas'}
{'Rank': '2', 'Name': 'Amazon', 'Industry': 'Retail and Cloud Computing', 'Revenue (USD millions)': '513,983', 'Revenue growth': '9.40%', 'Employees': '1,540,000', 'Headquarters': 'Seattle, Washington'}
{'Rank': '3', 'Name': 'Exxon Mobil', 'Industry': 'Petroleum industry', 'Revenue (USD millions)': '413,680', 'Revenue growth': '44.80%', 'Employees': '62,000', 'Headquarters': 'Spring, Texas'}
{'Rank': '4', 'Name': 'Apple', 'Industry': 'Electronics industry', 'Revenue (USD millions)': '394,328', 'Revenue growth': '7.80%', 'Employees': '164,000', 'Headquarters': 'Cupertino, California'}
{'Rank': '5', 'Name': 'UnitedHealth Group', 'Industry': 'Healthcare', 'Revenue (USD millions)': '324,162', 'Revenue growth': '12.70%', 'Employees': '400,000', 'Headquarters': 'Minnetonka, Minnesota'}
{'Rank': '6', 'Name': '

The `show_all` function from the `database` module displays all the records in the dataset. This is useful for getting an initial overview of the data, including the rank, name, industry, revenue, revenue growth, number of employees, and headquarters location for each company.

In [4]:
db.search(companies)

{'Rank': '1', 'Name': 'Walmart', 'Industry': 'Retail', 'Revenue (USD millions)': '611,289', 'Revenue growth': '6.70%', 'Employees': '2,100,000', 'Headquarters': 'Bentonville, Arkansas'}


The `search` function from the `database` module allows us to search through the dataset, using the rank, name, or industry as methods for searching. For example, we can search for all companies in the "Retail" industry or find a specific company by name or rank.

In [5]:
db.organize(companies)

{'Rank': '60', 'Name': 'Boeing', 'Industry': 'Aerospace and defense', 'Revenue (USD millions)': '66,608', 'Revenue growth': '6.90%', 'Employees': '156,000', 'Headquarters': 'Chicago, Illinois'}
{'Rank': '62', 'Name': 'Lockheed Martin', 'Industry': 'Aerospace and Defense', 'Revenue (USD millions)': '65,984', 'Revenue growth': '1.60%', 'Employees': '116,000', 'Headquarters': 'Bethesda, Maryland'}
{'Rank': '92', 'Name': 'CHS', 'Industry': 'Agriculture cooperative', 'Revenue (USD millions)': '47,194', 'Revenue growth': '24.30%', 'Employees': '10,014', 'Headquarters': 'Inver Grove Heights, Minnesota'}
{'Rank': '83', 'Name': 'John Deere', 'Industry': 'Agriculture manufacturing', 'Revenue (USD millions)': '52,577', 'Revenue growth': '19.40%', 'Employees': '82,239', 'Headquarters': 'Moline, Illinois'}
{'Rank': '87', 'Name': 'Delta Air Lines', 'Industry': 'Airline', 'Revenue (USD millions)': '50,582', 'Revenue growth': '69.20%', 'Employees': '95,000', 'Headquarters': 'Atlanta, Georgia'}
{'Rank'

The `organize` function from the `database` module sorts the dataset based on the specified criteria, which can be any of the columns in the csv (for example, organizing by employee count or revenue). This helps in identifying the top-performing companies in various aspects.

## 4. Results
[Add code blocks and markdown blocks to show your results effectively. You can use Markdown to format your results for better readability. Include tables, charts, or graphs to illustrate key findings.]
[You can also make subsections to organize your results, such as "4.1 Key Findings" or "4.2 Visualizations".]

## 4.1 Key Findings
* The top 100 companies in the US by revenue in 2023 span various industries, including technology, retail, healthcare, and finance. For the most part, the top 10 companies are dominated by technology and retail giants.
* The top 5 companies by revenue are:
  1. Walmart
  2. Amazon
  3. ExxonMobil
  4. Apple
  5. UnitedHealth Group
* The top 5 companies by revenue growth are:
  1. TD Synnex (97.2%)
  2. World Fuel Services (88.4%)
  3. United Airlines (82.5%)
  4. PBF Energy (71.8%)
  5. ConocoPhillips (69.9%)
* The top 5 companies by number of employees are:
  1. Walmart (2,100,000 employees)
  2. Amazon (1,540,000 employees)
  3. United States Postal Service (576,000 employees)
  4. FedEx (518,249 employees)
  5. The Home Depot (471,600 employees)
* The industries with the highest representation in the top 100 list is financials or financial services, retail, and petroleum. This indicates a strong presence of these industries in the US economy.
* The headquarters of these companies are primarily located in major cities such as New York, San Francisco, and Seattle, indicating a concentration of corporate headquarters in urban areas.
* The city with the most companies in the top 100 list is New York, with 12 companies headquartered there, followed by Houston with 5 companies, and Atlanta tied with Chicago at 3 companies.
* The dataset also shows that the majority of the top 100 companies are publicly traded, with only a few being privately held or government-owned. This indicates a strong presence of publicly traded companies in the US economy.

## 6. Conclusion
* Recap of goals and findings:
  - The goal of this analysis was to explore the dataset of the top 100 companies in the US by revenue in 2023, identify key trends, and provide insights into the performance of these companies. The analysis revealed that the top companies are primarily in the technology and retail sectors, with significant revenue growth and a large number of employees. The analysis also highlighted the concentration of corporate headquarters in major urban areas and the strong presence of publicly traded companies in the US economy.
* Main takeaway points: The top 100 companies in the US by revenue in 2023 are dominated by financial and retail giants, with significant revenue growth and a large number of employees. The financial, retail, and petroleum industries are the most common industries represented in the top 100 list.
* Recommendations for action or further analysis in the future: Future analysis could focus on the impact of economic factors such as inflation, interest rates, and global trade on the performance of these companies. Additionally, analyzing the impact of corporate social responsibility and sustainability practices on the performance of these companies could provide valuable insights. It would also be interesting to compare the performance of these companies over time to identify trends and patterns in their growth and success.

## 7. References
https://www.kaggle.com/datasets/neel1611/top-100-largest-companies-in-the-us-by-revenue