This dataset combines LeetCode problem ratings with company-specific interview questions to provide comprehensive insights into the difficulty and frequency of problems asked by different companies.
This dataset is created by merging two primary data sources:
- Source: zerotrac/leetcode_problem_rating
- Location:
leetcode_problem_rating/ratings.txt
- Description: Contains difficulty ratings for 2,293 LeetCode problems
- Key Columns:
ID
: Problem IDTitle
: Problem titleRating
: Numerical difficulty rating (1084-3774)Title Slug
: URL-friendly problem identifierContest Slug
: Associated contest informationProblem Index
: Contest problem index
- Source: snehasishroy/leetcode-companywise-interview-questions
- Location:
leetcode-companywise-interview-questions/
- Description: Collection of LeetCode problems organized by 470 companies
- Organization: Each company has 5 CSV files based on time periods:
1. Thirty Days.csv
- Problems from the last 30 days2. Three Months.csv
- Problems from the last 3 months3. Six Months.csv
- Problems from the last 6 months4. More Than Six Months.csv
- Older problems5. All.csv
- All problems for that company
- Key Columns:
ID
: Problem IDDifficulty
: Easy/Medium/Hard classificationTitle
: Problem titleFrequency
: How often the problem appearsAcceptance Rate
: Problem acceptance percentageLink
: LeetCode problem URLTopics
: Problem topic tags
The merged dataset is created using the main.py script, which:
- Loads the problem ratings from
leetcode_problem_rating/ratings.txt
- Iterates through all 2,350 company-specific CSV files
- Merges rating data with company problem data based on problem
ID
- Outputs simplified CSV files containing only:
ID
: Problem identifierTitle
: Problem nameRating
: Difficulty rating
output/
{Company Name}/
README.md
all.csv
more-than-six-months.csv
six-months.csv
three-months.csv
thirty-days.csv
Each company's README.md contains:
- Company name as title
- Statistical summary (mean, median, min, max, etc.) for each time period
To regenerate the dataset:
python main.py
To load individual datasets:
import pandas as pd
# Load a specific company's data
df = pd.read_csv('output/Google/all.csv')
# View rating statistics
print(df['Rating'].describe())
- Problem ratings by zerotrac
- Company-wise questions by snehasishroy