📊 Dive into the data job market! Focusing on data analyst roles, this project explores 💰 top-paying jobs, 🔥 in-demand skills, and 📈 where high demand meets high salary in data analytics.
🔍 SQL queries? Check them out here: project_sql folder
Driven by the perspective of a data analyst job seeker, I created this project to better understand the job market—specifically identifying high-paying data analayst roles and the skills required to get there.
Using a real-world dataset from Luke Barousse’s SQL course, I analyzed job postings to uncover insights on salaries, job titles, locations, and in-demand skills, with the aim of making my job search more targeted and data-driven.
- What are the top-paying data analyst jobs?
- What skills are required for these top-paying jobs?
- What skills are most in demand for data analysts?
- Which skills are associated with higher salaries?
- What are the most optimal skills to learn?
For my deep dive into the data analyst job market, I leveraged the power of several key tools:
- SQL: The backbone of my analysis, allowing me to query the database and uncover critical insights.
- PostgreSQL: The database system used to manage the large-scale job dataset.
- Visual Studio Code: My primary environment for writing and testing complex queries.
- Git & GitHub: Essential for version control and sharing my SQL scripts and analysis, ensuring collaboration and project tracking.
Each query for this project aimed at investigating specific aspects of the data analyst job market. Here’s how I approached each question:
To identify the highest-paying roles, I filtered data analyst positions by average yearly salary and location, focusing on remote jobs. This query highlights the high paying opportunities in the field.
SELECT
job_id,
job_title,
job_location,
job_schedule_type,
salary_year_avg,
job_posted_date,
name AS company_name
FROM
job_postings_fact
LEFT JOIN company_dim ON job_postings_fact.company_id = company_dim.company_id
WHERE
job_title_short = 'Data Analyst' AND
job_location = 'Anywhere' AND
salary_year_avg IS NOT NULL
ORDER BY
salary_year_avg DESC
LIMIT 10Here's the breakdown of the top data analyst jobs in 2023:
- Wide Salary Range: Top 10 paying data analyst roles span from $184,000 to $650,000, indicating significant salary potential in the field.
- Diverse Employers: Companies like SmartAsset, Meta, and AT&T are among those offering high salaries, showing a broad interest across different industries.
- Job Title Variety: There's a high diversity in job titles, from Data Analyst to Director of Analytics, reflecting varied roles and specializations within data analytics.

- Bar graph visualizing the salary for the top 10 salaries for data analysts
To understand what skills are required for the top-paying jobs, I joined the job postings with the skills data, providing insights into what employers value for high-compensation roles.
WITH top_paying_jobs AS (
SELECT
job_id,
job_title,
salary_year_avg,
name AS company_name
FROM
job_postings_fact
LEFT JOIN company_dim ON job_postings_fact.company_id = company_dim.company_id
WHERE
job_title_short = 'Data Analyst' AND
job_location = 'Anywhere' AND
salary_year_avg IS NOT NULL
ORDER BY
salary_year_avg DESC
LIMIT 10
)
SELECT
top_paying_jobs.*,
skills
FROM top_paying_jobs
INNER JOIN skills_job_dim ON top_paying_jobs.job_id = skills_job_dim.job_id
INNER JOIN skills_dim ON skills_job_dim.skill_id = skills_dim.skill_id
ORDER BY
salary_year_avg DESC;Here’s the breakdown of the most demanded skills for the specific highest-paying data analyst jobs (with listed skills) analyzed from the dataset:
-
SQL is leading the way, appearing in 8 out of the 8 analyzed roles (100% frequency).
-
Python and Tableau follow closely, each being required by 7 of the top roles.
-
R is also highly sought after with a count of 5.
-
Other notable skills showing significant demand include Snowflake (3), Pandas (3), and Excel (3), followed by a long tail of niche technical and cloud skills.
- Bar graph visualizing the count of skills for the top 10 paying jobs for data analysts; Google Gemini AI generated this graph from my SQL query results.
This query helped identify the skills most frequently requested in job postings, directing focus to areas with high demand.
SELECT
skills,
COUNT(skills_job_dim.job_id) AS demand_count
FROM job_postings_fact
INNER JOIN skills_job_dim ON job_postings_fact.job_id = skills_job_dim.job_id
INNER JOIN skills_dim ON skills_job_dim.skill_id = skills_dim.skill_id
WHERE
job_title_short = 'Data Analyst' AND
job_work_from_home = 'True'
GROUP BY
skills
ORDER BY
demand_count DESC
LIMIT 5;- SQL is the undisputed leader, appearing in 7,291 postings.
- Excel and Python follow closely, highlighting the importance of both spreadsheet proficiency and programming.
- Tableau and Power BI represent the critical need for data visualization expertise.
| Skills | Demand Count |
|---|---|
| SQL | 7291 |
| Excel | 4611 |
| Python | 4330 |
| Tableau | 3745 |
| Power BI | 2609 |
Table of the demand for the top 5 skills in data analyst job postings
Exploring the average salaries associated with different skills revealed which skills are the highest paying.
SELECT
skills,
ROUND(AVG(salary_year_avg), 0) AS avg_salary
FROM job_postings_fact
INNER JOIN skills_job_dim ON job_postings_fact.job_id = skills_job_dim.job_id
INNER JOIN skills_dim ON skills_job_dim.skill_id = skills_dim.skill_id
WHERE
job_title_short = 'Data Analyst'
AND salary_year_avg IS NOT NULL
AND job_work_from_home = 'True'
GROUP BY
skills
ORDER BY
avg_salary DESC
LIMIT 25;Here's a breakdown of the results for top paying skills for Data Analysts:
- High Demand for Big Data & ML Skills: Top salaries are commanded by analysts skilled in big data technologies (PySpark, Couchbase), machine learning tools (DataRobot, Jupyter), and Python libraries (Pandas, NumPy), reflecting the industry's high valuation of data processing and predictive modeling capabilities.
- Software Development & Deployment Proficiency: Knowledge in development and deployment tools (GitLab, Kubernetes, Airflow) indicates a lucrative crossover between data analysis and engineering, with a premium on skills that facilitate automation and efficient data pipeline management.
- Cloud Computing Expertise: Familiarity with cloud and data engineering tools (Elasticsearch, Databricks, GCP) underscores the growing importance of cloud-based analytics environments, suggesting that cloud proficiency significantly boosts earning potential in data analytics.
| Skills | Average Salary ($) |
|---|---|
| pyspark | 208,172 |
| bitbucket | 189,155 |
| couchbase | 160,515 |
| watson | 160,515 |
| datarobot | 155,486 |
| gitlab | 154,500 |
| swift | 153,750 |
| jupyter | 152,777 |
| pandas | 151,821 |
| elasticsearch | 145,000 |
Table of the average salary for the top 10 paying skills for data analysts
Combining insights from demand and salary data, this query aimed to pinpoint skills that are both in high demand and have high salaries, offering a strategic focus for skill development.
WITH skills_demand AS (
SELECT
skills_dim.skill_id,
skills_dim.skills,
COUNT(skills_job_dim.job_id) AS demand_count
FROM job_postings_fact
INNER JOIN skills_job_dim ON job_postings_fact.job_id = skills_job_dim.job_id
INNER JOIN skills_dim ON skills_job_dim.skill_id = skills_dim.skill_id
WHERE
job_title_short = 'Data Analyst'
AND salary_year_avg IS NOT NULL
AND job_location = 'Anywhere'
GROUP BY
skills_dim.skill_id
), average_salary AS (
SELECT
skills_dim.skill_id,
ROUND(AVG(salary_year_avg), 0) AS avg_salary
FROM job_postings_fact
INNER JOIN skills_job_dim ON job_postings_fact.job_id = skills_job_dim.job_id
INNER JOIN skills_dim ON skills_job_dim.skill_id = skills_dim.skill_id
WHERE
job_title_short = 'Data Analyst'
AND salary_year_avg IS NOT NULL
AND job_location = 'Anywhere'
GROUP BY
skills_dim.skill_id
)
SELECT
skills_demand.skills,
demand_count,
avg_salary
FROM
skills_demand
INNER JOIN average_salary ON skills_demand.skill_id = average_salary.skill_id
WHERE
demand_count > 10
ORDER BY
avg_salary DESC,
demand_count DESC
LIMIT 25;- High-Demand Programming Languages: Python and R stand out for their high demand, with demand counts of 236 and 148 respectively. Despite their high demand, their average salaries are around $101,397 for Python and $100,499 for R, indicating that proficiency in these languages is highly valued but also widely available.
- Cloud Tools and Technologies: Skills in specialized technologies such as Snowflake, Azure, AWS, and BigQuery show significant demand with relatively high average salaries, pointing towards the growing importance of cloud platforms and big data technologies in data analysis.
- Business Intelligence and Visualization Tools: Tableau and Looker, with demand counts of 230 and 49 respectively, and average salaries around $99,288 and $103,795, highlight the critical role of data visualization and business intelligence in deriving actionable insights from data.
- Database Technologies: The demand for skills in traditional and NoSQL databases (Oracle, SQL Server, NoSQL) with average salaries ranging from $97,786 to $104,534, reflects the enduring need for data storage, retrieval, and management expertise.
This table summarizes the "Optimal" skills for Data Analysts, combining high average salaries with a significant number of job postings. This represents the best return on investment for skill development.
| Skill | Demand Count | Average Salary ($) |
|---|---|---|
| Go | 27 | 115,320 |
| Confluence | 11 | 114,210 |
| Hadoop | 22 | 113,193 |
| Snowflake | 37 | 112,948 |
| Azure | 34 | 111,225 |
| BigQuery | 13 | 109,654 |
| AWS | 32 | 108,317 |
| Java | 17 | 106,906 |
| SSIS | 12 | 106,683 |
| Jira | 20 | 104,918 |
| Oracle | 37 | 104,534 |
| Looker | 49 | 103,795 |
| NoSQL | 13 | 101,414 |
| Python | 236 | 101,397 |
| R | 148 | 100,499 |
From a complete beginner to a confident Data Analyst, this project allowed me to master the technicalities of SQL while developing a data-driven mindset.
I transitioned from basic syntax to advanced analytical techniques, focusing on writing clean, efficient, and readable code.
-
Query Foundations: I mastered the essential building blocks of a query and, more importantly, the Logical Order of Execution (how the SQL engine actually processes the data):
FROM/JOIN(Gathering data)WHERE(Filtering raw rows)GROUP BY(Aggregating data)HAVING(Filtering aggregated results)SELECT(Finalizing columns and calculations)ORDER BY(Sorting for clarity)LIMIT(Optimizing output size)
-
Advanced Techniques:
- Multi-Table Joins: Successfully merged Fact and Dimension tables to provide context to raw numbers.
- CTEs & Subqueries: Used Common Table Expressions to simplify complex logic into modular, readable steps.
- Case Expressions: Created custom data categories (e.g., salary brackets) to make analysis more actionable.
- Window Functions: Applied
RANK()andROW_NUMBER()to identify top-tier opportunities efficiently. - Unions: Combined datasets for a holistic view of the job market.
- ROI Analysis: Identified "Optimal Skills" by intersecting high demand with high average salaries.
- Remote Market Intelligence: Focused on "Anywhere" and "WFH" roles to understand the global compensation landscape for Data Analysts.
- Professional Workflow: Integrated PostgreSQL, VS Code, and Git/GitHub into a unified analytical environment.
This project enhanced my SQL skills and provided valuable insights into the data analyst job market. The findings from the analysis serve as a guide to prioritizing skill development and job search efforts. Aspiring data analysts can better position themselves in a competitive job market by focusing on high-demand, high-salary skills. This exploration highlights the importance of continuous learning and adaptation to emerging trends in the field of data analytics.
