Career Pathways for Data Analysts: SQL-Based Salary and Skill Insights
19 Jun 2024
Introduction
In today’s data-driven world, the field of data analytics is booming, with a rapidly growing demand for skilled professionals. This project aims to analyze a dataset of job postings to uncover valuable insights about the most in-demand skills, top-paying jobs, leading companies, and emerging trends in the data analytics job market.
Objectives
As an aspiring Data Analyst, my goal for this project is to gain valuable insights into the Data Analytics job market. By analyzing job postings, I aim to understand the skill sets I should focus on, identify companies that offer higher compensation, and recognize the roles that are in high demand, especially for remote work opportunities. Based on these criteria, this project seeks to answer the following questions:
- What are the top-paying jobs for Data Analyst role?
- Identifying roles within the data analytics field that offer the highest salaries.
- What are the skills required for these top-paying jobs?
- Determining the key skills and qualifications that are essential for securing high-paying positions in data analytics.
- What are the most in-demand and optimal skills for my role?
- Highlighting the skills that are currently most sought after in the job market and those that provide the best career opportunities for Data Analysts.
Methodology
Data Source
The dataset used in this project was collected by Luke Barousse. It contains job postings data sourced from several websites, offering a comprehensive view of the job market. You can download the original dataset, set up for PostgreSQL, from here.
Since I work with MySQL, I made some tweaks to make the dataset compatible. If you are interested in using MySQL for your analysis, you can find the necessary conversion code here.
Data Cleaning and Processing
The dataset provided by Luke Barousse was fairly clean and well-structured. Consequently, minimal data cleaning was required. This allowed me to focus more on the analysis and insights without the need for extensive preprocessing.
Data Analysis and Findings
-- What are the top paying jobs for "Data Analyst"?
SELECT
j.job_title,
c.name AS company_name, -- Company name
j.salary_year_avg
FROM
job_postings_fact AS j
LEFT JOIN
company_dim AS c -- Company data table
USING(company_id)
WHERE
job_title_short = "Data Analyst" AND job_location = "Anywhere"
ORDER BY 3 DESC
LIMIT 10;
job_title | company_name | salary_year_avg | |
---|---|---|---|
0 | Data Analyst | Mantys | 650000 |
1 | Director of Analytics | Meta | 336500 |
2 | Associate Director- Data Insights | AT&T | 255830 |
3 | Data Analyst, Marketing | Pinterest Job Advertisements | 232423 |
4 | Data Analyst (Hybrid/Remote) | Uclahealthcareers | 217000 |
5 | Principal Data Analyst (Remote) | SmartAsset | 205000 |
6 | Director, Data Analyst - HYBRID | Inclusively | 189309 |
7 | Principal Data Analyst, AV Performance Analysis | Motional | 189000 |
8 | Principal Data Analyst | SmartAsset | 186000 |
9 | ERM Data Analyst | Get It Recruit - Information Technology | 184000 |
-- What are the skills required for these top-paying jobs?
WITH top_paying_jobs AS (
SELECT
j.job_ids,
j.job_title,
c.name AS company_name,
j.salary_year_avg
FROM
job_postings_fact AS j
LEFT JOIN company_dim AS c
USING(company_id)
WHERE
job_title_short = "Data Analyst" AND job_location = "Anywhere"
ORDER BY j.salary_year_avg DESC
LIMIT 10
)
SELECT
t.company_name,
t.job_title,
s.skills
FROM top_paying_jobs AS t
INNER JOIN skills_job_dim AS sj ON t.job_id = sj.job_id
INNER JOIN skills_dim AS s ON sj.skill_id = s.skill_id
ORDER BY t.salary_year_avg DESC;
skills | demand_count | avg_salary | bar_chart | |
---|---|---|---|---|
0 | sql | 7291 | 97237 | !!!!!!!!!!!!!!!!!!!! |
1 | excel | 4611 | 87288 | !!!!!!!!!!!!! |
2 | python | 4330 | 101397 | !!!!!!!!!!!! |
3 | tableau | 3745 | 99288 | !!!!!!!!!! |
4 | power bi | 2609 | 97431 | !!!!!!! |
5 | r | 2142 | 100499 | !!!!!! |