Nazmul Islam

A continuous learner

Career Pathways for Data Analysts: SQL-Based Salary and Skill Insights

Career Pathways for Data Analysts: SQL-Based Salary and Skill Insights

Share:

19 Jun 2024

Introduction

In today’s data-driven world, the field of data analytics is booming, with a rapidly growing demand for skilled professionals. This project aims to analyze a dataset of job postings to uncover valuable insights about the most in-demand skills, top-paying jobs, leading companies, and emerging trends in the data analytics job market.

Objectives

As an aspiring Data Analyst, my goal for this project is to gain valuable insights into the Data Analytics job market. By analyzing job postings, I aim to understand the skill sets I should focus on, identify companies that offer higher compensation, and recognize the roles that are in high demand, especially for remote work opportunities. Based on these criteria, this project seeks to answer the following questions:

  1. What are the top-paying jobs for Data Analyst role?
    • Identifying roles within the data analytics field that offer the highest salaries.
  2. What are the skills required for these top-paying jobs?
    • Determining the key skills and qualifications that are essential for securing high-paying positions in data analytics.
  3. What are the most in-demand and optimal skills for my role?
    • Highlighting the skills that are currently most sought after in the job market and those that provide the best career opportunities for Data Analysts.

Methodology

Data Source

The dataset used in this project was collected by Luke Barousse. It contains job postings data sourced from several websites, offering a comprehensive view of the job market. You can download the original dataset, set up for PostgreSQL, from here.

Since I work with MySQL, I made some tweaks to make the dataset compatible. If you are interested in using MySQL for your analysis, you can find the necessary conversion code here.

Data Cleaning and Processing

The dataset provided by Luke Barousse was fairly clean and well-structured. Consequently, minimal data cleaning was required. This allowed me to focus more on the analysis and insights without the need for extensive preprocessing.

Data Analysis and Findings

-- What are the top paying jobs for "Data Analyst"?
SELECT
    j.job_title,
    c.name AS company_name, -- Company name
    j.salary_year_avg
FROM
    job_postings_fact AS j
LEFT JOIN
    company_dim AS c        -- Company data table
USING(company_id)
WHERE
    job_title_short = "Data Analyst" AND job_location = "Anywhere"
ORDER BY 3 DESC
LIMIT 10;

Top Paying Jobs

  job_title company_name salary_year_avg
0 Data Analyst Mantys 650000
1 Director of Analytics Meta 336500
2 Associate Director- Data Insights AT&T 255830
3 Data Analyst, Marketing Pinterest Job Advertisements 232423
4 Data Analyst (Hybrid/Remote) Uclahealthcareers 217000
5 Principal Data Analyst (Remote) SmartAsset 205000
6 Director, Data Analyst - HYBRID Inclusively 189309
7 Principal Data Analyst, AV Performance Analysis Motional 189000
8 Principal Data Analyst SmartAsset 186000
9 ERM Data Analyst Get It Recruit - Information Technology 184000
-- What are the skills required for these top-paying jobs?
WITH top_paying_jobs AS (
    SELECT
        j.job_ids,
        j.job_title,
        c.name AS company_name,
        j.salary_year_avg
    FROM
        job_postings_fact AS j
    LEFT JOIN company_dim AS c
    USING(company_id)
    WHERE
        job_title_short = "Data Analyst" AND job_location = "Anywhere"
    ORDER BY j.salary_year_avg DESC
    LIMIT 10
)
SELECT
    t.company_name,
    t.job_title,
    s.skills
FROM top_paying_jobs AS t
INNER JOIN skills_job_dim AS sj ON t.job_id = sj.job_id
INNER JOIN skills_dim AS s ON sj.skill_id = s.skill_id
ORDER BY t.salary_year_avg DESC;

Most required skills

In-demand Skills for Data Analytics

  skills demand_count avg_salary bar_chart
0 sql 7291 97237 !!!!!!!!!!!!!!!!!!!!
1 excel 4611 87288 !!!!!!!!!!!!!
2 python 4330 101397 !!!!!!!!!!!!
3 tableau 3745 99288 !!!!!!!!!!
4 power bi 2609 97431 !!!!!!!
5 r 2142 100499 !!!!!!

Acknowledgement

References

1. Dataset

2. Tools and Technologies

3. Libraries and Resources

4. Online Resources

Previous post:

This is a demo project.