Skip to content

mplaine/analyzing-job-postings-with-sql

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Analyzing Jobs Postings in Data Science with SQL

SQL for Data Analytics Capstone Project by Markku Laine (2024)

Overview

The goal of this project is to explore the data science job market in Finland by analyzing job postings published in 2023 using SQL.

The questions I wanted answers to are:

  1. Which cities have the most data science jobs in Finland?
  2. Which companies recruit the most data professionals in Finland?
  3. What are the most in-demand data science job roles in Finland?
  4. What are the highest-paying data science job roles in Finland the world?
  5. What are the most in-demand skills for Data Analysts and Data Engineers in Finland?
  6. What are the most optimal skills to learn for Data Analysts and Data Engineers in Finland the world?

Tools: SQL, PostgreSQL, DrawSQL, Visual Studio Code (with SQLTools extension), Tableau, Docker Compose, git, GitHub

Dataset

This project uses a modified version of the original Job Postings in 2023 dataset from Luke Barousse's SQL for Data Analytics course. The dataset consists of four CSV files, each representing a table in the Job Postings database (see the ER diagram below): companies (140,033 rows), job_skills (3,669,604 rows), jobs (787,686 rows), and skills (256 rows).

ER diagram of the Job Postings database

Data Analysis

1. Which cities have the most data science jobs in Finland?

To identify the largest data science hubs in Finland, I filtered jobs by country and then grouped them by location.

SELECT
  job_location AS location,
  COUNT(*) AS job_count
FROM
  jobs
WHERE
  job_country = 'Finland'
GROUP BY
  job_location
ORDER BY
  job_count DESC, location
LIMIT
  10;

Top 10 locations with data science jobs in Finland, 2023

Insights:

  • Helsinki dominates the data science job market in Finland with 1,345 job postings, reflecting its status as the country's capital and major economic hub.
  • Espoo and Tampere, the second and third largest cities in Finland, feature 241 and 110 job postings, respectively. Both cities have a university of technology, which likely increases the number of tech companies and the demand for data professionals in the region.
  • Data science job opportunies are also abundant in smaller cities across the country, exemplified by Oulu (43), Jyväskylä (17), and Lahti (11), as well as through remote positions (Anywhere), accounting for 195 job postings.

2. Which companies recruit the most data professionals in Finland?

To identify the biggest recruiters in Finland, I joined companies with the jobs data and then filtered and grouped them by country and company, respectively.

SELECT
  name AS company,
  COUNT(*) AS job_count
FROM
  companies AS c
LEFT JOIN
  jobs AS j ON c.id = j.company_id
WHERE
  job_country = 'Finland'
GROUP BY
  name
ORDER BY
  job_count DESC, name
LIMIT
  25;

Top 25 companies recruiting data professionals in Finland, 2023

Insights:

  • Wolt leads with 155 job postings, signaling its strong emphasis on data-driven decision-making in the food delivery industry.
  • Technology recruitment companies such as Nigel Frank International (109), Technojobs (46), and Academic Work (22) are heavily represented on the list, indicating a rising demand for data professionals across various sectors.
  • Financial institutions like Nordea Bank (95) and OP Financial Group (26) actively seek data professionals, highlighting the ever-growing role of data analytics in the banking and finance sectors.

3. What are the most in-demand data science job roles in Finland?

To identify the most demanded job roles in Finland, I filtered jobs by country and then grouped them by role.

SELECT
  job_title_short AS role,
  COUNT(*) AS job_count
FROM
  jobs
WHERE
  job_country = 'Finland'
GROUP BY
  job_title_short
ORDER BY
  job_count DESC;

Data science job roles by demand in Finland, 2023

Insights:

  • (Senior) Data Engineer roles are the most in-demand in Finland, with 870 job postings, indicating that there is a high demand for data professionals to build and maintain automated data processing pipelines and data storage systems within organizations.
  • (Senior) Data Analyst and (Senior) Data Scientist positions are also highly sought-after, with 460 and 458 job postings respectively, suggesting a growing need for professionals skilled in data analysis and advanced data modeling techniques.
  • Software Engineer roles follow closely behind, with 397 job postings, highlighting the intersection of software development and data science in the Finnish job market.

4. What are the highest-paying data science job roles in Finland the world?

To identify the highest-paying job roles in Finland the world, I grouped jobs having salary information by role and then computed their average yearly salary.

SELECT
  job_title_short AS role,
  COUNT(salary_year_avg) AS job_count,
  ROUND(AVG(salary_year_avg), 0) AS average_salary,
  ROUND(MIN(salary_year_avg), 0) AS min_salary,
  ROUND(MAX(salary_year_avg), 0) AS max_salary
FROM
  jobs
WHERE
  --  job_country = 'Finland' AND
  salary_year_avg IS NOT NULL
GROUP BY
  job_title_short
ORDER BY
  average_salary DESC;

Highest-paying data science job roles worldwide, 2023

Insights:

  • Experience: Senior-level Data Scientists ($154K), Data Engineers ($146K), and Data Analysts ($114K) enjoy average yearly salaries that are 12-21% higher than those of their less seasoned peers, underscoring the importance of experience in data science.
  • Skills: Roles demanding a diverse set of skills such as Data Scientist, Data Engineer, and Machine Learning Engineer are generally associated with higher annual salaries.

5. What are the most in-demand skills for Data Analysts and Data Engineers in Finland?

To identify the most in-demand skills for Data Analysts and Data Engineers in Finland, I joined jobs with the associated skills data and then filtered them by country and role, and finally grouped skills by their id.

SELECT
  name AS skill,
  type,
  COUNT(*) AS job_count
FROM
  jobs AS j
INNER JOIN
  job_skills AS js ON j.id = js.job_id
INNER JOIN
  skills AS s ON js.skill_id = s.id
WHERE
  job_country = 'Finland' AND
  job_title_short in ('Data Analyst', 'Data Engineer')
GROUP BY
  s.id
ORDER BY
  job_count DESC
LIMIT
  10;

Top 10 in-demand skills for Data Analysts and Data Engineers in Finland, 2023

Insights:

  • Programming: SQL and Python top the list, with 584 and 537 job postings respectively, highlighting the importance of programming proficiency in both Data Analyst and Data Engineer roles.
  • Cloud: Cloud-related skills such as Azure, AWS, and Snowflake are in high demand, with 383, 273, and 186 job postings respectively, indicating a growing reliance on cloud-based data solutions and infrastructure among Finnish companies.
  • Analyst tools: Mastering tools like Power BI (231) and Tableau (150) are essential especially for Data Analysts, suggesting a strong emphasis on data visualization for effective analysis and decision-making.
  • Libraries: Open-source libraries such as Apache Spark and Apache Kafka are featured prominently, with 145 and 129 job postings respectively, underscoring their vital roles in data processing and streaming tasks for Data Engineers in Finland.

6. What are the most optimal skills to learn for Data Analysts and Data Engineers in Finland the world?

To identify the most optimal skills to learn for Data Analysts and Data Engineers in Finland the world, I joined jobs having salary information with the associated skills data and then filtered them by role, and finally grouped popular skills by their id.

SELECT
  name AS skill,
  type,
  COUNT(*) AS job_count,
  ROUND(AVG(salary_year_avg), 0) AS average_salary
FROM
  jobs AS j
INNER JOIN
  job_skills AS js ON j.id = js.job_id
INNER JOIN
  skills AS s ON js.skill_id = s.id
WHERE
  -- job_country = 'Finland' AND
  job_title_short in ('Data Analyst', 'Data Engineer') AND
  salary_year_avg IS NOT NULL
GROUP BY
  s.id
HAVING
  COUNT(*) > 500
ORDER BY
  average_salary DESC,
  job_count DESC
LIMIT
  10;

Top 10 optimal skills for Data Analysts and Data Engineers worldwide, 2023

Insights:

  • Programming: Proficiency in less popular programming languages for data science such as Scala and Java show significant demand, which is reflected in their relatively higher average yearly salaries reaching up to $141K and $133K, respectively.
  • Cloud: Skills related to cloud platforms and services like Redshift, Snowflake, and AWS are in high demand, offering lucrative average yearly salaries ranging from $131K to $136K.
  • Libraries: Mastery of Apache Kafka ($143K), Airflow ($135K), Spark ($134K), and Hadoop ($132K) is also highly valued, highlighting their significance in data engineering roles worldwide.

Conclusions

In 2023, a total of 2,557 data science job postings were published in Finland. The following summarizes the insights derived from the data analysis:

  1. Largest data science hubs: Large cities like Helsinki, Tampere, and Espoo boast with many data science job opportunities, but smaller cities such as Jyväskylä and Lahti also offer good employment opportunities in the field in addition to remote positions.
  2. Biggest recruiters: Wolt is Finland's biggest recruiter in the field of data science with 155 job postings, followed by major technology recruitment firms, financial institutions, and tech companies.
  3. Most in-demand roles: In Finland, the most in-demand roles include (Senior) Data Engineer, (Senior) Data Analyst, (Senior) Data Scientist, and Software Engineer, suggesting a growing need for data professionals capable of working at the intersection of data science and software development.
  4. Highest-paying roles: Experienced data professionals, such as Senior-level Data Scientists and Data Engineers, enjoy higher average yearly salaries, reflecting the value placed on both experience and diverse skills in the global job market.
  5. Most in-demand skills: In Finland, SQL and Python proficiency is crucial for Data Analyst and Data Engineer roles, while demand for cloud skills like Azure and AWS is rising, alongside essential data analytics tools and libraries, such as Power BI and Apache Spark.
  6. Most optimal skills: Proficiency in Scala and Java programming languages commands high salaries in the global job market, as do cloud skills like Redshift, Snowflake, and AWS, along with mastery of various Apache libraries for data engineering.

In summary, the Finnish job market for data science looks very promising, with large urban centers serving as hubs for opportunities. Wolt emerges as a major player in recruitment, while demand remains high for experienced data professionals adept in diverse skills and technologies.

Database Setup Guide

Prerequisites

Usage

To start a PostgreSQL database service with the job postings data, run:

docker compose up

Note: The database initialization process takes several minutes due to the large amount of data.

To stop and remove the database service, just hit Ctrl+C and run:

docker compose down

License

Copyright (c) 2024 Markku Laine

This software is distributed under the terms of the MIT License. See LICENSE for details.

About

SQL for Data Analytics capstone project: "Analyzing Job Postings in Data Science with SQL".

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published