Skip to content

Applying various data analytics methodologies and techniques in data collection, data wrangling, exploratory data analysis, and creation of data visualization.

Notifications You must be signed in to change notification settings

CelestialArcadia/EDA-Technology-Trends

Repository files navigation

EDA-Technology-Trends

Applications of various data analytics skills and techniques in data collection, data wrangling, exploratory data analysis, and creation of data visualization.

Project Overview

  • Executive Summary
  • Introduction
  • Methodology
  • Results -
    • Data Visualization & Dashboard
    • Discussion - Findings & Implications
  • Conclusion
  • Appendix

I. Executive Summary

The first step in solving any data analysis problem is Data Collection. It can be done through various forms and sources. In this project, it will be collected through Web Scraping and Web APIs.

Sources including: * Surveys * Job Postings * Training portals

Data collected from the web is often if not, almost always raw and unclean. So, the data need to be prepared for the analysis.

Achieving this will be using Data Wrangling Techniques; which means that the data needs to be consistent, and without missing values and errors.

After finishing up with the data cleaning, next comes the application of statistical techniques to analyze the data, and identify insights and trends. Posing questions are important in order to find accurate solutions, and these questions are: * What are the top programming languages in demand? * What are the top database skills in demand? * What are the highest paying programming jobs?

Consequently, the findings will be put together through creating a Data Visualization utilizing the IBM Cognos Analystics to build an interactive dashboard.

Finally, the presention through storytelling using a written report or as a presentation.

II. Introduction

A global IT, and business consulting services firm needed to identify future skill requirements in order to keep pace with the constantly changing technologies and remain competitive.

The principle of the project is to answer one key question - What are the current skill requirement in technology field

After going through the whole process of analysing data, the result will inform the consulting firm of the key technical skills needed at the moment so the firm can focus on putting effort and resource into developing valuable technology assets. This is essential in the success of a business because, it's only possible to understand the market's needs through knowing the current trends, otherwise, the firm might slowly slide into obscurity, as well as it will cost tons of money if the firm works in the wrong direction.

As well as it may help new graduates or any person interested building a career based on the current industries' trends.

III. Methodology

Data sources and methods implemented in this project are mentioned in the "project-eda-main" file.

IV. Results - Data Visualization & Dashboard

After the data was cleaned, a dashboard was created on the IBM Cognos Analytics platform. Interactive-Dashboard

IV. Results - Findings & Implications

Here are some charts showing comparisons of the technology trends:

Programming Language Trends Comparison

Top 5 Programming Language in 2019

Based on "Stack Overflow Developer Survey 2019

Top 5 Programming Languages in 2020

Based on "Stack Overflow Developer Survey 2019

Javascript and HTML/CSS remain in the top two regardless of 2019 or 2020, which implies that web development is still the highest in demand. Python jumps from the top five currently to the top three in the future which implies a rapid increase in the popularity of a general-purpose programming language that's easy to learn. SQL is floating around the top three in 2019 and the top four in 202 potentially, which suggests that although it’s not at the highest ranking, it still plays a significant role in the tech field. TypeScript is not on the top five in 2019 chart but making it to the number five in 2020 implies that it’s expected to be more popular in the future.

Database Trends Comparison

Top 5 Databases in 2019

Based on "Stack Overflow Developer Survey 2019

Top 5 Databases in 2020

MySQL is the most popular database in 2019 and still maintains in the top four in 2020. Microsoft SQL is the second most popular, however, it falls out of the top five list in 2020, losing its popularity to other databases. PostgreSQL is seemingly gaining popularity, jumping from the top three in 2019 to the top one in the year 2020. It will stay competitive for quite some time. MongoDB is making its way up as well, from the top five of the list and making it to the top two in 2020.

Survey Respondent Demographic Below is a demographic dashboard that I developed based on the developer survey data. It showcases the gender, age, education level, and country of origin of the survey respondents.

Survey Respondent Demographic

Based on "Stack Overflow Developer Survey 2019"

Key information about the developers’ demographic:

  • 93.5% of the population are men, leaving only 6.5% women in the field. There is a huge gender gap in the technology workforce.

  • The map where it’s the darkest blue demonstrates that most of the population is heavily concentrated in the United States.

  • Respondent's age range is roughly in the 20s to 30s.

  • The education level lies mostly in the attainment of a Bachelor's degree or a Master's degree.

From the demographic data analyzed, there are large gaps in the technology industry. More opportunities can be given to women and people in other geographic locations besides the United States. so the technology workforce would be more diversified.

V. Conclusion

Based on programming languages' and databases' trends out there, and collerating it to the current technical skills needed for the firm. Python, Swift, and Javascript seem to be widely used in recent years. Python is known for its readability. It allows programmers to write clear and logical codes without complexity. Swift is purposed for developing mobile applications. Javascript is exceptionally flexible as it is used for both web frontend and backend applications. They all seemingly carry the characteristics of mobility, simplicity, and flexibility. The firm can certainly focus on investing more resources into these areas.

VI. Appendix

While exploring the data, I also created a chart based on the top five most popular development environments that respondents picked. Visual Studio Code seems to be the most popular one that engineers use.

Top 5 Development Environments Based on "Stack Overflow Developer Survey 2019"

About

Applying various data analytics methodologies and techniques in data collection, data wrangling, exploratory data analysis, and creation of data visualization.

Topics

Resources

Stars

Watchers

Forks