1.0 STATEMENT OF THE PROBLEM:

The Stack Overflow survey offers a wealth of information, but due to the breadth and complexity of the data, analysis can be a difficult undertaking. Additionally, fresh data is added yearly, so the data is constantly changing. In order to enable the extraction of valuable insights that can be utilized for making informed decisions by software development companies and other stakeholders in the industry, there is a need for efficient and effective tools and processes for cleaning, processing, and analyzing this data.

1.1 AIM AND OBJECTIVES:

The aim of this project is to investigate how Python can be used to analyze data and how it can be used to analyse the results of the Stack Overflow survey from 2022. The objective is to uncover patterns and trends in the software development sector; these include gender distribution, programming languages and frameworks, and the degree of job compensation among developers. 

These objectives were achieved by carrying out the following steps:

  • Clean and preprocess the 2022 Stack Overflow survey data using Python and Pandas
  • Analyze the cleaned survey data with Pandas, Numpy and Scipy to identify patterns in the software development industry (gender distribution, programming languages and frameworks, and the degree of job compensation among developers)
  • Visualize the cleaned survey data using Python libraries such as Matplotlib and Seaborn
  • Generate insights and recommendations that can inform decision-making in the software development industry

By attaining these goals, this study can offer a road map for utilizing Python and data analysis methods to derive insightful knowledge from massive and intricate datasets.


1.2 RESEARCH QUESTIONS AND HYPOTHESES:

The following research questions guided this study:

  • What are the demographics of the survey participants?
  • What is the level of job compensation among developers?
  • How does job compensation among developers vary by demographic factors such as age and work experience?
  • What programming languages are most commonly used by developers?
  •  What programming frameworks are most commonly used by developers?

Hypothesis: 

  • There is no strong relationship between job compensation and developers work experience
  • There is no correlation between job compensation and developers age.


READ MORE