... with focus on Indonesia, diverse representation in tech, and xkcd-style visualizations.
This is my attempt to gain insights from Stackoverflow Developer Survey in 2020
. They have been doing it since 2011, but currently I’m focusing on 2020 first. In fact, I’m trying to compare the trends in Indonesia with what’s happening to the rest of the world. The data for each year can be obtained here
.
In any case you’re interested to play around on your machine, just do the following inside the terminal:
cd ~
git clone https://github.com/ledwindra/stackoverflow-survey.git
cd stackoverflow-survey
You may need a virtual environment in case you don’t want to mess up with the existing modules inside your machine because here we need external modules such as pandas
, matplotlib
, and seaborn
. Just do the following inside the terminal:
python -m venv .venv # .venv can be changed to whatever it is as you like
source .venv/bin/activate
pip install --upgrade pip
pip install -r requirements.txt # you're all set
Just type and press deactivate
when you want to exit from the virtual environment. You can just install the requirements if you don’t care with virtual environment at all
Now we’re ready to explore the data. There are two main components here. First, it’s the Python script that will be used in the notebook. It’s located inside src/stackoverflow.py
. Its task is to manipulate the DataFrame and create visualization methods. The main purpose is to make the notebook looks cleaner. The notebook itself is located inside index.ipynb
.
You can contribute to the analysis by editing this repo inside your branch and making a pull request. Moreover, you can also file issues
in case you have any questions or find bugs.
I hope you enjoy this repository and cheers!