Name	Name	Last commit message	Last commit date
Latest commit History 68 Commits
data	data
.gitignore	.gitignore
Data_clustering_countries.ipynb	Data_clustering_countries.ipynb
Data_clustering_indicators.ipynb	Data_clustering_indicators.ipynb
Data_filling_NaN_values.ipynb	Data_filling_NaN_values.ipynb
Data_load.ipynb	Data_load.ipynb
Data_normalization_outliers.ipynb	Data_normalization_outliers.ipynb
LICENSE	LICENSE
README.md	README.md

Name

Last commit message

Last commit date

data

.gitignore

Data_clustering_countries.ipynb

Data_clustering_indicators.ipynb

Data_filling_NaN_values.ipynb

Data_load.ipynb

Data_normalization_outliers.ipynb

LICENSE

README.md

Data Driven Decisions

Use Python, Pandas, Spark etc to demontrate that correlation can be used as a basis for decision making.

This project consists of finding the correlation between the GDP (Gross Domestic Product) and social and economical indicators, such as population growth, fertility rates, investment in specific sectors or prices.

The project will be developed by 2 teams in parallel. You can find more information in their main branches:

Execute the project

Execute the notebooks in the following order:

Data_load
Data_normalization
Data_outliers, Data_filling, Data_visualization.

This will create a series of output DataFrames as .csv files.

Explanation of the followed process

The Hypothesis: It is assumed that there exists a correlation between economic growth and indicators as infant mortality, access to education... We want to demonstarte the validity of this assumption based on available datasets.

In order to check the veracity of this hypothesis the following steps are going to be followed:

First step : Choose the indicators

In order to study the correlation between the economic indicators and some socio-demographic indicators, we have to choose the different indicators :

Gdp from 1850 to 2020 in pounds
Infant mortality of children under 5 years old
Percentage of population age 15+ with tertiary schooling.
Fertility rate
gender inequality
Life expectancy

I choose to measure the economic growth to compare the indicators with the GDP of the country.

2nd step : Select source of information

I chose to extract datasets about these indicators from the website Our world in data

Contributors 4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Data Driven Decisions

Execute the project

Explanation of the followed process

First step : Choose the indicators

2nd step : Select source of information

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

License

devonfw-forge/python-data-driven-decisions

Folders and files

Latest commit

History

Repository files navigation

Data Driven Decisions

Execute the project

Explanation of the followed process

First step : Choose the indicators

2nd step : Select source of information

About

Resources

License

Code of conduct

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Packages