brunoOnm
diff --git a/‎Cap09/Mini-Projeto/Mini-Projeto2 - Analise1.ipynb
Lines changed: 205 additions & 0 deletions b/‎Cap09/Mini-Projeto/Mini-Projeto2 - Analise1.ipynb
Lines changed: 205 additions & 0 deletions
diff --git a/‎Cap09/Mini-Projeto/Mini-Projeto2 - Analise2.ipynb
Lines changed: 168 additions & 0 deletions b/‎Cap09/Mini-Projeto/Mini-Projeto2 - Analise2.ipynb
Lines changed: 168 additions & 0 deletions
diff --git a/‎Cap09/Mini-Projeto/Mini-Projeto2 - Analise3.ipynb
Lines changed: 170 additions & 0 deletions b/‎Cap09/Mini-Projeto/Mini-Projeto2 - Analise3.ipynb
Lines changed: 170 additions & 0 deletions
diff --git a/‎Cap09/Mini-Projeto/Mini-Projeto2 - Analise4.ipynb
Lines changed: 217 additions & 0 deletions b/‎Cap09/Mini-Projeto/Mini-Projeto2 - Analise4.ipynb
Lines changed: 217 additions & 0 deletions
diff --git a/‎Cap09/Mini-Projeto/dataset/autos.csv
Lines changed: 313984 additions & 0 deletions b/‎Cap09/Mini-Projeto/dataset/autos.csv
Lines changed: 313984 additions & 0 deletions
diff --git a/‎Cap09/Mini-Projeto/plots/Analise1/count-vehicleType.png
29.6 KB b/‎Cap09/Mini-Projeto/plots/Analise1/count-vehicleType.png
29.6 KB
diff --git a/‎Cap09/Mini-Projeto/plots/Analise1/price-vehicleType-boxplot.png
17.6 KB b/‎Cap09/Mini-Projeto/plots/Analise1/price-vehicleType-boxplot.png
17.6 KB
diff --git a/‎Cap09/Mini-Projeto/plots/Analise1/vehicle-distribution.png
26.7 KB b/‎Cap09/Mini-Projeto/plots/Analise1/vehicle-distribution.png
26.7 KB
diff --git a/‎Cap09/Mini-Projeto/plots/Analise2/brand-vehicleCount.png
33.7 KB b/‎Cap09/Mini-Projeto/plots/Analise2/brand-vehicleCount.png
33.7 KB
diff --git a/‎Cap09/Mini-Projeto/plots/Analise2/vehicletype-gearbox-price.png
19.9 KB b/‎Cap09/Mini-Projeto/plots/Analise2/vehicletype-gearbox-price.png
19.9 KB
diff --git a/‎Cap09/Mini-Projeto/plots/Analise3/fueltype-vehicleType-price.png
19.8 KB b/‎Cap09/Mini-Projeto/plots/Analise3/fueltype-vehicleType-price.png
19.8 KB
diff --git a/‎Cap09/Mini-Projeto/plots/Analise3/vehicletype-fueltype-power.png
20.7 KB b/‎Cap09/Mini-Projeto/plots/Analise3/vehicletype-fueltype-power.png
20.7 KB
diff --git a/‎Cap09/Mini-Projeto/plots/Analise4/heatmap-price-brand-vehicleType.png
207 KB b/‎Cap09/Mini-Projeto/plots/Analise4/heatmap-price-brand-vehicleType.png
207 KB
diff --git a/‎Cap09/Notebooks/DSA-Python-Cap09-Analise-Exploratoria-de-Dados.ipynb
Lines changed: 2857 additions & 0 deletions b/‎Cap09/Notebooks/DSA-Python-Cap09-Analise-Exploratoria-de-Dados.ipynb
Lines changed: 2857 additions & 0 deletions
diff --git a/‎Cap09/Notebooks/DSA-Python-Cap09-Exercicio-Solucao.ipynb
Lines changed: 1164 additions & 0 deletions b/‎Cap09/Notebooks/DSA-Python-Cap09-Exercicio-Solucao.ipynb
Lines changed: 1164 additions & 0 deletions
diff --git a/‎Cap09/Notebooks/DSA-Python-Cap09-Exercicio.ipynb
Lines changed: 358 additions & 0 deletions b/‎Cap09/Notebooks/DSA-Python-Cap09-Exercicio.ipynb
Lines changed: 358 additions & 0 deletions
@@ -0,0 +1,358 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# <font color='blue'>Data Science Academy - Python Fundamentos - Capítulo 9</font>\n",
+    "\n",
+    "## Download: http://github.com/dsacademybr\n",
+    "\n",
+    "## Exercício: Análise Exploratória de Dados com Python\n",
+    "\n",
+    "Neste exercício, você vai realizar uma análise exploratória em um dos mais famosos datasets para Machine Learning, o dataset iris com informações sobre 3 tipos de plantas. Esse dataset é comumente usado em problemas de Machine Learning de classificação, quando nosso objetivo é prever a classe dos dados. No caso deste dataset, prever a categoria de uma planta a partir de medidas da planta (sepal e petal).\n",
+    "\n",
+    "Em cada célula, você encontra a tarefa a ser realizada. Faça todo o exercício e depois compare com a solução proposta.\n",
+    "\n",
+    "Dataset (já disponível com o Scikit-Learn): https://archive.ics.uci.edu/ml/datasets/iris"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Imports\n",
+    "import time\n",
+    "import numpy as np\n",
+    "import pandas as pd\n",
+    "from matplotlib import pyplot as plt\n",
+    "from sklearn.datasets import load_iris\n",
+    "%matplotlib inline\n",
+    "\n",
+    "fontsize = 14\n",
+    "ticklabelsize = 14"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "150\n"
+     ]
+    },
+    {
+     "data": {
+      "text/html": [
+       "<div>\n",
+       "<style scoped>\n",
+       "    .dataframe tbody tr th:only-of-type {\n",
+       "        vertical-align: middle;\n",
+       "    }\n",
+       "\n",
+       "    .dataframe tbody tr th {\n",
+       "        vertical-align: top;\n",
+       "    }\n",
+       "\n",
+       "    .dataframe thead th {\n",
+       "        text-align: right;\n",
+       "    }\n",
+       "</style>\n",
+       "<table border=\"1\" class=\"dataframe\">\n",
+       "  <thead>\n",
+       "    <tr style=\"text-align: right;\">\n",
+       "      <th></th>\n",
+       "      <th>sepal length (cm)</th>\n",
+       "      <th>sepal width (cm)</th>\n",
+       "      <th>petal length (cm)</th>\n",
+       "      <th>petal width (cm)</th>\n",
+       "    </tr>\n",
+       "  </thead>\n",
+       "  <tbody>\n",
+       "    <tr>\n",
+       "      <th>0</th>\n",
+       "      <td>5.1</td>\n",
+       "      <td>3.5</td>\n",
+       "      <td>1.4</td>\n",
+       "      <td>0.2</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>1</th>\n",
+       "      <td>4.9</td>\n",
+       "      <td>3.0</td>\n",
+       "      <td>1.4</td>\n",
+       "      <td>0.2</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>2</th>\n",
+       "      <td>4.7</td>\n",
+       "      <td>3.2</td>\n",
+       "      <td>1.3</td>\n",
+       "      <td>0.2</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>3</th>\n",
+       "      <td>4.6</td>\n",
+       "      <td>3.1</td>\n",
+       "      <td>1.5</td>\n",
+       "      <td>0.2</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>4</th>\n",
+       "      <td>5.0</td>\n",
+       "      <td>3.6</td>\n",
+       "      <td>1.4</td>\n",
+       "      <td>0.2</td>\n",
+       "    </tr>\n",
+       "  </tbody>\n",
+       "</table>\n",
+       "</div>"
+      ],
+      "text/plain": [
+       "   sepal length (cm)  sepal width (cm)  petal length (cm)  petal width (cm)\n",
+       "0                5.1               3.5                1.4               0.2\n",
+       "1                4.9               3.0                1.4               0.2\n",
+       "2                4.7               3.2                1.3               0.2\n",
+       "3                4.6               3.1                1.5               0.2\n",
+       "4                5.0               3.6                1.4               0.2"
+      ]
+     },
+     "execution_count": 2,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "# Carregando o dataset\n",
+    "iris = load_iris()\n",
+    "df = pd.DataFrame(iris.data, columns=iris.feature_names)\n",
+    "print(len(df))\n",
+    "df.head()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Extração e Transformação de Dados"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Imprima os valores numéricos da Variável target (o que queremos prever), \n",
+    "# uma de 3 possíveis categorias de plantas: setosa, versicolor ou virginica\n"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Imprima os valores numéricos da Variável target (o que queremos prever), \n",
+    "# uma de 3 possíveis categorias de plantas: 0, 1 ou 2\n"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Adicione ao dataset uma nova coluna com os nomes das espécies, pois é isso que vamos tentar prever (variável target)\n"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Inclua no dataset uma coluna com os valores numéricos da variável target\n"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 7,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Extraia as features (atributos) do dataset e imprima \n"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 8,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Calcule a média de cada feature para as 3 classes\n"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Exploração de Dados"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Imprima uma Transposta do dataset (transforme linhas e colunas e colunas em linhas)\n"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 10,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Utilize a função Info do dataset para obter um resumo sobre o dataset \n"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 11,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Faça um resumo estatístico do dataset\n"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 12,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Verifique se existem valores nulos no dataset\n"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 13,
+   "metadata": {
+    "scrolled": true
+   },
+   "outputs": [],
+   "source": [
+    "# Faça uma contagem de valores de sepal length\n"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Plot"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 14,
+   "metadata": {
+    "scrolled": true
+   },
+   "outputs": [],
+   "source": [
+    "# Crie um Histograma de sepal length\n"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 15,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Crie um Gráficos de Dispersão (scatter Plot) da variável sepal length versus número da linha, \n",
+    "# colorido por marcadores da variável target\n"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 16,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Crie um Scatter Plot de 2 Features (atributos)\n"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 17,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Crie um Scatter Matrix das Features (atributos)\n"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 18,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Crie um Histograma de todas as features\n"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Conheça a Formação Cientista de Dados, um programa completo, 100% online e 100% em português, com 340 horas, mais de 1.200 aulas em vídeos e 26 projetos, que vão ajudá-lo a se tornar um dos profissionais mais cobiçados do mercado de análise de dados. Clique no link abaixo, faça sua inscrição, comece hoje mesmo e aumente sua empregabilidade:\n",
+    "\n",
+    "https://www.datascienceacademy.com.br/pages/formacao-cientista-de-dados"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {
+    "collapsed": true
+   },
+   "source": [
+    "# Fim"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### Obrigado - Data Science Academy - <a href=http://facebook.com/dsacademy>facebook.com/dsacademybr</a>"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.6.4"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 1
+}