Adds enumeration ask

samagra14 · samagra14 · commit 40c907dc4e5f · 2018-07-31T10:16:34.000+05:30
diff --git a/notebooks/ProbabilisticReasoning.ipynb b/notebooks/ProbabilisticReasoning.ipynb
@@ -30,13 +30,13 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 2,
+   "execution_count": 3,
    "metadata": {},
    "outputs": [
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
-       "model_id": "d2cf7a35-5540-4e74-ac94-07cb97cbaf68",
+       "model_id": "05dc6698-2418-49a5-89a2-606bc131197f",
        "version_major": 2,
        "version_minor": 0
       },
@@ -167,26 +167,24 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 12,
+   "execution_count": 4,
    "metadata": {},
    "outputs": [
     {
      "name": "stdout",
      "output_type": "stream",
      "text": [
       "Random Variables = [Cavity, Toothache, Catch]\n",
-      "The cavity Node: Cavity\n",
-      "The toothache Node: Toothache\n",
-      "The catch Node: Catch\n"
+      "The cavity Node: Cavity\n"
      ]
     },
     {
      "data": {
       "text/plain": [
-       "aima.core.probability.bayes.impl.BayesNet@5a94841e"
+       "aima.core.probability.bayes.impl.BayesNet@3e549709"
       ]
      },
-     "execution_count": 12,
+     "execution_count": 4,
      "metadata": {},
      "output_type": "execute_result"
     }
@@ -299,7 +297,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 18,
+   "execution_count": 5,
    "metadata": {},
    "outputs": [
     {
@@ -320,7 +318,7 @@
        "null"
       ]
      },
-     "execution_count": 18,
+     "execution_count": 5,
      "metadata": {},
      "output_type": "execute_result"
     }
@@ -359,6 +357,203 @@
     "                  + model.posterior(cavity,toothache));"
    ]
   },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Exact Inference in Bayesian Networks\n",
+    "The basic task for any probabilistic inference system is to compute the posterior probability\n",
+    "distribution for a set of query variables, given some observed event—that is, some assignment of values to a set of evidence variables.We will use the notation from the previous notebook: X denotes the query variable; **E**\n",
+    "denotes the set of evidence variables $E_1 , . . . , E_m$ , and **e** is a particular observed event; Y will\n",
+    "denote the nonevidence, nonquery variables $Y_1 , . . . , Y_l$ (called the hidden variables). Thus,\n",
+    "the complete set of variables is $X = \\{X\\} \\cup E \\cup Y$. A typical query asks for the posterior\n",
+    "probability distribution $P(X | \\textbf{e})$."
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### Inference by enumeration\n",
+    "We proved in the previous notebook that any conditional probability can be computed by using the full joint distribution. Mathematically:\n",
+    "$$ \\textbf{P}(X|\\textbf{e}) = \\alpha \\textbf{P}(X,\\textbf{e}) = \\alpha \\sum_{y}\\textbf{P}(X|\\textbf{e},\\textbf{y})$$"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Also, we know that a Bayesian Network can give a complete representation of a full joint distribution. Hence, the above sum is calculable from a Bayesian Network. Now, let's have a look at the entire process. Let b, j and m be particular values of random variables *B*, *J* and *M*. Let *E* and *A* be the hidden variables. Then by using the sum and product rules of probability we get:\n",
+    "$$P(b|j,m) = \\alpha P(b) \\sum_{e}P(e)\\sum_{a}P(a|b,e)P(j|a)P(m|a)$$"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Now, the above calculation can be represented in the form of a calculation tree shown below. The order of calculation brings with itself an intuition of a depth first tree. In fact, the enumeration ask algorithm employs a depth first approach to solve the inference problem."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/markdown": [
+       "### AIMA3e\n",
+       "__function__ ENUMERATION-ASK(_X_, __e__, _bn_) __returns__ a distribution over _X_  \n",
+       "&emsp;__inputs__: _X_, the query variable  \n",
+       "&emsp;&emsp;&emsp;&emsp;&emsp;__e__, observed values for variables __E__  \n",
+       "&emsp;&emsp;&emsp;&emsp;&emsp;_bn_, a Bayes net with variables \\{_X_\\} &Union; __E__ &Union; __Y__ /\\* __Y__ = hidden variables \\*/  \n",
+       "\n",
+       "&emsp;__Q__(_X_) &larr; a distribution over _X_, initially empty  \n",
+       "&emsp;__for each__ value _x<sub>i</sub>_ of _X_ __do__  \n",
+       "&emsp;&emsp;&emsp;__Q__(_x<sub>i</sub>_) &larr; ENUMERATE\\-ALL(_bn_.VARS, __e__<sub>_x_<sub>_i_</sub></sub>)  \n",
+       "&emsp;&emsp;&emsp;&emsp;&emsp;where __e__<sub>_x_<sub>_i_</sub></sub> is __e__ extended with _X_ = _x<sub>i</sub>_  \n",
+       "&emsp;__return__ NORMALIZE(__Q__(_X_))  \n",
+       "\n",
+       "---\n",
+       "__function__ ENUMERATE\\-ALL(_vars_, __e__) __returns__ a real number  \n",
+       "&emsp;__if__ EMPTY?(_vars_) __then return__ 1.0  \n",
+       "&emsp;_Y_ &larr; FIRST(_vars_)  \n",
+       "&emsp;__if__ _Y_ has value _y_ in __e__  \n",
+       "&emsp;&emsp;&emsp;__then return__ _P_(_y_ &vert; _parents_(_Y_)) &times; ENUMERATE\\-ALL(REST(_vars_), __e__)  \n",
+       "&emsp;&emsp;&emsp;__else return__ &sum;<sub>_y_</sub> _P_(_y_ &vert; _parents_(_Y_)) &times; ENUMERATE\\-ALL(REST(_vars_), __e__<sub>_y_</sub>)  \n",
+       "&emsp;&emsp;&emsp;&emsp;&emsp;where __e__<sub>_y_</sub> is __e__ extended with _Y_ = _y_  \n",
+       "\n",
+       "---\n",
+       "__Figure__ ?? The enumeration algorithm for answering queries on Bayesian networks."
+      ],
+      "text/plain": [
+       "<IPython.core.display.Markdown object>"
+      ]
+     },
+     "execution_count": 2,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "%%python\n",
+    "from notebookUtils import *\n",
+    "pseudocode('Enumeration Ask')"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "The above algorithm calculates the desired conditional distribution. It is implemented in the [`EnumerationAsk`](/aima-core/src/main/java/aima/core/probability/bayes/exact/EnumerationAsk.java) class in the repository. The algorithm takes as input a query variable, a few evidence variables and a Bayesian Network. However, to use the algorithm we will not directly call the algorithm. Instead, we will pass the `EnumerationAsk` as a parameter in the form of an Inference Procedure to our `BayesNetModel`. The cell below shows the steps."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 20,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "The prior distribution for toothache is <0.2, 0.8>\n",
+      "The prior distribution for cavity is <0.2, 0.8>\n",
+      "The prior distribution for catch is <0.34, 0.66>\n",
+      "The posterior distribution for toothache given cavity is \n",
+      " \t <0.6, 0.10000000000000002, 0.4000000000000001, 0.9>\n",
+      "The posterior distribution for catch given cavity is \n",
+      " \t <0.9, 0.19999999999999998, 0.09999999999999999, 0.7999999999999999>\n",
+      "The prior probability of having a cavity is 0.2\n",
+      "The posterior probability of having a cavity given a toothache is 0.6\n",
+      "The prior probability of having a cavity or a toothache is 0.28\n",
+      "The posterior probability of not having a cavity given a toothache is 0.4000000000000001\n",
+      "The prior probability of not having a cavity but having a toothache is 0.08000000000000002\n",
+      "The prior probability of having a cavity or a toothache is 0.28\n",
+      "The posterior probability of having a cavity given that the patient has a cavity or a toothache is  0.7142857142857143\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "null"
+      ]
+     },
+     "execution_count": 20,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "package aima.notebooks.probabilisticreasoning;\n",
+    "\n",
+    "import aima.core.probability.example.*;\n",
+    "import aima.core.probability.bayes.*;\n",
+    "import aima.core.probability.bayes.exact.*;\n",
+    "import aima.core.probability.bayes.impl.*;\n",
+    "import aima.core.probability.bayes.impl.*;\n",
+    "import aima.core.probability.bayes.model.*;\n",
+    "import aima.core.probability.proposition.*;\n",
+    "\n",
+    "// Load the network from the network factory.\n",
+    "BayesianNetwork cavityNet = BayesNetExampleFactory.constructToothacheCavityCatchNetwork();\n",
+    "// Construct the BayesModel from the BayesNet\n",
+    "// We will pass EnumerationAsk as the new inference procedure\n",
+    "FiniteBayesModel model = new FiniteBayesModel(cavityNet, new EnumerationAsk());\n",
+    "\n",
+    "// Now we will fully exhaust this model to extract as much information as we can\n",
+    "\n",
+    "// First let us define some assignment propositions\n",
+    "AssignmentProposition atoothache = new AssignmentProposition(\n",
+    "\t\t\t\tExampleRV.TOOTHACHE_RV, true);\n",
+    "\t\tAssignmentProposition anottoothache = new AssignmentProposition(\n",
+    "\t\t\t\tExampleRV.TOOTHACHE_RV, false);\n",
+    "\t\tAssignmentProposition acavity = new AssignmentProposition(\n",
+    "\t\t\t\tExampleRV.CAVITY_RV, true);\n",
+    "\t\tAssignmentProposition anotcavity = new AssignmentProposition(\n",
+    "\t\t\t\tExampleRV.CAVITY_RV, false);\n",
+    "\t\tAssignmentProposition acatch = new AssignmentProposition(\n",
+    "\t\t\t\tExampleRV.CATCH_RV, true);\n",
+    "\t\tAssignmentProposition anotcatch = new AssignmentProposition(\n",
+    "\t\t\t\tExampleRV.CATCH_RV, false);\n",
+    "\n",
+    "// Now let us define some propositions which are conjunctions and/or disjunctions of the above propositions\n",
+    "ConjunctiveProposition toothacheAndNotCavity = new ConjunctiveProposition(\n",
+    "\t\t\t\tatoothache, anotcavity);\n",
+    "DisjunctiveProposition cavityOrToothache = new DisjunctiveProposition(\n",
+    "\t\t\t\tacavity, atoothache);\n",
+    "\n",
+    "// First let us calculate the prior probabilities of our random variables\n",
+    "// The probabilities in the distribution are returned in the order <True, False>\n",
+    "System.out.println(\"The prior distribution for toothache is \"+ model.priorDistribution(ExampleRV.TOOTHACHE_RV));\n",
+    "System.out.println(\"The prior distribution for cavity is \"+ model.priorDistribution(ExampleRV.CAVITY_RV));\n",
+    "System.out.println(\"The prior distribution for catch is \"+ model.priorDistribution(ExampleRV.CATCH_RV));\n",
+    "// Now let us calculate the posterior distribution is\n",
+    "// Posterior distribution first exhausts all the possibilities of the evidence variables\n",
+    "System.out.println(\"The posterior distribution for toothache given cavity is \\n \\t \"+ model.posteriorDistribution(ExampleRV.TOOTHACHE_RV,\n",
+    "                                                                                                           ExampleRV.CAVITY_RV).toString());\n",
+    "\n",
+    "System.out.println(\"The posterior distribution for catch given cavity is \\n \\t \"+ model.posteriorDistribution(ExampleRV.CATCH_RV,\n",
+    "                                                                                                           ExampleRV.CAVITY_RV).toString());\n",
+    "\n",
+    "// Now let us have a look at some individual probabilities\n",
+    "System.out.println(\"The prior probability of having a cavity is \"+model.prior(acavity));\n",
+    "System.out.println(\"The posterior probability of having a cavity given a toothache is \"+ model.posterior(acavity, atoothache));\n",
+    "System.out.println(\"The prior probability of having a cavity or a toothache is \"+model.prior(cavityOrToothache));\n",
+    "System.out.println(\"The posterior probability of not having a cavity given a toothache is \"+model.posterior(anotcavity, atoothache));\n",
+    "System.out.println(\"The prior probability of not having a cavity but having a toothache is \"+model.prior(toothacheAndNotCavity));\n",
+    "System.out.println(\"The prior probability of having a cavity or a toothache is \"+model.prior(cavityOrToothache));\n",
+    "System.out.println(\"The posterior probability of having a cavity given that the patient has a cavity or a toothache is  \"+\n",
+    "                          model.posterior(acavity,cavityOrToothache));\n"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "There are a large number of inferences that can be derived from a probability model. For the sake of conciseness, we will focus only on a few prior and posterior distributions in the upcoming examples."
+   ]
+  },
   {
    "cell_type": "code",
    "execution_count": null,