We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 93a22f0 commit 937d8ccCopy full SHA for 937d8cc
analysis/cleansing.md
@@ -0,0 +1,23 @@
1
+Data cleansing
2
+==============
3
+
4
+Often, the data you have will have errors, and you'll need to clean it up.
5
+Missing data, inconsistent labels, duplicates and misspellings are common
6
+examples of errors. The [Wikipedia article on data
7
+cleansing](http://en.wikipedia.org/wiki/Data_cleansing) talks more about this.
8
9
+The typical cycle of data cleansing is:
10
11
+1. Find errors
12
+2. Correct the errors
13
+3. Repeat 1 & 2 as often required
14
+4. Automate the correction process for new data feeds
15
16
+Finding errors
17
+--------------
18
19
+Correcting errors
20
+-----------------
21
22
+Automating correction
23
+---------------------
0 commit comments