Knime Project Report
Knime Project Report
Manual
CSL 232
Semester: 5th
Group: DS B
Table of Contents
S.No Page No.
1. Project Description 2
2. Problem Statement
3
3. Analysis
6. Output (Screenshots) 5
About Dataset
The dataset contains 768 rows and 9 columns, some of which are Glucose, Insulin,
Pregnancies, BMI and Outcome. Given with these details we have to predict
whether the Patient is Diabetic or not .
DD
3. Analysis
4. Design
The following steps were taken to get the best model accuracy:
a) Excel Reader
Reading the excel file using this node.
b) Column Filter
Removing unnecessary columns
c) Normalizer
Normalizing the data using min-max normalization
d) Partitioning
Dividing the dataset into two parts: 80% of training data and 20% of test data
Normalized table
Partitioning
DD
-test data
DD
Statistics:
7. Conclusion
Firstly, we applied both the techniques (Logistic Regression and Decision Tree)
on our dataset without normalization. The accuracy was:
We can clearly see from the above accuracy scores that Decision Tree is better.