0% found this document useful (0 votes)

83 views15 pages

ML Tools: Weka & RapidMiner Guide

This document provides an introduction and overview of the Weka and Rapid Miner machine learning software. Weka is an open source collection of machine learning algorithms for data pre-processing, classification, clustering, and association rule mining. It was created at the University of Waikato and uses the ARFF data format. Rapid Miner is a similar open source tool that implements data mining and analytics operators in a workflow format to build knowledge discovery processes. Both tools provide GUI and programmatic interfaces for applying machine learning.

Uploaded by

annamyem

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

83 views15 pages

ML Tools: Weka & RapidMiner Guide

Uploaded by

annamyem

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 15

Weka & Rapid Miner Tutorial

By Chibuike Muoh

WEKA:: Introduction

A collection of open source ML algorithms

pre-processing classifiers clustering association rule

Created by researchers at the University of Waikato in New Zealand Java based

WEKA:: Installation

Download software from http://www.cs.waikato.ac.nz/ml/weka/

If you are interested in modifying/extending weka there is a developer version that includes the source code
setenv WEKAHOME /usr/local/weka/weka-3-0-2 setenv CLASSPATH $WEKAHOME/weka.jar:$CLASSPATH

Set the weka environment variable for java

Download some ML data from http://mlearn.ics.uci.edu/MLRepository.html

WEKA:: Introduction .contd

Routines are implemented as classes and logically arranged in packages Comes with an extensive GUI interface

Weka routines can be used stand alone via the command line

Eg. java weka.classifiers.j48.J48 -t $WEKAHOME/data/iris.arff

WEKA:: Interface

WEKA:: Data format

Uses flat text files to describe the data Can work with a wide variety of data files including its own .arff format and C4.5 file formats Data can be imported from a file in various formats:

ARFF, CSV, C4.5, binary

Data can also be read from a URL or from an SQL database (using JDBC)

WEKA:: ARRF file format

@relation heart-disease-simplified @attribute @attribute @attribute @attribute @attribute @attribute age numeric sex { female, male} chest_pain_type { typ_angina, asympt, non_anginal, atyp_angina} cholesterol numeric exercise_induced_angina { no, yes} class { present, not_present}

@data 63,male,typ_angina,233,no,not_present 67,male,asympt,286,yes,present 67,male,asympt,229,yes,present 38,female,non_anginal,?,no,not_present

...

A more thorough description is available here http://www.cs.waikato.ac.nz/~ml/weka/arff.html

WEKA:: Explorer: Preprocessing

Pre-processing tools in WEKA are called filters WEKA contains filters for:

Discretization, normalization, resampling, attribute selection, transforming, combining attributes, etc

WEKA:: Explorer: building classifiers

Classifiers in WEKA are models for predicting nominal or numeric quantities Implemented learning schemes include:

Decision trees and lists, instance-based classifiers, support vector machines, multi-layer perceptrons, logistic regression, Bayes nets, Bagging, boosting, stacking, error-correcting output codes, locally weighted learning,

Meta-classifiers include:

WEKA:: Explorer: Clustering

Example showing simple K-means on the Iris dataset

RapidMiner:: Introduction

A very comprehensive open-source software implementing tools for

intelligent data analysis, data mining, knowledge discovery, machine learning, predictive analytics, forecasting, and analytics in business intelligence (BI).

Is implemented in Java and available under GPL among other licenses Available from http://rapid-i.com

RapidMiner:: Intro. Contd.

Is similar in spirit to Wekas Knowledge flow Data mining processes/routines are views as sequential operators

Knowledge discovery process are modeled as operator chains/trees

Operators define their expected inputs and delivered outputs as well as their parameters

Has over 400 data mining operators

RapidMiner:: Intro. Contd.

Uses XML for describing operator trees in the KD process Alternatively can be started through the command line and passed the XML process file

ML Tools: Weka & RapidMiner Guide
No ratings yet
ML Tools: Weka & RapidMiner Guide
15 pages
Group 3: Elhaine, Jai, Icelle and Marianne
No ratings yet
Group 3: Elhaine, Jai, Icelle and Marianne
17 pages
WEKA: ML Tool for Data Scientists
No ratings yet
WEKA: ML Tool for Data Scientists
23 pages
Appendix Weka
No ratings yet
Appendix Weka
17 pages
Weka Tutorial
No ratings yet
Weka Tutorial
32 pages
Weka Overview Slides
No ratings yet
Weka Overview Slides
31 pages
Data Warehousing and Data Mining Lab Manual
0% (1)
Data Warehousing and Data Mining Lab Manual
30 pages
DWDM Lab Manual
No ratings yet
DWDM Lab Manual
55 pages
Data Mining Complete Lab Manual - DRSNR
No ratings yet
Data Mining Complete Lab Manual - DRSNR
27 pages
WEKA Guide for ML Practitioners
No ratings yet
WEKA Guide for ML Practitioners
58 pages
Weka Data Miningvsem
No ratings yet
Weka Data Miningvsem
7 pages
Lecture 7 - Weka
No ratings yet
Lecture 7 - Weka
69 pages
WEKA Toolkit: Machine Learning Guide
No ratings yet
WEKA Toolkit: Machine Learning Guide
8 pages
Data Mining Lab Manual for CSE
No ratings yet
Data Mining Lab Manual for CSE
50 pages
Introduction To Weka: Xingquan (Hill) Zhu
No ratings yet
Introduction To Weka: Xingquan (Hill) Zhu
63 pages
DHW Lab (Ex1 To 3)
No ratings yet
DHW Lab (Ex1 To 3)
18 pages
Weka Software Manuala
No ratings yet
Weka Software Manuala
20 pages
Lab Manual (2024)
No ratings yet
Lab Manual (2024)
56 pages
Data Mining Term Project Machine Learning With WEKA: Weka Explorer Tutorial For Version 3.4.3
No ratings yet
Data Mining Term Project Machine Learning With WEKA: Weka Explorer Tutorial For Version 3.4.3
42 pages
WEKA Data Mining Tool Guide
No ratings yet
WEKA Data Mining Tool Guide
19 pages
Introduction To Weka
No ratings yet
Introduction To Weka
38 pages
Rintro Wekacomplete
No ratings yet
Rintro Wekacomplete
135 pages
Introduction To WEKA: Data Mining WEKA - What Is It? Weka Uis Integration With Pentaho Projects Based On Weka
No ratings yet
Introduction To WEKA: Data Mining WEKA - What Is It? Weka Uis Integration With Pentaho Projects Based On Weka
27 pages
Weka Guide for Data Scientists
No ratings yet
Weka Guide for Data Scientists
5 pages
WEKA Intro
No ratings yet
WEKA Intro
17 pages
Overview: Data Mining Methods: WEKA: A Machine Learning Toolkit The Explorer
No ratings yet
Overview: Data Mining Methods: WEKA: A Machine Learning Toolkit The Explorer
41 pages
Datawarehouse Pract 2
No ratings yet
Datawarehouse Pract 2
7 pages
Aiml Manual
No ratings yet
Aiml Manual
27 pages
DWBI Lab Manual 2023-24 Final
No ratings yet
DWBI Lab Manual 2023-24 Final
40 pages
Data Warehousing and Data Mining Lab Manual
100% (1)
Data Warehousing and Data Mining Lab Manual
30 pages
Lab 02
No ratings yet
Lab 02
4 pages
Mooc On Weka
No ratings yet
Mooc On Weka
59 pages
WEKA Data Mining Lab Manual
100% (1)
WEKA Data Mining Lab Manual
8 pages
Result Prediction Using Weka: An Effort by - Shlok Tibrewal (14bit0088) Siddarth Nyati (14bit0074)
No ratings yet
Result Prediction Using Weka: An Effort by - Shlok Tibrewal (14bit0088) Siddarth Nyati (14bit0074)
11 pages
Chapter 5 - The Application of WEKA Software
No ratings yet
Chapter 5 - The Application of WEKA Software
80 pages
WEKA Practical Protocol
No ratings yet
WEKA Practical Protocol
40 pages
2.3 Weka Tool
No ratings yet
2.3 Weka Tool
84 pages
Introduction To Weka
No ratings yet
Introduction To Weka
39 pages
Weka A Tool For Exploratory Data Mining
No ratings yet
Weka A Tool For Exploratory Data Mining
157 pages
Data Mining Unit 5
No ratings yet
Data Mining Unit 5
12 pages
Weka Data Mining Lab Guide
No ratings yet
Weka Data Mining Lab Guide
20 pages
Weka (20030421-Version1 by Kdelab)
No ratings yet
Weka (20030421-Version1 by Kdelab)
51 pages
Priyadarshini J. L. College of Engineering, Nagpur: Session 2022-23 Semester-V
No ratings yet
Priyadarshini J. L. College of Engineering, Nagpur: Session 2022-23 Semester-V
31 pages
DM Lab Material
No ratings yet
DM Lab Material
88 pages
Weka Tutorial
No ratings yet
Weka Tutorial
45 pages
WEKA Explorer Tutorial
No ratings yet
WEKA Explorer Tutorial
45 pages
Lab Manual - DM
No ratings yet
Lab Manual - DM
56 pages
Weka Tutorial
No ratings yet
Weka Tutorial
8 pages
DW Lab Manual
No ratings yet
DW Lab Manual
44 pages
Lab 04
No ratings yet
Lab 04
7 pages
DWM1 Riya
No ratings yet
DWM1 Riya
16 pages
Dinesh DM
No ratings yet
Dinesh DM
34 pages
Bioinformatics: Applications Note
No ratings yet
Bioinformatics: Applications Note
3 pages
Weka DW&DM Lab Notes
No ratings yet
Weka DW&DM Lab Notes
37 pages
Weka
No ratings yet
Weka
99 pages
A Simple Introduction: To Weka
No ratings yet
A Simple Introduction: To Weka
83 pages
Experiment WEKA
No ratings yet
Experiment WEKA
16 pages
Ayusante Catalogue
No ratings yet
Ayusante Catalogue
12 pages
Pakkam Vara Thudithen
100% (3)
Pakkam Vara Thudithen
88 pages
Mistral of Milan
No ratings yet
Mistral of Milan
16 pages
AagaayaGangai LakshmiPrabha
75% (4)
AagaayaGangai LakshmiPrabha
63 pages
Murugan 123
No ratings yet
Murugan 123
1 page
FGBFCB Nbbbnvbvbnmju, Lukhgnggfht56mhuh Rhgrtyjmnthk - Mge
No ratings yet
FGBFCB Nbbbnvbvbnmju, Lukhgnggfht56mhuh Rhgrtyjmnthk - Mge
4 pages
Cars
No ratings yet
Cars
1 page
Na Chuku Tty 123
No ratings yet
Na Chuku Tty 123
1 page
SQM Notes Unit II
No ratings yet
SQM Notes Unit II
25 pages
Mc9280 Data Mining and Data Warehousing
No ratings yet
Mc9280 Data Mining and Data Warehousing
1 page
Wedding Santorini Mykonos Sifnos Folegandros: Cosmopolitan Mýkonos
No ratings yet
Wedding Santorini Mykonos Sifnos Folegandros: Cosmopolitan Mýkonos
2 pages
Data Mining: Outlier Analysis - Presentation Transcript
No ratings yet
Data Mining: Outlier Analysis - Presentation Transcript
1 page
Array vs Linked List Operations
No ratings yet
Array vs Linked List Operations
5 pages
Difference Equation
No ratings yet
Difference Equation
1 page
Change The Positions of The Frogs On Right and Left. Usually It Can Be Done in 3 Minutes If Your IQ Is Not Under 50
No ratings yet
Change The Positions of The Frogs On Right and Left. Usually It Can Be Done in 3 Minutes If Your IQ Is Not Under 50
2 pages