DSTK - Data Science Toolkit 3 is a set of data and text mining softwares, following the CRISP DM model. DSTK offers data understanding using statistical and text analysis, data preparation using normalization and text processing, modeling and evaluation for machine learning and algorithms. It is based on the old version DSTK at https://sourceforge.net/projects/dstk2/



DSTK Engine is like R. DSTK ScriptWriter offers GUI to write DSTK script. DSTK Studio offers SPSS Statistics like GUI for data mining, and DSTK Text Explorer offers GUI for Text Mining. DSTK Engine and DSTK ScriptWriter are opensource, but DSTK Studio and Text Explorer requires small amount of payment. DSTK Studio and Text Explorer are free to use 10 times

Features

  • Data and Text Preprocessing (Normalization, stemming, stopwords, ...)
  • Data Exploration and Visualizations (Bar, Line, Scatter, BoxPlot, Histogram)
  • Data Understanding using Statistics (Chi Square, TTest, Descriptives, ...)
  • Text Analytics (Text Link Analysis, Sentiments Analysis, POS Tagging, Name Entity, ...)
  • Predictive Analytics (Neural Network, Naive Bayes, Linear Regression, Mulitple Linear Regression, KNN, Bags of Words, ...)
  • Expandable with plugins using R Scripts
  • Will improve over time, including Deep Neural Network and etc...

Project Samples

Project Activity

See All Activity >

License

GNU General Public License version 2.0 (GPLv2)

Follow DSTK - Data Science TooKit 3

DSTK - Data Science TooKit 3 Web Site

Other Useful Business Software
Comprehensive Cybersecurity to Safeguard Your Organization | SOCRadar Icon
Comprehensive Cybersecurity to Safeguard Your Organization | SOCRadar

See what hackers already know about your organization – and stop them from getting in.

Protect your organization from cyber threats with SOCRadar’s cutting-edge threat intelligence. Gain 360° visibility into your digital assets, monitor the dark web, and stay ahead of hackers with real-time insights. Start for free and transform your cybersecurity today.
Free Trial
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of DSTK - Data Science TooKit 3!

Additional Project Details

Operating Systems

Windows

Intended Audience

Non-Profit Organizations, Information Technology, Science/Research

User Interface

.NET/Mono

Programming Language

C#, Java

Related Categories

C# Artificial Intelligence Software, C# Business Intelligence Software, C# Data Science Tool, Java Artificial Intelligence Software, Java Business Intelligence Software, Java Data Science Tool

Registered

2018-05-08