Introduction to Machine Learning with KNIME

0
Join & Subscribe
LinkedIn Learning
Free Trial Available
English
Certificate Available
1-2 hours worth of material
selfpaced

Overview

Learn KNIME, a popular open-source platform for predictive analytics and machine learning. Discover how to use KNIME for merging and aggregation, modeling, data scoring, and more.

KNIME is an open-source workbench-style tool for predictive analytics and machine learning. It is highly compatible with numerous data science technologies, including R, Python, Scala, and Spark. With KNIME, you can produce solutions that are virtually self-documenting and ready for use. These reasons and more make KNIME one of the most popular and fastest-growing analytics platforms around. In this course, expert Keith McCormick shows how KNIME supports all the phases of the Cross Industry Standard Process for Data Mining (CRISP-DM) in one platform. Get up and running quickly—in 15 minutes or less—or stick around for the more in-depth training covering merging and aggregation, modeling, and data scoring. Plus, learn how to increase the power of KNIME with extensions and integrate R and Python.

Syllabus

Introduction
  • Open-source machine learning with KNIME
  • Who is this course for?
1. How Does KNIME Complement Your Existing Analytics Toolkit?
  • Why use an Analytics Workbench?
  • Using CRISP-DM to evaluate tools
  • Why choose KNIME?
2. Getting Comfortable with KNIME
  • The KNIME interface
  • Find case studies on the Examples Server
  • Add thousands of nodes with Extensions
  • Search and Help
3. Accessing Data
  • Accessing data
  • File reader node
4. Data Understanding
  • Describe data and verify data quality
  • Explore data: Scatterplot
  • Explore data: Boxplot
5. Data Integration and Merging
  • Merging with the Joiner node
  • Aggregating with the GroupBy node
  • Creating new variables with Construct
  • Select data with Column Filter
  • Balancing data with Row Sampling node
  • Clean data with the Missing Value node
  • Format with Cell Splitter
6. Modeling
  • KNIME modeling options
  • Regression example
  • Decision tree
  • Decision tree: Scoring new data
7. A World of Possibilities
  • PMML
  • R and GGPLOT2
  • Other options
Conclusion
  • Next steps

Taught by

Keith McCormick