KNIME is an enterprise-grade software platform for cutting edge analytics. It has been labelled a visionary in the Gartner’s 2020 Magic Quadrant for Data Science and Machine Learning Platforms.
KNIME portfolio offers the open-source KNIME Analytics Platform complemented by the commercial extension, KNIME Server, with advanced functionalities such as team collaboration, model management, deployment, automation etc.
The KNIME offering holds value particularly for “citizen” data scientists to quickly build & test custom ML models. In this article, we will build a custom ML scenarios on KNIME Analytics Platform
Scenario: Credit Card Default Prediction for Banking Industry
Please visit here to download the KNIME Analytics Platform - https://www.knime.com/downloads/download-knime
Once, installed, fire up the platform and you should see the below GUI-
The key windows that will be used extensively are- KNIME Explorer, Node Repository and Console
With the environment setup successfully, lets dive straight into building our prediction workflow
The data fields in our dataset are as following-
As we can see, there are several relevant predictor variables such as “age”, “job”, “marital” etc that can be instrumental in predicting our outcome variable “y” i.e. Default status where 1 à Default and 0 à No Default
Let’s open KNIME Analytics Platform and get started with importing nodes from the repository-
And with that, in matter of minutes, we easily trained and tested our Credit Card Default prediction scenario without any dependence on custom coding. Once you have tested and validated the performance of the model on the hold out group, you can easily deploy them with click of a button on KNIME Server.
If this was useful and you liked what you saw, please follow us on our page for similar content. We intend to come out with similar posts on next steps i.e. deployment on the Server and complete overview of model management.
We would love to hear from you. How would you rate KNIME in terms of “Ease of adoption” & “algorithmic sophistication” in comparison to other tools out there? Please comment below
Also, in case you’d like to sign up for our full program on KNIME Analytics Platform, connect with us on below coordinates-
Disclaimer: Author works with a KNIME Partner company named Nobleprog
Acknowledgement: KNIME official documentation