We use KNIME for a lot of predictive modeling. We use it to grab data, prepare it for modeling, do automated machine learning analysis, sometimes forecasting, and then try to deploy the models into production.
External reviews
External reviews are not included in the AWS star rating for the product.
A no-code platform that can be used for a lot of predictive modeling
What is our primary use case?
What is most valuable?
Since KNIME is a no-code platform, it is easy to work with. You don't have to write any codes and try to fix all the bits and pieces of coding or the intricacies of the programming language. Instead, getting a quick data prep or big data and eventually running it through your hypothesis is pretty fast. It's not ideal for huge data sets worth gigabytes, but it's okay since very few people have big data sets.
What needs improvement?
KNIME is not good at visualization. I would like to see NLQ (Natural language query) and automated visualizations added to KNIME.
For how long have I used the solution?
I have been using KNIME for two to three years.
What do I think about the stability of the solution?
Unless you are working with terabytes worth of data, KNIME is a stable solution.
What do I think about the scalability of the solution?
The solution is scalable and can be used up to terabytes of data. Around two to three people are using the solution in our organization.
How was the initial setup?
The solution’s initial setup is quick and easy.
What about the implementation team?
One person can deploy the solution within ten minutes.
What other advice do I have?
The solution is very essential when we require an explainable data modeling pipeline. We can show the workflows of KNIME to our customers and talk about it instead of showing the code and expecting them to read, which they can never do.
The process of providing KNIME to the client, how it works, where we get the data, what the initial data statistics were, and what we get in return are pretty explainable. We worked on multiple retail projects and insurance scoring projects.
KNIME is perfect for data pre-processing projects. The important thing is that when someone builds a KNIME workflow, we can quickly onboard and change it for something else. It means that we don't need to read and understand the code. It means that it's replicable and reusable.
If somebody does something, somebody else can quickly onboard and enhance, improve, or totally change the workflow from scratch. It's pretty hard and time-consuming for typical use cases where we utilize coding. KNIME's open-source nature has a good impact on our analytics work.
Recently, KNIME added something relevant to generative AI integration, which was a good move. Alteryx is slightly more powerful than KNIME, and Dataiku is more powerful than both KNIME and Alteryx. I sometimes work with the on-premises version of KNIME and sometimes the cloud version. The solution does not need any maintenance.
Users should quickly start using KNIME for whatever they want to do, and they'll learn it on the go easily. I would recommend the solution to other users.
Overall, I rate the solution an eight out of ten.
Stable, pretty straightforward to understand and offers drag-and-drop functionality
What is our primary use case?
I'm a professor at the local university. So, I used it to train virtual students in mechanical engineering.
I'm training a class for mechanical engineers on factory utilization and the basics of data science. That's what I use it for.
What is most valuable?
It's pretty straightforward to understand. So, if you understand what the pipeline is, you can use the drag-and-drop functionality without much training. Doing the same thing in Python requires so much more training. That's why I use KNIME.
What needs improvement?
In the last update, KNIME started hiding a lot of the nodes. It doesn't mean hiding, but you need to know what you're looking for. Before that, you had just a tree that you could click, and you could get an overview of what kind of nodes do I have.
Right now, it's like you need to know which node you need, and then you can start typing, but it's actually more difficult to find them.
For how long have I used the solution?
I have been using it for four years.
What do I think about the stability of the solution?
I've never had any problems with it, so it's a ten out of ten.
What do I think about the scalability of the solution?
I would rate the scalability a nine out of ten. For a basic training course, it's still fine. But I'm not a professional in using KNIME.
Which solution did I use previously and why did I switch?
I used RapidMiner. I have not been using it in six years. I used to use it six years ago. Then I switched to KNIME because a lot of my colleagues are using KNIME, so it felt like the right way to do it.
Moreover, I switched from one university to another, and at my new university, other colleagues are using KNIME as well. So, for the students, it's easier to go just with one product.
How was the initial setup?
Overall, it's still easier than using Python, so it's still fine. But, actually, they made it more complex by switching from the last version to the one before.
What's my experience with pricing, setup cost, and licensing?
We're using the free academic license just locally. I went for KNIME because they have a free academic license. And to be honest, I never bothered to check the prices.
What other advice do I have?
I like it a lot. I would advise that you shouldn't be afraid of data science. It's actually straightforward.
Overall, I would rate the solution a nine out of ten.
An easy-to-learn solution that can be used for analyzing data and machine learning
What is our primary use case?
We use KNIME for analyzing data, for ETLs, and analyzing for machine learning.
What is most valuable?
KNIME is easy to learn. You can code with KNIME using the visual coding platform if you know how to code. If you're working in an account management or financial department, you can use KNIME to work with a huge amount of data quickly. You can use KNIME to schedule your workflows, send emails, and write codes.
What needs improvement?
The main issue with KNIME is that it sometimes uses too much CPU and RAM when working with large amounts of data.
For how long have I used the solution?
I have been using KNIME for eight years.
What do I think about the stability of the solution?
KNIME is a stable solution. In the previous version, sometimes KNIME would get stuck, and we had to restart the server too many times. Sometimes, we faced a lack of memory issues with the solution.
I rate KNIME an eight out of ten for stability.
What do I think about the scalability of the solution?
Less than ten users are using KNIME in our organization.
I rate KNIME an eight out of ten for scalability.
How are customer service and support?
KNIME’s technical support team responds quickly. You can write your problems in the solution's forum, and they will answer you.
How was the initial setup?
KNIME's initial setup is not easy and needs someone who knows Linux to do it.
What about the implementation team?
A Linux engineer can deploy KNIME quickly, whereas someone who doesn't know Linux will take longer.
What's my experience with pricing, setup cost, and licensing?
There is no cost for using KNIME because it is an open-source solution, but you have to pay if you need a server.
What other advice do I have?
KNIME is a perfect solution for small and big companies, especially people who are using Excel. KNIME is very easy to learn and implement, and doctors and lab personnel can use it. Lots of companies are supporting KNIME and writing their own extensions. Data analysts and data scientists are using the solution for ETI processes.
Overall, I rate KNIME an eight out of ten.
An excellent choice for users seeking a powerful and flexible platform for data analytics and machine learning offering user-friendly visual interface, extensive library of plugins, and robust support
What is our primary use case?
As a university professor instructing courses on data mining and machine learning, I incorporate both KNIME and another software application into my teaching. This approach allows me to demonstrate various use cases effectively. I actively engage my students by having them utilize both software applications, providing practical hands-on experience in the areas of data mining and machine learning.
What is most valuable?
The most valuable is the ability to seamlessly connect operators without the need for extensive programming.
What needs improvement?
To enhance accessibility and user-friendliness, there is a need for improvements in the interface and usability of deep learning and large-scale learning languages.
For how long have I used the solution?
I have been using it for more than ten years.
What do I think about the stability of the solution?
I would rate its stability capabilities nine out of ten.
What do I think about the scalability of the solution?
It provides good scalability abilities, I would rate it eight out of ten. Currently, more than sixty individuals use it on a daily basis.
How are customer service and support?
They are helpful and I am highly satisfied with their customer support services. I would rate it nine out of ten.
Which solution did I use previously and why did I switch?
We use Orange as well.
How was the initial setup?
The initial setup is straightforward.
What's my experience with pricing, setup cost, and licensing?
While there are certain limitations in functionality, you can still utilize it efficiently free of charge.
What other advice do I have?
I would recommend it, especially for those who prefer not to program or have limited coding intervention. Overall, I would rate it nine out of ten.
Is user friendly and you can simply drag and drop elements to create your model
What is our primary use case?
I encountered a problem that I managed to resolve effectively. I documented the issue in a paper and aimed to determine if the issue was due to normal network behavior or an anomaly. To investigate, I employed machine learning models and used the KNIME’s database. I gathered a significant amount of data and extensively applied machine learning models. Ultimately, I achieved improved data accuracy, especially in the context of network data.
What is most valuable?
I believe that some individuals may not be skilled programmers, and this is where the agenda comes in. It allows for a user-friendly approach where you can simply drag and drop elements to create your model, which is a convenient and effective idea.
What needs improvement?
The pricing needs improvement.
For how long have I used the solution?
I have been using KNIME for the past six months.
What do I think about the stability of the solution?
It is stable solution.
What do I think about the scalability of the solution?
It is scalable solution.
How was the initial setup?
The initial setup is straightforward.
What's my experience with pricing, setup cost, and licensing?
It is expensive to procure the license.
What other advice do I have?
I would rate the overall solution an eight out of ten. I would suggest this application to individuals involved in auditing. It's user-friendly and makes it easy to initiate model creation.
A user-friendly tool that offers an open-source version
What is our primary use case?
I use KNIME for analysis-related purposes. I am currently in the process of developing some models for analysis.
What is most valuable?
The most valuable feature of the solution stems from the fact that it is a user-friendly tool where a person doesn't have to get involved with codes since you just need to drag the nodes to create your model, which is a very easy process for me.
What needs improvement?
The most difficult part of the solution revolves around its areas concerning machine learning and deep learning. The aforementioned area can be considered for improvement.
For how long have I used the solution?
I have been using KNIME since 2019. I am an end user of the solution.
What do I think about the stability of the solution?
It is a stable solution.
What do I think about the scalability of the solution?
It is a scalable solution.
I am the only user of the solution in my company. I do provide training to other employees in my company on how to use KNIME.
Which solution did I use previously and why did I switch?
I have experience with Excel, and I faced some limitations since my company had loads of data to analyze. Considering that my company had loads of data to analyze, I would say I find KNIME to be very useful.
How was the initial setup?
My company has some problems related to the solution's updates. I don't know if there are some restrictions from my organization because of which I cannot update or install some extensions.
The solution can be deployed in a few minutes.
The solution is currently deployed only on my personal computer, which I use in my company.
Only one person or an IT administrator is required to take care of the installation phase of the product.
What's my experience with pricing, setup cost, and licensing?
KNIME is a cheap product. I currently use KNIME's open-source version.
Which other solutions did I evaluate?
I have experience with Python. Compared to Python, KNIME is better because of the user-friendliness it provides. With KNIME, you don't have to get involved with codes. KNIME provides nodes, making it a very easy tool to use.
What other advice do I have?
I have not received any response from my company, though I had proposed to my organization to buy KNIME so that we can use it on the servers since, right now, it is like a standalone tool used on my personal computer only. I am just a basic and not an advanced user of KNIME. I find KNIME to be a very useful tool.
Speaking about the maintenance phase of the product, I would like to say that I cannot update the solution. If a new version is released, I cannot update the product. I always have to request my organization and the IT team to download and install the product's new version for me.
I recommend others to use KNIME. I have recommended KNIME to my colleagues.
I rate the overall solution an eight out of ten.
Allows you to easily tidy up your data, make lots of changes internally, and has good machine learning
What is our primary use case?
We have been using the most recent version. It's version 4.6.
10 August 2023 - It has now been upgraded to 5.0 and is, if anything, even more impressive, especially in its ability to use Python and its libraries.
How has it helped my organization?
Knime seems to keep getting better. Their open-source model seems to be working. The addition of AI both to help in the building of workflows and as a facility within a workflow once it is up and running seems to add a dimension. At the moment, though, the system is so rich and fully featured that I have explored only the surface of the new version (5.4).
To date, all my needs have been met by earlier versions of Knime. I am, though, confident that should I need to start using version 5.4, the process will be smooth, and the new functionality fit for purpose. Upgrades to Knime have always worked like that in the past and I would expect them to do so in the future.
What is most valuable?
I used to be a Pascal programmer, and then I did a bit of Python. It does many of the things that I would've had to do in code, but does so without using code. I don't think it does everything, but it does most of what I need to do.
It can read many different file formats. It can very easily tidy up your data, deleting blank rows, and deleting rows where certain columns are missing. It allows you to make lots of changes internally, which you do using JavaScript to put in the conditional.
For example, I have one data set whereby all of the data is encoded and there was one variable called opinion or something like that and it had codes for what the topic was, which was being discussed, whether it was positive or negative, whether it was strongly worded or weakly worded, and so many other things like that.
I had to transfer those into columns, like sentiment, the strength of sentiments, topic being discussed. I had to split it up into columns, and I could do that very easily, like simple JavaScript, in their column expressions.
It also has very good fundamental machine learning. It has decision trees, linear regression, and neural nets. It has a lot of text mining facilities as well. It's fairly fully-featured.
They are also very careful with things like lab variants and issued variants because they have some labs that develop nodes, and new chunks of code which are represented as an icon. They make it very clear that those lab ones are not fully tested, and they're very glad to get comments back if you have problems.
I haven't had that difficulty myself. They seem to be aware that they have the community there as their testing base, and they seem not to be embarrassed about that. They will tell you when they go wrong and try to put it right.
What needs improvement?
So far, I haven't had problems with it, so I haven't really thought about room for improvement. It's so much better than many other things. It's useful in that you can at least get people who are pretty averse to programming to start thinking about putting something into a program of any kind, because they can see what's happening.
It's visual. It's codeless. For some purposes, I'd want to add Python or R, but I haven't had to do that so far, so I haven't seen the shortcomings of it. There must be some. All software has shortcomings, but I haven't recognized any myself.
Not just for KNIME, but generally for software and analyzing data, I would welcome facilities for analyzing different sorts of scale data like Likert scales, Thurstone scales, magnitude ratio scales, and Guttman scales, which I don't use myself.
I use both Thurstone scales and magnitude ratio scales quite a bit, and they're very powerful. But I've always had to do all the analysis myself in some simple code. I don't think that's provided. You could probably include it in KNIME, but I haven't tried to do it.
If it just said, "Analyze scales," and you'd choose which sort of scale you want to analyze and it gave you the options of normalizing or reversing or whatever it happens to be, that would be helpful. There are lots of simple functions that you want to apply to scales, which would be useful in any software, including KNIME.
For how long have I used the solution?
I have been using this solution for about a year, but most particularly in the last six months.
What do I think about the stability of the solution?
It's been remarkably stable, much more so than most software. They have an active community forum. Problems seem to get fixed pretty quickly. I haven't had problems, but other people do report problems. So, there must be problems there, I just haven't had any.
How are customer service and support?
On the very rare occasions that I have to seek advice, I just post it to the forum and someone will offer advice.
Which solution did I use previously and why did I switch?
Compared to RapidMiner, at the moment I would go for KNIME, but that's largely because I haven't used RapidMiner much for the last year. It may have improved enormously since then. It was a very good package. They do much the same thing.
I'm more familiar with KNIME, so I would be able to talk more about it, whereas for RapidMiner, I was very enthusiastic when I used it. KNIME is a bit cheaper in a sense.
In RapidMiner, you can have up to 10,000 rows of data free of charge. For many things that I do, 10,000 rows of data is enough. I use quite a few UK government surveys, and I get the raw data from the UK Data Archive. They're often of the order of 10,000, 8,000. So, under 10,000 rows. I could use it free of charge.
How was the initial setup?
I just downloaded it and then ran it. The process really was that simple. If I need one of the extensions (e.g. text mining), the process is just as simple.
What about the implementation team?
We implemented the solution in-house.
What was our ROI?
I have not formally calculated it, but it must be substantial.
What's my experience with pricing, setup cost, and licensing?
With KNIME, you can use the desktop version free of charge as much as you like. I've yet to hit its limits. If I did, I'd have to go to the server version, and for that you have to pay. Fortunately, I don't have to at the moment.
What other advice do I have?
I would rate this solution an eight out of ten.
I'm unwilling to give anything a ten because everything can be improved. But it's been very useful so far to me and has saved me many hours of work. I could have written it all in Python if necessary, but it would have taken me weeks for what would be a few days of work.
My advice is to just download it and use it. The documentation is pretty good. There are many good videos online for it. If you go to YouTube, you can get pages and pages of KNIME tutorials. They're pretty clear, and they are produced by people who've used it. It's not just company advertising, as far as I can see.