
ByteHouse
Data platform has accelerated machine learning workflows and delivers faster, cheaper deployments
What is our primary use case?
I use ByteHouse to store data, which helps data scientists and data analysts securely store valid data.
Data scientists receive data sets that they analyze, clean, transform, and perform calculations on in order to make the data set valid for machine learning purposes. To store it, we need a platform with high scalability. They use ByteHouse to store the data.
ByteHouse is an enterprise tool that helps us manage everything easily because all aspects are managed from ByteHouse itself, including maintenance, scalability, and speed. Everything is managed at the back end, so we don't need to worry about it in the model era.
What is most valuable?
Scalability and cloud-native support are the best features ByteHouse offers in my experience. These are very effective.
For database integration, particularly for DevOps, integrating it with the CI/CD flow became much easier compared to other database solutions. For data scientists performing analysis, it is significantly easier and faster compared with storing data elsewhere.
ByteHouse has positively impacted my organization by making the machine learning life cycle considerably easier and faster. The machine learning life cycle has become much faster because we usually do five deployments per day, and now using ByteHouse, it is possible we can go beyond seven deployments.
What needs improvement?
ByteHouse is an enterprise solution, so the company has to pay considerably to get it. However, if it were open source, the company could have a trial period and could proceed with confidence to purchase it. Additionally, ByteHouse could still offer a free trial for limited data, such as 1GB or 2GB of data.
Regarding ByteHouse's AI capabilities, some tweaks are still needed. It is still in a growing state from what I have observed.
When considering MLOps or an MLOps life cycle where data sets are stored in ByteHouse, there should be a tool that alerts us when these data sets have missing values, need cleanup, contain duplicate values, or have inconsistent data. Regarding ByteHouse's AI capabilities, I would rate the accuracy as below average, but it is still in a growing state. I would give it a six out of ten.
My advice to others considering ByteHouse is that if you want scalability and your team is quite small, and you want a stable source where you can store your data sets and perform analysis, ByteHouse is a good option.
What do I think about the stability of the solution?
ByteHouse is stable in my experience.
What do I think about the scalability of the solution?
ByteHouse's scalability is production-grade. It is quite sensitive when it comes to scalability.
How are customer service and support?
We did not avail any customer support because we did not face any issues so far.
Which solution did I use previously and why did I switch?
I previously used BigQuery, and we switched for the only reason because of scalability and the pricing accordingly.
What was our ROI?
There is a good return on investment with ByteHouse. We had our own cloud setup and used to store in BigQuery previously. When we switched to ByteHouse, the price was drastically decreased. We paid for BigQuery previously, and the price was quite higher, ranging from $7,000 to $8,000. When we use ByteHouse, the price has significantly decreased.
What's my experience with pricing, setup cost, and licensing?
We paid $4,000 to $5,000 per month for ByteHouse pricing, setup cost, and licensing.
Which other solutions did I evaluate?
We evaluated other options before choosing ByteHouse, looking into Dataproc of GCP and Cloud Spanner. However, we wanted SaaS, so we looked at ByteHouse. Additionally, we received good reviews from ByteHouse.
What other advice do I have?
As I mentioned earlier, ByteHouse is an enterprise solution, not open source. If someone wants to learn it, he will get hands-on experience once he works with it. He cannot gain experience otherwise. If any individuals are interested in learning ByteHouse, they may need to find a corporate organization where it is being used. They cannot directly use it since it is an enterprise solution. From my use cases, I do not find any other improvements needed for data science. However, in different industries, they might have different needs. I would rate this review an eight out of ten.
Which deployment model are you using for this solution?
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Analytics platform has accelerated complex reporting and now supports faster data-driven decisions
What is our primary use case?
Our primary use case for ByteHouse is large-scale analytics and reporting. We use it to store and query high volumes of operational and business data, generate reports, support dashboarding, and perform ad hoc analysis. It helps us process large datasets quickly so teams can make data-driven decisions without significant delays.
My main use case is analyzing application and operational data. We use ByteHouse to query large datasets and investigate trends and support reporting requirements.
Recently our team used ByteHouse to analyze large volumes of application and transaction data for operational reporting. We needed to identify usage trends, monitor key performance metrics, and investigate a few anomalies reported by stakeholders. ByteHouse allowed us to run complex queries on a large dataset with relatively fast response time, helping us to generate reports and provide insights much more quickly than if we had relied on traditional reporting workflows or approaches.
What is most valuable?
The best features hosted by ByteHouse are its high-performance analytical query engine, scalability, and ability to handle large volumes of data efficiently. I also appreciate the separation of storage and compute, which allows resources to be scaled based on workload requirements. Another valuable feature is its support for real-time and near-real-time analytics, enabling teams to access insights quickly. Overall, the platform balances performance, scalability, and ease of managing analytical workloads.
My favorite feature for ByteHouse is its query performance because I think it is really impactful for our team currently. Because queries will be performed on large datasets, even when working with substantial amounts of data, queries generally complete quickly, which helps analysts and engineering teams make decisions faster. The high-performance query engine has been the most impactful feature for our team. I think it will be very useful for all the teams who are working with data-heavy workloads and want to query data and get results very quickly.
ByteHouse has been valuable because it combines scalability with good query performance. As our data volumes grew, we were still able to run analytical workloads efficiently without major changes to our processes. It has become an important part of our reporting and analytics workflow, helping teams access insights faster and make more informed decisions.
ByteHouse offers a good balance between performance and scalability. Some analytics platforms perform well initially but become harder to manage as data volumes grow. With ByteHouse, we have been able to continue supporting larger workloads without significant operational overhead. Overall, the platform provides the core capabilities we need for analytics while remaining relatively efficient to operate.
ByteHouse has impacted us overall positively by improving the speed and efficiency of our analytics workflows. Teams can access and analyze large datasets much faster, which has shortened the time required to generate reports and investigate issues. This has helped stakeholders make decisions more quickly and reduced the effort spent on data processing and analysis. The platform's scalability has also allowed us to handle growing data volumes without a proportional increase in operational complexity.
While I cannot share exact internal figures, we observed a noticeable improvement in reporting and analytics efficiency after adopting ByteHouse. For example, analytical queries that previously took several minutes were often completed in seconds, which reduced waiting time for analysts and engineers. We also saw roughly a 30 to 40 percent reduction in the time spent preparing recurring reports because teams could access and process data more quickly.
What needs improvement?
Overall, I have had a positive experience with ByteHouse, but there are a few areas where it could be improved. First, the user experience and administrative workflows could be made more intuitive, especially for new users who are not deeply familiar with data platforms. Secondly, broader integration options and more out-of-the-box connectors would help reduce setup and maintenance effort. Third, more comprehensive documentation and troubleshooting guidance for advanced use cases would make onboarding and issue resolution faster.
I would mainly want to see improvements in ease of use, documentation, and integration. The core analytics performance is strong, which is what it is made for, but enhancing these areas would make the overall experience even better.
There is nothing major to address. The main area I would want to see improved is ease of use. Beyond that, I would want to see continued enhancements around integrations, automations, and observability. More out-of-the-box connectors and streamlined workflows would help teams adopt the platform faster.
For how long have I used the solution?
I have been using ByteHouse for approximately one year.
What do I think about the stability of the solution?
I have not faced any major impact issues. Generally, it is a stable platform, especially for analytics and data warehouse workloads at scale. I have not seen much of a reliability issue or stability issue with it. It also has elastic scaling and monitoring capabilities for handling changing workloads, which is really nice.
ByteHouse has been a stable platform for analytics workloads. The platform handles large volumes of data efficiently and features such as resource isolation and elastic scaling help maintain consistent performance as workloads grow. Proper capacity planning is important, but overall, I have found it reliable for production use.
How are customer service and support?
I have not directly communicated with ByteHouse customer support. However, as I have heard from other teammates, the support team was generally responsive and knowledgeable, especially when it came to platform configuration, troubleshooting, and performance-related questions.
Which solution did I use previously and why did I switch?
We were not using any solution previously. We were using normal traditional approaches.
What was our ROI?
I have seen a positive impact by using this platform because it provides a great query engine and the analytics part is really strong. The reports are being delivered in a very short amount of time. I have seen an overall 40 percent reduction in report generation with the query engine.
Which other solutions did I evaluate?
Currently, I am not in the position of choosing other platforms because my managers and other stakeholders make this kind of big decision regarding which platform employees should use to do their work. I was not involved in this discussion and have not evaluated other options.
What other advice do I have?
My advice would be to start by clearly understanding your analytics and data growth requirements. ByteHouse tends to provide the most value when you are dealing with large-scale analytical workloads and need a platform that can scale as your data volume grows. It will be beneficial for companies that are data-heavy. I would also recommend spending time upfront on data modeling, partition strategies, and workload planning. While the platform is powerful, a well-designed data architecture helps you to get the best performance and cost efficiency. I would recommend ByteHouse to organizations looking for a scalable analytics platform, particularly if they expect significant growth in data volume and analytical workloads. ByteHouse is a very strong tool for organizations looking for a scalable analytics platform. I have given this review a rating of 8.