Sign in Agent Mode
Categories
Your Saved List Become a Channel Partner Sell in AWS Marketplace Amazon Web Services Home Help

DataHub

Datahub

Reviews from AWS customer

4 AWS reviews

External reviews

1 review
from

External reviews are not included in the AWS star rating for the product.


    Henrique dos Anjos

Metadata governance has improved data lineage visibility but still needs simpler integrations

  • March 31, 2026
  • Review provided by PeerSpot

What is our primary use case?

I work with Data Hub as a user, but I also have some administrative responsibilities there. I'm not a final user; the final users are business users, and I play some administrative roles in the tool to have the metadata information available for all Uber users.

I'm a Data Quality Engineer focused on data governance. I manage the metadata information for Uber, and I also use this to apply some data quality rules. My focus in my current job is to apply some rules and manage the metadata information and ensure it is accurate for the end users, which is why I'm using it.

What is most valuable?

One of the biggest advantages of Data Hub is the very good integration, for example, a department focused on development made the integrations between Data Hub and BigQuery. When this integration is very well done, it is possible to check data lineage, which I think is a very important subject in data governance. It's something that cannot be done manually, so having a tool that shows the data lineage from the source until the target tables helps us a lot. I think this is one of the best advantages that we have.

Data Hub helps to analyze data from various sources in my case.

What needs improvement?

I know that the integrations are not easy to do, and I believe it happens because it's a customized solution. There always needs to be software developers to work on this. It's complicated; every time we want to integrate new things or new sources, we need to generate a ticket or a request to another department. When I had my experience with Atlan, for example, I was able to connect different sources in a very user-friendly way. I just needed to set up some configurations and connect to the source without having to be a software developer or develop any code in the back end. It was just a feature in the data catalog that enabled me to connect with different kinds of sources. That's why I think the disadvantage of having a customized solution. Although I think Data Hub itself is a very good tool, years ago I had the opportunity to work with it, but with a clear interface and the open-source solution, which was very clear and easy to connect. At Uber, we need to have a request when we want to integrate new sources.

Regarding Data Hub's intuitiveness, regarding analytics, I would say that some quality dimensions are available for us. For example, for each field name or each column in a table, it's possible to see the frequency, how many values we have for a specific type or category, and we can see if there are new or null values, whether the columns are empty or not, along with some metrics. This is regarding the data quality dimensions, such as nullables and things of that nature. That is all we have for features. I remember when I was working with Atlan, there was a feature I liked very much—the possibility to have a sample. When I clicked on a table, I could see a short sample without needing SQL skills. I just clicked the table and could see some values or what the table represents; the data catalog would show a screen with some rows of the table. This feature was very good, but we don't have it in Data Hub the way it is implemented at Uber. I think it would be a very good feature for analytics, and we don't have it at the moment.

The integration part could be better, but again, it's because it's a customized solution. I think if they used the native version of the tool, it would be simpler. The integration part and the process of setting up new data quality rules would be important for data governance players like me.

For how long have I used the solution?

I've been using Data Hub for one year and a half.

What do I think about the stability of the solution?

Since I've been using Data Hub, it has always been very stable; I can say it was one hundred percent stable. I never encountered issues trying to check datasets or columns and checking their numbers. It has always worked very well in that regard.

What do I think about the scalability of the solution?

I think Data Hub can scale fast in its native way, but with a customized solution, it takes more time.

How are customer service and support?

My support is internal when I have any questions or requests, so I direct it to a support team from Uber and not from the provider. When I was working with Atlan, and needed support, they were very good at attending to my requests directly. I had contact with the provider, so it was very fast. At the moment, I don't have that; I direct my requests to an internal department of Uber.

Which solution did I use previously and why did I switch?

I'm not using Atlan anymore because the company that I was working with, I'm no longer there. I went to another consultancy group and now I'm working with other platforms. Atlan is not the one that I'm working with at the moment.

I am working with a different platform that is also regarding data governance and metadata management. The platform itself, the back end, is Data Hub. But the user interface is customized for this client. I'm currently working for Uber, the Uber company.

How was the initial setup?

Because Data Hub is a customized solution, I don't have many details about the installation and deployment process. However, when I was using Atlan, I saw that they implemented very fast. In this way, I believe both tools have an easy way to implement, but because Uber chose to have a customized solution, it became more difficult and complex. However, in their native way, I think both tools are good.

What was our ROI?

In terms of ROI, I would say that Atlan is better. I had a very good experience using Atlan, and I believe it's faster. Velocity in organizations today is very important; people want to see things very fast. I believe Atlan has a better approach compared to Data Hub.

The way Data Hub is implemented at the moment, Atlan is much better. It's much, much faster.

Which other solutions did I evaluate?

I worked with Databricks, but I'm not sure if it is from Amazon; I don't think so. I think Databricks is from Microsoft.

What other advice do I have?

I have experience with Data Hub to some extent.

I believe Data Hub uses a lot of APIs, but I don't think I'm the right person to answer that because it relies a lot on a technical aspect that I don't understand. I cannot provide you with a curated answer about it, but I know that the software development team that works with this customized solution uses APIs; I just don't know how to speak about their performance, whether it's good or not.

Real-time batch processing is very important for me and my organization because some datasets are very critical for the business. If we have batch processing, it's good for the organization to set up a very large dataset, for example, and have it available on the data catalog in a short time. I agree that this is important.

In both experiences I had, the integration with the catalog was with GCP. I don't have experience working with another data warehouse, so even in Atlan or now in Data Hub, it is connected with GCP.

I don't use anything else like CRM, storage, or any architecture management tools; just Data Hub.

I would give Data Hub a score of seven out of ten, summarizing everything that I've discussed about the product.


    Azhagarasan Annadorai

Catalog has centralized PII ownership and collaboration but still needs better automation and UI

  • February 22, 2026
  • Review from a verified AWS customer

What is our primary use case?

My main use case for Data Hub is to enrich the metadata to classify for PII data. As an administrator, I crawl a number of data sources and bring the metadata into a single place, then assign the ownership, such as a data owner or steward, for all the data assets. With their help, we classify the data into PII direct and indirect, sensitive, non-sensitive, and so on. We add tags and glossary terms onto the data elements. The main use case is for DSAR compliance; for GDPR DSAR compliance, we try to identify the PII data in the catalog so that we know where the PII data is in our data inventory.

How has it helped my organization?

The catalog helps with metadata discovery and to find the owners/ stewards of data sources. Without a data catalog, we scramble around and speak to multiple teams, which is time consuming.

What is most valuable?

The best features that Data Hub offers include the management of ownership, with standard out-of-the-box ownership such as business owner, data steward, or technical owner, which is relevant for us. It also integrates with Active Directory. In our Active Directory, we maintain certain roles based on the scrum teams related to a team member, and by integrating with Active Directory, we are able to bring the same roles and map them to the corresponding ownership roles within Data Hub. Data Hub has integrations with Slack, Snowflake, BigQuery, and so on, which we use.

Data Hub has positively impacted our organization by bringing the tribal knowledge that resides with team members into a single place where users can discover and understand the data elements before they make use of it. Users can ask questions via Slack to understand how a data element is defined and get the answers back. This definitely saves time; without a data catalog in place, users need to ask around to find out what a particular data element means and to find out the owners. Now, with the data catalog, searching and discovering data elements and the corresponding owners is easier, saving approximately thirty to forty percent of the time that would have been spent finding out the owners and definitions of the data elements.

What needs improvement?

Data Hub can be improved with more automation; there are some inbuilt automations, such as documenting definitions of data elements using AI, which is useful. I wonder if it can automate the classification exercise, possibly using AI to auto-classify PII direct and indirect items.

For how long have I used the solution?

Just started using it.

What do I think about the stability of the solution?

Data Hub is stable.

What do I think about the scalability of the solution?

Data Hub shows scalability in terms of the number of users and the number of new databases and data elements.

How are customer service and support?

We have not gone that far with customer support; as far as the POC is concerned, we received good support from the team and the sales team that helped us evaluate the tool.

How would you rate customer service and support?

Neutral

Which solution did I use previously and why did I switch?

We previously used OvalEdge as our data catalog before switching to find a tool that has more AI capability and allows extension of usage to non-technical users, seeking a tool that is less clunky and more intuitive.

How was the initial setup?

slightly techical. But there is enough documentation available.

What about the implementation team?

No

What was our ROI?

I have not yet seen a return on investment, and I do not have that information to share.

What's my experience with pricing, setup cost, and licensing?

Regarding experience with pricing, setup cost, and licensing, I think if we have a budget of one hundred thousand US dollars, we will be able to deploy a reasonable version and connect to a number of data sources.

Which other solutions did I evaluate?

Before choosing Data Hub, we compared it with Atlan and Alation.

What other advice do I have?

I chose seven out of ten because there are better catalogs available in the market that offer more features. The UI, especially when setting up new data sources and crawling them, is a little cumbersome, but it is a one-time activity, so it is manageable; however, the UI could be improved concerning administration.

My advice to others looking into using Data Hub, also known as Acryl, is that it is a reasonably stable product that satisfies most data catalog use cases; however, Atlan appears to be the closest competitor, while Alation is the market leader among the three. Data Hub has an open-source version I believe, and it may be worth considering that option as well.

I rated this review seven out of ten.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?


    reviewer2784462

Centralized metadata has empowered governed data discovery and clarified ownership for all teams

  • December 27, 2025
  • Review from a verified AWS customer

What is our primary use case?

We adopted Data Hub in the context of a large enterprise customer operating in a regulated industry with a strong focus on data governance, data discoverability, and ownership clarity across multiple cloud-native platforms. The solution was deployed on AWS, and the main business problem was the lack of a centralized, reliable view of data assets, including poor data discoverability, unclear data ownership and stewardship, limited lineage visibility across ingestion and transformation layers, and high dependency on tribal knowledge held by a few individuals. Data Hub was selected as an enterprise data catalog and metadata backbone with the goal of enabling both technical teams and business users to easily understand, trust, and reuse data.

We used Data Hub to create very good data discoverability, assign data ownership and stewardship, improve data quality processes, and establish good data governance for our customer in terms of data catalog, data lineage, and metadata management in general.

What is most valuable?

Our key benefits that we achieved include centralized metadata management across multiple AWS services and data platforms and improved data discoverability, significantly reducing the time required to find relevant data sets. Clear data ownership and stewardship improved accountability and collaboration between teams. End-to-end lineage visibility enabled faster impact analysis and safer changes, and faster onboarding of new data users through self-service access to documentation and metadata. From a governance perspective, Data Hub became a single source of truth for metadata, supporting both compliance requirements that are very important in a data governance environment and day-to-day operational needs.

The main strengths we experienced with Data Hub are a strong metadata model and its extensibility because Data Hub offers a rich and flexible metadata model that adapts well to complex enterprise scenarios. Excellent lineage capabilities are provided because the lineage visualization is clear, actionable, and extremely useful for impact analysis and governance workflow. The open source foundation with enterprise readiness is significant because the open architecture avoids vendor lock-in while still being suitable for production-grade environments.

Data Hub is very effective for us because we build the data lineage from the beginning, from origination to visualization, to the final use of the data. We follow and track a path of the data, which improves analysis and enables us to find where data is used and the impact of deleting data. This is also very important in a regulatory environment.

What needs improvement?

The impact is very positive, and there are many benefits for us using Data Hub because it was easier to make data governance, create centralized metadata management, improve data discoverability, and manage data in general. The areas for improvement, in my opinion, are the initial setup and configuration that can be complex without prior experience, especially in large-scale environments. User experience for non-technical users could be further simplified, particularly around advanced metadata concepts. The out-of-the-box governance workflow, for example, approvals and certification, could be more prescriptive for customers at early maturity stages.

Data Hub can be improved in the initial setup and configuration that is somewhat complex, and also in operational monitoring that could benefit from more native dashboards and alerts. However, these are not blockers, but areas where additional guidance or product enhancement would further accelerate adoption.

For how long have I used the solution?

I have been using Data Hub since 2023.

What other advice do I have?

Based on internal measurement and feedback from the data teams, there are many impacts. Time to locate and understand a data set was reduced by approximately 40-50 percent. Manual documentation effort was reduced by around 40 percent. Dependency on senior data engineers for data explanation dropped significantly. Data onboarding time for new team members decreased from weeks to days.

I would rate this product a 9 out of 10. I chose nine because Data Hub proved to be a robust, scalable, enterprise-ready data catalog that is well-suited for AWS-based architecture and complex organizational environments. It is always possible to improve and useful to maintain space for further optimization.

My advice is to use Data Hub to move from fragmented metadata and manual processes to a modern, governed, and self-service data ecosystem, delivering clear value in terms of efficiency, cost saving, and data trust. We would confidently recommend Data Hub to organizations looking to improve data governance, data discovery, and metadata management on AWS.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)


    reviewer2784771

Analytics work has become more efficient and now processes large datasets with flexibility

  • December 04, 2025
  • Review from a verified AWS customer

What is our primary use case?

My main use case for Acryl Data is analytics.

What is most valuable?

Acryl Data helps with processing large amounts of data as it is a very good tool that gives good flexibility to store a huge amount of data and is easier to use. The UI is good.

The best features Acryl Data offers include storage. When I mention storage, I refer to its scalability.

The positive impact of Acryl Data is that it has increased efficiency.

What needs improvement?

I do not have comments on how Acryl Data can be improved.

For how long have I used the solution?

I have been using Acryl Data for two years.

What do I think about the stability of the solution?

Acryl Data is stable.

What do I think about the scalability of the solution?

Acryl Data's scalability is good.

How are customer service and support?

The customer support is good.

How would you rate customer service and support?

Which solution did I use previously and why did I switch?

I did not previously use a different solution.

How was the initial setup?

My experience with pricing and setup was good.

What was our ROI?

I have seen a return on investment as it has saved time.

Which other solutions did I evaluate?

Before choosing Acryl Data, I did not evaluate other options.

What other advice do I have?

My advice to others looking into using Acryl Data is that they can use it. I gave this product a rating of 9.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?


    reviewer2784384

Simple data insights platform has boosted development speed and revealed top purchasing customers

  • December 04, 2025
  • Review from a verified AWS customer

What is our primary use case?

My main use case for Acryl Data is to extract insights from customer data. I use Acryl Data for a project in order to identify all the customers and find out which customer buys a lot of items.

What is most valuable?

The best feature Acryl Data offers is the simplicity of the UI. The UI is simple for me because it is easy to navigate. Acryl Data has positively impacted my organization by speeding up all the development. It sped up development because the team can access data faster, improving speed by approximately 50%.

What needs improvement?

The product cannot be improved in just one area. There are no points in support or documentation that require improvement. There are no improvements needed for Acryl Data that I have not mentioned yet.

For how long have I used the solution?

I have been using Acryl Data for five months.

What do I think about the stability of the solution?

Acryl Data is stable.

What do I think about the scalability of the solution?

I think the scalability of Acryl Data is a good point.

How are customer service and support?

The customer support is fine; we do not need any customer support, but I think it was fine.

How would you rate customer service and support?

Which solution did I use previously and why did I switch?

I did not previously use a different solution; I have no experience with any other solutions.

What was our ROI?

I have seen a return on investment through time saved and also money saved. I do not have specific numbers or examples about the time or money saved.

Which other solutions did I evaluate?

I did not evaluate other options before choosing Acryl Data; I evaluated only this option.

What other advice do I have?

My advice to others looking into using Acryl Data is to start faster with the analytic insights. I would rate this product a 10.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?


showing 1 - 5