Overview
IBM watsonx.data is an open, hybrid, and governed data store built on an open data lakehouse architecture. The data lakehouse is an emerging architecture that offers the flexibility of a data lake with the performance and structure of a data warehouse. Watsonx.data is an enterprise-ready data store that enables hybrid cloud analytics workloads such as data engineering, data science and business intelligence, through open-source components with integrated IBM innovation.
Watsonx.data will allow users to access their data through a single point of entry and run multiple fit-for-purpose query engines such as Presto and Spark across IT environments.With the integration of DataStax Astra DB, watsonx.data now extends beyond analytics to support real time operational workloads and advanced AI applications. Astra DB brings enterprise-grade vector database capabilities and multi-model data support, enabling organizations to build generative AI applications, real time recommendation engines, and high-performance operational systems,all within the same unified platform. This integration eliminates the need for separate operational databases and provides seamless data flow between transactional and analytical workloads. Through workload optimization an organization can reduce data warehouse costs by up to 50 percent by augmenting with this solution. It also offers built-in governance, automation and integrations with an organization's existing databases and tools to simplify setup and user experience.
Db2 Warehouse and Netezza on AWS natively integrate with watsonx.data with shared metadata and support for open formats such as Parquet and Iceberg to share and combine data for new insights without ETL. Watsonx.data allows customers to augment data warehouses such as Db2 Warehouse and Netezza and optimize workloads for performance and cost.
For trials and customized IBM watsonx.data pricing contact your IBM Sales Representative or email us at watsonx_on_AWS@wwpdl.vnet.ibm.com Visit https://www.ibm.com/products/watsonx-data
to learn more about our consumption model and product editions.
For more information on IBM watsonx.data visit https://www.ibm.com/products/watsonx-data
Highlights
- Access all your data across hybrid-cloud: Access all data through a single point of entry with a shared metadata layer across clouds and on-premises environments.
- Get started in minutes: Connect to storage and analytics environments in minutes and enhance trust in data with built-in governance, security, and automation.
- Reduce the cost of your data warehouse by up to 50% through workload optimization: Optimize costly data warehouse workloads across multiple query engines and storage tiers, pairing the right workload with the right engine.
Details
Introducing multi-product solutions
You can now purchase comprehensive solutions tailored to use cases and industries.
Features and programs
Financing for AWS Marketplace purchases
Pricing
Dimension | Description | Cost/unit |
|---|---|---|
WXD_PG_SL1 | IBM watsonx.data as service pay per use 1 RU | $1.00 |
Vendor refund policy
Please contact your client account team for refund information
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Software as a Service (SaaS)
SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.
Support
Vendor support
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.
Similar products



Customer reviews
Effortless Data Management, Inclusive Governance
Flexible, High-Performance Lakehouse for Modern Analytics at Scale
What has been most helpful is the way it reduces complexity when working across multiple data environments. It improves productivity by making data more accessible without creating unnecessary movement or duplication. Performance has been solid for large-scale querying, and the platform’s AI-focused design is a major plus for teams building analytics and machine learning workflows. From an ROI perspective, it can help control costs by improving efficiency and reducing manual effort. Support, documentation, and onboarding are also strong enough to make adoption smoother for enterprise teams.
Integrations are powerful but not always straightforward to set up, and sometimes require extra effort from the data engineering side. Performance is generally good, but in some cases, you still need to fine-tune things manually to get the best results.
Pricing can also be a concern for smaller teams, as the value is more noticeable at scale. During onboarding, documentation is helpful but could be more practical with real-world step-by-step examples.
On the AI side, the foundation is strong, but I feel there’s still room for improvement in terms of smarter automation and more intuitive recommendations.
With watsonx.data, we’re now able to access and query data across multiple environments without heavy data movement. This has simplified our workflow a lot. The UI makes it easier to explore datasets, and integrations with existing tools mean we didn’t have to rebuild our entire setup.
Performance has improved noticeably for large queries, which has reduced turnaround time for analytics. From a business perspective, this means faster decision-making and less dependency on manual data handling.
On the AI side, having data in a more organized and accessible format has made it easier to prepare for analytics and machine learning use cases. It’s not fully automated yet, but it definitely reduces the effort required to get data ready.
Overall, it has helped us save time, reduce complexity, and improve efficiency when working with large-scale data, which directly impacts productivity and long-term cost optimization
Powerful, Secure, and Scalable Platform with Easy Data Migration
There is also infrastructure manager in this platform which enhances visibility of the infrastructure components. It provides better understanding and effectiveness. The capability of its AI assistant in this platform is also good and can ease the task with its assistance. One best part of this platform is the IBM Ecosystem of this platform that makes this platform more robust.
IBM watsonx.data: Solving Data Silos and Accelerating AI with a Unified Lakehouse Platform”
In many organizations, data is stored in multiple silos—different clouds, on-prem databases, and data warehouses. This makes it hard to access, analyze, and use data for AI. watsonx.data brings all that data into one unified lakehouse platform so teams can access it from a single place without constantly moving or duplicating it. IBM designed it to simplify data engineering, analytics, and AI development on top of trusted data.