Listing Thumbnail

    Apache SkyWalking on ObserveAny - Pay As You Go

     Info
    Sold by: ObserveAny 
    Focus on building apps and not managing Apache SkyWalking with a scalable, resilient and secure Observability platform. Observability with SkyWalking made simple on AWS.
    4.2

    Overview

    What is ObserveAny? ObserveAny is a fully managed, truly cloud-native Apache SkyWalking service for integrating and processing all of your data in real-time, no matter where it lives. With ObserveAny fully managed cloud service on AWS, you can eliminate the burdens and risks of self-managing SkyWalking and focus more time on building apps that differentiate your business.

    Pay as you go provides a no commitment, low friction way to quickly get started with ObserveAny by paying only for what you use.

    To learn more about our cluster types, available features, and pricing, go to https://www.observeany.com/ 

    Highlights

    • Observable system based on Apache SkyWalking
    • Quickly deploy modern monitoring and security in one powerful observability platform.
    • Create actionable context to speed up, reduce costs, mitigate security threats and avoid downtime at any scale.

    Details

    Delivery method

    Deployed on AWS
    New

    Introducing multi-product solutions

    You can now purchase comprehensive solutions tailored to use cases and industries.

    Multi-product solutions

    Features and programs

    Financing for AWS Marketplace purchases

    AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.
    Financing for AWS Marketplace purchases

    Pricing

    Apache SkyWalking on ObserveAny - Pay As You Go

     Info
    Pricing is based on actual usage, with charges varying according to how much you consume. Subscriptions have no end date and may be canceled any time.
    Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator  to estimate your infrastructure costs.

    Usage costs (1)

     Info
    Dimension
    Cost/unit
    ObserveAny Hosts per hour - 45 Day Retention
    $2.03

    Vendor refund policy

    Pay As You Go

    How can we make this page better?

    Tell us how we can improve this page, or report an issue with this product.
    Tell us how we can improve this page, or report an issue with this product.

    Legal

    Vendor terms and conditions

    Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Usage information

     Info

    Delivery details

    Software as a Service (SaaS)

    SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.

    Resources

    Support

    AWS infrastructure support

    AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

    Product comparison

     Info
    Updated weekly

    Accolades

     Info
    Top
    25
    In Issue & Bug Tracking
    Top
    10
    In Application Performance and UX Monitoring
    Top
    100
    In Log Analysis

    Customer reviews

     Info
    Sentiment is AI generated from actual customer reviews on AWS and G2
    Reviews
    Functionality
    Ease of use
    Customer service
    Cost effectiveness
    1 reviews
    Insufficient data
    Insufficient data
    Insufficient data
    Insufficient data
    2 reviews
    Insufficient data
    Insufficient data
    Insufficient data
    Insufficient data
    Positive reviews
    Mixed reviews
    Negative reviews

    Overview

     Info
    AI generated from product descriptions
    Distributed Tracing and Monitoring
    Observable system based on Apache SkyWalking for monitoring distributed applications and microservices
    Cloud-Native Architecture
    Fully managed, cloud-native service deployed on AWS eliminating self-management overhead
    Real-Time Data Processing
    Real-time integration and processing of observability data from multiple sources
    Security Integration
    Integrated security monitoring capabilities within the observability platform for threat mitigation
    Scalable Infrastructure
    Scalable and resilient infrastructure supporting observability operations at any scale
    Data Ingestion and Query Performance
    Ingests petabytes of telemetry per day with capability to process hundreds of terabytes and execute tens of millions of queries daily without performance degradation
    Knowledge Graph Architecture
    Utilizes O11y Knowledge Graph to structure and correlate data across logs, metrics, and traces for fast search and correlation capabilities
    Natural Language Processing for Incident Analysis
    Enables troubleshooting of complex incidents using natural language queries through O11y AI for accelerated root cause analysis
    Open Data Lake Foundation
    Built on Snowflake data lake architecture providing open data storage without vendor lock-in
    Multi-Signal Correlation
    Correlates and correlates telemetry signals across logs, metrics, and traces with context-aware analysis for incident resolution
    Multi-Source Data Integration
    Collects and correlates data from more than 600 vendor-backed technologies and APM libraries in a single platform
    Unified Observability Dashboard
    Provides full visibility into health and performance across all environment layers through a single pane of glass
    Infrastructure and Application Monitoring
    Monitors underlying infrastructure, supporting services, applications, and security data simultaneously within one observability platform
    Rapid Deployment and Setup
    Enables quick deployment through turn-key integrations and easy-to-install agent for monitoring servers and resources
    Security and Performance Analytics
    Delivers actionable insights to reduce costs, mitigate security threats, and prevent downtime across any scale

    Contract

     Info
    Standard contract
    No

    Customer reviews

    Ratings and reviews

     Info
    4.2
    3 ratings
    5 star
    4 star
    3 star
    2 star
    1 star
    33%
    67%
    0%
    0%
    0%
    2 AWS reviews
    |
    1 external reviews
    External reviews are from PeerSpot .
    Aditya Bhatt

    Monitoring has transformed issue resolution and now reveals full application and container health

    Reviewed on May 14, 2026
    Review from a verified AWS customer

    What is our primary use case?

    My main use case for Apache SkyWalking  includes not only monitoring microservices and APIs but also managing the entire health of the application. I will explain the domains and backgrounds where we currently use it. It does more than check microservices or heavy queries. It can be integrated from the IT point of view, where your IT team can easily integrate it on the DevOps side or where applications are being deployed. It is widely used for managing the entire health of the application, checking the current status and health of the application, and how your services are currently running. When I mention services, this essentially means your queries, including database queries or backend logic that has been written to perform either up-syncing or down-syncing of data into the database or retrieving something, updating or inserting queries. Apart from that, it can be used for checking how your APIs are working, such as out of multiple API calls, how many are succeeding versus failing. If there is a particular timeout, I can see the frequency of recurrence and the time duration of the timeout or how long the network is unreachable.

    Even in payment applications where we have multiple applications, some related to payment processing may be failing or insert queries may not be working. There could be multiple layers on the backend side of the architecture design, and during issue resolution, it is very difficult to analyze where the actual pain point area is. Apache SkyWalking  really helps identify that a particular API call failed on the payment side, perhaps deep down three layers in the architecture design, so you can see that it is failing because of a specific reason, such as network timeout, unreachable network, the bank server being down, or a third party payment server integration not responding due to heavy traffic load.

    In some other domains, beyond checking health, if your applications or servers are running on pods or Kubernetes  containers, you can check the health of your pods as well. We have moved from outsystems to Mendix  and other Java hosted applications and .NET, which all utilize Kubernetes  and nodes. You can easily check which node is working fine and which is not in good state, how much traffic is currently passing through those containers or nodes, how they are integrated, and which one is responding fine versus not responding well enough. These are many areas where you can easily identify issues with the help of Apache SkyWalking. Because of its open use case platform, it helps from the licensing point of view and covers a wide area of use cases.

    In terms of projects, I would like to share a couple of examples. One of our patient services applications was facing issues with API failures. It was initially identified that this might be because of Java database upgradation, the fact service getting down, or perhaps a global outage of some database server, so the entire API services was getting affected. Then some fact line services started getting impacted, and because of that, a few of our Mule APIs were not working fine. Since the project had the dependency of cross-functional team members, each team was trying to identify where the actual cause was lying. At a high level, we thought that the Java API might not be connecting properly with the fact API or the Mule API internally calling the fact API, which was not getting reached properly. Someone was trying to reach out to the Mendix  team to see if they could figure out and find the logs, and it could be the .NET or other applications depending on what kind of application the team was currently working on. With the help of Apache SkyWalking, you can definitely have this in place and easily identify that for this particular time duration, this was the API call that went off and this was the feature that got stopped, and these are the documents that did not reach properly. You can easily identify the area and reach out to that team, stating that you need to check out these particular APIs, and you can reach out to the support team or the vendor if needed so that on the particular SLA, those can be taken care of on priority.

    Apart from that, there is one more use case I would like to share regarding one of the applications on the local platform we built. Apache SkyWalking can be integrated there also because most of the time when a lot of traffic is coming for a particular second, there is sometimes a huge spike on Grafana  or the logs and it is very hard to see that for a particular instant this much huge traffic is coming while your CPU or memory is quite low. You need to increase your space, but the logging is not able to maintain properly or pods are getting crashed and new pods are getting recreated. It is very hard to identify the logs to understand what is happening. Even in that area, you can easily integrate Apache SkyWalking and easily identify your Kubernetes containers and node health.

    What is most valuable?

    Apache SkyWalking offers the best features for integrating into the IT department to check microservices, the entire end-to-end health of the application, the node, Kubernetes, which queries are running fine, and which queries are running slow. From the SLA side, most queries should get completed within 200 milliseconds. If it is taking longer than that expected time, someone has to take the initiative to see where the room for improvement is.

    I have been using Apache SkyWalking while encountering a couple of scenarios in the IT department along with a couple of projects we were working on. That is where I was doing some self-exploration to see how we can try to get through the bottlenecks of the root causes and how we can easily identify what the RCA is, why lots of microservices and APIs are getting failed, and what the bottleneck is. Because that project had a dependency of cross-team members, that is where I got to know about Apache SkyWalking and explored it. It is a really wonderful tool to go ahead with the IT team.

    Apache SkyWalking helps me visualize data and performance by easily visualizing how the entire ecosystem is currently working. For example, if we have lots of Kubernetes containers in place and nodes being interconnected to multiple projects or products inside the organization, manually it is very hard to check out and take the export of the health of the containers and see how the traffic is going through which container is fine or bearing a lot of load and how we can shift it. Manually, it is going to take a lot of time. Visualizing it with the help of Apache SkyWalking is going to be a game-changer in such a way, reducing your time on that. You can easily visualize how the entire ecosystem is currently working. You can see where the current health is pretty much good and where the health of the system is degrading so that concern can be put into that sector as soon as possible.

    Apache SkyWalking has positively impacted my organization by reducing the time of the team so that they can put in more efforts into their other tasks, saving a lot of time, improving our SLA in resolving any issue, providing good RCA analysis to the leadership team, and helping us in monitoring the entire health in a shorter time span.

    What needs improvement?

    Apache SkyWalking can be improved by enhancing a few things. The learning curve is definitely there, so it needs a good learning curve. Your engineers or experts need to be pretty much handy and sound on the technical side to use this platform. If it comes to customization, that is where it needs a deep understanding. The normal configuration you can easily do, but on the heavy customization side, it needs a good learning curve.

    Secondly, on storage management, because you are doing the entire health checking and continuously monitoring the entire ecosystem, it occupies a lot of space. You need to either purge that storage and have something that can be recycled or put in some additional space in the archive section, and you can easily retrieve that after a certain period of time. If a million of records or traces are there, it becomes a heavy task. Storage management is something where it can be improved or explored much more to provide much more ease and convenience to the users who are opting for it.

    Thirdly, some UI modifications can also be done to make it much more beautified. I would add that Apache SkyWalking should improve the storage complexity and how we can easily manage that, and some customization and heavy customization learning curve can be reduced. Making the UI much more convenient and much more beautified for the end user would be beneficial.

    What do I think about the stability of the solution?

    Apache SkyWalking performs well in terms of reliability and uptime. The only issue is with managing its storage complexity, which is the major one. Apart from that, it is pretty good.

    What other advice do I have?

    Apache SkyWalking integrates with other tools or platforms in my environment easily if you have a good learning curve on that and are able to understand the technicalities. The DevOps team will be able to do that easily.

    Apache SkyWalking handles security and compliance requirements in my environment effectively, and I have found it helpful in identifying where the pain point area is while helping us in the RCA.

    My advice for others looking into using Apache SkyWalking is that if someone is really interested in identifying the entire health of the application and how the ecosystem is currently working without giving load to the developers to write down any particular code to check health statuses, they can definitely go with Apache SkyWalking. That helps in identifying the entire thing for their application ecosystem, Kubernetes, cloud services, nodes, and monitoring their microservices call, the API calls, and current functioning. Deep down inside the architecture planning, it helps in identifying and monitoring and managing the entire application, helping you to reduce your SLA in solving out your issues, any kind of hot potato incidents or hot fixes or any impediments which the team is getting blocked with. That is where it helps a lot in identifying the area of improvement and also sharing the reports with the higher leadership team members. I would rate this solution an 8.5 out of 10.

    Which deployment model are you using for this solution?

    Public Cloud

    If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

    Jaroslav Fikker

    Monitoring has accelerated root cause analysis and provides clear dashboards over time

    Reviewed on Feb 07, 2026
    Review provided by PeerSpot

    What is our primary use case?

    Our main use case for Apache SkyWalking  is for monitoring Java application servers such as Tomcat  and WebSphere Liberty.

    How has it helped my organization?

    We have continuous information about the status of the application servers.

    What is most valuable?

    The best features that Apache SkyWalking  offers are its user interface with graphical panels and charts. What I appreciate most about the user interface and panels is the ability to use different time periods in the chart and track resource consumption over time.

    Using Apache SkyWalking has had a positive impact on my organization because it has enabled us to identify the causes of various problems more quickly. Identifying causes is now approximately five times faster than before.

    What needs improvement?

    Apache SkyWalking can be improved by responding more quickly to new versions of monitored products, including, for example, taking into account changes in Java class names.

    For how long have I used the solution?

    I have been using Apache SkyWalking for about two years.

    What do I think about the stability of the solution?

    Apache SkyWalking is stable in my experience.

    What do I think about the scalability of the solution?

    We haven't had any issues with Apache SkyWalking's scalability because we use it in a relatively small environment where we monitor dozens of servers.

    How are customer service and support?

    My experience with Apache SkyWalking customer support has been good. I have contacted the support community several times and the responses have been very quick.

    Which solution did I use previously and why did I switch?

    Before Apache SkyWalking, we used Broadcom CA Spectrum . We decided to switch from Broadcom CA Spectrum  to Apache SkyWalking because of the license cost, as Apache SkyWalking is free.

    How was the initial setup?

    The initial installation of Apache SkyWalking went smoothly according to the available documentation.

    What was our ROI?

    I can't say how much ROI we've gotten from using Apache SkyWalking because I don't know the price of the original product (Broadcom CA Spectrum).

    What's my experience with pricing, setup cost, and licensing?

    Our experience with pricing, setup cost, and licensing for Apache SkyWalking is positive since it is free, which was the reason for our decision to use it.

    Which other solutions did I evaluate?

    We did not evaluate other options before choosing Apache SkyWalking.

    What other advice do I have?

    I recommend others looking into Apache SkyWalking to try it. I have given this review an overall rating of 8.

    reviewer2784462

    Tracing has revealed hybrid bottlenecks and delivers full visibility into critical payment flows

    Reviewed on Jan 05, 2026
    Review from a verified AWS customer

    What is our primary use case?

    My main use case for Apache SkyWalking  is a project that started in 2023 for a retail client facing serious performance issues on their new distributed architecture on AWS . The technical criticality is clear. We have an observability black hole on a high-traffic payment flow where we cannot distinguish if latencies are caused by microservices on Amazon EKS  or by calls to legacy on-premises databases. We chose Apache SkyWalking  through the AWS Marketplace  to integrate it immediately into the existing infrastructure with the goal of monitoring a massive environment consisting of over 80 microservices and about 600 active pods. This solution allows us to manage and analyze volumes in the order of 50 million traces per day, correlating every single end-to-end transaction in real time from front end to database and pinpointing bottlenecks that are invisible with traditional logging systems.

    How has it helped my organization?

    By analyzing traces provided by Apache SkyWalking, we identified a high volume of redundant and inefficient calls between microservices specifically, and synchronous calls that could be converted to asynchronous processes. Without this insight, we were over-provisioning our AWS  resources just to keep the system stable. Once we applied the optimization suggested by the trace data, the system became significantly more efficient. We were able to process 30% more transactions per second using the same infrastructure, which not only improved the end-to-end user experience during the high-traffic sales peaks but also optimized the client's cloud spend by increasing the overall density and performance of our existing 600 pods.

    What is most valuable?

    Apache SkyWalking provided full visibility into the black hole because before using it, we could not see what was happening when a request left Amazon EKS  and went to our on-premises legacy databases. Apache SkyWalking's distributed tracing correlates these two worlds in a single view, showing us that 40% of the latency was actually happening in the network hop between the cloud and the physical data center, not in the code itself.

    Second, it exposes hidden architectural flaws. By using the automatic dependency mapping, we discovered that some microservices were stuck in a cyclic dependency which was documented nowhere. This visual evidence allowed us to refactor the logic and immediately increased our throughput by 30%.

    Apache SkyWalking gave us database-level insight without database access. Through its slow query monitoring, the Java agents captured the exact SQL statements that were hanging during peak sales hours. This meant our developers could fix the exact line of code or index without needing to wait for a DBA to pull logs, reducing our mean time to resolution.

    There are many features that are useful to mention in this case because we obtained different benefits. Apache SkyWalking automatically drew the topology of the 600 pods where we discovered cyclic dependencies between services that no one had documented before and that were slowing down the system. Another valuable feature is resolving hybrid bottlenecks because we isolated a specific network issue between AWS and the physical data center. Without distributed tracing, infrastructure teams blame Java code and vice versa. Database tuning is also important because thanks to slow query metrics captured by the agent, we identified and rewrote the SQL queries that most impacted performance during sales peaks.

    What needs improvement?

    Apache SkyWalking can be improved with storage management complexity because with this volume of 50 million traces a day, managing data retention on OpenSearch  is critical. We had to implement custom logic to purge old data to prevent storage costs from exploding. Another area for improvement is the alert configuration because the out-of-the-box alerting system is basic, and we had to manually write complex rules to avoid false positives in activities that require expert time.

    For how long have I used the solution?

    Apache SkyWalking is the first solution for this purpose that I have used in my professional experience.

    What do I think about the stability of the solution?

    Apache SkyWalking is really stable for us.

    What do I think about the scalability of the solution?

    Handling 50 million traces a day is not trivial. Apache SkyWalking architecture, backed by a well-sized OpenSearch  cluster on AWS, handled the load without any loss.

    Apache SkyWalking is really scalable because we can break down its scalability into two main areas: horizontal scalability of the backend and storage scalability with Amazon OpenSearch. The real beauty of the system is in its non-blocking nature. The agents use a sampling strategy and asynchronous reporting, meaning that even if the backend is under heavy load, it never slows down the actual retail application. It is built for high-concurrency environments which gives us the confidence to monitor 600 pods simultaneously without fearing a system-wide collapse.

    What was our ROI?

    The biggest impact is the features that allow us to stop the blame game between the network, cloud, and database teams. By looking at the colored lines on the topology map, which turn red when latency exceeds our threshold, we can instantly see exactly where the bottleneck is located. It transforms a four-hour investigation into a five-minute visual check, and that is the key factor in improving our MTTR and increasing the system's overall throughput by 20-30%.

    The mean time to resolution, the time to diagnose a critical incident, drops from four hours to less than one hour and 45 minutes. Coverage also increases; we went from siloed visibility to 100% tracking of critical payment transactions. These are our success metrics.

    What's my experience with pricing, setup cost, and licensing?

    It is a good experience with pricing, setup cost, and licensing, but I am not in charge of this; the customer is in charge of the purchasing of the tool.

    What other advice do I have?

    Apache SkyWalking is a very nice tool and an exceptional tool for managing volume and complex architecture on AWS without the prohibitive cost of commercial suites. I would give this product a rating of 9 out of 10.

    Which deployment model are you using for this solution?

    Public Cloud

    If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

    Amazon Web Services (AWS)
    View all reviews