Cisco Cloud Observability Operators

Sold by

Cisco Systems, Inc.

Kubernetes Operator to install Cisco Cloud Observability Collectors

Leave a review

Ratings and reviews

4.2

67 ratings

5 star

4 star

3 star

2 star

1 star

49%

45%

29 AWS reviews

38 external reviews

External reviews are from PeerSpot .

Filters

Review type

AWS Marketplace reviews

External reviews

Reviews (67)

Purnambica Kolavennu

Unified observability has improved real-time governance and now drives data-led decisions

Reviewed on Jun 19, 2026

Review provided by PeerSpot

What is our primary use case?

I am Purnambica Kolavennu and I have been working for the past two to three years with the Agriculture Skills Council of India. I work at the intersection of technology and AI innovation in the agriculture skilling platform. I have been working in a similar portfolio for approximately five to seven years.

We have been using Splunk Observability Cloud for the past two years. At the intersection of agriculture and AI innovation, real-time visibility monitoring and analytics across the entire agriculture skilling ecosystem is extremely critical. We are helping various organizations, clients, training partners, assessment bodies, and government stakeholders improve their service delivery, audit compliance, and ultimate outcomes.

We are doing end-to-end monitoring of skilling platforms through our unified cloud-native SaaS platform, which is Splunk Observability Cloud. We are troubleshooting any kind of foreign incumbents or cyber threats. We are monitoring the health and performance of various systems including training systems, assessment portals, attendance portals, and learning management systems. All these components are monitored end-to-end. The entire monitoring of our training and assessment activities has become extremely easy, and we are able to make our clients happy with this solution. We are monitoring the real-time dashboard for PMKVY and government schemes.

What is most valuable?

Splunk Observability Cloud is helping us derive and fetch data-driven decision-making insights. These fresh insights help us take better decisions and improve scheme monitoring for PMKBY, PM Vishakarma, and other state programs. When monitoring assessment operations, our daily activities are critically handled here, and we have been able to reduce our assessment delays. We are able to make better compliance with guidelines and norms, and we can early detect bottlenecks or foreign threats or cyber threats coming into the system.

Through this platform, we have started doing automated identification of eligible and non-eligible candidates, which has improved the system greatly. Now there is more transparency and credibility and authenticity in the entire process. We have also reduced the manual verification effort, which has greatly helped our entire team. We are doing many activities in predictive analytics where we are trying to identify and gauge early intervention to protect scheme guidelines and enhance infrastructure performance monitoring of our various applications in a unified manner.

We have been able to have more traceability because the Log Observer Connect is a very useful functionality that has been given to our clients and various vendors. Through this, centralized log data for visibility is available to all partners, which allows seamless flow of information and data. We have been able to have more AI and automation integration, through which our manual effort has been reduced. We are doing specialization monitoring of the AI agents and our infrastructure stacks.

Splunk Observability Cloud is a very strong cloud-native SaaS platform designed for monitoring and troubleshooting cyber environments. The mean time to resolution functionality, which is an MTTR functionality, enables different kinds of intervention. Application performance monitoring is a very strong feature providing deep code-level visibility into distributed applications. It is also helping us manage each data flow into the system from one end to another end. Infrastructure monitoring is most important because improving operational efficiency is very critical for our organization. For that, we need real-time streaming analytics and automatic service discovery, which we are able to achieve through infrastructure monitoring. Real user monitoring of various applications is also a very strong feature, capturing the complete end-user experience from both web and mobile applications. The application proactively monitors service performance, APIs, and URLs globally before end users are impacted.

The strongest feature is unified application performance monitoring. We are working on approximately twenty-plus applications at a time, fetching data insights from them, and tracking their performance and identifying any bottlenecks or cyber threats. This would be extremely difficult if done manually, as many eyeballs looking for performance metrics would be necessary. This unified SaaS platform is helping us out and giving us full visibility and full business control, providing full business control in terms of mapping all performance metrics, deriving decision-making insights, and helping our people.

There is a very strong aspect of compliance monitoring and audit readiness for our organization because we are directly governed by the government of India. Fraud detection is helping us a lot because there are many times when confidential information and data flow are attacked by suspicious login patterns, unusual duplicate registrations, unauthorized system access, and anomalies that come into the system in the form of cyber threats. The system is doing twenty-four-seven monitoring of the various applications we are using, and because of that, there is enhanced compliance and improved audit readiness, which has actually reduced the risk of malpractices. The system is keeping us very clean in terms of compliance practices, and our service delivery has been excellent. Our stakeholder service management experience has increased greatly because clients are happy, there is faster issue resolution, and satisfaction is being built with our clients.

There is a very strong performance monitoring feature where we are able to track the entire performance metrics end-to-end because there is a single performance control center through which all dashboards and application visibility comes to us in just one click. There is real-time monitoring happening, which is helping us a lot. Our operational productivity and efficiency has improved. There is better audit and regulatory compliance now. Our clients are happy because they do not have queries raised on their processes. Twenty-four-seven issue resolution metrics are incorporated, which is helping us a lot. There is predictive insights, which is a kind of proactive insight, helping us in decision-making. There is strong unified governance. We are seeing a thirty percent to fifty percent reduction in incident resolution time, which is helping us a lot. There is higher compliance metrics, improved batch completion rates, and better visibility of all processes and systems. The strong data-driven decision-making is creating a very strong ecosystem and environment throughout.

There is a twenty-four-seven ticket resolution system and a very advanced chatbot system, which is a humanized form of chatbot system that works on a four-to-six-hour resolution time. For many of the applications, resolution time is twenty-four to forty-eight hours, but for them, it is approximately four to six hours. There is an embedded feature that if an issue is not addressed within thirty minutes or within one hour, then it is escalated to a higher level authority and quick resolution time is attained. Our manpower in terms of resolution and in terms of following up on tickets has been extremely reduced. Through the chatbot system, strong chatbot system support, and their email support, we have been able to reduce our burden greatly.

What needs improvement?

Log Observer Connect is embedded here, but we are facing some delays in centralized log collection and analysis, which can be further fastened. We are collecting all the data metrics and decision-making insights, but all these data-driven decisions coming from different applications are not connected somewhere. A consolidated form or correlation of these insights is not happening between each other due to which we feel we are missing something significant.

Some generalized feedback includes that predictive alerts or alarms which can be integrated with AI-driven alarms and alerting features should be established so that there is AI-driven intelligence and anomaly detection happening with a complete systematic process in service delivery. Application dependencies are huge, and business and operational dashboards should be improved. Right now there are very interactive custom dashboards, and every now and then, the personalization of enhancements keeps happening. KPI monitoring, executive reporting, and analytics have definitely been introduced to a great extent. There are few things in cloud-native monitoring, such as integration with AWS and Azure, where we sometimes do face lags. Those things can definitely be improved upon.

I have used Datadog and Dynatrace before using Splunk Observability Cloud. Datadog was definitely recommended by most of our peers because of its very strong comprehensive observability and very strong and unique dashboard systems. Dynatrace was also very good because they have offered a lot of AI-driven analysis methods and processes, which was helping our organization a lot. Since our organization has a very strong IT ecosystem for agriculture, very different kinds of customized things are required.

What do I think about the stability of the solution?

Splunk Observability Cloud is very, very stable. We are using approximately twenty-plus applications, and the system has the capacity to increase applications to up to ninety. There was a time when we were having sixty to seventy applications to be monitored in one go, but there was never any outages or downtime. We have never faced any kind of downtime or performance issue. It is highly scalable because it can handle approximately up to one hundred applications at a time without any lapse or lag.

What do I think about the scalability of the solution?

Scalability is huge for our needs. We are able to use it in our native cloud environment, but we also have external cloud environments of our client servers with very different configurations. The API integration is so smooth with those external client servers that there is never a scalability issue or compatibility issue that we have seen. We have never seen any kind of downtime or crashes, as it has been absolutely very easy to scale. If I am working on twenty different applications today and tomorrow I want to scale it up to fifty different applications, everything can be done easily without any downtime or outages.

How are customer service and support?

Customer support is great. The turnaround time for solution is extremely good. They are available twenty-four-seven with an advanced AI-driven chatbox system, and they are resolving issues within four to eight hours, which is commendable. The customer support system is the foundational pillar of any successful business, and the team has greatly excelled at this.

Which solution did I use previously and why did I switch?

I evaluated Datadog and Dynatrace. Datadog was very highly recommended by most of my peers because of its strong comprehensive observability and unique dashboard systems. Dynatrace was also recommended because of the strong AI root cause analysis. We also checked for new solutions but could not find the best deal with them, so we ultimately switched to Splunk Observability Cloud.

What about the implementation team?

Many features keep adding up every now and then as per different requirements and as per the changing business environment. We request their business team and tech team to do capacity planning or capacity development sessions every now and then so that there is uniform training happening across the ecosystem. Our new incumbents, new learners, and new tech executives are learning those new systems every day, and there is no mishap in understanding. This would definitely enhance user experiences and provide better orientation and better understanding of the systems, processes, and how the application actually functions and what the various utilities are.

What was our ROI?

We have reduced our employees from approximately ten to twelve people working in this vertical to five. We have reduced our operational expense by forty percent, and we have reduced our operational burden by nearly ten percent in the form of multitask management, which was done by human intervention or manual intervention. We have been able to save a great deal of money, and our profits have increased by twenty percent. Initially, even after one year of deployment, we were in profits.

What's my experience with pricing, setup cost, and licensing?

The pricing and initial setup cost were a bit pricey for us. However, we have done a lot of negotiations with the business team, and now we have gotten a reduction of approximately ten to fifteen percent. Their licensing has annual renewal, so we are doing every year SLA agreements with them and renewing it.

What other advice do I have?

I would definitely give Splunk Observability Cloud a nine out of ten rating. The unique strengths include strong application monitoring infrastructure, a very comprehensive observability environment, and a very powerful native cloud environment. There are strong dashboards for real-time visibility twenty-four-seven, and it is suitable for large enterprises. I would say it is best for large enterprises because their personalization and customization is extremely good and suited to the requirements and needs. Unlike other observability cloud applications, this is very advanced, and AI root cause analysis keeps happening throughout, due to which even complex IT ecosystems or complex integrations are handled very easily. There is full stack monitoring happening, and there is excellent log analytics, which is actually helping us a lot to make faster and better data-driven decision-making. Splunk Observability Cloud is extremely reliable and an extremely trusted source, and it has definitely gained public faith and public trust. It is a highly recommended application.

With the small enhancements or improvements regarding integration and doing a lot of training and orientation time and again to make the system more compatible and understandable for all, it could definitely be a ten out of ten.

There is very strong governance and security. The policy processes are very strongly governed. Government cloud ecosystems are very susceptible to any kind of threat attacks, and there are a lot of system bridges built there, with a lot of stake involved. The system is giving a lot of advanced use cases, such as Google Cloud-based applications which are very secure. They are hosting the entire program on another platform and also creating a duplicate of it, due to which there are various strong audit processes that have been inbuilt. There has been real-time observability all the time so that there is no such any problem. All of this is very cost-effective, so it is definitely very strongly compliant with built processes.

Accuracy and reliability are excellent. We are dealing with approximately millions of data every week, and the system runs throughout the day continuously running on those data and bringing data and insights to us. In the past two years, I have never seen any data mismatch or inaccuracy. There is strong trust built in where there is no data leak, no data misinformation, and nothing leaking or any kind of information going out of the system. Accuracy and reliability are very strong features. My overall review rating for Splunk Observability Cloud is nine out of ten.

Shrinkhala Singh

Unified monitoring has transformed drone-based agriculture and has improved real-time decisions

Reviewed on Jun 16, 2026

Review provided by PeerSpot

What is our primary use case?

Our main use case for Splunk Observability Cloud revolves around the agriculture scaling ecosystem, which is heavily dependent on advanced automation in terms of predicting climate resilience, agriculture analytics, and predicting IoT activities, where we check all activities through drones. The drones are controlled through various other application platforms, and we are revolutionizing the Indian agriculture ecosystem by introducing Kissan drone operators, also known as Srishi drone operators, where these drone applications are heavily monitored and evaluated. Various application platforms need unified control and monitoring, which is accomplished through Splunk Observability Cloud. Many activities or loopholes go unnoticed and can cause serious issues later, so bridging those gaps and bugs necessitates the introduction of Splunk Observability Cloud across all ecosystems.

What is most valuable?

Splunk Observability Cloud's monitoring has significantly changed our day-to-day operations and decision-making, especially in drone operation and its monitoring, which is heavily dependent on real-time data insights and technological interventions. Many decisions must be made quickly and in a scalable manner, achievable only through a unified platform that delivers all these aspects. We are currently using several metrics to measure outcomes in terms of increased production efficiency and improved operating efficiency, along with insights into activities undertaken by farmers or data monitoring teams, all of which provide us with a transparent system within the ecosystem. Troubleshooting these issues helps our farmers and engineers prevent performance problems and downtimes.

The core capabilities provided by Splunk Observability Cloud application platform have been well-documented, and it is crucial to note that there are no lapses or sampling errors in our organization's performance monitoring. We achieve 100% accuracy when monitoring application performance and providing data dashboards to our senior management. We have never missed a trace while analyzing transactions across different services offered to farmers. Infrastructure monitoring is equally vital for us, given our multiple servers and multi-cloud environments where various agricultural applications operate simultaneously and data flows seamlessly. All of this has only been possible because of Splunk Observability Cloud. Our digital enhancement experience has improved multifold, especially with the introduction of AI-powered insights over the past two to three years, allowing for pinpoint guidance in detecting anomalies, identifying root causes, and significantly reducing alert fatigue while enhancing overall efficiency.

The best features of Splunk Observability Cloud include full-stack application monitoring, which is very easy to navigate. The platform consistently demonstrates no performance downtime, even on weekends, which bolsters our client's trust and confidence. Predictive analytics powered by AI insights provide us with real-time data matrices and insights, significantly improving our customer experiences and accelerating innovation. Identifying potential threats and issues, such as root causes, has become very straightforward, leading to our immense satisfaction and gratitude towards their responsive business team. Their multidimensional features provide unified security and substantially enhance visibility, which perfectly aligns with the concept of observability.

One standout feature is full fidelity monitoring and proactive troubleshooting, especially with approximately twenty to twenty-five applications concurrently used across multi-cloud environments, managing data transfers and inputs efficiently. I recommend that more flexibility be included in launching applications and features. The database standard integration is incredibly beneficial, as is checking each data layer in a full-stack environment, something which Splunk Observability Cloud handles excellently.

Splunk Observability Cloud positively impacts our organization by significantly increasing overall visibility and observability experiences for the entire team through numerous newly introduced features. Previously, we lacked visibility into query logs, but now we can track and trace these logs effectively for problem identification and troubleshooting. As a result, the reoccurrence of similar issues has dramatically decreased. We now have structured logs and tracking that are amazing, and the user experience, especially for our clients—primarily farmers using less developed Android phones—is vastly improved. The application performance monitoring criteria make navigating the platform easy and clear, allowing us to perform hygiene practices for coding. Our on-premises deployment has proven advantageous in monitoring the health of our cloud environment, and we are recommending this to others. The scalability as we have grown from three to twenty-five platforms has been seamless; our system hasn't crashed, indicating stability.

The metrics from utilizing Splunk Observability Cloud clearly show improvement, especially in downtime reduction. Previously, we faced a systematic performance lag of around twenty to thirty percent, which has now reduced to just two to five percent—an improvement we can credibly showcase to our clientele. We now collect and track traces, query logs, and session data effectively, providing us with credible, quantifiable metrics for assessing business enhancements and current operational stages. Real-time visibility and data fetching for dashboards is an extraordinary addition that distinguishes our experience.

There are several performance enhancement areas for Splunk Observability Cloud. For instance, Splunk Observability Cloud's IT service intelligence core part needs improvements as clients request more IT services performance matrices than the current system supports. Certain matrices are still unnoticed, creating false alarms that require enhancement. We previously used Datadog and other AWS observability solutions that were quite affordable. Currently, smaller businesses struggle to reap the benefits. UI navigation is easy but could use polishing for a better experience. Integration issues arise with some services taking longer than expected to connect properly, which is an area for improvement.

An area needing improvement is the AI-driven anomaly and issue detection system, which occasionally generates many false alarms that consume our time. We also face challenges with metrics not communicating across different measurement platforms, which requires addressing regarding log-specific queries. Additionally, I suggest extending the trial period beyond thirty days to forty-five or sixty days, allowing more time for our team to understand the software's functionalities and business use cases.

What needs improvement?

The accuracy and reliability of Splunk Observability Cloud's outputs have been consistently impressive. No one can question accuracy due to its proven record, as many large organizations depend on it for application performance monitoring. Splunk Observability Cloud excels in troubleshooting cloud applications, and whenever customization is needed, it is smoothly introduced. Overall data insights gathered during critical platform phases are near 100% accurate, with no identified lapses in the data monitoring processes we have employed.

Splunk Observability Cloud significantly enhances our operational performance and company resilience. With automation in place, we enjoy improved customer experiences based on impactful business insights that help our clients make sharper decisions. This solution allows us to project future performance accurately and identify data anomalies while managing incoming threats effectively. The integration of this solution is straightforward and open-source, enabling users with basic knowledge to adapt without difficulties. We have also eliminated other monitoring solutions, consolidating everything onto one platform for greater efficiency.

For how long have I used the solution?

I have been using Splunk Observability Cloud specifically for approximately two to three years, utilizing the platform for monitoring and automating the agricultural activities in our various ecosystems.

What do I think about the stability of the solution?

Splunk Observability Cloud is stable and scalable, effectively monitoring and supporting real-time operational needs. The system's steady performance includes customization capabilities tailored to different organizations, enhancing transparency across all systems while remaining highly reliable.

How are customer service and support?

I would rate customer support for Splunk Observability Cloud a perfect ten out of ten. Their team is available twenty-four-seven, with ideal resolutions usually achieved in one to two days, and even complex issues resolved within a week. The accuracy in addressing tickets is commendable, ensuring efficiency in problem-solving.

Which solution did I use previously and why did I switch?

Previously, we used other solutions, including Grafana and Signos for around two to three years before deploying Splunk Observability Cloud.

How was the initial setup?

Pricing and setup costs were reasonable, initially about a hundred to a hundred twenty dollars for a variety of additional features along with the regular offerings. We increased from fifteen to eighteen dollars per month per user, which remains affordable and manageable. Licensing is renewed annually without significant issues over the past three years.

What about the implementation team?

Before deciding on Splunk Observability Cloud, we evaluated other options, particularly Absolute Fleet, which is regarded as stable and scalable. During deployment, we also tested Signos and Datadog, well-known for observability and security platforms offering comprehensive log management.

What was our ROI?

We have seen a return on investment with Splunk Observability Cloud. The solution's affordability and high metric index create substantial benefits for our observability needs, allowing easy configuration and automatic adjustments without requiring excessive API usage. We have saved considerable amounts of money, reducing our expenditures from around three to four crores to approximately one to one point two crores. Our application performance monitoring has shone brightly, leading us to exceed targets and numbers that were previously unattainable.

What's my experience with pricing, setup cost, and licensing?

For our organization, it is crucial that Splunk Observability Cloud provides end-to-end visibility into our cloud-native environments, especially given the government's audit parameters in India. Splunk Observability Cloud's global ratings above 4.3 reflect its excellent service in providing insights across various infrastructure layers. Customization becomes attainable when our team utilizes personalized navigators for better visibility, particularly regarding sensitive user data. Their monitoring does not retain information and maintains confidentiality, adhering to data protection policies, which has contributed positively to our experience.

Which other solutions did I evaluate?

When assessing Splunk Observability Cloud regarding our organization's growth, the initial client base of four to five has expanded to twenty to twenty-two, forecasting further growth shortly. Our turnover time has decreased significantly, allowing us to perform better practices and enhance confidence in our scaling efforts. Our cloud monitoring team feels empowered due to the accuracy in reporting, satisfying clients and promoting a win-win scenario for everyone involved.

What other advice do I have?

Splunk Observability Cloud enables us to transition our team from repetitive tasks to focusing on critical business initiatives. Instead of spending time on trivial activities fetching dashboard reports, our team can now concentrate on creating strategies for the upcoming months and quarters, maximizing the utility of our human resources. This shift has also fostered a research-oriented approach, allowing us to explore advanced cloud infrastructure options beneficial for our ecosystem.

The out-of-the-box dashboards and detectors in Splunk Observability Cloud are exceptionally advanced and enable us to integrate various platforms without experiencing downtime. This maturity facilitates a unified integration approach tailored to our needs and reflects the team's understanding of user cases. The system effectively mitigates challenges faced by other software during integration.

Since introducing Splunk Observability Cloud, the mean time to detect issues has certainly improved, allowing us to identify potential threats and cyber security issues proactively. Before the implementation, identifying issues took a month; now, we can recognize red flags twenty-four to forty-eight hours in advance, facilitating timely strategic adjustments across teams.

My advice for anyone considering Splunk Observability Cloud is to deeply explore the product page before obtaining a license. Understand the features available, and engage with the customization team to address any inquiries. Familiarize yourself with on-their-top guided workflows that clarify processes and enable informed decisions. I would rate my overall experience with this product an eight out of ten.

AmanThakkar

Real-time observability has reduced manual troubleshooting and now optimizes AI workloads

Reviewed on Jun 15, 2026

Review from a verified AWS customer

What is our primary use case?

Splunk Observability Cloud is primarily used for application latency measurement, CPU usage monitoring, and memory usage tracking. It essentially functions as a system monitoring solution.

Splunk Observability Cloud is being used to monitor and optimize AI applications. Every AI model is on one server using Ollama, and Splunk Observability Cloud is deployed to track which AI model is being used more, which is not, at what time, and which prompt has been heated. This integration allows for detailed monitoring and optimization of AI workload distribution.

Previously, issues such as identifying which instance was using more CPU or which was using more GPU were solved manually. With Splunk Observability Cloud, AI assists in these tasks automatically, allowing the team to focus on bigger issues.

The engineering team primarily deals with alerts that come in. The team tries to solve these issues using AI and other tools provided by the platform.

What is most valuable?

What I like about Splunk Observability Cloud are mainly the real-time dashboards. I am getting real-time usage of my CPUs and everything. It also provides end-to-end visibility with the system. I am getting to know what application is using which CPU, everything. Metrics have been set in our system, so I can get very end-to-end visibility into it.

Out of the box, the solution's dashboards and detectors were very helpful. During our last sale on Good Friday, one of my CPUs was being used with very high latency. We got very high alerts from it because we had set that up. Because of it, we are very grateful. We solved that issue in a very short amount of time, and the sale went live. It is basically used to reduce manual work. Earlier, we were not using this platform, so we had to find which CPU usage had increased and which had not. It was a very manual and messy process earlier.

My impression of the No-Sample Tracing feature in Splunk Observability Cloud is that we are collecting from Cribl and we are getting data from it. We have set every metric on the cloud and in Splunk, and it helps to showcase everything in real time.

The AI-powered analytics and guidance provided by Splunk is very good. The AI part is what I was expecting earlier because it was very messy. There is not a lot of information about Splunk in the market, so we required an AI for this. AI-powered analytics help identify any anomalous activities or any spark on any platform and solve issues automatically, resulting in good management of high latency and resource storage.

End-to-end visibility into my cloud-native environments is very useful. Earlier, I had to solve these things manually. I had to check every instance, I had to check every monitoring system, which has been blasted and which has been corrupted. With this solution, I am able to manage it effectively.

What needs improvement?

I would like to see a very detailed tutorial about it and how any newcomer can be able to use it. If there is any tutorial or other resources available, it could be better to use it.

I do not have any missing features that I would like to see included or enhanced in it. We have not faced any technical issues right now.

For how long have I used the solution?

I have been working with Splunk Observability Cloud for more than a year, but I have been using it for the last six to eight weeks.

What do I think about the stability of the solution?

Splunk Observability Cloud is very stable. We have been using it for the last six months and it is very stable.

I do not face any downtime because we do not experience that type of issue.

How are customer service and support?

I would evaluate customer service and technical support as a nine. It is very good.

It is a nine out of ten.

How was the initial setup?

The experience with the deployment is easy.

What about the implementation team?

I purchased Splunk directly through Splunk, not any third party.

I have done it through myself.

What's my experience with pricing, setup cost, and licensing?

I find the experience with the pricing aspect, setup cost, and licensing part to be very less. However, the pricing is not coming under my purview, so I cannot be sure about it.

What other advice do I have?

We are developing AI applications, so every AI model of ours is on one server using Ollama. We have integrated everything into it and we are using it as an API. We have deployed Splunk Observability Cloud and everything. This allows us to know which AI model is being used more, which is not, at what time, and which prompt has been heated. It is very good with the AI models.

The main benefits that Splunk Observability Cloud brings to the table are mainly to reduce the time and to reduce any manual work.

Splunk Observability Cloud has helped to improve my operational performance because it is useful for my company. It helps to solve issues, reduce the manpower, and reduce the man-time to solve it. It is very useful.

Mean time to detect has worked very well with our small application. We have our small application in the cloud, in an AWS instance, and we have connected everything with this. It works very well with that as well. It is working very well even in a very small application.

Previously, we had to solve problems manually, figuring out which instance was using more CPU or which was using more GPU. But with the help of this AI, we have come to know that we do not have to worry about the small things. The AI is solving those things on its own. We just have to focus on the bigger issues and the bigger picture. It is very good.

Overall, I assess Splunk Observability Cloud for helping my organization scale as very helpful because it is very useful and very impactful in my organization.

My impressions of Splunk Observability Cloud for helping my organization focus on its business-critical initiatives are positive. It is very useful because we can have more focus on what the issue is rather than finding what the issue is.

If you are using servers or any cloud, I highly recommend Splunk Observability Cloud. Because of it, we are able to find any issues in the server directly. I highly recommend Splunk Observability Cloud to any organization if they are using servers.

I assess this product with an overall rating of nine out of ten.

Which deployment model are you using for this solution?

On-premises

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)

Dhaval Bhalgamadiya

Real-time dashboards and AI-driven insights have reduced incident resolution time significantly

Reviewed on Apr 30, 2026

Review from a verified AWS customer

What is our primary use case?

In our organization, we are using Splunk Observability Cloud for real-time monitoring and troubleshooting of our applications and the infrastructure performance, tracking metrics such as CPU usage, memory, latency, and the services of different microservices which we run for our applications and products.

What is most valuable?

The best features from Splunk Observability Cloud include the high-level dashboard for clear visibility of our infrastructure and the product, as well as the detailed traces for the request flow of our APIs and the in-between application communication. From the detailed traces, we can know where our application fails, allowing us to solve incidents very easily, which has drastically reduced the MTTR of our application.

I find the out-of-the-box dashboards very helpful. Although we have not done much customization yet, the out-of-the-box dashboards and detection capabilities include pre-built dashboards for common services and infrastructure components. We have not used them extensively, but we customize them for our organization's needs, and we also adapt the detectors for alerting purposes.

I find the AI-powered analytics very helpful because we have also used other observability platforms such as SignalFX, where the AI-powered analytics is not built into the application. Here, the AI provides intelligent insights and very early anomaly detection and pattern recognition, automatically informing us of highly unusual behavior in the application before any incident or outage occurs during production.

What needs improvement?

One area that has room for improvement is the pricing; as I mentioned, it can be expensive due to large data volumes. Also, the pricing can be unpredictable, and if it were more predictable, the organization would be more comfortable with it. Additionally, I found the learning curve quite steep when I started using Splunk Observability Cloud; it took me some time to learn it. I also think that while our team is large enough to utilize it, smaller teams might not prefer this solution.

We have not started customizing Splunk Observability Cloud yet according to our needs, but we plan to in the next weeks. We have used the basic customization features, and I believe it is customizable.

For how long have I used the solution?

I have been using Splunk Observability Cloud for the last one year; I have joined my recent organization from the last three to four months, where I have been using it from the last three to four months.

What do I think about the stability of the solution?

The stability and reliability of Splunk Observability Cloud is top-notch, as we have not faced much downtime, so I would rate it nine.

What do I think about the scalability of the solution?

The scalability of Splunk Observability Cloud is also very good; we can ingest any data we desire, so I would rate that nine as well.

How are customer service and support?

I rate the technical support as very proactive, and our doubts and queries are resolved properly, so I would give it a rating of five.

Which solution did I use previously and why did I switch?

Before using Splunk Observability Cloud, we had used SignalFX and considered vendors such as Datadog and New Relic. We chose Splunk Observability Cloud because of its vast features, the visibility we gain from the dashboard, the AI integrated into the platform, detailed traces, and logging capabilities. While Datadog and New Relic are also good, Splunk Observability Cloud is better in certain areas.

How was the initial setup?

The deployment part was handled by the other developers and ops engineers in my organization, but I know the initial setup for Splunk Observability Cloud is simple and very easy.

What about the implementation team?

The deployment part was handled by the other developers and ops engineers in my organization.

What was our ROI?

From an ROI perspective, Splunk Observability Cloud offers much higher value because, as I mentioned earlier, our MTTR has reduced by more than 50%, which decreases the overall downtime for our application. When there is an outage, the time to resolve is shorter, and application uptime has also increased because of it. This improvement is the main reason for using Splunk Observability Cloud; we wanted to decrease our application downtime. Additionally, the visibility provided by the dashboard helps us understand where our application has failed.

Which other solutions did I evaluate?

What other advice do I have?

I have not used the no-sample tracing feature yet, so I am not sure about that.

I would say it takes around one month to learn Splunk Observability Cloud; it varies from person to person, but that was my experience in learning all the features and use cases our organization employs.

Our company is not deeply involved in LLMs and GPUs for AI applications; our applications mainly run on normal Java processes on standard servers, not on GPUs or LLMs yet. We are in the process of developing our capabilities in AI later on.

We are using normal servers as a cloud-based solution, but we still have some drawbacks, mainly the pricing part, as smaller teams may not find it suitable, and the pricing model is complex while the learning curve is steep, particularly for the SignalFlow query language.

My advice for anyone considering this solution is to opt for Splunk Observability Cloud without any hesitation, as it can drastically decrease the mean time to resolution and mean time to detect any issues in their applications. The overall visibility of the organization, including application usage and memory metrics, is clearly presented on the dashboard, allowing insights into what went wrong and when. Although the learning curve can be challenging initially, users will adapt and find it very beneficial for their organization.

I would describe the pricing as neither too high nor too low; however, if it could be cheaper, it would be beneficial for us since sometimes due to large data volumes, it can be expensive for the organization to track large datasets, as it charges for large volumes of data. Sometimes it can be costly if the data we are receiving is irrelevant.

Our organization has between 200 to 500 people, and I believe that more than 100 people are using Splunk Observability Cloud, including developers, ops engineers, security engineers, and others. I am not certain of the exact number, but it is definitely more than 50.

I would rate this product overall at a nine.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)

Ashutosh Parmar

AI-driven observability has reduced resolution times and improves real-time monitoring

Reviewed on Apr 17, 2026

Review provided by PeerSpot

What is our primary use case?

I mostly work with the performance metrics of the CPU, or host metrics, as well as application metrics and traces. Overall, I use these mostly for real-time monitoring based on the application to track application performance.

For the monitoring of infrastructure, it is quite insightful because in-depth, I can see what is going on in the infrastructure. If something goes down or some crons fail inside the infrastructure, the alerts are quite helpful for more visibility on the cloud-native side.

This is quite helpful for improving the application observability and the infrastructure side as well. I would rate observability above an eight.

I am not that much involved in the business side because I work as a DevOps engineer, so I do not know how much it helps on that front. However, it helps in tracking traces and metrics quite generously well and helps us improve the application side for more reliability on the business side.

What is most valuable?

It is very helpful and really enhances the AI-powered analytics, which helps us for troubleshooting the application and to get more insightful information while troubleshooting application error rates.

AI-powered guidance is really helpful because it provides more actionable insights and highlights anomalies automatically. I do not need to go through it manually, and it also helps us with smart alerting and recommendations.

It helped operationally because due to the insights of the applications, I get more insight for our application to enhance it further. It detects anomalies and correlates data while guiding us to the root causes, so we can enhance our application accordingly.

I have seen that mean time to resolution was reduced around 30 to 50 percent. The main reason for this combination is because of real-time monitoring and AI-powered anomaly detection and distributed tracing. Instead of manually checking the logs and metrics across multiple tools, the platform quickly highlights the issues, correlates data, and points us towards the root cause.

After implementing Splunk Observability Cloud, there was a deep learning curve for the new tool. It took one or two months to get proper insights from it. After configuring, I have seen that it is very useful for tracking traces and metrics of our application, servers, and clusters. Adoption time is usually after two months, or after a few weeks of getting Splunk Observability Cloud.

Splunk Observability Cloud is highly effective in improving digital resilience. Real-time visibility and proactive alerting and fast root cause analysis, distributed tracing, and AI-driven insights enable anomaly detection, which allows us to quickly understand failures and recover faster. This is critical for maintaining system availability and helps us handle failures in complex distributed environments since we can see how services interact and where breakdowns occur.

What needs improvement?

Regarding features, it helps us for better understanding of how the application works and in-depth tracking of application monitoring.

It can be more enhanced using additional AI power. I can get more reliability using AI because AI-driven guidance is more useful nowadays. It can really improve more on the AI side because it will help us to reduce manual intervention with the system and root cause analysis will be much better with AI over human analysis.

I would say that it is quite helpful, but for different kinds of applications, it could be improved because sometimes it might provide a cloud judgment of the root cause analysis. I need to do manual intervention using a dedicated human for root cause analysis for better understanding of the root cause. This is how the agentic side can be improved.

For how long have I used the solution?

I have been working with Splunk Observability Cloud for around a year.

What do I think about the scalability of the solution?

It is quite scalable. Right now, it is providing much better insights and can be more enhanced over several aspects. I would rate scalability an eight to eight point five.

Which solution did I use previously and why did I switch?

I have tried other solutions, but they were not that great in terms of functionalities and overall performance. Splunk Observability Cloud is much better than the others because it provides AI alongside the solution. This is very helpful due to the AI-driven solutions and guidance for root cause analysis. Splunk Observability Cloud goes through the details of application traces and metrics in depth, so I get better observability over the application. This is why I have preferred Splunk Observability Cloud over other monitoring tools.

I have tried SignalFx, but it was not quite insightful. I have tried Splunk Observability Cloud over SignalFx.

What other advice do I have?

Splunk Observability Cloud is quite insightful and helpful for improving the observability side. I provide this solution an overall rating of eight.

Aman Dhanesha

Monitoring has reduced API latency and now predicts issues across our cloud infrastructures

Reviewed on Apr 16, 2026

Review provided by PeerSpot

What is our primary use case?

I mainly use Splunk Observability Cloud to monitor the performance of our cloud-native infrastructure. Because we have created multiple infrastructures, we use it to handle and monitor everything.

Splunk Observability Cloud helps us manage latency across any of our projects and APIs. It is particularly valuable for detecting issues before they occur. We can predict features and errors in advance. Recently, we discovered problems in seven of our APIs that we were able to solve because of this predictive capability.

What is most valuable?

The best feature of Splunk Observability Cloud is that I can identify the root cause of any problem, including API latency. The real-time alerts and smart alerting system are exceptional, allowing me to know what is happening in real-time.

Detectors in Splunk Observability Cloud are very useful, and I have recently used them with great results.

Regarding the no-sample tracing feature, we collect multiple data from various sources. This feature is very useful since we recently shifted to it, and it is working very well.

The AI-powered analytics that Splunk provides allows me to get a smart analyzed version of any report.

Splunk Observability Cloud has greatly impacted our operations by reducing timing requirements. We get smarter solutions and overall use cases in a smart way. I have reduced our manpower requirements and time commitment significantly. Splunk Observability Cloud reduces our mean time to detect by approximately one to two hours.

The LLM in Splunk Observability Cloud is very powerful, and the vector database infrastructure is excellent. This is why we switched from our previous tools, and I believe it was a very good decision that has resulted in better outcomes.

What needs improvement?

The AI-powered analytics that Splunk provides delivers a smart analyzed version of reports, and it is quite good, but it is very generic. The issues identified could be better addressed through deeper AI thinking to provide a more effective solution.

For how long have I used the solution?

I have been using Splunk Observability Cloud for more than eight or nine months.

What do I think about the stability of the solution?

Splunk Observability Cloud experienced a significant outage recently when it went down for approximately five to six hours. This impacted us considerably because we were actively working during that time.

How are customer service and support?

I would rate the technical support for Splunk Observability Cloud as 9.5 out of 10 because we received their support during our deployment. They were very helpful in assisting us to create a good infrastructure.

Which solution did I use previously and why did I switch?

I find Splunk Observability Cloud to be very good. I previously used DataDog for observing everything, but Splunk Observability Cloud is more accurate and a better solution.

What was our ROI?

Previously with other applications, analyzing and controlling our API latency required almost five to six hours a day of resources. With Splunk Observability Cloud, I only need to allocate one to two hours maximum per day to accomplish the same tasks.

Which other solutions did I evaluate?

I highly recommend Splunk Observability Cloud. If you are using any other third-party tool, Splunk Observability Cloud is significantly better than the alternatives.

What other advice do I have?

I highly recommend creating better documentation for Splunk Observability Cloud. This documentation could be integrated with AI to provide specific use case solutions so that users do not have to search through Splunk documentation every time. Instead, users could directly ask about the issues they are facing and receive targeted solutions. My overall review rating for Splunk Observability Cloud is 9 out of 10.

Udit Parekh

End-to-end tracing has transformed how we detect failures and optimize critical transactions

Reviewed on Apr 08, 2026

Review from a verified AWS customer

What is our primary use case?

Our primary use case for Splunk Observability Cloud is to monitor our infrastructure and applications, and it helps us troubleshoot issues related to any failures.

What is most valuable?

The feature we appreciate most about Splunk Observability Cloud is their distributed tracing. We also value the ability to create real-time dashboards and their alerting system is exceptional. The main best feature of that observability is their distributed tracing.

We are very satisfied with the out-of-the-box dashboards and detectors in Splunk Observability Cloud. In distributed tracing, we have banks as our clients, so if anything goes wrong with transactions, we directly go to the trace and troubleshoot those issues faster.

The AI-powered analytics and guidance in Splunk Observability Cloud is very useful. You can observe your LLM models and monitor the usage of your APIs in that cloud.

Splunk helps improve our operational performance and resilience significantly. Before we used Splunk Observability Cloud, if any failures occurred, we had to go to servers and check all the log files to find the failure. Now in Splunk, we go to that single dashboard and filter with the timestamp of failure to directly find the log, allowing us to troubleshoot issues faster. In terms of optimization, before using Splunk, we could not measure why our API was taking 100 ms, but now through distributed tracing, we can see where the bottleneck of that API is. If that bottleneck is the database, we optimize our database queries, and our application is now optimized.

Splunk Observability Cloud has reduced our mean time to detect by approximately 25 to 30 percent because it offers real-time monitoring and intelligent alerting, allowing us to troubleshoot issues faster and enhancing detection by approximately 30 to 40 percent.

What needs improvement?

In terms of pricing, I have one issue with Splunk Observability Cloud. In a large-scale organization, it does not have features such as cost optimization or budgeting for observability spend. I think they need to improve that so that I can optimize our observability. For instance, if our thousands of server applications are running, I should be able to set a budget, such as only spending $100 per month for a specific environment. They need to introduce that feature because it is very important for budgeting.

In terms of areas for improvement in Splunk Observability Cloud, the first is cost budgeting. The second is that they have many integrations, but if you are new to Splunk or new to observability, you must dive deep into more concepts. They can improve user-friendly features so that new users can set up their observability in their environment more smoothly. I think they need to improve in that integration part so that end users can onboard their infrastructure or applications very effectively.

I would appreciate more simplicity in the platform.

For how long have I used the solution?

I have been using Splunk Observability Cloud for the past eight or nine months.

What do I think about the stability of the solution?

I rate the stability of Splunk Observability Cloud as ten out of ten because it is very stable, especially since we are using their cloud environment, and Splunk Observability Cloud is built for cloud-native systems.

What do I think about the scalability of the solution?

We have not explored enriching data with custom metrics in Splunk Observability Cloud because their ready-to-use dashboards are well designed, and every organization can benefit from them. However, if you have a very large organization with over ten thousand servers running applications, you may need to build a team to create custom metrics for your specific use case.

How are customer service and support?

I would rate their technical support in Splunk Observability Cloud a nine.

Which solution did I use previously and why did I switch?

I have used other vendors such as Elastic Stack and Grafana Stack, but in Splunk Observability Cloud, there are so many integrations and useful features that no other vendor can offer. In Grafana, the logs and tracing features are almost nonexistent. You can use Grafana only for monitoring your infrastructure, but Splunk provides end-to-end visibility with infrastructure monitoring, tracing, and overall observability of our application.

How was the initial setup?

Deploying Splunk Observability Cloud is an intermediate task for new users, but if you have been in this space for one or two years or longer, then it is easy to deploy their products.

It can take up to one week to deploy Splunk Observability Cloud.

What other advice do I have?

We are not using the NoSample tracing feature in Splunk Observability Cloud.

In our organization, we have approximately 25 to 30 users using the solution daily.

We do not require any maintenance for Splunk Observability Cloud since we are using their cloud solution, which means that all patching and updates are done by them.

I recommend Splunk Observability Cloud to other organizations because we are currently saving our engineers time by 20 to 30 percent, and for infrastructure alerting, we can use it to ensure that servers will not go down. Every organization should use this because it will reduce your engineering team's effort and the downtime of your application, and in terms of any failure or APIs, you can troubleshoot your issues faster.

End-to-end visibility into our cloud-native environment is very important. If an organization is building a SaaS or B2B software, then end-to-end visibility is crucial in terms of security, failures, and compliance. The end-to-end visibility of our infrastructure and applications is extremely important.

I recommend Splunk Observability Cloud to every user because they offer trials. If you do not just read the reviews, you should try it out. Understanding the biggest features and why others are using it can be beneficial, and I always recommend Splunk Observability Cloud for end-to-end visibility in your application.

I gave this review an overall rating of ten out of ten.

Nishith Joshi

Real-time monitoring has improved performance tracking and has simplified analyzing complex metrics

Reviewed on Mar 30, 2026

Review from a verified AWS customer

What is our primary use case?

I work in data analytics with experience in monitoring systems and working with large-scale data. I have used Splunk Observability Cloud in the context of real-time monitoring and performance tracking.

Splunk Observability Cloud works well alongside Splunk Enterprise for logs and integrates with cloud platforms and monitoring tools. It is often used together with other observability solutions. The tracking metrics such as latency, error, and throughput are easily visible. I can also build dashboards for real-time visibility.

We use Splunk Observability Cloud to track latency metrics and identify where slowdowns are happening. We have visualized response time trends and quickly detected performance degradation. We have also used it for infrastructure monitoring. Over the past six months, we have been monitoring metrics such as CPU usage and memory. If there is unusual usage, we identify it quickly using this tool and take action before it impacts our performance.

What is most valuable?

Splunk Observability Cloud has optimized our solutions and helped us understand the metrics. The AI-powered guidance in Splunk Observability Cloud helps us identify patterns and anomalies in system performance data. Instead of manually going through a large volume of metrics, it highlights unusual behavior and potential issues automatically. This makes it easier to detect problems early and understand where to focus, especially in complex systems.

There is definitely log analysis and dashboards. Log monitoring and dashboards have been better using Splunk. Splunk Observability Cloud is the best tool for log monitoring and dashboards. Splunk Observability Cloud feels more focused on real-time metrics and performance tracking compared to some other traditional log-based tools.

What needs improvement?

The learning curve for understanding all features should be improved, and the cost can increase. Splunk Observability Cloud is very costly. Cost is one of the drawbacks.

Sometimes too many alerts, if not configured properly, is a major drawback that could be improved.

The prices are quite high. As I have mentioned earlier, we are Splunk partners, so this has been handled by my other team. However, for other companies and small startups, the prices are very high for them to use Splunk Observability Cloud. Price is a concern.

For how long have I used the solution?

I have been working with Splunk Observability Cloud for the past six to eight months.

What do I think about the scalability of the solution?

We have expanded our team and usage. We are scaling up right now from ten people to twenty-five or thirty. Over time, I expanded my usage by going through basic monitoring and exploring things like setting up custom dashboards. We have gradually expanded our usage from setting up dashboards and alerts.

How are customer service and support?

For customer service, I would rate them eight out of ten because whenever we raise a support case, they are always available for us.

For Splunk real user monitoring, implementation took time because our engineers tried very hard. In case of support, there should be more engineers specifically for this case.

Which solution did I use previously and why did I switch?

We have used different products like Palo Alto and Cribl before moving to Splunk Observability Cloud. As we got a partnership, we have shifted to Splunk Observability Cloud.

What was our ROI?

The information is confidential and I cannot share specific details. However, I can tell you in percentage that fifty to sixty percent of our work has been easy to identify in terms of performance metrics and performance using Splunk Observability Cloud.

It has saved us thirty to forty percent in cost because we used some other tools before that were more costly. As we are Splunk partners, we obtained Splunk Observability Cloud, and our costs have been reduced by thirty to forty percent using this solution.

What other advice do I have?

My overall impression of using Splunk Observability Cloud is that it is a strong tool for real-time monitoring. It does take some time to get fully comfortable with all the features. We have not explored everything right now, but in the future, we are looking forward to using more features.

A part of the implementation has been handled by my other team. I have explored using custom metrics to enrich observability data, mainly by adding application layer or business-related metrics alongside system metrics. I have used custom metrics in a limited way to add more context to monitoring, such as tracking application-specific metrics alongside system data.

Dashboard customization in Splunk Observability Cloud is quite flexible. We care about metrics in different types of visualization, and it helps us organize them in a way that makes sense for monitoring. It allows us to build dashboards tailored to specific use cases. This makes it easier to monitor system performance and quickly identify issues without going through unnecessary data.

The integration in real user monitoring from Splunk Observability Cloud is actually better than from some other tools. If you are looking for the best SIM tool, then Splunk Observability Cloud is for you. If you have funds and capability for the cost, then Splunk Observability Cloud is definitely the best tool you can use.

I have given this review an overall rating of nine out of ten.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)

Jigar Hirani

End-to-end tracing has improved monitoring and now reduces downtime with proactive alerts

Reviewed on Mar 27, 2026

Review provided by PeerSpot

What is our primary use case?

My experience with Splunk Observability Cloud involves monitoring infrastructure, application performance monitoring, and real-time alerting. Although I am no longer working with Splunk Observability Cloud due to a recent position change that occurred approximately two months ago, I previously monitored servers, containers, Kubernetes, application performance, and Docker images. In terms of monitoring, I tracked response time, error rate, and latency. This capability helped in identifying performance issues or infrastructure issues before users were impacted. For instance, if Kafka failed, we knew about it before users experienced an impact and could resolve it before it caused maximum damage to our systems. I also used dashboards and alerts to monitor critical services and received notifications whenever issues arose.

The features of Splunk Observability Cloud that I found most valuable included application performance monitoring and distributed tracing, particularly when monitoring distributed systems or applications. Real-time alerting and Kubernetes monitoring were essential since Kubernetes is quite complex. I could effectively monitor Kubernetes using Splunk Observability Cloud. Additionally, the Smart Attack Detector, which I tried at the last moment, was a good feature, although I did not work extensively with it. The Log Observer was very fast and reliable, and the dashboards provided good visualization for troubleshooting and monitoring. If there was a network outage, I received notifications very quickly.

What is most valuable?

Splunk Observability Cloud helped me detect performance issues faster and reduce downtime in my organization. Earlier, I had limited visibility into my application performance. After implementing observability, I could see end-to-end transaction tracing and quickly identify where issues arose, which reduced troubleshooting time and improved overall application stability and availability for our customers and systems. This capability also helped in proactive detection.

What needs improvement?

I believe that areas of Splunk Observability Cloud that could be improved include the initial setup and instrumentation costs, which take more time for APM. Some dashboards and detectors require tuning, and I think the visualization needs enhancement. Additionally, alert noise remains an issue, and we need suppressions for when systems go down for short periods. Better integration with third-party tools and easier onboarding of data would also be beneficial.

What do I think about the stability of the solution?

When evaluating the stability and reliability of Splunk Observability Cloud, I can confirm it has been reliable. I would rate it eight out of ten for reliability.

What do I think about the scalability of the solution?

Splunk Observability Cloud scales very well with the growing needs of my organization. I can demonstrate the scalability of our system to our customers, which is advantageous for business. This capability helped us secure business as we provide real insights to customers who were happy to purchase our systems and applications. The ROI has been good for us.

How are customer service and support?

I communicated with the technical support of Splunk Observability Cloud regarding our issues, specifically when I was unable to monitor or set up Kubernetes to monitor our infrastructure. They were able to help us, and we purchased an on-demand call for assistance, which they provided.

How was the initial setup?

I did not participate significantly during the initial setup and deployment of Splunk Observability Cloud, but I was part of the team. I know the process is straightforward. We simply needed to ensure that all data was in the correct format, matched current dashboard setups, and included all necessary fields for insights.

What was our ROI?

My experience with lowering the cost of unplanned digital downtime using Splunk Observability Cloud has been positive, as it helped us significantly. Our system was bottlenecking and consuming excessive resources, but with the ability to detect and resolve that issue, overall system usage was reduced without further bottlenecking.

What's my experience with pricing, setup cost, and licensing?

Regarding metrics or data points confirming performance improvement and resilience, I found that during certain times, we experienced the most significant spike in our systems due to multiple users requesting the same service. We needed to change our overall architecture as we were not scaling adequately, and this was bottlenecking our systems. By observing this from the dashboards, I realized improvements could be made. After implementing the solution, our application's stability improved significantly. I can confidently say our availability improved by forty percent, and downtime was reduced by approximately seventy to eighty percent.

What other advice do I have?

My impression of the No-Sample Tracing feature in Splunk Observability Cloud is that it helped us detect key metrics and real use cases, particularly in tracking and monitoring. I primarily tracked server uptime, application response time, API latency, and similar metrics. Combining these parameters instead of relying on a single factor improved our system. Specifically, I used distributed tracing to understand how requests flowed through our network and how different systems responded, which helped determine if any particular system impacted all our systems.

Regarding the AI-powered analytics and guidance provided by Splunk Observability Cloud, I have not actually used the AI features, particularly with ITSI, as I did not utilize that aspect for observability.

My teams effectively utilized the ability to enrich data with custom metrics in Splunk Observability Cloud. They found valuable insights from our systems and created reports that the application and infrastructure teams used to decide their workarounds and solutions. They developed different solutions, experimenting and improving our systems by relying on observability to understand what happens when we adjust parameters or change configurations.

When evaluating the effectiveness of the out-of-the-box customizable dashboards provided by Splunk Observability Cloud, I note that we mostly used the default dashboards. While we created a custom dashboard to track our overall system flow, we relied on pre-built dashboards for monitoring and representing our business perspective. When we needed to showcase our environment to customers, we demonstrated our scalability and system performance, including response time and downtime, providing insightful details from the dashboards for business use cases.

I would rate Splunk Observability Cloud an eight out of ten, where ten is the best and one is the worst.

RahulMhatre3

Observability has improved anomaly detection and dashboard flexibility but needs simpler licensing

Reviewed on Mar 11, 2026

Review provided by PeerSpot

What is our primary use case?

I work with Splunk Observability Cloud.

What is most valuable?

Splunk Observability Cloud is effective for detecting anomalies and preventing system outages.

There are pre-built dashboards where I can check service centers and monitor spikes in errors and traces. I can also check error logs, and everything is consolidated while providing anomaly alerts in case there is any deviation from the baseline.

The personalized dashboard helps my team. Splunk Observability Cloud has its own query language that can be used to build easy dashboards. Multiple teams can build their own, replicate them, and also have role-based access control, which is beneficial.

The application management feature helps with end-user experiences because front-end monitoring helps track user issues and any back-end issues that may be causing them. It shows how the user experience is overall and identifies any outages. Front-end monitoring is very useful.

What needs improvement?

As an integrator, I think the biggest advantage of Splunk Observability Cloud is because it is part of the Splunk ecosystem, it is good to correlate logs with application data through traces and metrics. Overall, it is an evolving product, not top class, but it is getting there.

I see many good things about the product and many advantages. Regarding the negative side, I think the licensing can be much better because it is based upon host units and there is additional licensing for the number of traces that I can bring in. A simplified licensing model would be much better, similar to what other tools offer. Pricing could be either based upon ingestion or directly based upon host units, rather than multiple different trackers. There are licenses for custom metrics, licenses for the number of traces that I can ingest, and host unit licensing. A better licensing plan would be beneficial.

For how long have I used the solution?

I have been using Splunk Observability Cloud for more than two years.

What do I think about the stability of the solution?

I have not seen any issues with stability. The solution is very stable.

What do I think about the scalability of the solution?

Regarding scalability, I do not think there is an issue with scaling. I have never encountered any issues with that.

How are customer service and support?

Support is good.

Which solution did I use previously and why did I switch?

I have worked on Coralogix, which is also an observability tool. I worked in the product company itself. I have also worked on Dynatrace, and now I am working on Cribl.

How was the initial setup?

The installation and deployment process is somewhat challenging, but there are multiple ways of deployment that give me a lot of options. I would say it is acceptable and not that complicated. I can deploy agents with Splunk deployment server, which is beneficial. However, there is some dependency on the deployment server.

What about the implementation team?

As an integrator, I deployed it and made it workable with OpenTelemetry.

What was our ROI?

I am able to observe significant ROI with Splunk Observability Cloud. When I worked with a previous solution, it was one-third of the cost of Dynatrace, so there was definitely an exceptional return on investment. It helped reduce costs by almost 50%.

What's my experience with pricing, setup cost, and licensing?

Splunk Observability Cloud is affordable. I have visited the PeerSpot website and downloaded reports on Azure, Grafana, and Splunk Observability Cloud.

Which other solutions did I evaluate?

When I compare Splunk Observability Cloud to other vendors, the good part is the branding because the support is good. There is a large community where I can look for known issues. However, experience-wise, DataDog is far more superior and easier to use. DataDog has its own agent for tracing, so I just deploy one trace. With Splunk Observability Cloud, they are dependent upon OpenTelemetry, and there is a learning curve because it is open source. The onboarding is not as smooth as DataDog or Dynatrace.

What other advice do I have?

I deploy both on-cloud and on-premise options for clients. I have deployed Splunk Observability Cloud on Splunk Cloud. I have not used threat detection because there is a separate tool for it. I have not deployed a solution on AWS Cloud or purchased it from AWS Marketplace in my career. I would rate this review 7.5 out of 10.