Blog

Announcing Liberty Guardian: Automated Monitoring and Self-Healing Infrastructure for OpenStack

Automated Monitoring and Self-Healing Infrastructure for OpenStack with a cloud, servers, and shield icon showing automated recovery.

Liberty Center One has released Liberty Guardian, a new automated monitoring and remediation platform built directly into its OpenStack cloud environment. This enhancement introduces real-time health monitoring, automated recovery workflows, and expanded infrastructure visibility across cloud and external systems. 

These capabilities provide IT teams, engineers, and MSP partners with faster issue resolution, reduced operational overhead, and greater control over infrastructure performance without relying on manual intervention. 

Announcing Liberty Guardian: Automated Monitoring and Self-Healing Infrastructure for OpenStack

Liberty Center One is proud to introduce Liberty Guardian, a powerful new feature within the OpenStack cloud platform that brings automated monitoring and self-healing infrastructure directly into the LCO.Cloud environment. 

Built on OpenStack architecture, the Liberty Center One platform continues to evolve alongside its broader cloud solutions, delivering increased control, automation, and operational efficiency for IT teams, engineers, and MSP partners. 

Liberty Guardian shifts monitoring from a reactive process into an automated system that detects issues and resolves them in real time. For many IT teams, monitoring still means waiting for something to break, responding to alerts, and manually working through recovery steps. This introduces delays, increases operational overhead, and creates unnecessary risk during critical incidents. 

Liberty Guardian eliminates that gap by combining monitoring and remediation into a single automated system. 

What Is Liberty Guardian? 

 
Liberty Guardian is an automated monitoring and remediation platform built into the Liberty Center One OpenStack environment that detects, evaluates, and resolves infrastructure issues in real time. 

Here’s how those capabilities are delivered within the platform. 

  • Automated monitoring that detects and fixes issues in real time  
  • Built-in remediation workflows that reduce manual intervention  
  • Multi-protocol health checks for true service-level visibility  
  • 40+ pre-built monitoring templates for faster deployment  
  • Configurable escalation logic through Watch Groups  
  • Performance metrics for SLA tracking and trend analysis  
  • Flexible alerting and enterprise-grade security controls  

Liberty Guardian: Automated Monitoring That Fixes Problems in Real Time 

The flagship capability of this release is Liberty Guardian, built directly into the Liberty Center One OpenStack environment. 

See how Liberty Guardian detects and resolves issues automatically: 

image

Liberty Guardian continuously monitors infrastructure and responds when services become unhealthy—often before users are impacted. This reduces the need for constant manual oversight and allows teams to focus on higher-value work instead of routine incident response. 

Customers can now: 

• Monitor OpenStack instances 24/7 using configurable health probes 
• Automatically execute recovery actions without opening a support ticket 
• Define structured remediation workflows through Watch Groups 
• Track system health across cloud and external environments 
• Maintain a complete audit trail of all actions 

Instead of alerting your team and waiting for action, Liberty Guardian resolves issues automatically and escalates only when needed. 

How Does Liberty Guardian Work? 

Liberty Guardian operates as an automated monitoring and remediation engine that actively maintains infrastructure health. For organizations managing production environments, this means fewer manual interventions, faster recovery times, and more predictable system performance. 

Liberty Guardian replaces traditional alert-based monitoring workflows with an automated evaluation and remediation process. 

In a typical environment, issue resolution follows a manual sequence: a failure is detected, an alert is generated, a team investigates the root cause, and corrective action is taken. 

With Liberty Guardian, that process is streamlined into a controlled, automated workflow. The platform continuously evaluates system health, executes remediation actions when conditions are met, and escalates only when human intervention is required. 

How Liberty Guardian Monitors and Responds to System Health 

Liberty Guardian continuously evaluates system health and executes actions based on a structured decision process. For technical teams, this creates a consistent and repeatable process for evaluating system health. For business stakeholders, it reduces the risk of delayed response during critical incidents. 

Every monitored instance follows a structured decision flow: 

• Continuous health checks using ICMP, TCP, HTTP/HTTPS, and UDP probes 
• Failure thresholds to distinguish real issues from transient events 
• Condition validation to prevent unnecessary remediation during maintenance windows 
• Automated remediation based on predefined escalation logic 
• Logging and notifications for full visibility and compliance tracking 

image

When a failure meets the defined threshold, Liberty Guardian evaluates all conditions before executing the next action, ensuring remediation is fast, accurate, and controlled. 

Watch Groups: Controlling Monitoring and Remediation Behavior 

Watch Groups are the core configuration layer within Liberty Guardian, defining how monitoring and remediation actions are applied to different workloads. 

Instead of applying a one-size-fits-all approach, Watch Groups allow teams to tailor how Guardian responds based on the specific needs of each environment. 

With Watch Groups, teams can configure: 

• Health probe configuration and monitoring intervals   

• Escalation sequences for automated recovery actions   

• Cooldown periods and remediation schedules   

• Limits on actions per instance   

• Notification routing for alerts and escalations   

This level of control allows technical teams to fine-tune system behavior, while giving leadership confidence that monitoring and recovery processes are consistent across environments. In practice, this means production environments, development systems, and client workloads can all follow different recovery strategies without sacrificing consistency or control. 

Automated Recovery: Structured Remediation in Real Time 

When a system becomes unhealthy, Liberty Guardian does not simply generate an alert; it begins executing a structured recovery process designed to restore service as quickly as possible. 

This process follows a predefined escalation sequence, starting with the least disruptive action and progressing only if the issue persists. 

Typical Recovery Sequence 

• A soft reboot to attempt a graceful restart 
• A hard reboot if the system remains unresponsive 
• Additional actions, such as stopping the instance, to prevent further impact 
• Notification escalation if automated recovery is not successful 

image

Before each step is executed, Liberty Guardian evaluates several conditions to ensure the action is appropriate. 

This includes confirming that the failure threshold has been met, the instance is not in a suppression state, the action falls within the defined remediation schedule, and any required cooldown period has passed. 

By validating each step before execution, Liberty Guardian ensures recovery actions are both immediate and controlled. 

Multi-Protocol Monitoring: Beyond Basic Uptime Checks 

Liberty Guardian monitors actual service health, not just whether a system is online. 

A server may appear “up,” even when critical services are degraded or unresponsive. Liberty Guardian addresses this by validating how services are actually performing. 

It supports multiple monitoring protocols, including ICMP, TCP, HTTP/HTTPS, and UDP, allowing teams to detect issues at the service level before they escalate into full outages. 

This provides a more accurate and actionable view of system performance across environments. 

MONITORING SETUP AND SERVICE-LEVEL VISIBILITY 

Liberty Guardian simplifies monitoring by combining pre-configured templates with real-time performance visibility. 

Teams can quickly apply standardized monitoring configurations across common workloads such as web servers, databases, and container environments, reducing setup time, and ensuring consistency. 

At the same time, Liberty Guardian provides service-level metrics including response times, availability trends, and failure history across multiple timeframes. 

This allows teams to both deploy monitoring quickly and maintain clear visibility into system performance without additional configuration overhead. 

Can Liberty Guardian Monitor Systems Outside OpenStack? 

Yes. Liberty Guardian extends monitoring beyond the OpenStack environment, allowing teams to track on-premises systems, network appliances, SaaS platforms, and external dependencies. 

While automated remediation applies to OpenStack workloads, external systems benefit from monitoring, alerting, and performance visibility, creating a unified operational view across environments. 

What Is Dry Run Mode in Liberty Guardian? 

Dry Run mode allows teams to validate monitoring and remediation policies without impacting production systems. 

When enabled, Liberty Guardian evaluates health conditions and records the actions it would take without executing them. 

This allows teams to safely test configurations, refine escalation logic, and deploy automation with confidence. For teams introducing automation into production environments, this provides a critical layer of confidence before enabling live remediation. 

Liberty Guardian vs Traditional Monitoring Tools 

Traditional monitoring tools detect issues and generate alerts but rely on manual intervention to resolve them. 

Liberty Guardian eliminates this gap by combining monitoring and remediation into a single automated system. 

For a broader look at cloud capabilities and monitoring approaches, explore cloud services and solutions offered by Liberty Center One. 

Why Liberty Guardian Matters for IT Teams and MSPs 

Liberty Guardian changes how infrastructure is managed. 

Instead of reacting to failures, teams can rely on automated systems to detect, evaluate, and resolve issues in real time. 

For organizations, this means: 

• Less downtime and faster recovery 
• Reduced operational burden on IT teams 
• More predictable system performance 
• Improved visibility across environments 
• Greater confidence in infrastructure stability 

For MSPs, this creates a more scalable and consistent way to manage multiple client environments without increasing operational complexity. 

Related OpenStack Features in the LCO.Cloud Platform 

Liberty Guardian is part of a broader set of enhancements within the Liberty Center One OpenStack platform, including automated deployment, disaster recovery, and infrastructure management capabilities. 

Together, these features create a more automated, resilient, and manageable cloud environment. 

Continued Investment in the Liberty Center One OpenStack Platform 

Liberty Guardian represents the next phase in the ongoing evolution of Liberty Center One’s OpenStack platform. 

Liberty Center One continues to invest in automation, performance, and infrastructure control to help businesses reduce complexity and improve reliability. 

To learn more about Liberty Guardian or see how it applies to your environment, contact Liberty Center One at 248-336-7809 or visit www.libertycenterone.com

Frequently Asked Questions About Liberty Guardian 

Q. What is Liberty Guardian? 

 A. Liberty Guardian is an automated monitoring and remediation platform built into the Liberty Center One OpenStack environment. 

Q. How does Liberty Guardian’s remediation process work? 

 A. It evaluates failures against thresholds and executes a predefined escalation sequence including reboot, stop, or notification. 

Q. What is a Watch Group? 

 A. A Watch Group defines monitoring rules, thresholds, escalation actions, cooldowns, and notification routing. 

Q. Can I test Liberty Guardian before enabling remediation? 

 A. Yes. Dry Run mode allows you to simulate actions without impacting production systems. 

Q. Can Liberty Guardian monitor systems outside OpenStack? 

 A. Yes. It can monitor external systems, though remediation is limited to notifications. 

Q. How does Liberty Guardian support compliance? 

 A. It includes audit logs, RBAC, 2FA, and SSO integration for enterprise compliance. 

About the Author 

Jason Huebner is the Managing Director at Liberty Center One.      

Liberty Center One brings decades of experience providing secure cloud hosting and datacenter services for businesses. As a regional IT infrastructure solutions provider, Liberty Center One specializes in data protection, colocation, white-glove cloud migration, and backup and disaster recovery solutions backed by highly skilled professionals ready to support critical business needs.     

Contact Liberty Center One at 248-336-7809 or visit https://www.libertycenterone.com/.     

Facebook
Twitter
LinkedIn
Archives