High-Availability Application Services Platform for Distributed Networking Systems
Element is a suite of high-availability, standards-based middleware services that sit between the operating system and applications. Built atop Enea’s LINX interprocess communications foundation, Element extends the functionality of embedded operating systems, providing advanced distributed communications and management functionality typically associated with large enterprise operating systems
Element provides core services for synchronizing, instrumenting, monitoring, and establishing communications between applications spread across multiple operating systems and processors. Element also provides network supervision, fault management, and shelf management services that make it easy to monitor, repair, configure, provision, and upgrade live systems as they operate in the field.
LINX Communications Foundation
Element’s LINX-based communications provide a foundation for building complex distributed applications that span multiple devices and operating systems. These services make it easy for processes to discover other processes and services, communicate with those processes, advertise service availability, broadcast information to and share information with other processes (publish/subscribe), and monitor other processes for readiness. Element’s system-wide communications services are device and OS independent, providing transparency that enables systems to be scaled and reconfigured without changes to the application code.
Event Logging and Application Monitoring
Element’s event logging and application monitoring services make it easy for developers to instrument their applications, greatly increasing visibility into system operation. Event logging enables application processes to log and report event and state information such as slot/service availability, diagnostics and critical network events such as alarm conditions. Event logs can also be aggregated system wide and archived to persistent media (including remote file systems) for live or post-mortem analysis. Developers and network operators can interactively obtain this information using standard Web browsers.
HA Fault Management
Element’s High Availability Framework (HAF) delivers all of the services required to build a true non-stop computing platform. The HAF provides full fault management, including monitoring, detection, recovery, and reporting, for every resource in the system. The HAF employs active heartbeat monitoring and reactive error detection schemes to ensure the health of key hardware and software components at the system, slot and application levels. The HAF also provides a lightweight state checkpoint service, which preserves state information in the event of failover to a redundant blade.
HA Shelf Management
Element also provides a comprehensive suite of shelf management services for configuring and monitoring individual slots, blades and chassis. Element works with shelf management controllers to detect and configure blades, track revision numbers, and monitor key blade-level parameters like temperature, voltage and fan speed. Element also supports alarm management and hot swap, which enables individual blades to be inserted and removed from a live chassis.
Supported Platforms
- OSE™
- RedHat Enterprise Linux 4.0
- Fedora Core 4.0
- Monta Vista Linux 3.1/4.0
CPUs:
- Intel Architecture (x86 and compatibles)
-
PowerPC