CUC 2004 F3 Zdenko Škiljan, Branimir Radić - Monitoring Systems: Concepts and Tools
Every computer system has to be systematically supervised on account of recognition critical circumstances that need troubleshooting, system/application tuning or in the end, upgrade of a system. During years on UNIX/Linux platform there has been developed great deal of tools for that purpose. I this presentation an overview will be made of both traditional tools for monitoring UNIX/Linux systems and complex tools for monitoring distributed systems. Among traditional tools, different set of tools according to their specific field of usage will be taken in consideration; basic system monitoring tools, system integrity monitoring tools, system performance monitoring tools and services activity monitoring tools. Very important role in long term diagnostics and decision making have visualization of data collected through monitoring system, so the most prominent solutions in that area will be commented. In the end, special attention will be paid to the concept of cluster monitoring and its fundamental principles. Some consideration of active response and job distribution based on of monitoring systems will also be made.