CAST
latest
  • Cluster System Management (CSM)
    • User Guide
    • Installation and Configuration
    • APIs
    • Database
    • Infrastructure
    • Inventory
    • External Integration
    • Tools
  • Burst Buffer
  • Big Data Store
  • Releases
CAST
  • Docs »
  • Cluster System Management (CSM)
  • Edit on GitHub

Cluster System Management (CSM)ΒΆ

https://user-images.githubusercontent.com/4662139/49670811-e958ff00-fa33-11e8-92c2-3bf00e8d1001.png

CSM is a cognitive self learning system for managing and overseeing a HPC cluster. CSM interacts with a variety of open source IBM tools for supporting and maintaining a cluster, such as:

  • Discovery and management of system resources
  • Database integration (PostgreSQL)
  • Job launch support (workload management, cluster, and allocation APIs)
  • Node diagnostics (diag APIs and scripts)
  • RAS events and actions
  • Infrastructure Health checks
  • Python Bindings for C APIs

Table of Contents

  • User Guide
    • Introduction
    • CSM Database
    • CSM Infrastructure
    • Compute node states
    • Job Launch
    • CSM REST Daemon
  • Installation and Configuration
    • Introduction
    • Pre-Requisites
    • Updating from a previous version
    • Installation
    • Configuration
    • Uninstallation
    • Appendices
  • APIs
    • Installation
    • Configuration
    • List of CSM APIs
    • Implementing New CSM APIs
    • CSM API Python Bindings Guide
    • Soft Failure Recovery
    • Change Log
  • Database
    • CSM Database Appendix
    • Using csm_db_history_archive.py
    • Using csm_db_backup_script_v1.sh
    • Using csm_db_connections_script.sh
    • Using csm_db_history_delete.py
    • Using csm_db_schema_version_upgrade_19_0.sh
    • Using csm_db_script.sh
    • Using csm_db_stats.sh script
    • Using csm_ras_type_script.sh
  • Infrastructure
    • CSMD Executable
    • CSMD Configuration
  • Inventory
    • GPU Inventory
    • Network Inventory
    • Node Inventory
  • External Integration
    • Mellanox and Infiniband
  • Tools
    • CSM logging tools
    • CSM standalone inventory collection
Next Previous

© Copyright 2020, IBM Corporation Revision 315963e0.

Built with Sphinx using a theme provided by Read the Docs.