CloudManager module diagnostics

Important

All commands are executed only from superuser.

Mode superuser:

sudo -i

Services logging used by CloudManager module

Logging happens using logging module.

Log files are located in /var/log/aos/cloud-manager/ directory.

Directory contains the following files:

  • api.log is a aos-cloud-manager-api command message collection file;
  • beat.log is a aos-cloud-manager-beat command message collection file;
  • node-tracker.log is a file for collecting messages for monitoring compute nodes;
  • node-syncer.log is a file for collecting messages for synchronizing computational nodes;
  • storage-syncer.log is a file for collecting messages for synchronizing storages;
  • drs-trigger.log is a file for collecting auditing applying messages;
  • worker.log is a aos-cloud-manager-worker command message collection file.

Note

Description of the logging process provided by the OpenStack platform can be found in the corresponding section of the official documentation.

CloudManager debugging

There are the following solutions in case of problems in the operation of the module:

  1. Set log level to value DEBUG which will allow recording event messages in log files with maximum detail for diagnosing and solving problems.

  2. Run openstack aos tests module self-diagnosis utility.

    Example of using:

    openstack aos tests --names cloud_manager --modules
    Modules diagnostic started.
    Running tests for: cloud_manager
    
    +--------------------------+-----------------------------------------------------------------------------------------------+
    | Start date and time      | Thu Dec 24 17:03:53 2020 GMT                                                                  |
    +--------------------------+-----------------------------------------------------------------------------------------------+
    | Inactive services        | apache-htcacheclean, nova-novncproxy, nova-serialproxy, nova-xenvncproxy, rtslib-fb-targetctl |
    +--------------------------+-----------------------------------------------------------------------------------------------+
    | Database names           | Failed to get database names. Please run command as root.                                     |
    +--------------------------+-----------------------------------------------------------------------------------------------+
    | Rabbit users             | Failed to get RabbitMQ info. Please run command as root.                                      |
    +--------------------------+-----------------------------------------------------------------------------------------------+
    | Rabbit virtual hosts     | Failed to get RabbitMQ info. Please run command as root.                                      |
    +--------------------------+-----------------------------------------------------------------------------------------------+
    | Rabbit limits            | Failed to get RabbitMQ info. Please run command as root.                                      |
    +--------------------------+-----------------------------------------------------------------------------------------------+
    | Apache config parameters | Failed to get Apache configs info. Please run command as root.                                |
    +--------------------------+-----------------------------------------------------------------------------------------------+
    | Apache MPM               | worker                                                                                        |
    +--------------------------+-----------------------------------------------------------------------------------------------+
    
    ============
    CloudManager
    Version: 1.4.1.dev15 (latest: 1.4.1.dev8)
    License: 46a5c6f069514b2eb23844ac338bed5b (valid until 15.02.2021 03:00:00)
    ============
    +----+-----------------------------------------+---------+-------------------------------------------------------------------+
    | N  | Test name                               | Status  | Failure reason                                                    |
    +----+-----------------------------------------+---------+-------------------------------------------------------------------+
    | 1  | test api connection                     | OK      |                                                                   |
    +----+-----------------------------------------+---------+-------------------------------------------------------------------+
    | 2  | test api service registration           | OK      |                                                                   |
    +----+-----------------------------------------+---------+-------------------------------------------------------------------+
    | 3  | test arp command                        | OK      |                                                                   |
    +----+-----------------------------------------+---------+-------------------------------------------------------------------+
    | 4  | test config file existence              | OK      |                                                                   |
    +----+-----------------------------------------+---------+-------------------------------------------------------------------+
    | 5  | test connection to keystone             | OK      |                                                                   |
    +----+-----------------------------------------+---------+-------------------------------------------------------------------+
    | 6  | test connection to nova                 | OK      |                                                                   |
    +----+-----------------------------------------+---------+-------------------------------------------------------------------+
    | 7  | test drs service availability           | FAIL    | DRS service is not available.                                     |
    +----+-----------------------------------------+---------+-------------------------------------------------------------------+
    | 8  | test leases files exist                 | SKIPPED | No leases file found. Looks like you don't have a DHCP server.    |
    +----+-----------------------------------------+---------+-------------------------------------------------------------------+
    | 9  | test license validity                   | OK      |                                                                   |
    +----+-----------------------------------------+---------+-------------------------------------------------------------------+
    | 10 | test migrations applied                 | OK      |                                                                   |
    +----+-----------------------------------------+---------+-------------------------------------------------------------------+
    | 11 | test ping command                       | OK      |                                                                   |
    +----+-----------------------------------------+---------+-------------------------------------------------------------------+
    | 12 | test pxe folder owner write permissions | SKIPPED | Folder for PXE images does not exist, check config PXE parameter. |
    +----+-----------------------------------------+---------+-------------------------------------------------------------------+
    | 13 | test registered celery tasks            | OK      |                                                                   |
    +----+-----------------------------------------+---------+-------------------------------------------------------------------+
    | 14 | test systemd services                   | OK      |                                                                   |
    +----+-----------------------------------------+---------+-------------------------------------------------------------------+
    Ran 14 in 1.714s
    FAIL (success=11, failures=1, errors=0))
    

Note

Detailed description of utility is available in section «Self-diagnosis utility».