Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: crmsh 4.6.0 support and stonith-enabled workflow update #232

Merged
merged 3 commits into from
Oct 16, 2024
Merged
Show file tree
Hide file tree
Changes from 2 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
10 changes: 9 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,8 @@ An Ansible role for managing High Availability Clustering.

* Compatible OS
* RHEL 8.3+, Fedora 31+
* SLES 15 SP5+ with HA extension, SLES4SAP 15 SP5+
* SUSE Linux Enterprise Server 15 with HA extension
* SUSE Linux Enterprise Server for SAP Applications 15
* Systems running RHEL are expected to be registered and have High-Availability
repositories accessible, and ResilientStorage repositories accessible if using
`ha_cluster_enable_repos_resilient_storage`
Expand Down Expand Up @@ -153,6 +154,13 @@ It is possible to specify fence agents here as well. However,
[`ha_cluster_fence_agent_packages`](#ha_cluster_fence_agent_packages) is
preferred for that, so that its default value is overridden.

#### `ha_cluster_use_latest_packages`

boolean, default: `false`

If set to `true`, all packages will be installed with latest version.</br>
richm marked this conversation as resolved.
Show resolved Hide resolved
If set to `false`, existing packages will not be updated.

#### `ha_cluster_hacluster_password`

string, no default - must be specified
Expand Down
1 change: 1 addition & 0 deletions defaults/main.yml
Original file line number Diff line number Diff line change
Expand Up @@ -16,6 +16,7 @@ ha_cluster_extra_packages: []
# Default fence agent packages are defined in respective os_family var files
ha_cluster_fence_agent_packages:
"{{ __ha_cluster_fence_agent_packages_default }}"
ha_cluster_use_latest_packages: false

ha_cluster_hacluster_password: ""
ha_cluster_regenerate_keys: false
Expand Down
4 changes: 2 additions & 2 deletions tasks/main.yml
Original file line number Diff line number Diff line change
Expand Up @@ -14,7 +14,7 @@
__ha_cluster_role_essential_packages
+
ha_cluster_extra_packages }}"
state: present
state: "{{ 'latest' if ha_cluster_use_latest_packages else 'present' }}"
use: "{{ (__ha_cluster_is_ostree | d(false)) |
ternary('ansible.posix.rhel_rpm_ostree', omit) }}"

Expand Down Expand Up @@ -73,7 +73,7 @@
ternary(__ha_cluster_cloud_agents_packages, [])
+
ha_cluster_fence_agent_packages }}"
state: present
state: "{{ 'latest' if ha_cluster_use_latest_packages else 'present' }}"
use: "{{ (__ha_cluster_is_ostree | d(false)) |
ternary('ansible.posix.rhel_rpm_ostree', omit) }}"

Expand Down
90 changes: 69 additions & 21 deletions tasks/shell_crmsh/create-and-push-cib.yml
Original file line number Diff line number Diff line change
Expand Up @@ -27,23 +27,28 @@
check_mode: false
changed_when: not ansible_check_mode

# Maintenance mode is required, because CIB version changes with cluster
# status changes, resulting in shadow CIB outdated and unable to patch.
# Sleep is implemented to ensure that cluster have enough time to freeze
# to ensure CIB export consistency.
# Meta-attrs is-managed will conflict with maintenance mode as well as
# individual resource maintenance attributes. Expect will skip their deletion.
# - name: Put cluster in maintenance mode to freeze cib changes
# ansible.builtin.expect:
# command: crm --force configure property maintenance-mode=true
# responses:
# ".*is-managed.*": "n"
# ".*already.*": "n"
# run_once: true # noqa: run_once[task]
# check_mode: false
# changed_when: true

# Maintenance mode is required, because CIB version changes with cluster
# status changes, resulting in shadow CIB outdated and unable to patch.
- name: Put cluster in maintenance mode to freeze cib changes
ansible.builtin.expect:
command: crm configure property maintenance-mode=true
responses:
".*is-managed.*": "n"
".*already.*": "n"
ansible.builtin.command:
cmd: crm --force configure property maintenance-mode=true
run_once: true # noqa: run_once[task]
check_mode: false
changed_when: true

- name: Verify that maintenace-mode is true
- name: Verify that maintenance-mode is true
ansible.builtin.command:
cmd: crm status
register: __ha_cluster_crm_status_maint
Expand Down Expand Up @@ -116,12 +121,16 @@
# Build the new CIB
- name: Build the new CIB
block:
## Cluster properties
- name: Configure cluster properties
ansible.builtin.include_tasks: crm-cluster-properties.yml
vars:
properties_set: "{{ ha_cluster_cluster_properties[0] }}"
when: ha_cluster_cluster_properties[0].attrs | d([])
## Ensure that stonith is disabled before executing crm configure.
## This is usually disabled by running crm init.
## Executing crm configure without stonith results in "config not valid".
- name: Set property stonith-enabled to false
ansible.builtin.command:
cmd: >-
crm -c {{ __ha_cluster_crm_shadow }}
marcelmamula marked this conversation as resolved.
Show resolved Hide resolved
configure property stonith-enabled=false
check_mode: false
changed_when: not ansible_check_mode

## Resource defaults
- name: Configure resource defaults
Expand Down Expand Up @@ -229,6 +238,38 @@
index_var: constraint_index
loop_var: constraint

## Cluster properties
- name: Configure cluster properties
ansible.builtin.include_tasks: crm-cluster-properties.yml
vars:
properties_set: "{{ ha_cluster_cluster_properties[0] }}"
when: ha_cluster_cluster_properties[0].attrs | d([])

# Verify CIB to ensure that there are no errors before applying.
- name: Verify shadow CIB
ansible.builtin.command:
cmd: >-
crm_verify -V -x
/var/lib/pacemaker/cib/shadow.{{ __ha_cluster_crm_shadow }}
marcelmamula marked this conversation as resolved.
Show resolved Hide resolved
register: __ha_cluster_crm_verify
ignore_errors: true
check_mode: false
changed_when: false

## Fail execution if shadow CIB is not valid.
## Example: No STONITH resources were defined while stonith-enabled is true
- name: Fail if shadow CIB is invalid
ansible.builtin.fail:
msg:
- "ERROR: Cluster configuration was invalid."
- Following errors have to be remediated before retrying.
- "{{ __ha_cluster_crm_verify.stdout_lines | d('') }}"
- "{{ __ha_cluster_crm_verify.stderr_lines | d('') }}"
when:
- __ha_cluster_crm_verify.rc != 0
check_mode: false


# Push the new CIB into the cluster
- name: Copy shadow cib to temp
ansible.builtin.copy:
Expand Down Expand Up @@ -290,12 +331,19 @@

# Meta-attrs is-managed will conflict with maintenance mode as well as
# individual resource maintenance attributes. Expect will skip their deletion.
# - name: Disable maintenance mode
# ansible.builtin.expect:
# command: crm --force configure property maintenance-mode=false
# responses:
# ".*is-managed.*": "n"
# ".*already.*": "n"
# check_mode: false
# changed_when: true
# run_once: true # noqa: run_once[task]

- name: Disable maintenance mode
ansible.builtin.expect:
command: crm configure property maintenance-mode=false
responses:
".*is-managed.*": "n"
".*already.*": "n"
ansible.builtin.command:
cmd: crm --force configure property maintenance-mode=false
check_mode: false
changed_when: true
run_once: true # noqa: run_once[task]
Expand Down
36 changes: 34 additions & 2 deletions tasks/shell_crmsh/crm-cluster-properties.yml
Original file line number Diff line number Diff line change
@@ -1,15 +1,47 @@
# SPDX-License-Identifier: MIT
---
# stonith-enabled=true is required for clusters, but this task
# allows for creation of cluster without stonith enabled.
# It also ensures that stonith-enabled is always present.
- name: Append stonith-enabled to ha_cluster_cluster_properties
ansible.builtin.set_fact:
__ha_cluster_property_attr: "{{
(properties_set.attrs + [{'name': 'stonith-enabled', 'value': 'true'}])
if properties_set.attrs | selectattr('name', 'equalto', 'stonith-enabled') | list | length == 0
richm marked this conversation as resolved.
Show resolved Hide resolved
else properties_set.attrs }}" # yamllint disable-line rule:line-length
richm marked this conversation as resolved.
Show resolved Hide resolved

- name: Configure cluster properties set
ansible.builtin.command:
cmd: >-
crm -c {{ __ha_cluster_crm_shadow }}
crm --force -c {{ __ha_cluster_crm_shadow }}
configure property {{ item.name | quote }}={{ item.value | quote }}
loop: "{{ properties_set.attrs }}"
loop: "{{ __ha_cluster_property_attr }}"
# Pause ensures that cluster is consistent for further property changes.
# Setting up crm properties without pause resulted in unstable cluster.
loop_control:
pause: 5
retries: 10
check_mode: false
changed_when: not ansible_check_mode

# Get status of cluster properties for validation of stonith-enabled
- name: Get status of cluster properties
ansible.builtin.command:
cmd: >-
crm --force -c {{ __ha_cluster_crm_shadow }}
configure show type:property
register: __ha_cluster_crm_show_property
check_mode: false
changed_when: false

# Warning that stonith-enabled is false and it is not recommended.
- name: Show warning if stonith-enabled=false
ansible.builtin.debug:
msg:
- "Warning: Property stonith-enabled is set to false."
- Property stonith-enabled should be always true for Live clusters!
- Set stonith-enabled as true in variable ha_cluster_cluster_properties.
when:
- __ha_cluster_crm_show_property is defined
- "'stonith-enabled=false'
in __ha_cluster_crm_show_property.stdout | lower"
Loading