Skip to content

Conversation

@enrico-usai
Copy link
Contributor

Description of changes

The latest RHEL8.7 kernel 4.18.0-425.19.2.el8_7 introduced a kABI regression,
causing Lustre to re-release the client compiled for this latest version.

This change means that this same client will not modprobe for kernel 4.18.0-425.13.1.el8_7.

With this patch we're skipping Lustre installation and printing a warning in the log,
like we did for 4.18.0-425.3.1.el8 that has a similar issue.

Tests

EC2:

# RHEL-8.7.0_HVM-20230330-x86_64-56-Hourly2-GP2 in eu-west-1
# with kernel 4.18.0-425.13.1.el8_7
export KITCHEN_RHEL8_AMI=ami-0fa2f7b35eeb82b7a
bash kitchen.ec2.sh environment-install test lustre-rhel8

Before the patch: this test was failing on converge phase,
After the patch: converge is passing and printing the following WARN message:

  WARN: FSx for Lustre is not supported in kernel version 4.18.0-425.13.1.el8_7.x86_64 of RHEL 8.7, please update the kernel version

Docker:

bash kitchen.docker.sh environment-install test lustre-rhel8

This test passes because in docker we have RHEL 8.8 with a fake value for kernel version and we're skipping modprobe command.
Use 477 version in the kernel release test variable to match 8.8 kernel version available on docker.

We're overriding the kernel_release variable on docker because it does not have the same format of EC2.
Adding the "fake-value" word in the kernel version simplifies debugging on docker.

References

Enrico Usai added 2 commits August 31, 2023 14:52
…Lustre

The latest RHEL8.7 kernel 4.18.0-425.19.2.el8_7 introduced a kABI regression,
causing Lustre to re-release the client compiled for this latest version.

This change means that this same client will not modprobe for kernel 4.18.0-425.13.1.el8_7.

With this patch we're skipping Lustre installation and printing a warning in the log,
like we did for 4.18.0-425.3.1.el8 that has a similar issue.

### Tests

EC2:
```
# RHEL-8.7.0_HVM-20230330-x86_64-56-Hourly2-GP2 in eu-west-1
# with kernel 4.18.0-425.13.1.el8_7
export KITCHEN_RHEL8_AMI=ami-0fa2f7b35eeb82b7a
bash kitchen.ec2.sh environment-install test lustre-rhel8
```
Before the patch: this test was failing on converge phase,
After the patch: converge is passing and printing the following WARN message:
```
  WARN: FSx for Lustre is not supported in kernel version 4.18.0-425.13.1.el8_7.x86_64 of RHEL 8.7, please update the kernel version
```

Docker:
```
bash kitchen.docker.sh environment-install test lustre-rhel8
```
This test passes because in docker we have RHEL 8.8 with a fake value for kernel version and we're skipping modprobe command.
Use 477 version in the kernel release test variable to match 8.8 kernel version available on docker.

### References
* 4.18.0-425.3.1 issue: https://access.redhat.com/solutions/6985596
* 4.18.0-425.19.2 issue: openzfs/zfs#14724

Signed-off-by: Enrico Usai <usai@amazon.com>
We're overriding the kernel_release variable on docker because
it does not have the same format of EC2.

Adding the "fake-value" word in the kernel version simplifies debugging on docker.

Signed-off-by: Enrico Usai <usai@amazon.com>
@enrico-usai enrico-usai requested review from a team as code owners August 31, 2023 12:54
@enrico-usai enrico-usai changed the base branch from develop to release-3.7 August 31, 2023 12:54
enrico-usai pushed a commit to enrico-usai/aws-parallelcluster that referenced this pull request Aug 31, 2023
We improved the Lustre code to avoid build failures
when Lustre is not compatible with 8.7 kernel.

### References
* aws/aws-parallelcluster-cookbook#2442

Signed-off-by: Enrico Usai <usai@amazon.com>
@enrico-usai enrico-usai enabled auto-merge (rebase) August 31, 2023 14:37
auto-merge was automatically disabled August 31, 2023 14:37

Pull request was closed

@enrico-usai enrico-usai reopened this Aug 31, 2023
@enrico-usai enrico-usai enabled auto-merge (rebase) August 31, 2023 14:37
@enrico-usai enrico-usai merged commit 217d961 into aws:release-3.7 Aug 31, 2023
@enrico-usai enrico-usai deleted the wip/rhel8-lustre-37 branch August 31, 2023 14:39
enrico-usai pushed a commit to aws/aws-parallelcluster that referenced this pull request Aug 31, 2023
We improved the Lustre code to avoid build failures
when Lustre is not compatible with 8.7 kernel.

### References
* aws/aws-parallelcluster-cookbook#2442

Signed-off-by: Enrico Usai <usai@amazon.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants