Skip to content

Feature branch sync - pub/q2_dev to staging_q2#4383

Merged
abhishek-sa1 merged 274 commits into
staging_q2from
pub/q2_dev
May 7, 2026
Merged

Feature branch sync - pub/q2_dev to staging_q2#4383
abhishek-sa1 merged 274 commits into
staging_q2from
pub/q2_dev

Conversation

@abhishek-sa1
Copy link
Copy Markdown
Collaborator

Feature branch sync - pub/q2_dev to staging_q2

priti-parate and others added 30 commits April 13, 2026 12:05
Signed-off-by: priti-parate <140157516+priti-parate@users.noreply.github.com>
victoria operator and victoria log input validation
support for Victoria log input variables and input validation.
Signed-off-by: priti-parate <140157516+priti-parate@users.noreply.github.com>
 VictoriaLogs container image references and default variable
Signed-off-by: priti-parate <140157516+priti-parate@users.noreply.github.com>
merge pub/q2 dev with pub/telemetry
Powerscale telemetry input validation
jagadeeshnv and others added 28 commits May 5, 2026 23:55
Signed-off-by: Jagadeesh N V <39791839+jagadeeshnv@users.noreply.github.com>
Update provision_mapping_nodes.yml
Signed-off-by: Jagadeesh N V <39791839+jagadeeshnv@users.noreply.github.com>
- Replace nfs_client_params with mounts list for consistency
- Extract k8s_nfs_server_ip from mounts source field (split on ':')
- Extract k8s_client_mount_path from mounts mount_point field
- Introduce k8s_nfs_server_path as unified source reference
- Update error message to use k8s_nfs_server_path for clarity
- Comment out deprecated k8s_server_share_path and nfs_server_ip references
- Consolidate 'mount -av' into single set_fact call instead of separate task
- Remove redundant debug task for single_mnt_runcmd
- Add loop_control label for better Ansible output readability
- Improve code formatting for regex_replace operations (no functional change)
- Streamline permission and bind mount runcmd list construction
remove default dns entry for ib network
Telemetry enabled flag set for creating nd executing the telemetry.sh
…m support detection

- Move mkdir for K8s NFS mount directories before fstab entries to prevent race condition
  where mount attempts occur before target directories exist
- Fix duplicate mount point names in storage_config.yml (ps3: slurm_login -> slurm_login_kube_node,
  ps4: login_node -> login_kubectrl) to avoid conflicts between functional groups

- Add mkdir -p for /tmp/crio-storage, /var/lib/etcd, /var/lib/kubelet, /etc/kubernetes,
  /var/log/pods, /var/lib/packages BEFORE adding NFS entries to /etc/fstab
- Affected templates:
  - ci-group-service_kube_control_plane_first_x86_64.yaml.j2
  - ci-group-service_kube_control_plane_x86_64.yaml.j2
  - ci-group-service_kube_node_x86_64.yaml.j2

- Remove redundant 'mount -av' from bind mount runcmd sequence (process_single_mount.yml)
- Functional groups now handle mount -av independently

- Fix slurm_support detection logic to check both software_config AND functional_groups
- Initialize slurm_support to false, then set true only if:
  1. slurm_custom is in software_config.softwares, AND
  2. slurm_control_node_x86_64 exists in functional_groups
- Apply fix to both mount_config and slurm_config roles
- Remove unused slurm_support var from mount_config/vars/main.yml

- Remove verbose flags from swap commands (fallocate -v, chmod -v, mkswap -v, swapon -v)
  in ci-group-login_compiler_node_aarch64.yaml.j2

- Update storage_config.json schema to reflect new mount point naming conventions

This commit addresses Issue #3 (race condition) and Issue #4 (missing directory creation)
from the cloud-init analysis, preventing silent NFS mount failures on K8s nodes.
Signed-off-by: sakshi-singla-1735 <sakshi.s@dell.com>
gpu defect- peermem defect- prolog removal
Signed-off-by: Jagadeesh N V <39791839+jagadeeshnv@users.noreply.github.com>
@abhishek-sa1 abhishek-sa1 marked this pull request as ready for review May 7, 2026 17:40
@abhishek-sa1 abhishek-sa1 merged commit fd38580 into staging_q2 May 7, 2026
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

8 participants