Loading...

XML

Word

Printable

Details

Type: Bug
Resolution: Fixed
Priority: P1: Critical
Fix Version/s: None
Affects Version/s: production
Component/s: CI / Test infrastructure
Labels:
None

Commits:
2bc191db8 (dev), 9adb368f5 (master)

Description

We’re currently experiencing a bottleneck in the CI pipeline caused by 32 vCPU jobs waiting excessively long to acquire VMs. This leads to a situation where only a small number of VMs (~50) are running, while over 1000 jobs are queued, despite there being available capacity on the hosts.

NUMA pinning contributes to this a lot: a 32 vCPU job requires all 32 vCPUs to be available on a single physical CPU. For instance, on an 80 vCPU host with 2 physical CPUs (40 vCPUs each), 2 vCPUs are reserved for the host, leaving only 38 per physical CPU for VMs. As a result, the host must be nearly empty to accommodate a 32 vCPU job.

This constraint significantly limits scheduling flexibility and needs to be addressed to prevent CI delays and better utilize available resources.

Attachments

Issue Links

resulted in

QTQAINFRA-7193 OpenNebula not scheduling VMs fast enough

Closed

Gerrit Reviews

- Issue Only
- Show All Reviews
- Show Open Reviews
- Show All Issues
- Show Open Issues

No reviews matched the request. Check your Options in the drop-down menu of this sections header.

Activity

People

Assignee:: Jukka Jokiniva

Reporter:: Jukka Jokiniva

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 22 May '25 07:57

Updated:: 12 Jun '25 09:53

Resolved:: 26 May '25 06:28

Gerrit Reviews

There are no open Gerrit changes

Show There are 2 closed Gerrit changes

Hide There are 2 closed Gerrit changes

Add ability to specify per config VM sizes in modules: Gerrit Review:

Limit 32 core VM to Windows cross-compilation target: Gerrit Review: