Revisions of slurm
Egbert Eich (eeich)
accepted
request 990637
from
Bernhard Wiedemann (bmwiedemann)
(revision 213)
make slurmtest.tar reproducible
buildservice-autocommit
accepted
request 990643
from
Factory Maintainer (factory-maintainer)
(revision 212)
baserev update by copy to link target
Egbert Eich (eeich)
committed
(revision 211)
- Fix a typo which prevented the nproc limit for slurmd to be up-ed for the test suite.
Egbert Eich (eeich)
accepted
request 989256
from
Egbert Eich (eeich)
(revision 210)
- Improve check for mpicc in testsuite package: if binary isn't found, don't crash.
Egbert Eich (eeich)
committed
(revision 209)
- Fix a typo
buildservice-autocommit
accepted
request 988733
from
Egbert Eich (eeich)
(revision 208)
baserev update by copy to link target
Egbert Eich (eeich)
accepted
request 988732
from
Egbert Eich (eeich)
(revision 207)
- Package the Slurm testsuite for QA purposes. * Fixes for test suite: Keep-logs-of-skipped-test-when-running-test-cases-sequentially.patch Fix-test-21.41.patch Fix-test-38.11.patch Fix-test-32.8.patch Fix-test-3.13.patch Fix-test7.2-to-find-libpmix-under-lib64-as-well.patch * Add documentation: README_Testsuite.md - Allow log in as user 'slurm'. This allows admins to run certain priviledged commands more easily without becoming root.
Christian Goll (mslacken)
accepted
request 983910
from
Christian Goll (mslacken)
(revision 206)
- update to 22.05.2 with following fixes: * Fix regression which allowed the oversubscription of licenses. * Fix a segfault in slurmctld when requesting gres in job arrays.
Egbert Eich (eeich)
committed
(revision 205)
- Package the Slrum testsuite for QA purposes. NOTE: This package is not meant to be used for testing by the user but rather for testing by the maintainers to ensure the package is working properly. DO NOT report test suite failures unless you are able to confirm that the failure is really a bug.
buildservice-autocommit
accepted
request 980097
from
Christian Goll (mslacken)
(revision 204)
baserev update by copy to link target
Christian Goll (mslacken)
accepted
request 980093
from
Christian Goll (mslacken)
(revision 203)
- update to 22.05.0 with following changes: - Support for dynamic node addition and removal - Support for native Linux cgroup v2 operation - Newly added plugins to support HPE Slingshot 11 networks (switch/hpe_slingshot), and Intel Xe GPUs (gpu/oneapi) - Added new acct_gather_interconnect/sysfs plugin to collect statistics from arbitrary network interfaces. - Expanded and synced set of environment variables available in the Prolog/Epilog/PrologSlurmctld/EpilogSlurmctld scripts. - New "--prefer" option to job submissions to allow for a "soft constraint" request to influence node selection. - Optional support for license planning in the backfill scheduler with "bf_licenses" option in SchedulerParameters. - removed file slurm-2.4.4-init.patch as sysvinit is now realy deprecated - removed file load-pmix-major-version.patch as fixed upstream
buildservice-autocommit
accepted
request 976280
from
Egbert Eich (eeich)
(revision 202)
baserev update by copy to link target
Egbert Eich (eeich)
committed
(revision 201)
- Update to 21.08.8 which fixes CVE-2022-29500 (bsc#1199278), CVE-2022-29501 (bsc#1199279), and CVE-2022-29502 (bsc#1199281).
Egbert Eich (eeich)
accepted
request 976056
from
Egbert Eich (eeich)
(revision 200)
- Add a comment about the CommunicationParameters=block_null_hash option warning users who migrate - just in case.
buildservice-autocommit
accepted
request 975440
from
Christian Goll (mslacken)
(revision 199)
baserev update by copy to link target
Christian Goll (mslacken)
accepted
request 975374
from
Christian Goll (mslacken)
(revision 198)
- Update to 21.08.8 which fixes CVE-2022-29500, CVE-2022-29501 and CVE-2022-29502 - Added 'CommunicationParameters=block_null_hash' to slurm.conf, please add this parameter to existing configurations.
buildservice-autocommit
accepted
request 974456
from
Christian Goll (mslacken)
(revision 197)
baserev update by copy to link target
Christian Goll (mslacken)
accepted
request 974433
from
Christian Goll (mslacken)
(revision 196)
- Update to 21.08.7 with following changes: * openapi/v0.0.37 - correct calculation for bf_queue_len_mean in /diag. * Avoid shrinking a reservation when overlapping with downed nodes. * Only check TRES limits against current usage for TRES requested by the job. * Do not allocate shared gres (MPS) in whole-node allocations * Constrain slurmstepd to job/step cgroup like in previous versions of Slurm. * Fix warnings on 32-bit compilers related to printf() formats. * Fix reconfigure issues after disabling/reenabling the GANG PreemptMode. * Fix race condition where a cgroup was being deleted while another step was creating it. * Set the slurmd port correctly if multi-slurmd * Fix FAIL mail not being sent if a job was cancelled due to preemption. * slurmrestd - move debug logs for HTTP handling to be gated by debugflag NETWORK to avoid unnecessary logging of communication contents. * Fix issue with bad memory access when shrinking running steps. * Fix various issues with internal job accounting with GRES when jobs are shrunk. * Fix ipmi polling on slurmd reconfig or restart. * Fix srun crash when reserved ports are being used and het step fails to launch. * openapi/dbv0.0.37 - fix DELETE execution path on /user/{user_name}. * slurmctld - Properly requeue all components of a het job if PrologSlurmctld fails. * rlimits - remove final calls to limit nofiles to 4096 but to instead use the max possible nofiles in slurmd and slurmdbd. * Allow the DBD agent to load large messages (up to MAX_BUF_SIZE) from state. * Fix potential deadlock during slurmctld restart when there is a completing job. * slurmstepd - reduce user requested soft rlimits when they are above max hard rlimits to avoid rlimit request being completely ignored and
Christian Goll (mslacken)
accepted
request 942081
from
Christian Goll (mslacken)
(revision 195)
- update to 21.08.5 with following changes: * Fix issue where typeless GRES node updates were not immediately reflected. * Fix setting the default scrontab job working directory so that it's the home of the different user (*u <user>) and not that of root or SlurmUser editor. * Fix stepd not respecting SlurmdSyslogDebug. * Fix concurrency issue with squeue. * Fix job start time not being reset after launch when job is packed onto already booting node. * Fix updating SLURM_NODE_ALIASES for jobs packed onto powering up nodes. * Cray - Fix issues with starting hetjobs. * auth/jwks - Print fatal() message when jwks is configured but file could not be opened. * If sacctmgr has an association with an unknown qos as the default qos print 'UNKN*###' instead of leaving a blank name. * Correctly determine task count when giving --cpus-per-gpu, --gpus and *-ntasks-per-node without task count. * slurmctld - Fix places where the global last_job_update was not being set to the time of update when a job's reason and description were updated. * slurmctld - Fix case where a job submitted with more than one partition would not have its reason updated while waiting to start. * Fix memory leak in node feature rebooting. * Fix time limit permanetly set to 1 minute by backfill for job array tasks higher than the first with QOS NoReserve flag and PreemptMode configured. * Fix sacct -N to show jobs that started in the current second * Fix issue on running steps where both SLURM_NTASKS_PER_TRES and SLURM_NTASKS_PER_GPU are set. * Handle oversubscription request correctly when also requesting *-ntasks-per-tres. * Correctly detect when a step requests bad gres inside an allocation. * slurmstepd - Correct possible deadlock when UnkillableStepTimeout triggers.
Christian Goll (mslacken)
accepted
request 932063
from
Antoine Ginies (aginies)
(revision 194)
add a ref to SLE-22741 (firewall config) in changelog
Displaying revisions 81 - 100 of 293