Revisions of slurm

Egbert Eich's avatar Egbert Eich (eeich) accepted request 990637 from Bernhard Wiedemann's avatar Bernhard Wiedemann (bmwiedemann) (revision 213)
make slurmtest.tar reproducible
buildservice-autocommit accepted request 990643 from Factory Maintainer's avatar Factory Maintainer (factory-maintainer) (revision 212)
baserev update by copy to link target
Egbert Eich's avatar Egbert Eich (eeich) committed (revision 211)
- Fix a typo which prevented the nproc limit for slurmd to be
  up-ed for the test suite.
Egbert Eich's avatar Egbert Eich (eeich) accepted request 989256 from Egbert Eich's avatar Egbert Eich (eeich) (revision 210)
- Improve check for mpicc in testsuite package: if binary isn't
  found, don't crash.
Egbert Eich's avatar Egbert Eich (eeich) committed (revision 209)
- Fix a typo
buildservice-autocommit accepted request 988733 from Egbert Eich's avatar Egbert Eich (eeich) (revision 208)
baserev update by copy to link target
Egbert Eich's avatar Egbert Eich (eeich) accepted request 988732 from Egbert Eich's avatar Egbert Eich (eeich) (revision 207)
- Package the Slurm testsuite for QA purposes.
  * Fixes for test suite:
    Keep-logs-of-skipped-test-when-running-test-cases-sequentially.patch
    Fix-test-21.41.patch
    Fix-test-38.11.patch
    Fix-test-32.8.patch
    Fix-test-3.13.patch
    Fix-test7.2-to-find-libpmix-under-lib64-as-well.patch
  * Add documentation:
    README_Testsuite.md
- Allow log in as user 'slurm'. This allows admins to run certain
  priviledged commands more easily without becoming root.
Christian Goll's avatar Christian Goll (mslacken) accepted request 983910 from Christian Goll's avatar Christian Goll (mslacken) (revision 206)
- update to 22.05.2 with following fixes:
  * Fix regression which allowed the oversubscription of licenses.
  * Fix a segfault in slurmctld when requesting gres in job arrays.
Egbert Eich's avatar Egbert Eich (eeich) committed (revision 205)
- Package the Slrum testsuite for QA purposes.
  NOTE: This package is not meant to be used for testing by the
  user but rather for testing by the maintainers to ensure the
  package is working properly.
  DO NOT report test suite failures unless you are able to confirm
  that the failure is really a bug.
buildservice-autocommit accepted request 980097 from Christian Goll's avatar Christian Goll (mslacken) (revision 204)
baserev update by copy to link target
Christian Goll's avatar Christian Goll (mslacken) accepted request 980093 from Christian Goll's avatar Christian Goll (mslacken) (revision 203)
- update to 22.05.0 with following changes:
- Support for dynamic node addition and removal
- Support for native Linux cgroup v2 operation
- Newly added plugins to support HPE Slingshot 11 networks
  (switch/hpe_slingshot), and Intel Xe GPUs (gpu/oneapi)
- Added new acct_gather_interconnect/sysfs plugin to collect statistics
  from arbitrary network interfaces.
- Expanded and synced set of environment variables available in the
  Prolog/Epilog/PrologSlurmctld/EpilogSlurmctld scripts.
- New "--prefer" option to job submissions to allow for a "soft
  constraint" request to influence node selection.
- Optional support for license planning in the backfill scheduler with
  "bf_licenses" option in SchedulerParameters.
- removed file slurm-2.4.4-init.patch as sysvinit is now realy deprecated
- removed file load-pmix-major-version.patch as fixed upstream
buildservice-autocommit accepted request 976280 from Egbert Eich's avatar Egbert Eich (eeich) (revision 202)
baserev update by copy to link target
Egbert Eich's avatar Egbert Eich (eeich) committed (revision 201)
- Update to 21.08.8 which fixes CVE-2022-29500 (bsc#1199278),
  CVE-2022-29501 (bsc#1199279), and CVE-2022-29502 (bsc#1199281).
Egbert Eich's avatar Egbert Eich (eeich) accepted request 976056 from Egbert Eich's avatar Egbert Eich (eeich) (revision 200)
- Add a comment about the CommunicationParameters=block_null_hash
  option warning users who migrate - just in case.
buildservice-autocommit accepted request 975440 from Christian Goll's avatar Christian Goll (mslacken) (revision 199)
baserev update by copy to link target
Christian Goll's avatar Christian Goll (mslacken) accepted request 975374 from Christian Goll's avatar Christian Goll (mslacken) (revision 198)
- Update to 21.08.8 which fixes CVE-2022-29500, CVE-2022-29501
  and CVE-2022-29502
- Added 'CommunicationParameters=block_null_hash' to slurm.conf, please
  add this parameter to existing configurations.
buildservice-autocommit accepted request 974456 from Christian Goll's avatar Christian Goll (mslacken) (revision 197)
baserev update by copy to link target
Christian Goll's avatar Christian Goll (mslacken) accepted request 974433 from Christian Goll's avatar Christian Goll (mslacken) (revision 196)
 - Update to 21.08.7 with following changes:
  * openapi/v0.0.37 - correct calculation for bf_queue_len_mean in /diag.
  * Avoid shrinking a reservation when overlapping with downed nodes.
  * Only check TRES limits against current usage for TRES requested by the job.
  * Do not allocate shared gres (MPS) in whole-node allocations
  * Constrain slurmstepd to job/step cgroup like in previous versions of Slurm.
  * Fix warnings on 32-bit compilers related to printf() formats.
  * Fix reconfigure issues after disabling/reenabling the GANG PreemptMode.
  * Fix race condition where a cgroup was being deleted while another step
    was creating it.
  * Set the slurmd port correctly if multi-slurmd
  * Fix FAIL mail not being sent if a job was cancelled due to preemption.
  * slurmrestd - move debug logs for HTTP handling to be gated by debugflag
    NETWORK to avoid unnecessary logging of communication contents.
  * Fix issue with bad memory access when shrinking running steps.
  * Fix various issues with internal job accounting with GRES when jobs are
    shrunk.
  * Fix ipmi polling on slurmd reconfig or restart.
  * Fix srun crash when reserved ports are being used and het step fails
    to launch.
  * openapi/dbv0.0.37 - fix DELETE execution path on /user/{user_name}.
  * slurmctld - Properly requeue all components of a het job if PrologSlurmctld
    fails.
  * rlimits - remove final calls to limit nofiles to 4096 but to instead use
    the max possible nofiles in slurmd and slurmdbd.
  * Allow the DBD agent to load large messages (up to MAX_BUF_SIZE) from state.
  * Fix potential deadlock during slurmctld restart when there is a completing
    job.
  * slurmstepd - reduce user requested soft rlimits when they are above max
    hard rlimits to avoid rlimit request being completely ignored and
Christian Goll's avatar Christian Goll (mslacken) accepted request 942081 from Christian Goll's avatar Christian Goll (mslacken) (revision 195)
- update to 21.08.5 with following changes:
  * Fix issue where typeless GRES node updates were not immediately reflected.
  * Fix setting the default scrontab job working directory so that it's the home
    of the different user (*u <user>) and not that of root or SlurmUser editor.
  * Fix stepd not respecting SlurmdSyslogDebug.
  * Fix concurrency issue with squeue.
  * Fix job start time not being reset after launch when job is packed onto
    already booting node.
  * Fix updating SLURM_NODE_ALIASES for jobs packed onto powering up nodes.
  * Cray - Fix issues with starting hetjobs.
  * auth/jwks - Print fatal() message when jwks is configured but file could
    not be opened.
  * If sacctmgr has an association with an unknown qos as the default qos
    print 'UNKN*###' instead of leaving a blank name.
  * Correctly determine task count when giving --cpus-per-gpu, --gpus and
    *-ntasks-per-node without task count.
  * slurmctld - Fix places where the global last_job_update was not being set
    to the time of update when a job's reason and description were updated.
  * slurmctld - Fix case where a job submitted with more than one partition
    would not have its reason updated while waiting to start.
  * Fix memory leak in node feature rebooting.
  * Fix time limit permanetly set to 1 minute by backfill for job array tasks
    higher than the first with QOS NoReserve flag and PreemptMode configured.
  * Fix sacct -N to show jobs that started in the current second
  * Fix issue on running steps where both SLURM_NTASKS_PER_TRES and
    SLURM_NTASKS_PER_GPU are set.
  * Handle oversubscription request correctly when also requesting
    *-ntasks-per-tres.
  * Correctly detect when a step requests bad gres inside an allocation.
  * slurmstepd - Correct possible deadlock when UnkillableStepTimeout triggers.
Christian Goll's avatar Christian Goll (mslacken) accepted request 932063 from Antoine Ginies's avatar Antoine Ginies (aginies) (revision 194)
add a ref to SLE-22741 (firewall config) in changelog
Displaying revisions 81 - 100 of 293
openSUSE Build Service is sponsored by