Revisions of hwloc

buildservice-autocommit accepted request 999885 from Marcus Meissner's avatar Marcus Meissner (msmeissn) (revision 51)
baserev update by copy to link target
Marcus Meissner's avatar Marcus Meissner (msmeissn) accepted request 991745 from Callum Farmer's avatar Callum Farmer (gmbr3) (revision 50)
- Add libXNVCtrl support on TW
buildservice-autocommit accepted request 990738 from Dirk Mueller's avatar Dirk Mueller (dirkmueller) (revision 49)
baserev update by copy to link target
Dirk Mueller's avatar Dirk Mueller (dirkmueller) accepted request 988289 from Dirk Mueller's avatar Dirk Mueller (dirkmueller) (revision 48)
- update to 2.8.0:
  * API
    + Add HWLOC_TOPOLOGY_FLAG_NO_DISTANCES, _NO_MEMATTRS and _NO_CPUKINDS
      to reduce the overhead when unneeded.
    + Add separate Read/Write Bandwidth/Latency memory attributes and
      implement them on Linux.
  * Backends
    + NUMA nodes may now have a subtype such as DRAM, HBM, SPM, or NVM
      on heterogeneous memory platforms on Linux.
      - Add DAXType and DAXParent attributes on Linux to tell where a
	DAX device or its corresponding NUMA node come from (SPM for
	Specific-Purpose or NVM for Non-Volatile Memory).
    + Detect heterogeneous caches in hybrid CPUs on MacOS X,
      thanks to Paul Bone for the help.
    + Max frequencies are not ignored in Linux cpukinds anymore (they were
      ignored in hwloc 2.7.0), but they may be slightly adjusted to avoid
      reporting hybrid CPUs because Intel Turbo Boost Max 3.0.
      - See the documentation of environment variable HWLOC_CPUKINDS_MAXFREQ.
    + Hardwire the PCI locality of HPE Cray EX235a nodes.
  * Tools
    + lstopo and other tools may now load Linux and x86 cpuid topology files
      from a tarball.
    + lstopo may now replace the P# and L# index prefixes with custom strings
      thanks to --os-index-prefix and --logical-index-prefix options.
  * Misc
    + Add --disable-readme to avoid regenerating the top-level hwloc README
      file from the documentation.
buildservice-autocommit accepted request 970777 from Marcus Meissner's avatar Marcus Meissner (msmeissn) (revision 47)
baserev update by copy to link target
Marcus Meissner's avatar Marcus Meissner (msmeissn) accepted request 967881 from Dirk Mueller's avatar Dirk Mueller (dirkmueller) (revision 46)
- update to 2.7.1:
  * Workaround crashes when virtual machines report incoherent x86 CPUID
    information about numbers of cores and threads.
    Thanks to Peter Bense for the report.
  * Use setenv() instead of putenv() when trying to force enable oneAPI L0
    support, to avoid issues with applications that touch the environment,
    thanks to Josh Hursey for the patch.
  * Add some warnings at the end of configure when GPU libraries are
    missing on the system or their path is missing in the environment.
  * Backends
    + Add support for NUMA nodes and caches with more than 64 PUs across
      multiple processor groups on Windows 11 and Windows Server 2022.
    + Group objects are not created for Windows processor groups anymore,
      except if HWLOC_WINDOWS_PROCESSOR_GROUP_OBJS=1 in the environment.
    + Expose "Cluster" group objects on Linux kernel 5.16+ for CPUs
      that share some internal cache or bus. This can be equivalent
      to the L2 Cache level on some platforms (e.g. x86) or a specific
      level between L2 and L3 on others (e.g. ARM Kungpeng 920).
      Thanks to Jonathan Cameron for the help.
      - HWLOC_DONT_MERGE_CLUSTER_GROUPS=1 may be set in the environment
	to prevent these groups from being merged with identical caches, etc.
    + Improve the oneAPI LevelZero backend:
      - Expose subdevices such as "ze0.1" inside root OS devices ("ze0")
	when the hardware contains multiple subdevices.
      - Add many new attributes to describe device type, and the
	numbers of slices, subslices, execution units and threads.
      - Expose the memory information as LevelZeroHBM/DDR/MemorySize infos.
    + Ignore the max frequencies of cores in Linux cpukinds when the
      base frequencies are available (to avoid exposing hybrid CPUs
      when Intel Turbo Boost Max 3.0 gives slightly different max
Thomas Blume's avatar Thomas Blume (tsaupe) accepted request 935865 from Dirk Mueller's avatar Dirk Mueller (dirkmueller) (revision 45)
- update to 2.6.0:
  * Backends
    + Expose two cpukinds for energy-efficient cores (icestorm) and
      high-performance cores (firestorm) on Apple M1 on Mac OS X.
    + Use sysfs CPU "capacity" to rank hybrid cores by efficiency
      on Linux when available (mostly on recent ARM platforms for now).
    + Improve HWLOC_MEMBIND_BIND (without the STRICT flag) on Linux kernel
      >= 5.15: If more than one node is given, the kernel may now use all
      of them instead of only the first one before falling back to others.
    + Expose cache os_index when available on Linux, it may be needed
      when using resctrl to configure cache partitioning, memory bandwidth
      monitoring, etc.
    + Add a "XGMIHops" distances matrix in the RSMI backend for AMD GPU
      interconnected through XGMI links.
    + Expose AMD GPU memory information (VRAM and GTT) in the RSMI backend.
    + Add OS devices such as "bxi0" for Atos/Bull BXI HCAs on Linux.
  * Tools
    + lstopo has a better placement algorithm with respect to I/O
      objects, see --children-order in the manpage for details.
    + hwloc-annotate may now change object subtypes and cache or memory
      sizes.
  * Build
    + Allow to specify the ROCm installation for building the RSMI backend:
      - Use a custom installation path if specified with --with-rocm=<dir>.
      - Use /opt/rocm-<version> if specified with --with-rocm-version=<version>
	or the ROCM_VERSION environment variable.
      - Try /opt/rocm if it exists.
      - See "How do I enable ROCm SMI and select which version to use?"
	in the FAQ for details.
    + Add a CMakeLists for Windows under contrib/windows-cmake/ .
Thomas Blume's avatar Thomas Blume (tsaupe) accepted request 906822 from Dirk Mueller's avatar Dirk Mueller (dirkmueller) (revision 44)
- update to 2.5.0: 
  + Add hwloc/windows.h to query Windows processor groups.
  + Add hwloc_get_obj_with_same_locality() to convert between objects
    with same locality, for instance NUMA nodes and Packages,
    or OS devices within a PCI device.
  + Add hwloc_distances_transform() to modify distances structures.
    - hwloc-annotate and lstopo have new distances-transform options.
  + hwloc_distances_add() is replaced with _add_create() followed by
    _add_values() and _add_commit(). See hwloc/distances.h for details.
  + Add topology flags to mitigate binding modifications during
    hwloc discovery, especially on Windows:
    - HWLOC_TOPOLOGY_FLAG_RESTRICT_TO_CPUBINDING and _MEMBINDING
      restrict discovery to PUs and NUMA nodes inside the binding.
    - HWLOC_TOPOLOGY_FLAG_DONT_CHANGE_BINDING prevents from ever
      changing the binding during discovery.
  + Add a levelzero backend for oneAPI L0 devices, exposed as OS devices
    of subtype "LevelZero" and name such as "ze0".
    - Add hwloc/levelzero.h for interoperability between converting
      between L0 API devices and hwloc cpusets or OS devices.
  + Expose NEC Vector Engine cards on Linux as OS devices of subtype
    "VectorEngine" and name "ve0", etc.
    Thanks to Anara Kozhokanova, Tim Cramer and Erich Focht for the help.
  + Add a NVLinkBandwidth distances structure between NVIDIA GPUs
    (and POWER processor or NVSwitches) in the NVML backend,
    and a XGMIBandwidth distances structure between AMD GPUs
    in the RSMI backends.
    - See "Topology Attributes: Distances, Memory Attributes and CPU Kinds"
      in the documentation for details about these new distances.
  + Add support for NUMA node 0 being offline in Linux, thanks to Jirka Hladky.
  + Add --with-cuda-version=<version> or look at the CUDA_VERSION
buildservice-autocommit accepted request 880105 from Thomas Blume's avatar Thomas Blume (tsaupe) (revision 43)
baserev update by copy to link target
Thomas Blume's avatar Thomas Blume (tsaupe) accepted request 879506 from Dirk Mueller's avatar Dirk Mueller (dirkmueller) (revision 42)
- update to 2.4.1:
  * Fix AMD OpenCL device locality when PCI bus or device number >= 128.
    Thanks to Edgar Leon for reporting the issue.
    + Applications using any of the following inline functions must
      be recompiled to get the fix: hwloc_opencl_get_device_pci_busid()
      hwloc_opencl_get_device_cpuset(), hwloc_opencl_get_device_osdev().
  * Fix the ranking of cpukinds on non-Windows systems,
    thanks to Ivan Kochin for the report.
  * Fix the insertion of custom Groups after loading the topology,
    thanks to Scott Hicks.
  * Add support for CPU0 being offline in Linux, thanks to Garrett Clay.
  * Fix missing x86 Package and Core objects FreeBSD/NetBSD.
    Thanks to Thibault Payet and Yuri Victorovich for the report.
  * Fix the import of very large distances with heterogeneous object types.
  * Fix a memory leak in the Linux backend,
    thanks to Perceval Anichini.
buildservice-autocommit accepted request 866970 from Thomas Blume's avatar Thomas Blume (tsaupe) (revision 41)
baserev update by copy to link target
Thomas Blume's avatar Thomas Blume (tsaupe) accepted request 866383 from Dirk Mueller's avatar Dirk Mueller (dirkmueller) (revision 40)
- update to 2.4.0:
  + Add hwloc/cpukinds.h for reporting information about hybrid CPUs.
    - Use Linux cpufreq frequencies to rank cores by efficiency.
    - Use x86 CPUID hybrid leaf and future Linux kernels sysfs CPU type
      files to identify Intel Atom and Core cores.
    - Use the Windows native EfficiencyClass to separate kinds.
  + Properly handle Linux kernel 5.10+ exposing ACPI HMAT information
    with knowledge of Generic Initiators.
  + lstopo has new --cpukinds and --no-cpukinds options for showing
    CPU kinds or not in textual and graphical modes respectively.
  + hwloc-calc has a new --cpukind option for filtering PUs by kind.
  + hwloc-annotate has a new cpukind command for modifying CPU kinds.
  + Fix hwloc_bitmap_nr_ulongs(), thanks to Norbert Eicker.
  + Add a documentation section about
    "Topology Attributes: Distances, Memory Attributes and CPU Kinds".
  + Silence some spurious warnings in the OpenCL backend and when showing
    process binding with lstopo --ps.
  + Add hwloc/memattrs.h for exposing latency/bandwidth information
    between initiators (CPU sets for now) and target NUMA nodes,
    typically on heterogeneous platforms.
    - When available, bandwidths and latencies are read from the ACPI HMAT
      table exposed by Linux kernel 5.2+.
    - Attributes may also be customized to expose user-defined performance
      information.
  + Add hwloc_get_local_numanode_objs() for listing NUMA nodes that are
    local to some locality.
  + The new topology flag HWLOC_TOPOLOGY_FLAG_IMPORT_SUPPORT causes
    support arrays to be loaded from XML exported with hwloc 2.3+.
    - hwloc_topology_get_support() now returns an additional "misc"
      array with feature "imported_support" set when support was imported.
buildservice-autocommit accepted request 850238 from Adam Majer's avatar Adam Majer (adamm) (revision 39)
baserev update by copy to link target
Adam Majer's avatar Adam Majer (adamm) accepted request 850138 from Thomas Blume's avatar Thomas Blume (tsaupe) (revision 38)
move hwloc manpage to main package (bsc#1178802)
buildservice-autocommit accepted request 828324 from Dirk Mueller's avatar Dirk Mueller (dirkmueller) (revision 37)
baserev update by copy to link target
Dirk Mueller's avatar Dirk Mueller (dirkmueller) committed (revision 36)
Dirk Mueller's avatar Dirk Mueller (dirkmueller) committed (revision 35)
Dirk Mueller's avatar Dirk Mueller (dirkmueller) committed (revision 34)
- update to 2.2.0:
  * API
    + Add hwloc_bitmap_singlify_by_core() to remove SMT from a given cpuset,
      thanks to Florian Reynier for the suggestion.
    + Add --enable-32bits-pci-domain to stop ignoring PCI devices with domain
      >16bits (e.g. 10000:02:03.4). Enabling this option breaks the library ABI.
      Thanks to Dylan Simon for the help.
  * Backends
    + Add support for Linux cgroups v2.
    + Add NUMA support for FreeBSD.
    + Add get_last_cpu_location support for FreeBSD.
    + Remove support for Intel Xeon Phi (MIC, Knights Corner) co-processors.
  * Tools
    + Add --uid to filter the hwloc-ps output by uid on Linux.
    + Add a GRAPHICAL OUTPUT section in the manpage of lstopo.
  * Misc
    + Use the native dlopen instead of libltdl,
      unless --disable-plugin-dlopen is passed at configure time. 
- install systemd files using systemd macros and register it on install
  with systemd
- build against libnuma on all architectures
buildservice-autocommit accepted request 744097 from Adam Majer's avatar Adam Majer (adamm) (revision 33)
baserev update by copy to link target
Adam Majer's avatar Adam Majer (adamm) accepted request 738580 from Thomas Blume's avatar Thomas Blume (tsaupe) (revision 32)
update to version 2.1 (jsc#SLE-8583)
Displaying revisions 21 - 40 of 71
openSUSE Build Service is sponsored by