* Add cpu_used (all-cpu_idle) to CpustatCollector
* Update to line-protocol/v2
* Update runonce.yml with Golang 1.20
* Update fsnotify in LIKWID Collector
* Use not a pointer to line-protocol.Encoder
* Simplify Makefile
* Use only as many arguments as required
* Allow sum function to handle non float types
* Allow values to be a slice of type float64, float32, int, int64, int32, bool
* Use generic function to simplify code
* Add missing case for type []int32
* Use generic function to compute minimum
* Use generic function to compute maximum
* Use generic function to compute average
* Add error value to sumAnyType
* Use generic function to compute median
* For older versions of go slices is not part of the installation
* Remove old entries from go.sum
* Use simpler sort function
* Compute metrics ib_total and ib_total_pkts
* Add aggregated metrics.
Add missing units
* Update likwidMetric.go
Fixes a potential bug when `fsnotify.NewWatcher()` fails with an error
* Completly avoid memory allocations in infinibandMetric read()
* Fixed initialization: Initalization and measurements should run in the same thread
---------
Co-authored-by: Holger Obermaier <40787752+ho-ob@users.noreply.github.com>
* Cleanup: Remove unused code
* Use Golang duration parser for 'interval' and 'duration'
in main config
* Update handling of LIKWID headers. Download only if not already present in the system. Fixes#73
* Units with cc-units (#64)
* Add option to normalize units with cc-unit
* Add unit conversion to router
* Add option to change unit prefix in the router
* Add to MetricRouter README
* Add order of operations in router to README
* Use second add_tags/del_tags only if metric gets renamed
* Skip disks in DiskstatCollector that have size=0
* Check readability of sensor files in TempCollector
* Fix for --once option
* Rename `cpu` type to `hwthread` (#69)
* Rename 'cpu' type to 'hwthread' to avoid naming clashes with MetricStore and CC-Webfrontend
* Collectors in parallel (#74)
* Provide info to CollectorManager whether the collector can be executed in parallel with others
* Split serial and parallel collectors. Read in parallel first
* Update NvidiaCollector with new metrics, MIG and NvLink support (#75)
* CC topology module update (#76)
* Rename CPU to hardware thread, write some comments
* Do renaming in other parts
* Remove CpuList and SocketList function from metricCollector. Available in ccTopology
* Option to use MIG UUID as subtype-id in NvidiaCollector
* Option to use MIG slice name as subtype-id in NvidiaCollector
* MetricRouter: Fix JSON in README
* Fix for Github Action to really use the selected version
* Remove Ganglia installation in runonce Action and add Go 1.18
* Fix daemon options in init script
* Add separate go.mod files to use it with deprecated 1.16
* Minor updates for Makefiles
* fix string comparison
* AMD ROCm SMI collector (#77)
* Add collector for AMD ROCm SMI metrics
* Fix import path
* Fix imports
* Remove Board Number
* store GPU index explicitly
* Remove board number from description
* Use http instead of ftp to download likwid
* Fix serial number in rocmCollector
* Improved http sink (#78)
* automatic flush in NatsSink
* tweak default options of HttpSink
* shorter cirt. section and retries for HttpSink
* fix error handling
* Remove file added by mistake.
* Use http instead of ftp to download likwid
* Fix serial number in rocmCollector
Co-authored-by: Thomas Roehl <thomas.roehl@fau.de>
Co-authored-by: Holger Obermaier <40787752+ho-ob@users.noreply.github.com>
Co-authored-by: Lou <lou.knauer@gmx.de>
* Update configuration.md
Add an additional receiver to have better alignment of components
* Change default GpfsCollector command to `mmpmon` (#53)
* Set default cmd to 'mmpmon'
* Reuse looked up path
* Cast const to string
* Just download LIKWID to get the headers (#54)
* Just download LIKWID to get the headers
* Remove perl-Data-Dumper from BuildRequires, only required by LIKWID build
* Add HttpReceiver as counterpart to the HttpSink (#49)
* Use GBytes as unit for large memory numbers
* Make maxForward configurable, save old name in meta in rename metrics and make the hostname tag key configurable
* Single release action (#55)
Building all RPMs and releasing in a single workflow
* Makefile target to build binary-only Debian packages (#61)
* Add 'install' and 'DEB' make targets to build binary-only Debian packages
* Add control file for DEB builds
* Use a single line for bash loop in make clean
* Add config options for retry intervals of InfluxDB clients (#59)
* Refactoring of LikwidCollector and metric units (#62)
* Reduce complexity of LikwidCollector and allow metric units
* Add unit to LikwidCollector docu and fix some typos
* Make library path configurable
* Use old metric name in Ganglia if rename has happened in the router (#60)
* Use old metric name if rename has happened in the router
* Also check for Ganglia renames for the oldname
* Derived metrics (#57)
* Add time-based derivatived (e.g. bandwidth) to some collectors
* Add documentation
* Add comments
* Fix: Only compute rates with a valid previous state
* Only compute rates with a valid previous state
* Define const values for net/dev fields
* Set default config values
* Add comments
* Refactor: Consolidate data structures
* Refactor: Consolidate data structures
* Refactor: Avoid struct deep copy
* Refactor: Avoid redundant tag maps
* Refactor: Use int64 type for absolut values
Co-authored-by: Holger Obermaier <40787752+ho-ob@users.noreply.github.com>
* Simplified iota usage
* Move unit tag to meta data tags
* Derived metrics (#65)
* Add time-based derivatived (e.g. bandwidth) to some collectors
* Add documentation
* Add comments
* Fix: Only compute rates with a valid previous state
* Only compute rates with a valid previous state
* Define const values for net/dev fields
* Set default config values
* Add comments
* Refactor: Consolidate data structures
* Refactor: Consolidate data structures
* Refactor: Avoid struct deep copy
* Refactor: Avoid redundant tag maps
* Refactor: Use int64 type for absolut values
* Update LustreCollector
Co-authored-by: Holger Obermaier <40787752+ho-ob@users.noreply.github.com>
* Meta to tags list and map for sinks (#63)
* Change ccMetric->Influx functions
* Use a meta_as_tags string list in config but create a lookup map afterwards
* Add meta as tag logic to sampleSink
* Fix staticcheck warnings (#66)
Co-authored-by: Holger Obermaier <40787752+ho-ob@users.noreply.github.com>
* Add sink directly using libganglia.so
* Remove unneeded confuse header
* add submodule init to build action
* add submodule init to runonce action
* add installation og ganglia to runonce
* add installation of ganglia to runonce
* add installation of ganglia to runonce
* libconfuse not required
* Remove ganglia submodule
* Remove ganglia.h
* Add Makefile to help creating the libganglia.so link
* Fix cgo header
* Rename new Ganglia sink to 'libgangliaSink'
* Add documentation for libgangliaSink
* Extend make buildsystem with find&symlink helper for libgangliaSink
* Add metric renaming function
* Add build tag 'ganglia' and create corresponding files
* Use sink-specific configurations to have more flexibility. Adjust sample sink configuration files
* Add documentation
* Add links to individual sink readmes
* Fix link in README
* HTTPS for HttpSink
* If no CPU die id available, use the socket id instead
* Fix config for Github Actions
* Fix paths
* Add CentOS Latest and AlmaLinux 8.5 to RPM action
* Fix ID
* Reduce min Go version to 1.16 and use time.Unix in gpfsMetric