Holger Obermaier
580d21d8bb
Fix: When sending metrics failed the batch size could be exceeded
2022-06-20 18:06:27 +02:00
Lou
b732b2d739
Improved http sink ( #78 )
...
* automatic flush in NatsSink
* tweak default options of HttpSink
* shorter cirt. section and retries for HttpSink
* fix error handling
* Remove file added by mistake.
* Use http instead of ftp to download likwid
* Fix serial number in rocmCollector
Co-authored-by: Thomas Roehl <thomas.roehl@fau.de>
2022-06-08 14:12:35 +02:00
Thomas Roehl
bef807dd44
Fix serial number in rocmCollector
2022-06-05 15:53:39 +02:00
Thomas Roehl
659d0115c0
Use http instead of ftp to download likwid
2022-06-05 15:50:04 +02:00
Thomas Gruber
e13695307f
AMD ROCm SMI collector ( #77 )
...
* Add collector for AMD ROCm SMI metrics
* Fix import path
* Fix imports
* Remove Board Number
* store GPU index explicitly
* Remove board number from description
2022-05-25 15:55:43 +02:00
Thomas Roehl
4ed07cad77
fix string comparison
2022-05-25 15:48:55 +02:00
Thomas Roehl
ad5dbd85ea
Minor updates for Makefiles
2022-05-25 15:45:21 +02:00
Thomas Roehl
132ebabd45
Add separate go.mod files to use it with deprecated 1.16
2022-05-25 15:35:11 +02:00
Thomas Roehl
f8d91d9cf1
Fix daemon options in init script
2022-05-25 15:16:01 +02:00
Thomas Roehl
cc84a94647
Remove Ganglia installation in runonce Action and add Go 1.18
2022-05-23 17:37:14 +02:00
Thomas Roehl
838b8d824d
Fix for Github Action to really use the selected version
2022-05-23 16:50:58 +02:00
Thomas Roehl
7ddc889f06
MetricRouter: Fix JSON in README
2022-05-20 16:06:54 +02:00
Thomas Roehl
500685672b
Option to use MIG slice name as subtype-id in NvidiaCollector
2022-05-13 15:26:47 +02:00
Thomas Roehl
d4c89a4206
Option to use MIG UUID as subtype-id in NvidiaCollector
2022-05-13 14:34:32 +02:00
Thomas Gruber
826f364772
CC topology module update ( #76 )
...
* Rename CPU to hardware thread, write some comments
* Do renaming in other parts
* Remove CpuList and SocketList function from metricCollector. Available in ccTopology
2022-05-13 14:28:07 +02:00
Thomas Gruber
5df550b208
Update NvidiaCollector with new metrics, MIG and NvLink support ( #75 )
2022-05-13 14:11:55 +02:00
Thomas Gruber
5c34805918
Collectors in parallel ( #74 )
...
* Provide info to CollectorManager whether the collector can be executed in parallel with others
* Split serial and parallel collectors. Read in parallel first
2022-05-13 14:10:39 +02:00
Thomas Gruber
1db5f3b29a
Rename cpu
type to hwthread
( #69 )
...
* Rename 'cpu' type to 'hwthread' to avoid naming clashes with MetricStore and CC-Webfrontend
2022-05-13 14:09:45 +02:00
Thomas Roehl
0623691bab
Fix for --once option
2022-05-13 13:50:19 +02:00
Thomas Roehl
9886f14d14
Check readability of sensor files in TempCollector
2022-05-13 13:32:54 +02:00
Thomas Roehl
857903be2b
Skip disks in DiskstatCollector that have size=0
2022-05-13 13:31:22 +02:00
Thomas Gruber
80d92d6d28
Units with cc-units ( #64 )
...
* Add option to normalize units with cc-unit
* Add unit conversion to router
* Add option to change unit prefix in the router
* Add to MetricRouter README
* Add order of operations in router to README
* Use second add_tags/del_tags only if metric gets renamed
2022-05-13 13:30:02 +02:00
Thomas Roehl
8068e59818
Update handling of LIKWID headers. Download only if not already present in the system. Fixes #73
2022-05-13 13:14:47 +02:00
Thomas Roehl
8abedac0fe
Use Golang duration parser for 'interval' and 'duration'
...
in main config
2022-05-13 12:33:33 +02:00
Holger Obermaier
ee4bd558f1
Cleanup: Remove unused code
2022-05-06 11:44:57 +02:00
Holger Obermaier
e098c33179
Add some golang debug options
2022-05-04 12:48:46 +02:00
Thomas Roehl
38d4e0a730
Merge branch 'develop' of github.com:ClusterCockpit/cc-metric-collector into develop
2022-05-04 11:54:55 +02:00
Thomas Roehl
54d14519ca
Skip mount points in DiskstatCollector if statfs() call does not work (bind mounts, ...)
2022-05-04 11:54:34 +02:00
Holger Obermaier
c35ac9dba8
Flush if batch size is reached
2022-05-04 11:28:06 +02:00
Holger Obermaier
c019f8e7ad
Reuse tags and meta data tags
2022-05-03 17:55:33 +02:00
Holger Obermaier
fb6f6a4daa
Fix GPFS collector last state handling
2022-05-02 16:57:19 +02:00
Holger Obermaier
9d6d0dbd93
Delete empty tags and meta data tags
2022-04-20 14:39:26 +02:00
Holger Obermaier
c2d4272fdf
Clear workerInput channel after done event
2022-04-20 12:36:45 +02:00
Holger Obermaier
8c73095548
Allow to shutdown redfish receiver during metric read
2022-04-20 09:58:02 +02:00
Holger Obermaier
31c5c89a5a
Fix: Close done channel
2022-04-19 14:01:23 +02:00
Holger Obermaier
bf9c7e1830
Update requirements
2022-04-19 12:15:51 +02:00
Holger Obermaier
48d34bf564
Adopt sinks.json for new meta_as_tags usage
2022-04-19 12:06:53 +02:00
Holger Obermaier
a1d85fa886
Add redfish receiver
2022-04-19 12:05:03 +02:00
Holger Obermaier
96ee16398e
Removed unused done channel and wg wait group
2022-04-19 11:53:11 +02:00
Holger Obermaier
e7b8088c41
Extended go routine use case in sample receiver
2022-04-19 11:42:46 +02:00
Thomas Roehl
017cd58247
Updating page for LikwidCollector
2022-04-05 10:57:09 +02:00
Thomas Roehl
7b098e0b1b
Fix for missing metrics in LikwidCollector is hwthread is inactive
2022-04-04 15:16:11 +02:00
Thomas Roehl
70a9530aba
Set WriteFailedCallback to get some error message
2022-04-04 11:48:54 +02:00
Thomas Roehl
69f7c19659
InfluxAsyncSink: Add custom flush mechanism
2022-04-04 02:56:23 +02:00
Thomas Roehl
ecdb4c1bcf
Add debug message when updating interval_timestep
2022-04-04 02:55:44 +02:00
Thomas Roehl
4d5b1adbc8
Fix for interval_timestamp option
2022-04-04 02:26:04 +02:00
Thomas Roehl
28348bd108
InfluxSink: Use batch&flush logic from HttpSink
2022-04-01 18:37:45 +02:00
Thomas Roehl
a3b9d8a90b
HttpSink: Use sink name in error outputs
2022-04-01 18:36:54 +02:00
Thomas Roehl
7e43e9171e
Use default options. Overwrite if anything is configured differently. Use seconds as precision
2022-04-01 17:26:56 +02:00
Thomas Roehl
5d25a7bf12
Add units to InfiniBandCollector
2022-04-01 17:14:26 +02:00