948 Commits

Author SHA1 Message Date
9af44779aa Add error handling in tagger initialization 2026-02-17 08:13:50 +01:00
Aditya Ujeniya
2eeefc2720 Add healthCheck support for external CCMS 2026-02-16 16:57:17 +01:00
25ff094bdf Fix log endpoint 2026-02-16 09:10:08 +01:00
98f9c18f72 Merge branch 'dev' into log-aggregator 2026-02-15 19:53:34 +01:00
2c8608f5a4 Update job archive retention to uniform policy with json and parquet target format 2026-02-13 12:19:31 +01:00
c15f1117f5 Review and improve node repo queries 2026-02-12 15:45:15 +01:00
f016bd4232 Extend node repository unit tests 2026-02-12 09:30:14 +01:00
54ea5d7900 Add nodestate retention and archiving 2026-02-12 09:21:44 +01:00
865cd3db54 Prersist faulty nodestate metric lists to db 2026-02-12 08:48:15 +01:00
90c8fbf07c Merge branch 'dev' of github.com:ClusterCockpit/cc-backend into dev 2026-02-12 07:34:26 +01:00
f4ee0d1042 Update cc-lib and extend nodestate sql schema 2026-02-12 07:34:24 +01:00
Christoph Kluge
12e9f6700e fix nodeList resolver data handling, increase nodestate filter cutoff
- add comment on cutoff
2026-02-11 16:16:09 +01:00
bca7dd743b Merge branch 'dev' of github.com:ClusterCockpit/cc-backend into dev 2026-02-11 07:06:33 +01:00
8d6c6b819b Update and port to cc-lib 2026-02-11 07:06:06 +01:00
Christoph Kluge
5e4994a64c revert external config supply for nodeList component 2026-02-10 16:46:18 +01:00
0dff9fa07f Update docs and agent files 2026-02-10 09:17:34 +01:00
1feb3baf68 Create copy of test db before unit tests 2026-02-10 07:53:30 +01:00
ac7eb93141 fix: Transfer always to main job table before archiving 2026-02-09 19:57:46 +01:00
1980ef5f43 Renaming due to linter complaints 2026-02-09 09:17:01 +01:00
fd9b76c6a7 Security hardening of ldap and oicd auth implementations 2026-02-09 09:12:06 +01:00
2b395a94e6 Fix setup issue with chi router 2026-02-07 18:02:48 +01:00
f6aa40d927 Migrate from gorilla to chi web framework. add 404 handler 2026-02-07 17:48:12 +01:00
c920c57f5d Add parquet file job archiving target 2026-02-07 10:51:56 +01:00
363e839c49 Add simple log viewer in web frontend 2026-02-07 07:05:33 +01:00
6294f8e263 Review and improve detectApp implementation 2026-02-06 14:53:05 +01:00
0adf2bad92 Add info log about applied tag 2026-02-06 14:50:13 +01:00
a85f72fccd Change log level to debug for nodestate API endpoint 2026-02-06 14:30:04 +01:00
db8772dc0b Merge branch 'dev' of github.com:ClusterCockpit/cc-backend into dev 2026-02-06 14:06:59 +01:00
fa7727c6ca Print job db id instead of its address 2026-02-06 14:06:56 +01:00
Christoph Kluge
5655639320 add subCluster jobFilter for statusDetail queries 2026-02-06 11:10:08 +01:00
Christoph Kluge
276559d120 revert endpoint change 2026-02-05 15:25:49 +01:00
Christoph Kluge
e3148b16eb add timers to updateNodeStates 2026-02-05 15:24:11 +01:00
Christoph Kluge
4d13c37008 remove non-required return in updatenodehandler 2026-02-05 15:18:01 +01:00
Christoph Kluge
a7a95bb866 add shortjobs and resource sums to project and user lists 2026-02-04 13:01:30 +01:00
Aditya Ujeniya
39b8356683 Optimized CCMS healthcheck 2026-02-04 10:24:45 +01:00
0d62a300e7 Intermediate state of node Healthcheck
TODOS:
* Remove error handling from routine and simplify API call
* Use map for hardware level metrics
2026-02-03 18:35:17 +01:00
248f11f4f8 Change API of Node HealthState 2026-02-03 14:55:12 +01:00
00a41373e8 Add monitoring healthstate support in nodestate API. 2026-02-03 12:23:24 +01:00
Christoph Kluge
e9cd6b4225 set updateNodeStates timeStamp once per request
-prevents per-host timestamp mismatches due to handler iteration duration
2026-02-02 17:51:41 +01:00
Christoph Kluge
13cca1ee62 change log msg on clusterMetrics 2026-02-02 14:45:19 +01:00
Christoph Kluge
7b4e2fcf59 add isNan to clusterMetric aggregation 2026-02-02 14:34:49 +01:00
Aditya Ujeniya
a71341064e Update to MetricStore HealthCheck API 2026-01-30 23:24:16 +01:00
Christoph Kluge
1ffcc5e241 apply ccUnit for clusterMetrics normalitazion 2026-01-30 15:52:59 +01:00
Christoph Kluge
e8c81ba7d4 various small dashboard fixes
- piechart color, idle count cap, metricHistoMaximum increased
2026-01-29 17:46:01 +01:00
Christoph Kluge
f26cabbdf1 Streamline missing data warnings, review logging 2026-01-29 15:17:33 +01:00
69739ffdfd Merge branch 'dev' of github.com:ClusterCockpit/cc-backend into dev 2026-01-28 07:05:32 +01:00
95689e3c99 Add API endpoint for getUsedNodes
Needed by dynamic memory management for external ccms
2026-01-28 07:05:29 +01:00
Christoph Kluge
9d9babe94d review clusterMetrics aggregation handling, fixes index error 2026-01-27 19:04:29 +01:00
28a3ff8d67 Upgrade cc-lib. Change nodestate log levels 2026-01-27 14:26:41 +01:00
Christoph Kluge
2173d3527d add rounding to statsTable data 2026-01-27 13:31:16 +01:00