Commit Graph

2732 Commits

Author SHA1 Message Date
Jan Eitzinger
d4ee937115 Merge pull request #531 from ClusterCockpit/hotfix
Fix goreleaser config. Cleanup.
2026-03-20 09:25:04 +01:00
999d93efc3 Fix goreleaser config. Cleanup. 2026-03-20 09:19:13 +01:00
Jan Eitzinger
4ce0cfb686 Merge pull request #530 from ClusterCockpit/hotfix
Hotfix
2026-03-20 08:41:27 +01:00
359962d166 Fix typo 2026-03-20 08:23:46 +01:00
60554896d5 Update ReleaseNote for upcoming release
Entire-Checkpoint: 30099a746fc7
2026-03-20 08:21:16 +01:00
Jan Eitzinger
a9f335d910 Merge pull request #529 from ClusterCockpit/hotfix
Hotfix
2026-03-20 05:50:18 +01:00
bf48389aeb Optimize sortby in stats queries
Entire-Checkpoint: 9b5b833472e1
2026-03-20 05:39:22 +01:00
Christoph Kluge
10b4fa5a06 change: remove heuristic metricHealth, replace with DB metricHealth
- add metricHealth to single Node view
2026-03-19 15:55:58 +01:00
Christoph Kluge
886791cf8a remove deprecated minRunningFor filter remnants 2026-03-19 14:09:10 +01:00
Christoph Kluge
6cad2ee1f0 bump frontend dependencies, increase version to match release 2026-03-19 13:56:46 +01:00
Christoph Kluge
16ec1e69d9 streamline and unify statsSeries calc and render 2026-03-19 13:30:38 +01:00
Jan Eitzinger
676025adfe Merge pull request #528 from ClusterCockpit/hotfix
further clarify plot titles
2026-03-19 11:44:12 +01:00
Christoph Kluge
c42898bd99 fix: add top list query fixes to analysis and dashboard 2026-03-19 11:31:40 +01:00
22057ff281 Pass reqKey as CacheKey
Entire-Checkpoint: b95ef43221bb
2026-03-19 11:04:32 +01:00
Christoph Kluge
30b8ca4a1a further clarify plot titles 2026-03-19 10:45:55 +01:00
Jan Eitzinger
d4a0ae173f Merge pull request #525 from ClusterCockpit/hotfix
Hotfix
2026-03-18 19:31:05 +01:00
09501df3c2 fix: reduce memory usage in parquet checkpoint archiver
Stream CheckpointFile trees directly to parquet rows instead of
materializing all rows in a giant intermediate slice. This eliminates
~1.9GB per host of redundant allocations (repeated string headers)
and removes the expensive sort on millions of 104-byte structs.

Key changes:
- Replace flattenCheckpointFile + sortParquetRows + WriteHostRows with
  streaming WriteCheckpointFile that walks the tree with sorted keys
- Reduce results channel buffer from len(hostEntries) to 2 for
  back-pressure (at most NumWorkers+2 results in flight)
- Workers send CheckpointFile trees instead of []ParquetMetricRow
- Write rows in small 1024-element batches via reusable buffer

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Entire-Checkpoint: f31dc1847539
2026-03-18 17:32:16 +01:00
Christoph Kluge
bb6915771d fix: clarify title 2026-03-18 13:23:33 +01:00
8b0881fb17 Exclude down nodes from HealthCheck
Entire-Checkpoint: 0c3347168c79
2026-03-18 11:20:12 +01:00
Christoph Kluge
33beb3c806 fix: simplify stats query condition
- caused expensive subquery without need in frontend
2026-03-18 11:07:57 +01:00
c1d51959d5 Change dtermineState to enforce priority order
Make exception if node is idle + down, then final state is idle

Entire-Checkpoint: 92c797737df8
2026-03-18 10:57:06 +01:00
3328d2ca11 Update go version in CLAUDE.md 2026-03-18 10:37:32 +01:00
8f10eba771 Extend CLAUDE.md
Entire-Checkpoint: 17cdf997acff
2026-03-18 10:05:09 +01:00
c449996559 Add context to log message
Entire-Checkpoint: 55d95cdef0d4
2026-03-18 09:43:41 +01:00
Jan Eitzinger
a7e5ecaf6c Merge pull request #524 from ClusterCockpit/hotfix
Remove tracked .entire/metadata/ files from git
2026-03-18 07:15:36 +01:00
51ae2a5d10 Remove tracked .entire/metadata/ files from git
These conversation transcript files were committed before the gitignore
rule existed. They are now properly ignored via .entire/.gitignore.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-18 07:10:01 +01:00
Jan Eitzinger
965e2007fb Merge pull request #523 from ClusterCockpit/hotfix
Hotfix
2026-03-18 07:04:12 +01:00
6ebc9e88fa Add more context information to auth failed log
Entire-Checkpoint: 2187cd89cb78
2026-03-18 06:56:01 +01:00
8b132ed7f8 fix: Blocking ReceiveNats call
Entire-Checkpoint: 38a235c86ceb
2026-03-18 06:47:45 +01:00
bf1a8a174e fix: Shard WAL consumer for higher throughput
Entire-Checkpoint: e583b7b11439
2026-03-18 06:32:14 +01:00
50aed595cf fix: metricstore NATS contention
Entire-Checkpoint: 7e68050cab59
2026-03-18 06:14:15 +01:00
33bc19c732 Upgrade cc-lib 2026-03-18 05:52:58 +01:00
045f81f985 Prepare release v1.5.2
Entire-Checkpoint: 9286f4c43ab5
2026-03-18 05:31:49 +01:00
d46e6371fc Add log about checkpoint archiving
Entire-Checkpoint: bf29af79b268
2026-03-18 05:22:39 +01:00
02f82c2c0b fix: Prevent memory spikes in parquet writer for metricstore move policy
Entire-Checkpoint: 4a675b8352a2
2026-03-18 05:08:37 +01:00
Jan Eitzinger
6a29faf460 Merge pull request #521 from ClusterCockpit/hotfix
Hotfix
2026-03-17 09:23:59 +01:00
3314b8e284 Ignore ErrNoRows error. Include calling function in log.
Entire-Checkpoint: 20746187d135
2026-03-16 20:09:44 +01:00
6855d62bf2 Make log in scanRow more descriptive. No log for common no rows error
Entire-Checkpoint: 858b34ef56b8
2026-03-16 20:03:27 +01:00
7f3eb443d9 Include calling function in error message
Entire-Checkpoint: a4948d0fe7a3
2026-03-16 15:42:38 +01:00
bab6eb4c3a Convert Warn message on missing metrics to Debug level 2026-03-16 15:35:24 +01:00
09d0ba71d2 Provide idential nodestate functionality in NATS API
Entire-Checkpoint: 3a40b75edd68
2026-03-16 12:13:14 +01:00
df93dbed63 Add busyTimeout config setting
Entire-Checkpoint: 81097a6c52a2
2026-03-16 11:30:21 +01:00
e4f3fa9ba0 Wrap SyncJobs in transaction
Entire-Checkpoint: d4f6c79a8dc1
2026-03-16 11:25:49 +01:00
51517f8031 Reduce insert pressure in db. Increase sqlite timeout value
Entire-Checkpoint: a1e2931d4deb
2026-03-16 11:17:47 +01:00
0aad8f01c8 Upgrade cc-lib
Fixes panic in AddNodeScope

Entire-Checkpoint: afef27e07ec9
2026-03-16 08:55:56 +01:00
Jan Eitzinger
8751ae023d Merge pull request #520 from ClusterCockpit/hotfix
Extend known issues in ReleaseNotes
2026-03-15 07:22:20 +01:00
973ca87bd1 Extend known issues in ReleaseNotes 2026-03-15 07:02:54 +01:00
Jan Eitzinger
128c098865 Merge pull request #519 from ClusterCockpit/hotfix
Hotfix
v1.5.1
2026-03-13 17:39:04 +01:00
045311eec0 Prepare release 1.5.1
Entire-Checkpoint: baed7fbee099
2026-03-13 17:30:03 +01:00
e38396a081 Upgrade dependencies. Rebuild GraphQL.
Entire-Checkpoint: f770853c9fa0
2026-03-13 17:22:34 +01:00