Commit Graph

2819 Commits

Author SHA1 Message Date
83d04dff17 feat(auth): replace .env/godotenv secret handling with config-based secrets
Secrets (JWT keys, LDAP sync password, OIDC client id/secret, cross-login
keys) are now configured directly in config.json under the auth section
where they are used. Each secret can still be supplied via its existing
environment variable, which takes precedence over the config value.

The godotenv dependency, the .env file, configs/env-template.txt and the
loadEnvironment() bootstrap step are removed. -init now writes the demo
JWT keys into config.json instead of a .env file.

Closes #283

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Entire-Checkpoint: 3a7cb814c53f
2026-06-17 12:28:17 +02:00
Jan Eitzinger
07b9a57479 Merge pull request #559 from ClusterCockpit/feature/558-replace-gorilla-sessions
Feature/558 replace gorilla sessions
2026-06-17 10:05:07 +02:00
b7f597bb7d Update entire config 2026-06-17 07:58:29 +02:00
2b01b57495 feat: replace gorilla/sessions with alexedwards/scs/v2
Browser sessions are now server-side, stored in the SQLite database via
scs/sqlite3store (new `sessions` table, DB migration to version 12) instead
of gorilla/sessions client-side cookie storage. Only an opaque random token
is kept in the cookie; session data lives server-side and survives restarts.

Session middleware is wired as a hybrid to avoid buffering large responses:
scs.LoadAndSave on the login/logout write paths, and a non-buffering
read-only LoadSession middleware on the secured/config/frontend read paths
so the large GraphQL /query responses stream unbuffered. JWT-only APIs
(/api, /userapi, /api/metricstore) and static files are left unwrapped.

The session cookie Secure flag is now derived from the server config (set
when cc-backend terminates TLS itself); previously it was effectively never
set. The SESSION_KEY env var is removed as server-side tokens need no
signing secret. The dormant Bearer-JWT branch in the frontend urql client
is removed; the web UI authenticates GraphQL via the session cookie.

Closes #558

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Entire-Checkpoint: b51075f43cc7
2026-06-17 07:54:26 +02:00
Jan Eitzinger
3bfd3d06ca Merge pull request #557 from ClusterCockpit/release/v1.5
Release/v1.5
2026-06-07 08:21:14 +02:00
9c6075ebb5 Update README to reflect main branch naming 2026-06-07 08:18:46 +02:00
af7528c8b2 Update CLAUDE.md
Entire-Checkpoint: 306db138cb4c
2026-06-07 08:16:10 +02:00
01fb4d53f1 Fix broken link in README 2026-06-07 08:12:25 +02:00
Jan Eitzinger
c94f5918f3 Merge pull request #556 from ClusterCockpit/release/v1.5
Fix critical/severe issues in init, startup and shutdown
v1.5.4
2026-06-07 07:31:02 +02:00
1b72b0b5ad Fix critical/severe issues in init, startup and shutdown
- auth: do not abort the server when authentication is disabled. auth.Init
  is now always called; with disable-authentication it sets up an ephemeral
  session store (SESSION_KEY not required) and registers no authenticators,
  so the unconditional auth.GetAuthInstance() callers (server init,
  api.New()) always get a valid instance.
- main: run the graceful-shutdown sequence on the startup-error path. runServer
  derives a cancelable context and, on a server-start failure, cancels it and
  waits so the metricstore final checkpoint / WAL rotation, archiver flush and
  taskmanager shutdown actually run before exit.
- server: log the :80 HTTP->HTTPS redirect listener error instead of dropping it.
- archiver: guard Shutdown against being called when Start never ran
  (avoids close(nil) panic / blocking on a nil workerDone).
- nats API: stop worker goroutines on shutdown via a stop channel + idempotent
  Shutdown(); workers and subscription callbacks select on stop and the
  channels are never closed, so no send-on-closed-channel can occur. Wired
  into Server.Shutdown after the NATS client is closed.
- metricstore: make Shutdown idempotent (nil shutdownFunc, early return) and
  release shutdownFuncMu before the checkpoint write.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Entire-Checkpoint: 3c179f9caa8f
2026-06-05 10:16:28 +02:00
Jan Eitzinger
dd8c0e1b9f Merge pull request #555 from ClusterCockpit/release/v1.5
Update ReleaseNotes
2026-06-05 09:05:17 +02:00
56ae1de011 Update ReleaseNotes
Entire-Checkpoint: 051ff3c0d253
2026-06-05 09:01:47 +02:00
Jan Eitzinger
879f730876 Merge pull request #554 from ClusterCockpit/release/v1.5
Fix bug that prevents WAL cleanup on shutdown
2026-06-05 08:42:38 +02:00
8733497223 Fix bug that prevents WAL cleanup on shutdown
Entire-Checkpoint: 5b79628feea4
2026-06-05 08:39:49 +02:00
Jan Eitzinger
d74ae77c8e Merge pull request #553 from ClusterCockpit/release/v1.5
Release/v1.5
2026-06-04 20:33:22 +02:00
6cfa511348 Update release notes
Entire-Checkpoint: 14328c112325
2026-06-04 20:19:09 +02:00
3bef199cbe Regenerate GraphQL 2026-06-04 20:12:27 +02:00
16942f55a0 Fix medium-severity issues from follow-up security audit
Addresses the remaining medium findings from the second-pass audit:

- DoS hardening: bound GraphQL query cost with FixedComplexityLimit, and
  reject non-positive items-per-page / page values so uint64 conversion
  cannot underflow into an unbounded LIMIT/OFFSET. The -1 "load all"
  sentinel stays valid for dashboards; REST now returns 400 for bad input.

- Security headers: add X-Content-Type-Options, X-Frame-Options,
  Referrer-Policy and a conservative CSP (frame-ancestors/object-src/
  base-uri) that hardens against clickjacking and base-tag injection
  without restricting the self-hosted SPA's inline scripts.

- Stored XSS: render job.metaData.message as escaped text instead of
  {@html ...} in Job.root and JobFootprint, preserving line breaks via
  white-space: pre-wrap.

- SQL injection hardening: parameterize the tag-scope IN list and the
  manager project subquery in CountTags instead of interpolating
  user.Username / user.Projects (externally sourced via OIDC/LDAP).

- CSRF defense-in-depth: reject cross-site state-changing requests via
  Sec-Fetch-Site, failing open for non-browser clients, on top of the
  existing SameSite=Lax session cookie.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Entire-Checkpoint: de7d47a85c7c
2026-06-04 20:08:41 +02:00
6d86690c76 Fix critical issues from follow-up security audit
A second-pass audit surfaced three severe issues missed by the previous
review, each a sibling code path of a bug class that was only partially
fixed before:

- auth: JWT session login (jwtSession.go) registered its authenticator
  even when CROSS_LOGIN_JWT_HS512_KEY was unset, leaving an empty HMAC
  key. golang-jwt verifies any HS256/HS512 signature against an empty
  key, allowing unauthenticated admin token forgery. Init() now refuses
  to register without a key, with a defense-in-depth empty-key guard in
  the keyfunc.

- repository: metric names from GraphQL ([String!]) were interpolated
  raw into json_extract(footprint, "$.<name>") SQL. SQLite parses
  double-quoted strings as literals, enabling SQL injection by any
  authenticated user. Validate metric names against ^[a-zA-Z0-9_]+$ in
  jobsMetricStatisticsHistogram and buildFloatJSONCondition.

- metricstore: cluster/host line-protocol tags flowed unvalidated into
  path.Join(RootDir, cluster, host) for checkpoint/WAL files, allowing
  arbitrary file write outside the checkpoint root via NATS
  (unauthenticated) or POST /api/write. Reject path-traversal sequences
  in DecodeLine before the tags become path components.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Entire-Checkpoint: b57246993ec1
2026-06-04 19:07:20 +02:00
6f7e262f3f Fix issues after security audit
Entire-Checkpoint: bc18358a9343
2026-06-04 18:33:30 +02:00
58ead40112 Merge branch 'main' into release/v1.5 2026-06-04 17:56:41 +02:00
0020f63582 Rebuild Swagger 2026-06-04 17:56:32 +02:00
Jan Eitzinger
cdb4f06fea Merge pull request #552 from ClusterCockpit/fix/add-user-edit-api
Reintroduce user update api path
2026-06-04 08:27:23 +02:00
Christoph Kluge
1ebde74774 Adapt swagger definitions of user update endpoint 2026-06-02 17:58:15 +02:00
Christoph Kluge
40722d72f5 fix name in doc comment 2026-06-02 16:48:23 +02:00
Christoph Kluge
f4384668e5 fix name comment 2026-06-02 16:47:29 +02:00
Christoph Kluge
e06982db00 reintroduce user update api path 2026-06-02 16:34:46 +02:00
Jan Eitzinger
0bc0f29f0a Merge pull request #551 from ClusterCockpit/release/v1.5
Release/v1.5
2026-05-31 07:59:06 +02:00
8bf5f67bb2 Update dependencies 2026-05-29 16:44:05 +02:00
Jan Eitzinger
994b89a378 Merge pull request #550 from ClusterCockpit/fix/metricstore-stype
Metricstore: Fix parsing of metric subtypes (key stype)
2026-05-28 15:24:06 +02:00
Jan Eitzinger
d536d3802d Merge pull request #549 from ClusterCockpit/feature/add_subcluster_filter
feat: add subCluster filter to filter component
2026-05-28 15:21:02 +02:00
77fdcddc26 Prepare Release 1.5.4
Entire-Checkpoint: 535c1737ceb2
2026-05-12 12:00:05 +02:00
Thomas Roehl
c76219651e Fix parsing of metric subtypes (key stype) 2026-05-04 18:10:01 +02:00
6397f1eaae Merge branch 'release/v1.5' into feature/add_subcluster_filter 2026-04-24 11:55:28 +02:00
Christoph Kluge
4c59aee304 feat: add subCluster filter to filter component 2026-04-24 11:43:42 +02:00
2ca3285ccd Upgrade dependencies 2026-04-24 09:09:11 +02:00
86bc14b610 Merge branch 'main' into release/v1.5 2026-04-24 09:01:35 +02:00
Jan Eitzinger
196f659a50 Merge pull request #547 from ClusterCockpit/546_fix_roofline_legends
fix: change to fixed legend coordinates instead of dynamic placement
2026-04-24 09:01:06 +02:00
Christoph Kluge
ac7865d597 fix: change to fixed legend coordinates instead of dynamic placement 2026-04-15 11:12:07 +02:00
e45b5f70e3 fix: Changes in frontend ui 2026-04-03 05:13:07 +02:00
Jan Eitzinger
3001086647 Merge pull request #544 from ClusterCockpit/hotfix
Update ReleaseNotes
v1.5.3
2026-04-01 14:02:14 +02:00
573f7d144b Update ReleaseNotes
Entire-Checkpoint: cd686356ea80
2026-04-01 13:58:26 +02:00
Jan Eitzinger
38cbc33fb0 Merge pull request #543 from ClusterCockpit/hotfix
Hotfix
2026-04-01 13:53:50 +02:00
43807ae12a feat: Also submit projects array via oidc token
Entire-Checkpoint: 2064482d97e1
2026-04-01 13:46:21 +02:00
31a8a11f1b fix: Always request oidc roles from token
Entire-Checkpoint: bfdbffd7aae0
2026-04-01 12:36:37 +02:00
84fe61b3e0 fix: allow all role changes on SyncUser and UpdateUser callback
Entire-Checkpoint: 496bace0120e
2026-04-01 11:09:50 +02:00
1f04e0a1ce fix: oidc role extraction
Entire-Checkpoint: bbe9ad3cf817
2026-04-01 11:03:19 +02:00
Jan Eitzinger
a101f215dc Merge pull request #542 from ClusterCockpit/hotfix
Hotfix
2026-03-31 07:23:14 +02:00
641dc0e3b8 Run gofumpt 2026-03-30 16:49:27 +02:00
b734c1a92a And another update
Entire-Checkpoint: 9bb66d18af6d
2026-03-30 16:48:12 +02:00