Only register PullWorker task if it was configured

Remove metricData usage
Is replaced by builtin memorystore API
2026-02-28 21:37:31 +01:00 · 2025-12-21 13:39:21 +01:00 · 2025-12-21 13:29:43 +01:00 · 2025-12-21 08:12:36 +01:00 · 2025-12-21 07:34:17 +01:00 · 2025-12-21 06:34:17 +01:00
280 changed files with 11505 additions and 26271 deletions
--- a/.github/dependabot.yml
+++ b/.github/dependabot.yml
@@ -0,0 +1,15 @@
+# To get started with Dependabot version updates, you'll need to specify which
+# package ecosystems to update and where the package manifests are located.
+# Please see the documentation for all configuration options:
+# https://docs.github.com/code-security/dependabot/dependabot-version-updates/configuration-options-for-the-dependabot.yml-file
+
+version: 2
+updates:
+  - package-ecosystem: "gomod"
+    directory: "/"
+    schedule:
+      interval: "weekly"
+  - package-ecosystem: "npm"
+    directory: "/web/frontend"
+    schedule:
+      interval: "weekly"
--- a/.gitignore
+++ b/.gitignore
@@ -13,7 +13,7 @@
 /var/checkpoints*

 migrateTimestamps.pl
-test_ccms_*
+test_ccms_write_api*

 /web/frontend/public/build
 /web/frontend/node_modules
--- a/CLAUDE.md
+++ b/CLAUDE.md
@@ -22,7 +22,7 @@ make
 make frontend

 # Build only the backend (requires frontend to be built first)
-go build -ldflags='-s -X main.date=$(date +"%Y-%m-%d:T%H:%M:%S") -X main.version=1.5.0 -X main.commit=$(git rev-parse --short HEAD)' ./cmd/cc-backend
+go build -ldflags='-s -X main.date=$(date +"%Y-%m-%d:T%H:%M:%S") -X main.version=1.4.4 -X main.commit=$(git rev-parse --short HEAD)' ./cmd/cc-backend
 ```

 ### Testing
@@ -41,7 +41,7 @@ go test ./internal/repository
 ### Code Generation

 ```bash
-# Regenerate GraphQL schema and resolvers (after modifying api/schema.graphqls)
+# Regenerate GraphQL schema and resolvers (after modifying api/*.graphqls)
 make graphql

 # Regenerate Swagger/OpenAPI docs (after modifying API comments)
@@ -90,26 +90,21 @@ The backend follows a layered architecture with clear separation of concerns:
  - Transaction support for batch operations
 - **internal/api**: REST API endpoints (Swagger/OpenAPI documented)
 - **internal/graph**: GraphQL API (uses gqlgen)
-  - Schema in `api/schema.graphqls`
+  - Schema in `api/*.graphqls`
  - Generated code in `internal/graph/generated/`
  - Resolvers in `internal/graph/schema.resolvers.go`
 - **internal/auth**: Authentication layer
  - Supports local accounts, LDAP, OIDC, and JWT tokens
  - Implements rate limiting for login attempts
- **pkg/metricstore**: Metric store with data loading API
-  - In-memory metric storage with checkpointing
-  - Query API for loading job metric data
+- **internal/metricdata**: Metric data repository abstraction
+  - Pluggable backends: cc-metric-store, Prometheus, InfluxDB
+  - Each cluster can have a different metric data backend
 - **internal/archiver**: Job archiving to file-based archive
- **internal/api/nats.go**: NATS-based API for job and node operations
-  - Subscribes to NATS subjects for job events (start/stop)
-  - Handles node state updates via NATS
-  - Uses InfluxDB line protocol message format
 - **pkg/archive**: Job archive backend implementations
  - File system backend (default)
  - S3 backend
  - SQLite backend (experimental)
-  - **parquet** sub-package: Parquet format support (schema, reader, writer, conversion)
- **internal/metricstoreclient**: Client for cc-metric-store queries
+- **pkg/nats**: NATS integration for metric ingestion

 ### Frontend Structure

@@ -139,7 +134,7 @@ recommended). Configuration is per-cluster in `config.json`.
 3. The first authenticator that returns true performs the actual `Login`
 4. JWT tokens are used for API authentication

-**Database Migrations**: SQL migrations in `internal/repository/migrations/sqlite3/` are
+**Database Migrations**: SQL migrations in `internal/repository/migrations/` are
 applied automatically on startup. Version tracking in `version` table.

 **Scopes**: Metrics can be collected at different scopes:
@@ -151,14 +146,6 @@ applied automatically on startup. Version tracking in `version` table.
 ## Configuration

 - **config.json**: Main configuration (clusters, metric repositories, archive settings)
-  - `main.apiSubjects`: NATS subject configuration (optional)
-    - `subjectJobEvent`: Subject for job start/stop events (e.g., "cc.job.event")
-    - `subjectNodeState`: Subject for node state updates (e.g., "cc.node.state")
-  - `nats`: NATS client connection configuration (optional)
-    - `address`: NATS server address (e.g., "nats://localhost:4222")
-    - `username`: Authentication username (optional)
-    - `password`: Authentication password (optional)
-    - `creds-file-path`: Path to NATS credentials file (optional)
 - **.env**: Environment variables (secrets like JWT keys)
  - Copy from `configs/env-template.txt`
  - NEVER commit this file
@@ -174,7 +161,7 @@ applied automatically on startup. Version tracking in `version` table.

 **GraphQL** (gqlgen):

- Schema: `api/schema.graphqls`
+- Schema: `api/*.graphqls`
 - Config: `gqlgen.yml`
 - Generated code: `internal/graph/generated/`
 - Custom resolvers: `internal/graph/schema.resolvers.go`
@@ -183,7 +170,7 @@ applied automatically on startup. Version tracking in `version` table.
 **Swagger/OpenAPI**:

 - Annotations in `internal/api/*.go`
- Generated docs: `internal/api/docs.go`, `api/swagger.yaml`
+- Generated docs: `api/docs.go`, `api/swagger.yaml`
 - Run `make swagger` after API changes

 ## Testing Conventions
@@ -197,7 +184,7 @@ applied automatically on startup. Version tracking in `version` table.

 ### Adding a new GraphQL field

-1. Edit schema in `api/schema.graphqls`
+1. Edit schema in `api/*.graphqls`
 2. Run `make graphql`
 3. Implement resolver in `internal/graph/schema.resolvers.go`

@@ -210,97 +197,19 @@ applied automatically on startup. Version tracking in `version` table.

 ### Adding a new metric data backend

-1. Implement metric loading functions in `pkg/metricstore/query.go`
-2. Add cluster configuration to metric store initialization
+1. Implement `MetricDataRepository` interface in `internal/metricdata/`
+2. Register in `metricdata.Init()` switch statement
 3. Update config.json schema documentation

 ### Modifying database schema

-1. Create new migration in `internal/repository/migrations/sqlite3/`
+1. Create new migration in `internal/repository/migrations/`
 2. Increment `repository.Version`
 3. Test with fresh database and existing database

-## NATS API
-
-The backend supports a NATS-based API as an alternative to the REST API for job and node operations.
-
-### Setup
-
-1. Configure NATS client connection in `config.json`:
-   ```json
-   {
-     "nats": {
-       "address": "nats://localhost:4222",
-       "username": "user",
-       "password": "pass"
-     }
-   }
-   ```
-
-2. Configure API subjects in `config.json` under `main`:
-   ```json
-   {
-     "main": {
-       "apiSubjects": {
-         "subjectJobEvent": "cc.job.event",
-         "subjectNodeState": "cc.node.state"
-       }
-     }
-   }
-   ```
-
-### Message Format
-
-Messages use **InfluxDB line protocol** format with the following structure:
-
-#### Job Events
-
-**Start Job:**
-```
-job,function=start_job event="{\"jobId\":123,\"user\":\"alice\",\"cluster\":\"test\", ...}" 1234567890000000000
-```
-
-**Stop Job:**
-```
-job,function=stop_job event="{\"jobId\":123,\"cluster\":\"test\",\"startTime\":1234567890,\"stopTime\":1234571490,\"jobState\":\"completed\"}" 1234571490000000000
-```
-
-**Tags:**
- `function`: Either `start_job` or `stop_job`
-
-**Fields:**
- `event`: JSON payload containing job data (see REST API documentation for schema)
-
-#### Node State Updates
-
-```json
-{
-  "cluster": "testcluster",
-  "nodes": [
-    {
-      "hostname": "node001",
-      "states": ["allocated"],
-      "cpusAllocated": 8,
-      "memoryAllocated": 16384,
-      "gpusAllocated": 0,
-      "jobsRunning": 1
-    }
-  ]
-}
-```
-
-### Implementation Notes
-
- NATS API mirrors REST API functionality but uses messaging
- Job start/stop events are processed asynchronously
- Duplicate job detection is handled (same as REST API)
- All validation rules from REST API apply
- Messages are logged; no responses are sent back to publishers
- If NATS client is unavailable, API subscriptions are skipped (logged as warning)
-
 ## Dependencies

 - Go 1.24.0+ (check go.mod for exact version)
 - Node.js (for frontend builds)
 - SQLite 3 (only supported database)
- Optional: NATS server for NATS API integration
+- Optional: NATS server for metric ingestion
--- a/6
+++ b/6
@@ -1,6 +1,6 @@
 TARGET = ./cc-backend
 FRONTEND = ./web/frontend
-VERSION = 1.5.0
+VERSION = 1.4.4
 GIT_HASH := $(shell git rev-parse --short HEAD || echo 'development')
 CURRENT_TIME = $(shell date +"%Y-%m-%d:T%H:%M:%S")
 LD_FLAGS = '-s -X main.date=${CURRENT_TIME} -X main.version=${VERSION} -X main.commit=${GIT_HASH}'
@@ -46,7 +46,7 @@ $(TARGET): $(SVELTE_TARGETS)

 frontend:
 	$(info ===>  BUILD frontend)
-	cd web/frontend && npm ci && npm run build
+	cd web/frontend && npm install && npm run build

 swagger:
 	$(info ===>  GENERATE swagger)
@@ -84,4 +84,4 @@ $(VAR):

 $(SVELTE_TARGETS): $(SVELTE_SRC)
 	$(info ===>  BUILD frontend)
-	cd web/frontend && npm ci && npm run build
+	cd web/frontend && npm install && npm run build
--- a/README.md
+++ b/README.md
@@ -22,12 +22,11 @@ switching from PHP Symfony to a Golang based solution are explained
 ## Overview

 This is a Golang web backend for the ClusterCockpit job-specific performance
-monitoring framework. It provides a REST API and an optional NATS-based messaging
-API for integrating ClusterCockpit with an HPC cluster batch system and external
-analysis scripts. Data exchange between the web front-end and the back-end is
-based on a GraphQL API. The web frontend is also served by the backend using
-[Svelte](https://svelte.dev/) components. Layout and styling are based on
-[Bootstrap 5](https://getbootstrap.com/) using
+monitoring framework. It provides a REST API for integrating ClusterCockpit with
+an HPC cluster batch system and external analysis scripts. Data exchange between
+the web front-end and the back-end is based on a GraphQL API. The web frontend
+is also served by the backend using [Svelte](https://svelte.dev/) components.
+Layout and styling are based on [Bootstrap 5](https://getbootstrap.com/) using
 [Bootstrap Icons](https://icons.getbootstrap.com/).

 The backend uses [SQLite 3](https://sqlite.org/) as the relational SQL database.
@@ -36,10 +35,6 @@ databases, the only tested and supported setup is to use cc-metric-store as the
 metric data backend. Documentation on how to integrate ClusterCockpit with other
 time series databases will be added in the future.

-For real-time integration with HPC systems, the backend can subscribe to
-[NATS](https://nats.io/) subjects to receive job start/stop events and node
-state updates, providing an alternative to REST API polling.
-
 Completed batch jobs are stored in a file-based job archive according to
 [this specification](https://github.com/ClusterCockpit/cc-specifications/tree/master/job-archive).
 The backend supports authentication via local accounts, an external LDAP
@@ -135,59 +130,27 @@ ln -s <your-existing-job-archive> ./var/job-archive

 ## Project file structure

- [`.github/`](https://github.com/ClusterCockpit/cc-backend/tree/master/.github)
-  GitHub Actions workflows and dependabot configuration for CI/CD.
 - [`api/`](https://github.com/ClusterCockpit/cc-backend/tree/master/api)
  contains the API schema files for the REST and GraphQL APIs. The REST API is
  documented in the OpenAPI 3.0 format in
-  [./api/swagger.yaml](./api/swagger.yaml). The GraphQL schema is in
-  [./api/schema.graphqls](./api/schema.graphqls).
+  [./api/openapi.yaml](./api/openapi.yaml).
 - [`cmd/cc-backend`](https://github.com/ClusterCockpit/cc-backend/tree/master/cmd/cc-backend)
-  contains the main application entry point and CLI implementation.
+  contains `main.go` for the main application.
 - [`configs/`](https://github.com/ClusterCockpit/cc-backend/tree/master/configs)
  contains documentation about configuration and command line options and required
-  environment variables. Sample configuration files are provided.
+  environment variables. A sample configuration file is provided.
+- [`docs/`](https://github.com/ClusterCockpit/cc-backend/tree/master/docs)
+  contains more in-depth documentation.
 - [`init/`](https://github.com/ClusterCockpit/cc-backend/tree/master/init)
  contains an example of setting up systemd for production use.
 - [`internal/`](https://github.com/ClusterCockpit/cc-backend/tree/master/internal)
  contains library source code that is not intended for use by others.
-  - [`api`](https://github.com/ClusterCockpit/cc-backend/tree/master/internal/api)
-    REST API handlers and NATS integration
-  - [`archiver`](https://github.com/ClusterCockpit/cc-backend/tree/master/internal/archiver)
-    Job archiving functionality
-  - [`auth`](https://github.com/ClusterCockpit/cc-backend/tree/master/internal/auth)
-    Authentication (local, LDAP, OIDC) and JWT token handling
-  - [`config`](https://github.com/ClusterCockpit/cc-backend/tree/master/internal/config)
-    Configuration management and validation
-  - [`graph`](https://github.com/ClusterCockpit/cc-backend/tree/master/internal/graph)
-    GraphQL schema and resolvers
-  - [`importer`](https://github.com/ClusterCockpit/cc-backend/tree/master/internal/importer)
-    Job data import and database initialization
-  - [`metricdispatch`](https://github.com/ClusterCockpit/cc-backend/tree/master/internal/metricdispatch)
-    Dispatches metric data loading to appropriate backends
-  - [`repository`](https://github.com/ClusterCockpit/cc-backend/tree/master/internal/repository)
-    Database repository layer for jobs and metadata
-  - [`routerConfig`](https://github.com/ClusterCockpit/cc-backend/tree/master/internal/routerConfig)
-    HTTP router configuration and middleware
-  - [`tagger`](https://github.com/ClusterCockpit/cc-backend/tree/master/internal/tagger)
-    Job classification and application detection
-  - [`taskmanager`](https://github.com/ClusterCockpit/cc-backend/tree/master/internal/taskmanager)
-    Background task management and scheduled jobs
-  - [`metricstoreclient`](https://github.com/ClusterCockpit/cc-backend/tree/master/internal/metricstoreclient)
-    Client for cc-metric-store queries
 - [`pkg/`](https://github.com/ClusterCockpit/cc-backend/tree/master/pkg)
  contains Go packages that can be used by other projects.
-  - [`archive`](https://github.com/ClusterCockpit/cc-backend/tree/master/pkg/archive)
-    Job archive backend implementations (filesystem, S3, SQLite)
-  - [`metricstore`](https://github.com/ClusterCockpit/cc-backend/tree/master/pkg/metricstore)
-    In-memory metric data store with checkpointing and metric loading
 - [`tools/`](https://github.com/ClusterCockpit/cc-backend/tree/master/tools)
  Additional command line helper tools.
  - [`archive-manager`](https://github.com/ClusterCockpit/cc-backend/tree/master/tools/archive-manager)
-    Commands for getting infos about an existing job archive, importing jobs
-    between archive backends, and converting archives between JSON and Parquet formats.
-  - [`archive-migration`](https://github.com/ClusterCockpit/cc-backend/tree/master/tools/archive-migration)
-    Tool for migrating job archives between formats.
+    Commands for getting infos about and existing job archive.
  - [`convert-pem-pubkey`](https://github.com/ClusterCockpit/cc-backend/tree/master/tools/convert-pem-pubkey)
    Tool to convert external pubkey for use in `cc-backend`.
  - [`gen-keypair`](https://github.com/ClusterCockpit/cc-backend/tree/master/tools/gen-keypair)
@@ -199,7 +162,7 @@ ln -s <your-existing-job-archive> ./var/job-archive
  - [`frontend`](https://github.com/ClusterCockpit/cc-backend/tree/master/web/frontend)
    Svelte components and static assets for the frontend UI
  - [`templates`](https://github.com/ClusterCockpit/cc-backend/tree/master/web/templates)
-    Server-side Go templates, including monitoring views
+    Server-side Go templates
 - [`gqlgen.yml`](https://github.com/ClusterCockpit/cc-backend/blob/master/gqlgen.yml)
  Configures the behaviour and generation of
  [gqlgen](https://github.com/99designs/gqlgen).
--- a/ReleaseNotes.md
+++ b/ReleaseNotes.md
@@ -1,277 +1,47 @@
-# `cc-backend` version 1.5.0
+# `cc-backend` version 1.4.4

-Supports job archive version 3 and database version 10.
+Supports job archive version 2 and database version 8.

-This is a feature release of `cc-backend`, the API backend and frontend
+This is a bug fix release of `cc-backend`, the API backend and frontend
 implementation of ClusterCockpit.
 For release specific notes visit the [ClusterCockpit Documentation](https://clusterockpit.org/docs/release/).

 ## Breaking changes

-### Configuration changes
+The option `apiAllowedIPs` is now a required configuration attribute in
+`config.json`. This option restricts access to the admin API.

- **JSON attribute naming**: All JSON configuration attributes now use `kebab-case`
-  style consistently (e.g., `api-allowed-ips` instead of `apiAllowedIPs`).
-  Update your `config.json` accordingly.
- **Removed `disable-archive` option**: This obsolete configuration option has been removed.
- **Removed `clusters` config section**: The separate clusters configuration section
-  has been removed. Cluster information is now derived from the job archive.
- **`apiAllowedIPs` is now optional**: If not specified, defaults to not
-  restricted.
-
-### Architecture changes
-
- **Web framework replaced**: Migrated from `gorilla/mux` to `chi` as the HTTP
-  router. This should be transparent to users but affects how middleware and
-  routes are composed. A proper 404 handler is now in place.
- **MetricStore moved**: The `metricstore` package has been moved from `internal/`
-  to `pkg/` as it is now part of the public API.
- **MySQL/MariaDB support removed**: Only SQLite is now supported as the database backend.
- **Archive to Cleanup renaming**: Archive-related functions have been refactored
-  and renamed to "Cleanup" for clarity.
- **`minRunningFor` filter removed**: This undocumented filter has been removed
-  from the API and frontend.
-
-### Dependency changes
-
- **cc-lib v2.5.1**: Switched to cc-lib version 2 with updated APIs (currently at v2.5.1)
- **cclib NATS client**: Now using the cclib NATS client implementation
- Removed obsolete `util.Float` usage from cclib
-
-## Major new features
-
-### NATS API Integration
-
- **Real-time job events**: Subscribe to job start/stop events via NATS
- **Node state updates**: Receive real-time node state changes via NATS
- **Configurable subjects**: NATS API subjects are now configurable via `api-subjects`
- **Deadlock fixes**: Improved NATS client stability and graceful shutdown
-
-### Public Dashboard
-
- **Public-facing interface**: New public dashboard route for external users
- **DoubleMetricPlot component**: New visualization component for comparing metrics
- **Improved layout**: Reviewed and optimized dashboard layouts for better readability
-
-### Enhanced Node Management
-
- **Node state tracking**: New node table in database with timestamp tracking
- **Node state filtering**: Filter jobs by node state in systems view
- **Node list enhancements**: Improved paging, filtering, and continuous scroll support
- **Nodestate retention and archiving**: Node state data is now subject to configurable
-  retention policies and can be archived to Parquet format for long-term storage
- **Faulty node metric tracking**: Faulty node state metric lists are persisted to the database
-
-### Health Monitoring
-
- **Health status dashboard**: New dedicated "Health" tab in the status details view
-  showing per-node metric health across the cluster
- **CCMS health check**: Support for querying health status of external
-  cc-metric-store (CCMS) instances via the API
- **GraphQL health endpoints**: New GraphQL queries and resolvers for health data
- **Cluster/subcluster filter**: Filter health status view by cluster or subcluster
-
-### Log Viewer
-
- **Web-based log viewer**: New log viewer page in the admin interface for inspecting
-  backend log output directly from the browser without shell access
- **Accessible from header**: Quick access link from the navigation header
-
-### MetricStore Improvements
-
- **Memory tracking worker**: New worker for CCMS memory usage tracking
- **Dynamic retention**: Support for job specific dynamic retention times
- **Improved compression**: Transparent compression for job archive imports
- **Parallel processing**: Parallelized Iter function in all archive backends
-
-### Job Tagging System
-
- **Job tagger option**: Enable automatic job tagging via configuration flag
- **Application detection**: Automatic detection of applications (MATLAB, GROMACS, etc.)
- **Job classification**: Automatic detection of pathological jobs
- **omit-tagged**: Option to exclude tagged jobs from retention/cleanup operations (`none`, `all`, or `user`)
- **Admin UI trigger**: Taggers can be run on-demand from the admin web interface
-  without restarting the backend
-
-### Archive Backends
-
- **Parquet archive format**: New Parquet file format for job archiving, providing
-  columnar storage with efficient compression for analytical workloads
- **S3 backend**: Full support for S3-compatible object storage
- **SQLite backend**: Full support for SQLite backend using blobs
- **Performance improvements**: Fixed performance bugs in archive backends
- **Better error handling**: Improved error messages and fallback handling
- **Zstd compression**: Parquet writers use zstd compression for better
-  compression ratios compared to the previous snappy default
- **Optimized sort order**: Job and nodestate Parquet files are sorted by
-  cluster, subcluster, and start time for efficient range queries
-
-### Unified Archive Retention and Format Conversion
-
- **Uniform retention policy**: Job archive retention now supports both JSON and
-  Parquet as target formats under a single, consistent policy configuration
- **Archive manager tool**: The `tools/archive-manager` utility now supports
-  format conversion between JSON and Parquet job archives
- **Parquet reader**: Full Parquet archive reader implementation for reading back
-  archived job data
-
-## New features and improvements
-
-### Frontend
-
- **Loading indicators**: Added loading indicators to status detail and job lists
- **Job info layout**: Reviewed and improved job info row layout
- **Metric selection**: Enhanced metric selection with drag-and-drop fixes
- **Filter presets**: Move list filter preset to URL for easy sharing
- **Job comparison**: Improved job comparison views and plots
- **Subcluster reactivity**: Job list now reacts to subcluster filter changes
- **Short jobs quick selection**: New "Short jobs" quick-filter button in job lists
-  replaces the removed undocumented `minRunningFor` filter
- **Row plot cursor sync**: Cursor position is now synchronized across all metric
-  plots in a job list row for easier cross-metric comparison
- **Disabled metrics handling**: Improved handling and display of disabled metrics
-  across job view, node view, and list rows
- **"Not configured" info cards**: Informational cards shown when optional features
-  are not yet configured
- **Frontend dependencies**: Bumped frontend dependencies to latest versions
- **Svelte 5 compatibility**: Fixed Svelte state warnings and compatibility issues
-
-### Backend
-
- **Progress bars**: Import function now shows progress during long operations
- **Better logging**: Improved logging with appropriate log levels throughout
- **Graceful shutdown**: Fixed shutdown timeout bugs and hanging issues
- **Configuration defaults**: Sensible defaults for most configuration options
- **Documentation**: Extensive documentation improvements across packages
- **Server flag in systemd unit**: Example systemd unit now includes the `-server` flag
-
-### Security
-
- **LDAP security hardening**: Improved input validation, connection handling, and
-  error reporting in the LDAP authenticator
- **OIDC security hardening**: Stricter token validation and improved error handling
-  in the OIDC authenticator
- **Auth schema extensions**: Additional schema fields for improved auth configuration
-
-### API improvements
-
- **Role-based metric visibility**: Metrics can now have role-based access control
- **Job exclusivity filter**: New filter for exclusive vs. shared jobs
- **Improved error messages**: Better error messages and documentation in REST API
- **GraphQL enhancements**: Improved GraphQL queries and resolvers
- **Stop job lookup order**: Reversed lookup order in stop job requests for
-  more reliable job matching (cluster+jobId first, then jobId alone)
-
-### Performance
-
- **Database indices**: Optimized SQLite indices for better query performance
- **Job cache**: Introduced caching table for faster job inserts
- **Parallel imports**: Archive imports now run in parallel where possible
- **External tool integration**: Optimized use of external tools (fd) for better performance
- **Node repository queries**: Reviewed and optimized node repository SQL queries
- **Buffer pool**: Resized and pooled internal buffers for better memory reuse
-
-### Developer experience
-
- **AI agent guidelines**: Added documentation for AI coding agents (AGENTS.md, CLAUDE.md)
- **Example API payloads**: Added example JSON API payloads for testing
- **Unit tests**: Added more unit tests for NATS API, node repository, and other components
- **Test improvements**: Better test coverage; test DB is now copied before unit tests
-  to avoid state pollution between test runs
- **Parquet writer tests**: Comprehensive tests for Parquet archive writing and conversion
-
-## Bug fixes
-
- Fixed nodelist paging issues
- Fixed metric select drag and drop functionality
- Fixed render race conditions in nodeList
- Fixed tag count grouping including type
- Fixed wrong metricstore schema (missing comma)
- Fixed configuration issues causing shutdown hangs
- Fixed deadlock when NATS is not configured
- Fixed archive backend performance bugs
- Fixed continuous scroll buildup on refresh
- Improved footprint calculation logic
- Fixed polar plot data query decoupling
- Fixed missing resolution parameter handling
- Fixed node table initialization fallback
- Fixed reactivity key placement in nodeList
- Fixed nodeList resolver data handling and increased nodestate filter cutoff
- Fixed job always being transferred to main job table before archiving
- Fixed AppTagger error handling and logging
- Fixed log endpoint formatting and correctness
- Fixed automatic refresh in metric status tab
- Fixed NULL value handling in `health_state` and `health_metrics` columns
- Fixed bugs related to `job_cache` IDs being used in the main job table
- Fixed SyncJobs bug causing start job hooks to be called with wrong (cache) IDs
- Fixed 404 handler route for sub-routers
-
-## Configuration changes
-
-### New configuration options
+To retain the previous behavior that the API is per default accessible from
+everywhere set:

 ```json
-{
-  "main": {
-    "enable-job-taggers": true,
-    "resampling": {
-      "minimum-points": 600,
-      "trigger": 180,
-      "resolutions": [240, 60]
-    },
-    "api-subjects": {
-      "subject-job-event": "cc.job.event",
-      "subject-node-state": "cc.node.state"
-    }
-  },
-  "nats": {
-    "address": "nats://0.0.0.0:4222",
-    "username": "root",
-    "password": "root"
-  },
-  "cron": {
-    "commit-job-worker": "1m",
-    "duration-worker": "5m",
-    "footprint-worker": "10m"
-  },
-  "metric-store": {
-    "cleanup": {
-      "mode": "archive",
-      "interval": "48h",
-      "directory": "./var/archive"
-    }
-  },
-  "archive": {
-    "retention": {
-      "policy": "delete",
-      "age": "6months",
-      "target-format": "parquet"
-    }
-  },
-  "nodestate": {
-    "retention": {
-      "policy": "archive",
-      "age": "30d",
-      "archive-path": "./var/nodestate-archive"
-    }
-  }
-}
+  "apiAllowedIPs": [
+    "*"
+  ]
 ```

-## Migration notes
+## Breaking changes for minor release 1.4.x

- Review and update your `config.json` to use kebab-case attribute names
- If using NATS, configure the new `nats` and `api-subjects` sections
- If using S3 archive backend, configure the new `archive` section options
- Test the new public dashboard at `/public` route
- Review cron worker configuration if you need different frequencies
- If using the archive retention feature, configure the `target-format` option
-  to choose between `json` (default) and `parquet` output formats
- Consider enabling nodestate retention if you track node states over time
+- You need to perform a database migration. Depending on your database size the
+  migration might require several hours!
+- You need to adapt the `cluster.json` configuration files in the job-archive,
+  add new required attributes to the metric list and after that edit
+  `./job-archive/version.txt` to version 2. Only metrics that have the footprint
+  attribute set can be filtered and show up in the footprint UI and polar plot.
+- Continuous scrolling is default now in all job lists. You can change this back
+  to paging globally, also every user can configure to use paging or continuous
+  scrolling individually.
+- Tags have a scope now. Existing tags will get global scope in the database
+  migration.
+
+## New features
+
+- Enable to delete tags from the web interface

 ## Known issues

 - Currently energy footprint metrics of type energy are ignored for calculating
  total energy.
+- Resampling for running jobs only works with cc-metric-store
 - With energy footprint metrics of type power the unit is ignored and it is
  assumed the metric has the unit Watt.
--- a/api/schema.graphqls
+++ b/api/schema.graphqls
@@ -19,7 +19,6 @@ type Node {
  schedulerState: SchedulerState!
  healthState: MonitoringState!
  metaData: Any
-  healthData: Any
 }

 type NodeStates {
@@ -329,7 +328,6 @@ type Query {
  ## Node Queries New
  node(id: ID!): Node
  nodes(filter: [NodeFilter!], order: OrderByInput): NodeStateResultList!
-  nodesWithMeta(filter: [NodeFilter!], order: OrderByInput): NodeStateResultList!
  nodeStates(filter: [NodeFilter!]): [NodeStates!]!
  nodeStatesTimed(filter: [NodeFilter!], type: String!): [NodeStatesTimed!]!

@@ -431,7 +429,7 @@ type TimeRangeOutput {
 input NodeFilter {
  hostname: StringInput
  cluster: StringInput
-  subCluster: StringInput
+  subcluster: StringInput
  schedulerState: SchedulerState
  healthState: MonitoringState
  timeStart: Int
@@ -446,7 +444,6 @@ input JobFilter {
  project: StringInput
  jobName: StringInput
  cluster: StringInput
-  subCluster: StringInput
  partition: StringInput
  duration: IntRange
  energy: FloatRange
@@ -461,7 +458,6 @@ input JobFilter {
  state: [JobState!]
  metricStats: [MetricStatItem!]
  shared: String
-  schedule: String
  node: StringInput
 }

--- a/api/swagger.json
+++ b/api/swagger.json
--- a/api/swagger.yaml
+++ b/api/swagger.yaml
--- a/cmd/cc-backend/init.go
+++ b/cmd/cc-backend/init.go
@@ -12,10 +12,11 @@ import (
 	"encoding/json"
 	"os"

+	"github.com/ClusterCockpit/cc-backend/internal/config"
 	"github.com/ClusterCockpit/cc-backend/internal/repository"
 	"github.com/ClusterCockpit/cc-backend/pkg/archive"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/util"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/util"
 )

 const envString = `
@@ -34,20 +35,20 @@ const configString = `
    "addr": "127.0.0.1:8080",
    "short-running-jobs-duration": 300,
    "resampling": {
-      "minimum-points": 600,
-      "trigger": 300,
+      "minimumPoints": 600,
+      "trigger": 180,
      "resolutions": [
        240,
        60
      ]
    },
-    "api-allowed-ips": [
+    "apiAllowedIPs": [
      "*"
    ],
    "emission-constant": 317
  },
  "cron": {
-    "commit-job-worker": "1m",
+    "commit-job-worker": "2m",
    "duration-worker": "5m",
    "footprint-worker": "10m"
  },
@@ -59,7 +60,31 @@ const configString = `
    "jwts": {
      "max-age": "2000h"
    }
-  }
+  },
+ "clusters": [
+     {
+         "name": "name",
+         "metricDataRepository": {
+             "kind": "cc-metric-store",
+             "url": "http://localhost:8082",
+             "token": ""
+         },
+         "filterRanges": {
+             "numNodes": {
+                 "from": 1,
+                 "to": 64
+             },
+             "duration": {
+                 "from": 0,
+                 "to": 86400
+             },
+             "startTime": {
+                 "from": "2023-01-01T00:00:00Z",
+                 "to": null
+             }
+         }
+     }
+ ]
 }
 `

@@ -88,7 +113,7 @@ func initEnv() {
 		cclog.Abortf("Could not create default ./var/job-archive folder with permissions '0o777'. Application initialization failed, exited.\nError: %s\n", err.Error())
 	}
 	archiveCfg := "{\"kind\": \"file\",\"path\": \"./var/job-archive\"}"
-	if err := archive.Init(json.RawMessage(archiveCfg)); err != nil {
+	if err := archive.Init(json.RawMessage(archiveCfg), config.Keys.DisableArchive); err != nil {
 		cclog.Abortf("Could not initialize job-archive, exited.\nError: %s\n", err.Error())
 	}
 }
--- a/cmd/cc-backend/main.go
+++ b/cmd/cc-backend/main.go
@@ -24,19 +24,19 @@ import (
 	"github.com/ClusterCockpit/cc-backend/internal/auth"
 	"github.com/ClusterCockpit/cc-backend/internal/config"
 	"github.com/ClusterCockpit/cc-backend/internal/importer"
-	"github.com/ClusterCockpit/cc-backend/internal/metricdispatch"
+	"github.com/ClusterCockpit/cc-backend/internal/memorystore"
+	"github.com/ClusterCockpit/cc-backend/internal/metricdata"
 	"github.com/ClusterCockpit/cc-backend/internal/repository"
 	"github.com/ClusterCockpit/cc-backend/internal/tagger"
 	"github.com/ClusterCockpit/cc-backend/internal/taskmanager"
 	"github.com/ClusterCockpit/cc-backend/pkg/archive"
-	"github.com/ClusterCockpit/cc-backend/pkg/metricstore"
+	"github.com/ClusterCockpit/cc-backend/pkg/nats"
 	"github.com/ClusterCockpit/cc-backend/web"
-	ccconf "github.com/ClusterCockpit/cc-lib/v2/ccConfig"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/nats"
-	"github.com/ClusterCockpit/cc-lib/v2/runtime"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
-	"github.com/ClusterCockpit/cc-lib/v2/util"
+	ccconf "github.com/ClusterCockpit/cc-lib/ccConfig"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/runtimeEnv"
+	"github.com/ClusterCockpit/cc-lib/schema"
+	"github.com/ClusterCockpit/cc-lib/util"
 	"github.com/google/gops/agent"
 	"github.com/joho/godotenv"

@@ -103,12 +103,17 @@ func initConfiguration() error {
 		return fmt.Errorf("main configuration must be present")
 	}

-	config.Init(cfg)
+	clustercfg := ccconf.GetPackageConfig("clusters")
+	if clustercfg == nil {
+		return fmt.Errorf("cluster configuration must be present")
+	}
+
+	config.Init(cfg, clustercfg)
 	return nil
 }

 func initDatabase() error {
-	repository.Connect(config.Keys.DB)
+	repository.Connect(config.Keys.DBDriver, config.Keys.DB)
 	return nil
 }

@@ -248,7 +253,7 @@ func generateJWT(authHandle *auth.Authentication, username string) error {
 		return fmt.Errorf("getting user '%s': %w", username, err)
 	}

-	if !user.HasRole(schema.RoleAPI) {
+	if !user.HasRole(schema.RoleApi) {
 		cclog.Warnf("JWT: User '%s' does not have the role 'api'. REST API endpoints will return error!\n", user.Username)
 	}

@@ -272,13 +277,22 @@ func initSubsystems() error {
 	// Initialize job archive
 	archiveCfg := ccconf.GetPackageConfig("archive")
 	if archiveCfg == nil {
-		cclog.Debug("Archive configuration not found, using default archive configuration")
 		archiveCfg = json.RawMessage(defaultArchiveConfig)
 	}
-	if err := archive.Init(archiveCfg); err != nil {
+	if err := archive.Init(archiveCfg, config.Keys.DisableArchive); err != nil {
 		return fmt.Errorf("initializing archive: %w", err)
 	}

+	// Initialize metricdata
+	// if err := metricdata.Init(); err != nil {
+	// 	return fmt.Errorf("initializing metricdata repository: %w", err)
+	// }
+
+	// Initialize upstream metricdata repositories for pull worker
+	if err := metricdata.InitUpstreamRepos(); err != nil {
+		return fmt.Errorf("initializing upstream metricdata repositories: %w", err)
+	}
+
 	// Handle database re-initialization
 	if flagReinitDB {
 		if err := importer.InitDB(); err != nil {
@@ -302,8 +316,6 @@ func initSubsystems() error {

 	// Apply tags if requested
 	if flagApplyTags {
-		tagger.Init()
-
 		if err := tagger.RunTaggers(); err != nil {
 			return fmt.Errorf("running job taggers: %w", err)
 		}
@@ -316,37 +328,11 @@ func runServer(ctx context.Context) error {
 	var wg sync.WaitGroup

 	// Initialize metric store if configuration is provided
-	haveMetricstore := false
 	mscfg := ccconf.GetPackageConfig("metric-store")
 	if mscfg != nil {
-		metrics := metricstore.BuildMetricList()
-		metricstore.Init(mscfg, metrics, &wg)
-
-		// Inject repository as NodeProvider to break import cycle
-		ms := metricstore.GetMemoryStore()
-		jobRepo := repository.GetJobRepository()
-		ms.SetNodeProvider(jobRepo)
-		metricstore.MetricStoreHandle = &metricstore.InternalMetricStore{}
-		haveMetricstore = true
+		memorystore.Init(mscfg, &wg)
 	} else {
-		metricstore.MetricStoreHandle = nil
-		cclog.Debug("missing internal metricstore configuration")
-	}
-
-	// Initialize external metric stores if configuration is provided
-	mscfg = ccconf.GetPackageConfig("metric-store-external")
-	if mscfg != nil {
-		err := metricdispatch.Init(mscfg)
-
-		if err != nil {
-			cclog.Debugf("initializing metricdispatch: %v", err)
-		} else {
-			haveMetricstore = true
-		}
-	}
-
-	if !haveMetricstore {
-		return fmt.Errorf("missing metricstore configuration")
+		cclog.Debug("Metric store configuration not found, skipping memorystore initialization")
 	}

 	// Start archiver and task manager
@@ -369,11 +355,13 @@ func runServer(ctx context.Context) error {
 	errChan := make(chan error, 1)

 	// Start HTTP server
-	wg.Go(func() {
+	wg.Add(1)
+	go func() {
+		defer wg.Done()
 		if err := srv.Start(ctx); err != nil {
 			errChan <- err
 		}
-	})
+	}()

 	// Handle shutdown signals
 	wg.Add(1)
@@ -387,7 +375,7 @@ func runServer(ctx context.Context) error {
 		case <-ctx.Done():
 		}

-		runtime.SystemdNotify(false, "Shutting down ...")
+		runtimeEnv.SystemdNotifiy(false, "Shutting down ...")
 		srv.Shutdown(ctx)
 		util.FsWatcherShutdown()
 		taskmanager.Shutdown()
@@ -395,41 +383,26 @@ func runServer(ctx context.Context) error {

 	// Set GC percent if not configured
 	if os.Getenv(envGOGC) == "" {
-		debug.SetGCPercent(15)
+		debug.SetGCPercent(25)
 	}
-	runtime.SystemdNotify(true, "running")
+	runtimeEnv.SystemdNotifiy(true, "running")

-	waitDone := make(chan struct{})
+	// Wait for completion or error
 	go func() {
 		wg.Wait()
-		close(waitDone)
-	}()
-
-	go func() {
-		<-waitDone
 		close(errChan)
 	}()

-	// Wait for either:
-	// 1. An error from server startup
-	// 2. Completion of all goroutines (normal shutdown or crash)
+	// Check for server startup errors
 	select {
 	case err := <-errChan:
-		// errChan will be closed when waitDone is closed, which happens
-		// when all goroutines complete (either from normal shutdown or error)
 		if err != nil {
 			return err
 		}
 	case <-time.After(100 * time.Millisecond):
-		// Give the server 100ms to start and report any immediate startup errors
-		// After that, just wait for normal shutdown completion
-		select {
-		case err := <-errChan:
-			if err != nil {
-				return err
-			}
-		case <-waitDone:
-			// Normal shutdown completed
+		// Server started successfully, wait for completion
+		if err := <-errChan; err != nil {
+			return err
 		}
 	}

--- a/cmd/cc-backend/server.go
+++ b/cmd/cc-backend/server.go
@@ -2,7 +2,6 @@
 // All rights reserved. This file is part of cc-backend.
 // Use of this source code is governed by a MIT-style
 // license that can be found in the LICENSE file.
-
 // Package main provides the entry point for the ClusterCockpit backend server.
 // This file contains HTTP server setup, routing configuration, and
 // authentication middleware integration.
@@ -14,6 +13,7 @@ import (
 	"encoding/json"
 	"errors"
 	"fmt"
+	"io"
 	"net"
 	"net/http"
 	"os"
@@ -29,15 +29,14 @@ import (
 	"github.com/ClusterCockpit/cc-backend/internal/config"
 	"github.com/ClusterCockpit/cc-backend/internal/graph"
 	"github.com/ClusterCockpit/cc-backend/internal/graph/generated"
+	"github.com/ClusterCockpit/cc-backend/internal/memorystore"
 	"github.com/ClusterCockpit/cc-backend/internal/routerConfig"
-	"github.com/ClusterCockpit/cc-backend/pkg/metricstore"
+	"github.com/ClusterCockpit/cc-backend/pkg/nats"
 	"github.com/ClusterCockpit/cc-backend/web"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/nats"
-	"github.com/ClusterCockpit/cc-lib/v2/runtime"
-	"github.com/go-chi/chi/v5"
-	"github.com/go-chi/chi/v5/middleware"
-	"github.com/go-chi/cors"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/runtimeEnv"
+	"github.com/gorilla/handlers"
+	"github.com/gorilla/mux"
 	httpSwagger "github.com/swaggo/http-swagger"
 )

@@ -50,7 +49,7 @@ const (

 // Server encapsulates the HTTP server state and dependencies
 type Server struct {
-	router        chi.Router
+	router        *mux.Router
 	server        *http.Server
 	restAPIHandle *api.RestAPI
 	natsAPIHandle *api.NatsAPI
@@ -70,7 +69,7 @@ func NewServer(version, commit, buildDate string) (*Server, error) {
 	buildInfo = web.Build{Version: version, Hash: commit, Buildtime: buildDate}

 	s := &Server{
-		router: chi.NewRouter(),
+		router: mux.NewRouter(),
 	}

 	if err := s.init(); err != nil {
@@ -106,27 +105,6 @@ func (s *Server) init() error {

 	authHandle := auth.GetAuthInstance()

-	// Middleware must be defined before routes in chi
-	s.router.Use(func(next http.Handler) http.Handler {
-		return http.HandlerFunc(func(rw http.ResponseWriter, r *http.Request) {
-			start := time.Now()
-			ww := middleware.NewWrapResponseWriter(rw, r.ProtoMajor)
-			next.ServeHTTP(ww, r)
-			cclog.Debugf("%s %s (%d, %.02fkb, %dms)",
-				r.Method, r.URL.RequestURI(),
-				ww.Status(), float32(ww.BytesWritten())/1024,
-				time.Since(start).Milliseconds())
-		})
-	})
-	s.router.Use(middleware.Compress(5))
-	s.router.Use(middleware.Recoverer)
-	s.router.Use(cors.Handler(cors.Options{
-		AllowCredentials: true,
-		AllowedHeaders:   []string{"X-Requested-With", "Content-Type", "Authorization", "Origin"},
-		AllowedMethods:   []string{"GET", "POST", "HEAD", "OPTIONS"},
-		AllowedOrigins:   []string{"*"},
-	}))
-
 	s.restAPIHandle = api.New()

 	info := map[string]any{}
@@ -138,11 +116,11 @@ func (s *Server) init() error {
 		info["hasOpenIDConnect"] = true
 	}

-	s.router.Get("/login", func(rw http.ResponseWriter, r *http.Request) {
+	s.router.HandleFunc("/login", func(rw http.ResponseWriter, r *http.Request) {
 		rw.Header().Add("Content-Type", "text/html; charset=utf-8")
 		cclog.Debugf("##%v##", info)
 		web.RenderTemplate(rw, "login.tmpl", &web.Page{Title: "Login", Build: buildInfo, Infos: info})
-	})
+	}).Methods(http.MethodGet)
 	s.router.HandleFunc("/imprint", func(rw http.ResponseWriter, r *http.Request) {
 		rw.Header().Add("Content-Type", "text/html; charset=utf-8")
 		web.RenderTemplate(rw, "imprint.tmpl", &web.Page{Title: "Imprint", Build: buildInfo})
@@ -152,6 +130,13 @@ func (s *Server) init() error {
 		web.RenderTemplate(rw, "privacy.tmpl", &web.Page{Title: "Privacy", Build: buildInfo})
 	})

+	secured := s.router.PathPrefix("/").Subrouter()
+	securedapi := s.router.PathPrefix("/api").Subrouter()
+	userapi := s.router.PathPrefix("/userapi").Subrouter()
+	configapi := s.router.PathPrefix("/config").Subrouter()
+	frontendapi := s.router.PathPrefix("/frontend").Subrouter()
+	metricstoreapi := s.router.PathPrefix("/metricstore").Subrouter()
+
 	if !config.Keys.DisableAuthentication {
 		// Create login failure handler (used by both /login and /jwt-login)
 		loginFailureHandler := func(rw http.ResponseWriter, r *http.Request, err error) {
@@ -166,10 +151,10 @@ func (s *Server) init() error {
 			})
 		}

-		s.router.Post("/login", authHandle.Login(loginFailureHandler).ServeHTTP)
-		s.router.HandleFunc("/jwt-login", authHandle.Login(loginFailureHandler).ServeHTTP)
+		s.router.Handle("/login", authHandle.Login(loginFailureHandler)).Methods(http.MethodPost)
+		s.router.Handle("/jwt-login", authHandle.Login(loginFailureHandler))

-		s.router.Post("/logout", authHandle.Logout(
+		s.router.Handle("/logout", authHandle.Logout(
 			http.HandlerFunc(func(rw http.ResponseWriter, r *http.Request) {
 				rw.Header().Add("Content-Type", "text/html; charset=utf-8")
 				rw.WriteHeader(http.StatusOK)
@@ -180,97 +165,86 @@ func (s *Server) init() error {
 					Build:   buildInfo,
 					Infos:   info,
 				})
-			})).ServeHTTP)
+			}))).Methods(http.MethodPost)
+
+		secured.Use(func(next http.Handler) http.Handler {
+			return authHandle.Auth(
+				// On success;
+				next,
+
+				// On failure:
+				func(rw http.ResponseWriter, r *http.Request, err error) {
+					rw.WriteHeader(http.StatusUnauthorized)
+					web.RenderTemplate(rw, "login.tmpl", &web.Page{
+						Title:    "Authentication failed - ClusterCockpit",
+						MsgType:  "alert-danger",
+						Message:  err.Error(),
+						Build:    buildInfo,
+						Infos:    info,
+						Redirect: r.RequestURI,
+					})
+				})
+		})
+
+		securedapi.Use(func(next http.Handler) http.Handler {
+			return authHandle.AuthAPI(
+				// On success;
+				next,
+				// On failure: JSON Response
+				onFailureResponse)
+		})
+
+		userapi.Use(func(next http.Handler) http.Handler {
+			return authHandle.AuthUserAPI(
+				// On success;
+				next,
+				// On failure: JSON Response
+				onFailureResponse)
+		})
+
+		metricstoreapi.Use(func(next http.Handler) http.Handler {
+			return authHandle.AuthMetricStoreAPI(
+				// On success;
+				next,
+				// On failure: JSON Response
+				onFailureResponse)
+		})
+
+		configapi.Use(func(next http.Handler) http.Handler {
+			return authHandle.AuthConfigAPI(
+				// On success;
+				next,
+				// On failure: JSON Response
+				onFailureResponse)
+		})
+
+		frontendapi.Use(func(next http.Handler) http.Handler {
+			return authHandle.AuthFrontendAPI(
+				// On success;
+				next,
+				// On failure: JSON Response
+				onFailureResponse)
+		})
 	}

 	if flagDev {
 		s.router.Handle("/playground", playground.Handler("GraphQL playground", "/query"))
-		s.router.Get("/swagger/*", httpSwagger.Handler(
-			httpSwagger.URL("http://"+config.Keys.Addr+"/swagger/doc.json")))
+		s.router.PathPrefix("/swagger/").Handler(httpSwagger.Handler(
+			httpSwagger.URL("http://" + config.Keys.Addr + "/swagger/doc.json"))).Methods(http.MethodGet)
 	}
+	secured.Handle("/query", graphQLServer)

-	// Secured routes (require authentication)
-	s.router.Group(func(secured chi.Router) {
-		if !config.Keys.DisableAuthentication {
-			secured.Use(func(next http.Handler) http.Handler {
-				return authHandle.Auth(
-					next,
-					func(rw http.ResponseWriter, r *http.Request, err error) {
-						rw.WriteHeader(http.StatusUnauthorized)
-						web.RenderTemplate(rw, "login.tmpl", &web.Page{
-							Title:    "Authentication failed - ClusterCockpit",
-							MsgType:  "alert-danger",
-							Message:  err.Error(),
-							Build:    buildInfo,
-							Infos:    info,
-							Redirect: r.RequestURI,
-						})
-					})
-			})
-		}
-
-		secured.Handle("/query", graphQLServer)
-
-		secured.HandleFunc("/search", func(rw http.ResponseWriter, r *http.Request) {
-			routerConfig.HandleSearchBar(rw, r, buildInfo)
-		})
-
-		routerConfig.SetupRoutes(secured, buildInfo)
+	// Send a searchId and then reply with a redirect to a user, or directly send query to job table for jobid and project.
+	secured.HandleFunc("/search", func(rw http.ResponseWriter, r *http.Request) {
+		routerConfig.HandleSearchBar(rw, r, buildInfo)
 	})

-	// API routes (JWT token auth)
-	s.router.Route("/api", func(apiRouter chi.Router) {
-		// Main API routes with API auth
-		apiRouter.Group(func(securedapi chi.Router) {
-			if !config.Keys.DisableAuthentication {
-				securedapi.Use(func(next http.Handler) http.Handler {
-					return authHandle.AuthAPI(next, onFailureResponse)
-				})
-			}
-			s.restAPIHandle.MountAPIRoutes(securedapi)
-		})
-
-		// Metric store API routes with separate auth
-		apiRouter.Group(func(metricstoreapi chi.Router) {
-			if !config.Keys.DisableAuthentication {
-				metricstoreapi.Use(func(next http.Handler) http.Handler {
-					return authHandle.AuthMetricStoreAPI(next, onFailureResponse)
-				})
-			}
-			s.restAPIHandle.MountMetricStoreAPIRoutes(metricstoreapi)
-		})
-	})
-
-	// User API routes
-	s.router.Route("/userapi", func(userapi chi.Router) {
-		if !config.Keys.DisableAuthentication {
-			userapi.Use(func(next http.Handler) http.Handler {
-				return authHandle.AuthUserAPI(next, onFailureResponse)
-			})
-		}
-		s.restAPIHandle.MountUserAPIRoutes(userapi)
-	})
-
-	// Config API routes (uses Group with full paths to avoid shadowing
-	// the /config page route that is registered in the secured group)
-	s.router.Group(func(configapi chi.Router) {
-		if !config.Keys.DisableAuthentication {
-			configapi.Use(func(next http.Handler) http.Handler {
-				return authHandle.AuthConfigAPI(next, onFailureResponse)
-			})
-		}
-		s.restAPIHandle.MountConfigAPIRoutes(configapi)
-	})
-
-	// Frontend API routes
-	s.router.Route("/frontend", func(frontendapi chi.Router) {
-		if !config.Keys.DisableAuthentication {
-			frontendapi.Use(func(next http.Handler) http.Handler {
-				return authHandle.AuthFrontendAPI(next, onFailureResponse)
-			})
-		}
-		s.restAPIHandle.MountFrontendAPIRoutes(frontendapi)
-	})
+	// Mount all /monitoring/... and /api/... routes.
+	routerConfig.SetupRoutes(secured, buildInfo)
+	s.restAPIHandle.MountAPIRoutes(securedapi)
+	s.restAPIHandle.MountUserAPIRoutes(userapi)
+	s.restAPIHandle.MountConfigAPIRoutes(configapi)
+	s.restAPIHandle.MountFrontendAPIRoutes(frontendapi)

 	if config.Keys.APISubjects != nil {
 		s.natsAPIHandle = api.NewNatsAPI()
@@ -279,59 +253,28 @@ func (s *Server) init() error {
 		}
 	}

-	// 404 handler for pages and API routes
-	notFoundHandler := func(rw http.ResponseWriter, r *http.Request) {
-		if strings.HasPrefix(r.URL.Path, "/api/") || strings.HasPrefix(r.URL.Path, "/userapi/") ||
-			strings.HasPrefix(r.URL.Path, "/frontend/") || strings.HasPrefix(r.URL.Path, "/config/") {
-			rw.Header().Set("Content-Type", "application/json")
-			rw.WriteHeader(http.StatusNotFound)
-			json.NewEncoder(rw).Encode(map[string]string{
-				"status": "Resource not found",
-				"error":  "the requested endpoint does not exist",
-			})
-			return
-		}
-		rw.Header().Set("Content-Type", "text/html; charset=utf-8")
-		rw.WriteHeader(http.StatusNotFound)
-		web.RenderTemplate(rw, "404.tmpl", &web.Page{
-			Title: "Page Not Found",
-			Build: buildInfo,
-		})
-	}
-
-	// Set NotFound on the router so chi uses it for all unmatched routes,
-	// including those under subrouters like /api, /userapi, /frontend, etc.
-	s.router.NotFound(notFoundHandler)
+	s.restAPIHandle.MountMetricStoreAPIRoutes(metricstoreapi)

 	if config.Keys.EmbedStaticFiles {
 		if i, err := os.Stat("./var/img"); err == nil {
 			if i.IsDir() {
 				cclog.Info("Use local directory for static images")
-				s.router.Handle("/img/*", http.StripPrefix("/img/", http.FileServer(http.Dir("./var/img"))))
+				s.router.PathPrefix("/img/").Handler(http.StripPrefix("/img/", http.FileServer(http.Dir("./var/img"))))
 			}
 		}
-		fileServer := http.StripPrefix("/", web.ServeFiles())
-		s.router.Handle("/*", http.HandlerFunc(func(rw http.ResponseWriter, r *http.Request) {
-			if web.StaticFileExists(r.URL.Path) {
-				fileServer.ServeHTTP(rw, r)
-				return
-			}
-			notFoundHandler(rw, r)
-		}))
+		s.router.PathPrefix("/").Handler(http.StripPrefix("/", web.ServeFiles()))
 	} else {
-		staticDir := http.Dir(config.Keys.StaticFiles)
-		fileServer := http.FileServer(staticDir)
-		s.router.Handle("/*", http.HandlerFunc(func(rw http.ResponseWriter, r *http.Request) {
-			f, err := staticDir.Open(r.URL.Path)
-			if err == nil {
-				f.Close()
-				fileServer.ServeHTTP(rw, r)
-				return
-			}
-			notFoundHandler(rw, r)
-		}))
+		s.router.PathPrefix("/").Handler(http.FileServer(http.Dir(config.Keys.StaticFiles)))
 	}

+	s.router.Use(handlers.CompressHandler)
+	s.router.Use(handlers.RecoveryHandler(handlers.PrintRecoveryStack(true)))
+	s.router.Use(handlers.CORS(
+		handlers.AllowCredentials(),
+		handlers.AllowedHeaders([]string{"X-Requested-With", "Content-Type", "Authorization", "Origin"}),
+		handlers.AllowedMethods([]string{"GET", "POST", "HEAD", "OPTIONS"}),
+		handlers.AllowedOrigins([]string{"*"})))
+
 	return nil
 }

@@ -342,6 +285,20 @@ const (
 )

 func (s *Server) Start(ctx context.Context) error {
+	handler := handlers.CustomLoggingHandler(io.Discard, s.router, func(_ io.Writer, params handlers.LogFormatterParams) {
+		if strings.HasPrefix(params.Request.RequestURI, "/api/") {
+			cclog.Debugf("%s %s (%d, %.02fkb, %dms)",
+				params.Request.Method, params.URL.RequestURI(),
+				params.StatusCode, float32(params.Size)/1024,
+				time.Since(params.TimeStamp).Milliseconds())
+		} else {
+			cclog.Debugf("%s %s (%d, %.02fkb, %dms)",
+				params.Request.Method, params.URL.RequestURI(),
+				params.StatusCode, float32(params.Size)/1024,
+				time.Since(params.TimeStamp).Milliseconds())
+		}
+	})
+
 	// Use configurable timeouts with defaults
 	readTimeout := time.Duration(defaultReadTimeout) * time.Second
 	writeTimeout := time.Duration(defaultWriteTimeout) * time.Second
@@ -349,7 +306,7 @@ func (s *Server) Start(ctx context.Context) error {
 	s.server = &http.Server{
 		ReadTimeout:  readTimeout,
 		WriteTimeout: writeTimeout,
-		Handler:      s.router,
+		Handler:      handler,
 		Addr:         config.Keys.Addr,
 	}

@@ -388,7 +345,7 @@ func (s *Server) Start(ctx context.Context) error {
 	// Because this program will want to bind to a privileged port (like 80), the listener must
 	// be established first, then the user can be changed, and after that,
 	// the actual http server can be started.
-	if err := runtime.DropPrivileges(config.Keys.Group, config.Keys.User); err != nil {
+	if err := runtimeEnv.DropPrivileges(config.Keys.Group, config.Keys.User); err != nil {
 		return fmt.Errorf("dropping privileges: %w", err)
 	}

@@ -424,7 +381,7 @@ func (s *Server) Shutdown(ctx context.Context) {
 	}

 	// Archive all the metric store data
-	metricstore.Shutdown()
+	memorystore.Shutdown()

 	// Shutdown archiver with 10 second timeout for fast shutdown
 	if err := archiver.Shutdown(10 * time.Second); err != nil {
--- a/configs/config-demo.json
+++ b/configs/config-demo.json
@@ -1,29 +1,96 @@
 {
  "main": {
-    "addr": "127.0.0.1:8080"
+    "addr": "127.0.0.1:8080",
+    "short-running-jobs-duration": 300,
+    "resampling": {
+      "minimumPoints": 600,
+      "trigger": 180,
+      "resolutions": [
+        240,
+        60
+      ]
+    },
+    "apiAllowedIPs": [
+      "*"
+    ],
+    "emission-constant": 317
  },
  "cron": {
-    "commit-job-worker": "1m",
-    "duration-worker": "3m",
-    "footprint-worker": "5m"
+    "commit-job-worker": "2m",
+    "duration-worker": "5m",
+    "footprint-worker": "10m"
+  },
+  "archive": {
+    "kind": "file",
+    "path": "./var/job-archive"
  },
  "auth": {
    "jwts": {
      "max-age": "2000h"
    }
  },
-  "metric-store-external": [
+  "nats": {
+    "address": "nats://0.0.0.0:4222",
+    "username": "root",
+    "password": "root"
+  },
+  "clusters": [
    {
-      "scope": "fritz",
-      "url": "http://0.0.0.0:8082",
-      "token": "eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJleHAiOjE3NzU3Nzg4NDQsImlhdCI6MTc2ODU3ODg0NCwicm9sZXMiOlsiYWRtaW4iLCJhcGkiXSwic3ViIjoiZGVtbyJ9._SDEW9WaUVXSBFmWqGhyIZXLoqoDU8F1hkfh4cXKIqF4yw7w50IUpfUBtwUFUOnoviFKoi563f6RAMC7XxeLDA"
+      "name": "fritz",
+      "filterRanges": {
+        "numNodes": {
+          "from": 1,
+          "to": 64
+        },
+        "duration": {
+          "from": 0,
+          "to": 86400
+        },
+        "startTime": {
+          "from": "2022-01-01T00:00:00Z",
+          "to": null
+        }
+      }
+    },
+    {
+      "name": "alex",
+      "filterRanges": {
+        "numNodes": {
+          "from": 1,
+          "to": 64
+        },
+        "duration": {
+          "from": 0,
+          "to": 86400
+        },
+        "startTime": {
+          "from": "2022-01-01T00:00:00Z",
+          "to": null
+        }
+      }
    }
  ],
  "metric-store": {
    "checkpoints": {
-      "interval": "12h"
+      "file-format": "avro",
+      "interval": "1h",
+      "directory": "./var/checkpoints",
+      "restore": "48h"
+    },
+    "archive": {
+      "interval": "1h",
+      "directory": "./var/archive"
    },
    "retention-in-memory": "48h",
-    "memory-cap": 100
+    "subscriptions": [
+      {
+        "subscribe-to": "hpc-nats",
+        "cluster-tag": "fritz"
+      },
+      {
+        "subscribe-to": "hpc-nats",
+        "cluster-tag": "alex"
+      }
+    ]
  }
-}
+}
--- a/configs/config.json
+++ b/configs/config.json
@@ -5,95 +5,45 @@
    "https-key-file": "/etc/letsencrypt/live/url/privkey.pem",
    "user": "clustercockpit",
    "group": "clustercockpit",
-    "api-allowed-ips": ["*"],
+    "validate": false,
+    "apiAllowedIPs": ["*"],
    "short-running-jobs-duration": 300,
-    "enable-job-taggers": true,
-    "nodestate-retention": {
-      "policy": "move",
-      "target-kind": "file",
-      "target-path": "./var/nodestate-archive"
-    },
    "resampling": {
-      "minimum-points": 600,
+      "minimumPoints": 600,
      "trigger": 180,
-      "resolutions": [240, 60]
-    },
-    "api-subjects": {
-      "subject-job-event": "cc.job.event",
-      "subject-node-state": "cc.node.state"
-    }
-  },
-  "nats": {
-    "address": "nats://0.0.0.0:4222",
-    "username": "root",
-    "password": "root"
-  },
-  "auth": {
-    "jwts": {
-      "max-age": "2000h"
+      "resolutions": [
+        240,
+        60
+      ]
    }
  },
  "cron": {
-    "commit-job-worker": "1m",
+    "commit-job-worker": "2m",
    "duration-worker": "5m",
    "footprint-worker": "10m"
  },
  "archive": {
-    "kind": "s3",
-    "endpoint": "http://x.x.x.x",
-    "bucket": "jobarchive",
-    "access-key": "xx",
-    "secret-key": "xx",
-    "retention": {
-      "policy": "move",
-      "age": 365,
-      "location": "./var/archive"
-    }
+    "kind": "file",
+    "path": "./var/job-archive"
  },
-  "metric-store-external": [
+  "clusters": [
    {
-      "scope": "*",
-      "url": "http://x.x.x.x:8082",
-      "token": "MySecret"
-    },
-    {
-      "scope": "fritz",
-      "url": "http://x.x.x.x:8084",
-      "token": "MySecret"
-    },
-    {
-      "scope": "fritz-spr1tb",
-      "url": "http://x.x.x.x:8083",
-      "token": "MySecret"
-    },
-    {
-      "scope": "alex",
-      "url": "http://x.x.x.x:8084",
-      "token": "MySecret"
-    }
-  ],
-  "metric-store": {
-    "checkpoints": {
-      "interval": "12h",
-      "directory": "./var/checkpoints"
-    },
-    "memory-cap": 100,
-    "retention-in-memory": "48h",
-    "cleanup": {
-      "mode": "archive",
-      "interval": "48h",
-      "directory": "./var/archive"
-    },
-    "nats-subscriptions": [
-      {
-        "subscribe-to": "hpc-nats",
-        "cluster-tag": "fritz"
-      },
-      {
-        "subscribe-to": "hpc-nats",
-        "cluster-tag": "alex"
+      "name": "test",
+      "filterRanges": {
+        "numNodes": {
+          "from": 1,
+          "to": 64
+        },
+        "duration": {
+          "from": 0,
+          "to": 86400
+        },
+        "startTime": {
+          "from": "2022-01-01T00:00:00Z",
+          "to": null
+        }
      }
-    ]
-  },
-  "ui-file": "ui-config.json"
+    }
+  ]
 }
+
--- a/configs/tagger/README.md
+++ b/configs/tagger/README.md
@@ -1,419 +0,0 @@
-# Job Tagging Configuration
-
-ClusterCockpit provides automatic job tagging functionality to classify and
-categorize jobs based on configurable rules. The tagging system consists of two
-main components:
-
-1. **Application Detection** - Identifies which application a job is running
-2. **Job Classification** - Analyzes job performance characteristics and applies classification tags
-
-## Directory Structure
-
-```
-configs/tagger/
-├── apps/              # Application detection patterns
-│   ├── vasp.txt
-│   ├── gromacs.txt
-│   └── ...
-└── jobclasses/        # Job classification rules
-    ├── parameters.json
-    ├── lowUtilization.json
-    ├── highload.json
-    └── ...
-```
-
-## Activating Tagger Rules
-
-### Step 1: Copy Configuration Files
-
-To activate tagging, review, adapt, and copy the configuration files from
-`configs/tagger/` to `var/tagger/`:
-
-```bash
-# From the cc-backend root directory
-mkdir -p var/tagger
-cp -r configs/tagger/apps var/tagger/
-cp -r configs/tagger/jobclasses var/tagger/
-```
-
-### Step 2: Enable Tagging in Configuration
-
-Add or set the following configuration key in the `main` section of your `config.json`:
-
-```json
-{
-  "enable-job-taggers": true
-}
-```
-
-**Important**: Automatic tagging is disabled by default. You must explicitly
-enable it by setting `enable-job-taggers: true` in the main configuration file.
-
-### Step 3: Restart cc-backend
-
-The tagger system automatically loads configuration from `./var/tagger/` at
-startup. After copying the files and enabling the feature, restart cc-backend:
-
-```bash
-./cc-backend -server
-```
-
-### Step 4: Verify Configuration Loaded
-
-Check the logs for messages indicating successful configuration loading:
-
-```
-[INFO] Setup file watch for ./var/tagger/apps
-[INFO] Setup file watch for ./var/tagger/jobclasses
-```
-
-## How Tagging Works
-
-### Automatic Tagging
-
-When `enable-job-taggers` is set to `true` in the configuration, tags are
-automatically applied when:
-
- **Job Start**: Application detection runs immediately when a job starts
- **Job Stop**: Job classification runs when a job completes
-
-The system analyzes job metadata and metrics to determine appropriate tags.
-
-**Note**: Automatic tagging only works for jobs that start or stop after the
-feature is enabled. Existing jobs are not automatically retagged.
-
-### Manual Tagging (Retroactive)
-
-To apply tags to existing jobs in the database, use the `-apply-tags` command
-line option:
-
-```bash
-./cc-backend -apply-tags
-```
-
-This processes all jobs in the database and applies current tagging rules. This
-is useful when:
-
- You have existing jobs that were created before tagging was enabled
- You've added new tagging rules and want to apply them to historical data
- You've modified existing rules and want to re-evaluate all jobs
-
-### Hot Reload
-
-The tagger system watches the configuration directories for changes. You can
-modify or add rules without restarting `cc-backend`:
-
- Changes to `var/tagger/apps/*` are detected automatically
- Changes to `var/tagger/jobclasses/*` are detected automatically
-
-## Application Detection
-
-Application detection identifies which software a job is running by matching
-patterns in the job script.
-
-### Configuration Format
-
-Application patterns are stored in text files under `var/tagger/apps/`. Each
-file contains one or more regular expression patterns (one per line) that match
-against the job script.
-
-**Example: `apps/vasp.txt`**
-
-```
-vasp
-VASP
-```
-
-### How It Works
-
-1. When a job starts, the system retrieves the job script from metadata
-2. Each line in the app files is treated as a regex pattern
-3. Patterns are matched case-insensitively against the lowercased job script
-4. If a match is found, a tag of type `app` with the filename (without extension) is applied
-5. Only the first matching application is tagged
-
-### Adding New Applications
-
-1. Create a new file in `var/tagger/apps/` (e.g., `tensorflow.txt`)
-2. Add regex patterns, one per line:
-
-   ```
-   tensorflow
-   tf\.keras
-   import tensorflow
-   ```
-
-3. The file is automatically detected and loaded
-
-**Note**: The tag name will be the filename without the `.txt` extension (e.g., `tensorflow`).
-
-## Job Classification
-
-Job classification analyzes completed jobs based on their metrics and properties
-to identify performance issues or characteristics.
-
-### Configuration Format
-
-Job classification rules are defined in JSON files under
-`var/tagger/jobclasses/`. Each rule file defines:
-
- **Metrics required**: Which job metrics to analyze
- **Requirements**: Pre-conditions that must be met
- **Variables**: Computed values used in the rule
- **Rule expression**: Boolean expression that determines if the rule matches
- **Hint template**: Message displayed when the rule matches
-
-### Parameters File
-
-`jobclasses/parameters.json` defines shared threshold values used across multiple rules:
-
-```json
-{
-  "lowcpuload_threshold_factor": 0.9,
-  "highmemoryusage_threshold_factor": 0.9,
-  "job_min_duration_seconds": 600.0,
-  "sampling_interval_seconds": 30.0
-}
-```
-
-### Rule File Structure
-
-**Example: `jobclasses/lowUtilization.json`**
-
-```json
-{
-  "name": "Low resource utilization",
-  "tag": "lowutilization",
-  "parameters": ["job_min_duration_seconds"],
-  "metrics": ["flops_any", "mem_bw"],
-  "requirements": [
-    "job.shared == \"none\"",
-    "job.duration > job_min_duration_seconds"
-  ],
-  "variables": [
-    {
-      "name": "mem_bw_perc",
-      "expr": "1.0 - (mem_bw.avg / mem_bw.limits.peak)"
-    }
-  ],
-  "rule": "flops_any.avg < flops_any.limits.alert",
-  "hint": "Average flop rate {{.flops_any.avg}} falls below threshold {{.flops_any.limits.alert}}"
-}
-```
-
-#### Field Descriptions
-
-| Field          | Description                                                                   |
-| -------------- | ----------------------------------------------------------------------------- |
-| `name`         | Human-readable description of the rule                                        |
-| `tag`          | Tag identifier applied when the rule matches                                  |
-| `parameters`   | List of parameter names from `parameters.json` to include in rule environment |
-| `metrics`      | List of metrics required for evaluation (must be present in job data)         |
-| `requirements` | Boolean expressions that must all be true for the rule to be evaluated        |
-| `variables`    | Named expressions computed before evaluating the main rule                    |
-| `rule`         | Boolean expression that determines if the job matches this classification     |
-| `hint`         | Go template string for generating a user-visible message                      |
-
-### Expression Environment
-
-Expressions in `requirements`, `variables`, and `rule` have access to:
-
-**Job Properties:**
-
- `job.shared` - Shared node allocation type
- `job.duration` - Job runtime in seconds
- `job.numCores` - Number of CPU cores
- `job.numNodes` - Number of nodes
- `job.jobState` - Job completion state
- `job.numAcc` - Number of accelerators
- `job.smt` - SMT setting
-
-**Metric Statistics (for each metric in `metrics`):**
-
- `<metric>.min` - Minimum value
- `<metric>.max` - Maximum value
- `<metric>.avg` - Average value
- `<metric>.limits.peak` - Peak limit from cluster config
- `<metric>.limits.normal` - Normal threshold
- `<metric>.limits.caution` - Caution threshold
- `<metric>.limits.alert` - Alert threshold
-
-**Parameters:**
-
- All parameters listed in the `parameters` field
-
-**Variables:**
-
- All variables defined in the `variables` array
-
-### Expression Language
-
-Rules use the [expr](https://github.com/expr-lang/expr) language for expressions. Supported operations:
-
- **Arithmetic**: `+`, `-`, `*`, `/`, `%`, `^`
- **Comparison**: `==`, `!=`, `<`, `<=`, `>`, `>=`
- **Logical**: `&&`, `||`, `!`
- **Functions**: Standard math functions (see expr documentation)
-
-### Hint Templates
-
-Hints use Go's `text/template` syntax. Variables from the evaluation environment are accessible:
-
-```
-{{.flops_any.avg}}          # Access metric average
-{{.job.duration}}            # Access job property
-{{.my_variable}}             # Access computed variable
-```
-
-### Adding New Classification Rules
-
-1. Create a new JSON file in `var/tagger/jobclasses/` (e.g., `memoryLeak.json`)
-2. Define the rule structure:
-
-   ```json
-   {
-     "name": "Memory Leak Detection",
-     "tag": "memory_leak",
-     "parameters": ["memory_leak_slope_threshold"],
-     "metrics": ["mem_used"],
-     "requirements": ["job.duration > 3600"],
-     "variables": [
-       {
-         "name": "mem_growth",
-         "expr": "(mem_used.max - mem_used.min) / job.duration"
-       }
-     ],
-     "rule": "mem_growth > memory_leak_slope_threshold",
-     "hint": "Memory usage grew by {{.mem_growth}} per second"
-   }
-   ```
-
-3. Add any new parameters to `parameters.json`
-4. The file is automatically detected and loaded
-
-## Configuration Paths
-
-The tagger system reads from these paths (relative to cc-backend working directory):
-
- **Application patterns**: `./var/tagger/apps/`
- **Job classification rules**: `./var/tagger/jobclasses/`
-
-These paths are defined as constants in the source code and cannot be changed without recompiling.
-
-## Troubleshooting
-
-### Tags Not Applied
-
-1. **Check tagging is enabled**: Verify `enable-job-taggers: true` is set in `config.json`
-
-2. **Check configuration exists**:
-
-   ```bash
-   ls -la var/tagger/apps
-   ls -la var/tagger/jobclasses
-   ```
-
-3. **Check logs for errors**:
-
-   ```bash
-   ./cc-backend -server -loglevel debug
-   ```
-
-4. **Verify file permissions**: Ensure cc-backend can read the configuration files
-
-5. **For existing jobs**: Use `./cc-backend -apply-tags` to retroactively tag jobs
-
-### Rules Not Matching
-
-1. **Enable debug logging**: Set `loglevel: debug` to see detailed rule evaluation
-2. **Check requirements**: Ensure all requirements in the rule are satisfied
-3. **Verify metrics exist**: Classification rules require job metrics to be available
-4. **Check metric names**: Ensure metric names match those in your cluster configuration
-
-### File Watch Not Working
-
-If changes to configuration files aren't detected:
-
-1. Restart cc-backend to reload all configuration
-2. Check filesystem supports file watching (network filesystems may not)
-3. Check logs for file watch setup messages
-
-## Best Practices
-
-1. **Start Simple**: Begin with basic rules and refine based on results
-2. **Use Requirements**: Filter out irrelevant jobs early with requirements
-3. **Test Incrementally**: Add one rule at a time and verify behavior
-4. **Document Rules**: Use descriptive names and clear hint messages
-5. **Share Parameters**: Define common thresholds in `parameters.json` for consistency
-6. **Version Control**: Keep your `var/tagger/` configuration in version control
-7. **Backup Before Changes**: Test new rules on a copy before deploying to production
-
-## Examples
-
-### Simple Application Detection
-
-**File: `var/tagger/apps/python.txt`**
-
-```
-python
-python3
-\.py
-```
-
-This detects jobs running Python scripts.
-
-### Complex Classification Rule
-
-**File: `var/tagger/jobclasses/cpuImbalance.json`**
-
-```json
-{
-  "name": "CPU Load Imbalance",
-  "tag": "cpu_imbalance",
-  "parameters": ["core_load_imbalance_threshold_factor"],
-  "metrics": ["cpu_load"],
-  "requirements": ["job.numCores > 1", "job.duration > 600"],
-  "variables": [
-    {
-      "name": "load_variance",
-      "expr": "(cpu_load.max - cpu_load.min) / cpu_load.avg"
-    }
-  ],
-  "rule": "load_variance > core_load_imbalance_threshold_factor",
-  "hint": "CPU load varies by {{printf \"%.1f%%\" (load_variance * 100)}} across cores"
-}
-```
-
-This detects jobs where CPU load is unevenly distributed across cores.
-
-## Reference
-
-### Configuration Options
-
-**Main Configuration (`config.json`)**:
-
- `enable-job-taggers` (boolean, default: `false`) - Enables automatic job tagging system
-  - Must be set to `true` to activate automatic tagging on job start/stop events
-  - Does not affect the `-apply-tags` command line option
-
-**Command Line Options**:
-
- `-apply-tags` - Apply all tagging rules to existing jobs in the database
-  - Works independently of `enable-job-taggers` configuration
-  - Useful for retroactively tagging jobs or re-evaluating with updated rules
-
-### Default Configuration Location
-
-The example configurations are provided in:
-
- `configs/tagger/apps/` - Example application patterns (16 applications)
- `configs/tagger/jobclasses/` - Example classification rules (3 rules)
-
-Copy these to `var/tagger/` and customize for your environment.
-
-### Tag Types
-
- `app` - Application tags (e.g., "vasp", "gromacs")
- `jobClass` - Classification tags (e.g., "lowutilization", "highload")
-
-Tags can be queried and filtered in the ClusterCockpit UI and API.
--- a/configs/tagger/apps/vasp.txt
+++ b/configs/tagger/apps/vasp.txt
@@ -1,3 +0,0 @@
-vasp_gam
-vasp_ncl
-vasp_std
--- a/configs/tagger/jobclasses/highMemoryUsage.json
+++ b/configs/tagger/jobclasses/highMemoryUsage.json
@@ -1,21 +0,0 @@
-{
-  "name": "High memory usage",
-  "tag": "highmemory",
-  "parameters": [
-    "highmemoryusage_threshold_factor",
-    "job_min_duration_seconds"
-  ],
-  "metrics": ["mem_used"],
-  "requirements": [
-    "job.shared == \"none\"",
-    "job.duration > job_min_duration_seconds"
-  ],
-  "variables": [
-    {
-      "name": "memory_usage_pct",
-      "expr": "mem_used.max / mem_used.limits.peak * 100.0"
-    }
-  ],
-  "rule": "mem_used.max > memory_used.limits.alert",
-  "hint": "This job used high memory: peak memory usage {{.mem_used.max}} GB ({{.memory_usage_pct}}% of {{.mem_used.limits.peak}} GB node capacity), exceeding the {{.highmemoryusage_threshold_factor}} utilization threshold. Risk of out-of-memory conditions."
-}
--- a/configs/tagger/jobclasses/lowUtilization.json
+++ b/configs/tagger/jobclasses/lowUtilization.json
@@ -1,22 +0,0 @@
-{
-  "name": "Low resource utilization",
-  "tag": "lowutilization",
-  "parameters": ["job_min_duration_seconds"],
-  "metrics": ["flops_any", "mem_bw"],
-  "requirements": [
-    "job.shared == \"none\"",
-    "job.duration > job_min_duration_seconds"
-  ],
-  "variables": [
-    {
-      "name": "mem_bw_pct",
-      "expr": "mem_bw.avg / mem_bw.limits.peak * 100.0"
-    },
-    {
-      "name": "flops_any_pct",
-      "expr": "flops_any.avg / flops_any.limits.peak * 100.0"
-    }
-  ],
-  "rule": "flops_any.avg < flops_any.limits.alert && mem_bw.avg < mem_bw.limits.alert",
-  "hint": "This job shows low resource utilization: FLOP rate {{.flops_any.avg}} GF/s ({{.flops_any_pct}}% of peak) and memory bandwidth {{.mem_bw.avg}} GB/s ({{.mem_bw_pct}}% of peak) are both below their alert thresholds."
-}
--- a/configs/tagger/jobclasses/lowload.json
+++ b/configs/tagger/jobclasses/lowload.json
@@ -1,18 +0,0 @@
-{
-  "name": "Low CPU load",
-  "tag": "lowload",
-  "parameters": ["lowcpuload_threshold_factor", "job_min_duration_seconds"],
-  "metrics": ["cpu_load"],
-  "requirements": [
-    "job.shared == \"none\"",
-    "job.duration > job_min_duration_seconds"
-  ],
-  "variables": [
-    {
-      "name": "load_threshold",
-      "expr": "cpu_load.limits.peak * lowcpuload_threshold_factor"
-    }
-  ],
-  "rule": "cpu_load.avg < load_threshold",
-  "hint": "This job was detected as low CPU load: average cpu load {{.cpu_load.avg}} is below the threshold {{.load_threshold}} ({{.lowcpuload_threshold_factor}})."
-}
--- a/configs/tagger/jobclasses/memoryBound.json
+++ b/configs/tagger/jobclasses/memoryBound.json
@@ -1,22 +0,0 @@
-{
-  "name": "Memory bandwidth bound",
-  "tag": "memorybound",
-  "parameters": ["membound_bw_threshold_factor", "job_min_duration_seconds"],
-  "metrics": ["mem_bw"],
-  "requirements": [
-    "job.shared == \"none\"",
-    "job.duration > job_min_duration_seconds"
-  ],
-  "variables": [
-    {
-      "name": "mem_bw_threshold",
-      "expr": "mem_bw.limits.peak * membound_bw_threshold_factor"
-    },
-    {
-      "name": "mem_bw_pct",
-      "expr": "mem_bw.avg / mem_bw.limits.peak * 100.0"
-    }
-  ],
-  "rule": "mem_bw.avg > mem_bw_threshold",
-  "hint": "This job is memory bandwidth bound: memory bandwidth {{.mem_bw.avg}} GB/s ({{.mem_bw_pct}}% of peak) is within {{.membound_bw_threshold_factor}} of peak bandwidth. Consider improving data reuse or compute intensity."
-}
--- a/configs/uiConfig.json
+++ b/configs/uiConfig.json
@@ -1,38 +1,38 @@
 {
-  "job-list": {
-    "use-paging": false,
-    "show-footprint":false
+  "jobList": {
+    "usePaging": false,
+    "showFootprint":false
  },
-  "job-view": {
-    "show-polar-plot": true,
-    "show-footprint": true,
-    "show-roofline": true,
-    "show-stat-table": true
+  "jobView": {
+    "showPolarPlot": true,
+    "showFootprint": true,
+    "showRoofline": true,
+    "showStatTable": true
  },
-  "metric-config": {
-    "job-list-metrics": ["mem_bw", "flops_dp"],
-    "job-view-plot-metrics": ["mem_bw", "flops_dp"],
-    "job-view-table-metrics": ["mem_bw", "flops_dp"],
+  "metricConfig": {
+    "jobListMetrics": ["mem_bw", "flops_dp"],
+    "jobViewPlotMetrics": ["mem_bw", "flops_dp"],
+    "jobViewTableMetrics": ["mem_bw", "flops_dp"],
    "clusters": [
      {
        "name": "test",
-        "sub-clusters": [
+        "subClusters": [
          {
            "name": "one",
-            "job-list-metrics": ["mem_used", "flops_sp"]
+            "jobListMetrics": ["mem_used", "flops_sp"]
          }
        ]
      }
    ]
  },
-  "node-list": {
-    "use-paging": true
+  "nodeList": {
+    "usePaging": true
  },
-  "plot-configuration": {
-    "plots-per-row": 3,
-    "color-background": true,
-    "line-width": 3,
-    "color-scheme": [
+  "plotConfiguration": {
+    "plotsPerRow": 3,
+    "colorBackground": true,
+    "lineWidth": 3,
+    "colorScheme": [
 	"#00bfff",
 	"#0000ff",
 	"#ff00ff",
--- a/go.mod
+++ b/go.mod
@@ -1,6 +1,8 @@
 module github.com/ClusterCockpit/cc-backend

-go 1.25.0
+go 1.24.0
+
+toolchain go1.24.1

 tool (
 	github.com/99designs/gqlgen
@@ -8,68 +10,71 @@ tool (
 )

 require (
-	github.com/99designs/gqlgen v0.17.86
-	github.com/ClusterCockpit/cc-lib/v2 v2.7.0
-	github.com/ClusterCockpit/cc-line-protocol/v2 v2.4.0
+	github.com/99designs/gqlgen v0.17.84
+	github.com/ClusterCockpit/cc-lib v1.0.2
 	github.com/Masterminds/squirrel v1.5.4
-	github.com/aws/aws-sdk-go-v2 v1.41.1
-	github.com/aws/aws-sdk-go-v2/config v1.32.8
-	github.com/aws/aws-sdk-go-v2/credentials v1.19.8
-	github.com/aws/aws-sdk-go-v2/service/s3 v1.96.0
-	github.com/coreos/go-oidc/v3 v3.17.0
-	github.com/expr-lang/expr v1.17.8
-	github.com/go-chi/chi/v5 v5.2.5
-	github.com/go-chi/cors v1.2.2
-	github.com/go-co-op/gocron/v2 v2.19.1
+	github.com/aws/aws-sdk-go-v2 v1.41.0
+	github.com/aws/aws-sdk-go-v2/config v1.31.20
+	github.com/aws/aws-sdk-go-v2/credentials v1.18.24
+	github.com/aws/aws-sdk-go-v2/service/s3 v1.90.2
+	github.com/coreos/go-oidc/v3 v3.16.0
+	github.com/expr-lang/expr v1.17.6
+	github.com/go-co-op/gocron/v2 v2.18.2
 	github.com/go-ldap/ldap/v3 v3.4.12
-	github.com/golang-jwt/jwt/v5 v5.3.1
+	github.com/golang-jwt/jwt/v5 v5.3.0
 	github.com/golang-migrate/migrate/v4 v4.19.1
-	github.com/google/gops v0.3.29
+	github.com/google/gops v0.3.28
+	github.com/gorilla/handlers v1.5.2
+	github.com/gorilla/mux v1.8.1
 	github.com/gorilla/sessions v1.4.0
+	github.com/influxdata/line-protocol/v2 v2.2.1
 	github.com/jmoiron/sqlx v1.4.0
 	github.com/joho/godotenv v1.5.1
-	github.com/mattn/go-sqlite3 v1.14.34
-	github.com/parquet-go/parquet-go v0.27.0
+	github.com/linkedin/goavro/v2 v2.14.1
+	github.com/mattn/go-sqlite3 v1.14.32
+	github.com/nats-io/nats.go v1.47.0
+	github.com/prometheus/client_golang v1.23.2
+	github.com/prometheus/common v0.67.4
 	github.com/qustavo/sqlhooks/v2 v2.1.0
 	github.com/santhosh-tekuri/jsonschema/v5 v5.3.1
 	github.com/stretchr/testify v1.11.1
 	github.com/swaggo/http-swagger v1.3.4
 	github.com/swaggo/swag v1.16.6
 	github.com/vektah/gqlparser/v2 v2.5.31
-	golang.org/x/crypto v0.48.0
-	golang.org/x/oauth2 v0.35.0
+	golang.org/x/crypto v0.45.0
+	golang.org/x/oauth2 v0.32.0
 	golang.org/x/time v0.14.0
 )

 require (
-	github.com/Azure/go-ntlmssp v0.1.0 // indirect
+	github.com/Azure/go-ntlmssp v0.0.0-20221128193559-754e69321358 // indirect
 	github.com/KyleBanks/depth v1.2.1 // indirect
 	github.com/agnivade/levenshtein v1.2.1 // indirect
-	github.com/andybalholm/brotli v1.2.0 // indirect
-	github.com/apapsch/go-jsonmerge/v2 v2.0.0 // indirect
-	github.com/aws/aws-sdk-go-v2/aws/protocol/eventstream v1.7.4 // indirect
-	github.com/aws/aws-sdk-go-v2/feature/ec2/imds v1.18.17 // indirect
-	github.com/aws/aws-sdk-go-v2/internal/configsources v1.4.17 // indirect
-	github.com/aws/aws-sdk-go-v2/internal/endpoints/v2 v2.7.17 // indirect
+	github.com/aws/aws-sdk-go-v2/aws/protocol/eventstream v1.7.3 // indirect
+	github.com/aws/aws-sdk-go-v2/feature/ec2/imds v1.18.13 // indirect
+	github.com/aws/aws-sdk-go-v2/internal/configsources v1.4.13 // indirect
+	github.com/aws/aws-sdk-go-v2/internal/endpoints/v2 v2.7.13 // indirect
 	github.com/aws/aws-sdk-go-v2/internal/ini v1.8.4 // indirect
-	github.com/aws/aws-sdk-go-v2/internal/v4a v1.4.17 // indirect
-	github.com/aws/aws-sdk-go-v2/service/internal/accept-encoding v1.13.4 // indirect
-	github.com/aws/aws-sdk-go-v2/service/internal/checksum v1.9.8 // indirect
-	github.com/aws/aws-sdk-go-v2/service/internal/presigned-url v1.13.17 // indirect
-	github.com/aws/aws-sdk-go-v2/service/internal/s3shared v1.19.17 // indirect
-	github.com/aws/aws-sdk-go-v2/service/signin v1.0.5 // indirect
-	github.com/aws/aws-sdk-go-v2/service/sso v1.30.9 // indirect
-	github.com/aws/aws-sdk-go-v2/service/ssooidc v1.35.14 // indirect
-	github.com/aws/aws-sdk-go-v2/service/sts v1.41.6 // indirect
+	github.com/aws/aws-sdk-go-v2/internal/v4a v1.4.13 // indirect
+	github.com/aws/aws-sdk-go-v2/service/internal/accept-encoding v1.13.3 // indirect
+	github.com/aws/aws-sdk-go-v2/service/internal/checksum v1.9.4 // indirect
+	github.com/aws/aws-sdk-go-v2/service/internal/presigned-url v1.13.13 // indirect
+	github.com/aws/aws-sdk-go-v2/service/internal/s3shared v1.19.13 // indirect
+	github.com/aws/aws-sdk-go-v2/service/sso v1.30.3 // indirect
+	github.com/aws/aws-sdk-go-v2/service/ssooidc v1.35.7 // indirect
+	github.com/aws/aws-sdk-go-v2/service/sts v1.40.2 // indirect
 	github.com/aws/smithy-go v1.24.0 // indirect
+	github.com/beorn7/perks v1.0.1 // indirect
+	github.com/cespare/xxhash/v2 v2.3.0 // indirect
 	github.com/cpuguy83/go-md2man/v2 v2.0.7 // indirect
 	github.com/davecgh/go-spew v1.1.2-0.20180830191138-d8f796af33cc // indirect
+	github.com/felixge/httpsnoop v1.0.4 // indirect
 	github.com/fsnotify/fsnotify v1.9.0 // indirect
 	github.com/go-asn1-ber/asn1-ber v1.5.8-0.20250403174932-29230038a667 // indirect
 	github.com/go-jose/go-jose/v4 v4.1.3 // indirect
-	github.com/go-openapi/jsonpointer v0.22.4 // indirect
-	github.com/go-openapi/jsonreference v0.21.4 // indirect
-	github.com/go-openapi/spec v0.22.3 // indirect
+	github.com/go-openapi/jsonpointer v0.22.3 // indirect
+	github.com/go-openapi/jsonreference v0.21.3 // indirect
+	github.com/go-openapi/spec v0.22.1 // indirect
 	github.com/go-openapi/swag/conv v0.25.4 // indirect
 	github.com/go-openapi/swag/jsonname v0.25.4 // indirect
 	github.com/go-openapi/swag/jsonutils v0.25.4 // indirect
@@ -77,48 +82,45 @@ require (
 	github.com/go-openapi/swag/stringutils v0.25.4 // indirect
 	github.com/go-openapi/swag/typeutils v0.25.4 // indirect
 	github.com/go-openapi/swag/yamlutils v0.25.4 // indirect
-	github.com/go-viper/mapstructure/v2 v2.5.0 // indirect
-	github.com/goccy/go-yaml v1.19.2 // indirect
+	github.com/go-viper/mapstructure/v2 v2.4.0 // indirect
+	github.com/goccy/go-yaml v1.19.0 // indirect
+	github.com/golang/snappy v0.0.4 // indirect
 	github.com/google/uuid v1.6.0 // indirect
 	github.com/gorilla/securecookie v1.1.2 // indirect
 	github.com/gorilla/websocket v1.5.3 // indirect
 	github.com/hashicorp/golang-lru/v2 v2.0.7 // indirect
-	github.com/influxdata/influxdb-client-go/v2 v2.14.0 // indirect
-	github.com/influxdata/line-protocol v0.0.0-20210922203350-b1ad95c89adf // indirect
 	github.com/jonboulle/clockwork v0.5.0 // indirect
-	github.com/klauspost/compress v1.18.4 // indirect
-	github.com/kr/pretty v0.3.1 // indirect
+	github.com/jpillora/backoff v1.0.0 // indirect
+	github.com/json-iterator/go v1.1.12 // indirect
+	github.com/klauspost/compress v1.18.1 // indirect
 	github.com/lann/builder v0.0.0-20180802200727-47ae307949d0 // indirect
 	github.com/lann/ps v0.0.0-20150810152359-62de8c46ede0 // indirect
-	github.com/nats-io/nats.go v1.49.0 // indirect
-	github.com/nats-io/nkeys v0.4.15 // indirect
+	github.com/modern-go/concurrent v0.0.0-20180306012644-bacd9c7ef1dd // indirect
+	github.com/modern-go/reflect2 v1.0.2 // indirect
+	github.com/munnerz/goautoneg v0.0.0-20191010083416-a7dc8b61c822 // indirect
+	github.com/mwitkow/go-conntrack v0.0.0-20190716064945-2f068394615f // indirect
+	github.com/nats-io/nkeys v0.4.11 // indirect
 	github.com/nats-io/nuid v1.0.1 // indirect
-	github.com/oapi-codegen/runtime v1.2.0 // indirect
-	github.com/parquet-go/bitpack v1.0.0 // indirect
-	github.com/parquet-go/jsonlite v1.4.0 // indirect
-	github.com/pierrec/lz4/v4 v4.1.25 // indirect
 	github.com/pmezard/go-difflib v1.0.1-0.20181226105442-5d4384ee4fb2 // indirect
+	github.com/prometheus/client_model v0.6.2 // indirect
+	github.com/prometheus/procfs v0.16.1 // indirect
 	github.com/robfig/cron/v3 v3.0.1 // indirect
-	github.com/rogpeppe/go-internal v1.10.0 // indirect
 	github.com/russross/blackfriday/v2 v2.1.0 // indirect
 	github.com/sosodev/duration v1.3.1 // indirect
-	github.com/stmcginnis/gofish v0.21.3 // indirect
 	github.com/stretchr/objx v0.5.2 // indirect
 	github.com/swaggo/files v1.0.1 // indirect
-	github.com/twpayne/go-geom v1.6.1 // indirect
 	github.com/urfave/cli/v2 v2.27.7 // indirect
 	github.com/urfave/cli/v3 v3.6.1 // indirect
 	github.com/xrash/smetrics v0.0.0-20250705151800-55b8f293f342 // indirect
 	go.yaml.in/yaml/v2 v2.4.3 // indirect
 	go.yaml.in/yaml/v3 v3.0.4 // indirect
-	golang.org/x/exp v0.0.0-20260218203240-3dfff04db8fa // indirect
-	golang.org/x/mod v0.33.0 // indirect
-	golang.org/x/net v0.51.0 // indirect
-	golang.org/x/sync v0.19.0 // indirect
-	golang.org/x/sys v0.41.0 // indirect
-	golang.org/x/text v0.34.0 // indirect
-	golang.org/x/tools v0.42.0 // indirect
-	google.golang.org/protobuf v1.36.11 // indirect
+	golang.org/x/mod v0.30.0 // indirect
+	golang.org/x/net v0.47.0 // indirect
+	golang.org/x/sync v0.18.0 // indirect
+	golang.org/x/sys v0.38.0 // indirect
+	golang.org/x/text v0.31.0 // indirect
+	golang.org/x/tools v0.39.0 // indirect
+	google.golang.org/protobuf v1.36.10 // indirect
 	gopkg.in/yaml.v3 v3.0.1 // indirect
 	sigs.k8s.io/yaml v1.6.0 // indirect
 )
--- a/go.sum
+++ b/go.sum
@@ -1,15 +1,11 @@
 filippo.io/edwards25519 v1.1.0 h1:FNf4tywRC1HmFuKW5xopWpigGjJKiJSV0Cqo0cJWDaA=
 filippo.io/edwards25519 v1.1.0/go.mod h1:BxyFTGdWcka3PhytdK4V28tE5sGfRvvvRV7EaN4VDT4=
-github.com/99designs/gqlgen v0.17.86 h1:C8N3UTa5heXX6twl+b0AJyGkTwYL6dNmFrgZNLRcU6w=
-github.com/99designs/gqlgen v0.17.86/go.mod h1:KTrPl+vHA1IUzNlh4EYkl7+tcErL3MgKnhHrBcV74Fw=
-github.com/Azure/go-ntlmssp v0.1.0 h1:DjFo6YtWzNqNvQdrwEyr/e4nhU3vRiwenz5QX7sFz+A=
-github.com/Azure/go-ntlmssp v0.1.0/go.mod h1:NYqdhxd/8aAct/s4qSYZEerdPuH1liG2/X9DiVTbhpk=
-github.com/ClusterCockpit/cc-lib/v2 v2.7.0 h1:EMTShk6rMTR1wlfmQ8SVCawH1OdltUbD3kVQmaW+5pE=
-github.com/ClusterCockpit/cc-lib/v2 v2.7.0/go.mod h1:0Etx8WMs0lYZ4tiOQizY18CQop+2i3WROvU9rMUxHA4=
-github.com/ClusterCockpit/cc-line-protocol/v2 v2.4.0 h1:hIzxgTBWcmCIHtoDKDkSCsKCOCOwUC34sFsbD2wcW0Q=
-github.com/ClusterCockpit/cc-line-protocol/v2 v2.4.0/go.mod h1:y42qUu+YFmu5fdNuUAS4VbbIKxVjxCvbVqFdpdh8ahY=
-github.com/DATA-DOG/go-sqlmock v1.5.2 h1:OcvFkGmslmlZibjAjaHm3L//6LiuBgolP7OputlJIzU=
-github.com/DATA-DOG/go-sqlmock v1.5.2/go.mod h1:88MAG/4G7SMwSE3CeA0ZKzrT5CiOU3OJ+JlNzwDqpNU=
+github.com/99designs/gqlgen v0.17.84 h1:iVMdiStgUVx/BFkMb0J5GAXlqfqtQ7bqMCYK6v52kQ0=
+github.com/99designs/gqlgen v0.17.84/go.mod h1:qjoUqzTeiejdo+bwUg8unqSpeYG42XrcrQboGIezmFA=
+github.com/Azure/go-ntlmssp v0.0.0-20221128193559-754e69321358 h1:mFRzDkZVAjdal+s7s0MwaRv9igoPqLRdzOLzw/8Xvq8=
+github.com/Azure/go-ntlmssp v0.0.0-20221128193559-754e69321358/go.mod h1:chxPXzSsl7ZWRAuOIE23GDNzjWuZquvFlgA8xmpunjU=
+github.com/ClusterCockpit/cc-lib v1.0.2 h1:ZWn3oZkXgxrr3zSigBdlOOfayZ4Om4xL20DhmritPPg=
+github.com/ClusterCockpit/cc-lib v1.0.2/go.mod h1:UGdOvXEnjFqlnPSxtvtFwO6BtXYW6NnXFoud9FtN93k=
 github.com/KyleBanks/depth v1.2.1 h1:5h8fQADFrWtarTdtDudMmGsC7GPbOAu6RVB3ffsVFHc=
 github.com/KyleBanks/depth v1.2.1/go.mod h1:jzSb9d0L43HxTQfT+oSA1EEp2q+ne2uh6XgeJcm8brE=
 github.com/Masterminds/squirrel v1.5.4 h1:uUcX/aBc8O7Fg9kaISIUsHXdKuqehiXAMQTYX8afzqM=
@@ -18,72 +14,60 @@ github.com/NVIDIA/go-nvml v0.13.0-1 h1:OLX8Jq3dONuPOQPC7rndB6+iDmDakw0XTYgzMxObk
 github.com/NVIDIA/go-nvml v0.13.0-1/go.mod h1:+KNA7c7gIBH7SKSJ1ntlwkfN80zdx8ovl4hrK3LmPt4=
 github.com/PuerkitoBio/goquery v1.11.0 h1:jZ7pwMQXIITcUXNH83LLk+txlaEy6NVOfTuP43xxfqw=
 github.com/PuerkitoBio/goquery v1.11.0/go.mod h1:wQHgxUOU3JGuj3oD/QFfxUdlzW6xPHfqyHre6VMY4DQ=
-github.com/RaveNoX/go-jsoncommentstrip v1.0.0/go.mod h1:78ihd09MekBnJnxpICcwzCMzGrKSKYe4AqU6PDYYpjk=
 github.com/agnivade/levenshtein v1.2.1 h1:EHBY3UOn1gwdy/VbFwgo4cxecRznFk7fKWN1KOX7eoM=
 github.com/agnivade/levenshtein v1.2.1/go.mod h1:QVVI16kDrtSuwcpd0p1+xMC6Z/VfhtCyDIjcwga4/DU=
-github.com/alecthomas/assert/v2 v2.10.0 h1:jjRCHsj6hBJhkmhznrCzoNpbA3zqy0fYiUcYZP/GkPY=
-github.com/alecthomas/assert/v2 v2.10.0/go.mod h1:Bze95FyfUr7x34QZrjL+XP+0qgp/zg8yS+TtBj1WA3k=
-github.com/alecthomas/repr v0.4.0 h1:GhI2A8MACjfegCPVq9f1FLvIBS+DrQ2KQBFZP1iFzXc=
-github.com/alecthomas/repr v0.4.0/go.mod h1:Fr0507jx4eOXV7AlPV6AVZLYrLIuIeSOWtW57eE/O/4=
 github.com/alexbrainman/sspi v0.0.0-20250919150558-7d374ff0d59e h1:4dAU9FXIyQktpoUAgOJK3OTFc/xug0PCXYCqU0FgDKI=
 github.com/alexbrainman/sspi v0.0.0-20250919150558-7d374ff0d59e/go.mod h1:cEWa1LVoE5KvSD9ONXsZrj0z6KqySlCCNKHlLzbqAt4=
 github.com/andreyvit/diff v0.0.0-20170406064948-c7f18ee00883 h1:bvNMNQO63//z+xNgfBlViaCIJKLlCJ6/fmUseuG0wVQ=
 github.com/andreyvit/diff v0.0.0-20170406064948-c7f18ee00883/go.mod h1:rCTlJbsFo29Kk6CurOXKm700vrz8f0KW0JNfpkRJY/8=
-github.com/andybalholm/brotli v1.2.0 h1:ukwgCxwYrmACq68yiUqwIWnGY0cTPox/M94sVwToPjQ=
-github.com/andybalholm/brotli v1.2.0/go.mod h1:rzTDkvFWvIrjDXZHkuS16NPggd91W3kUSvPlQ1pLaKY=
 github.com/andybalholm/cascadia v1.3.3 h1:AG2YHrzJIm4BZ19iwJ/DAua6Btl3IwJX+VI4kktS1LM=
 github.com/andybalholm/cascadia v1.3.3/go.mod h1:xNd9bqTn98Ln4DwST8/nG+H0yuB8Hmgu1YHNnWw0GeA=
-github.com/antithesishq/antithesis-sdk-go v0.5.0-default-no-op h1:Ucf+QxEKMbPogRO5guBNe5cgd9uZgfoJLOYs8WWhtjM=
-github.com/antithesishq/antithesis-sdk-go v0.5.0-default-no-op/go.mod h1:IUpT2DPAKh6i/YhSbt6Gl3v2yvUZjmKncl7U91fup7E=
 github.com/apapsch/go-jsonmerge/v2 v2.0.0 h1:axGnT1gRIfimI7gJifB699GoE/oq+F2MU7Dml6nw9rQ=
 github.com/apapsch/go-jsonmerge/v2 v2.0.0/go.mod h1:lvDnEdqiQrp0O42VQGgmlKpxL1AP2+08jFMw88y4klk=
 github.com/arbovm/levenshtein v0.0.0-20160628152529-48b4e1c0c4d0 h1:jfIu9sQUG6Ig+0+Ap1h4unLjW6YQJpKZVmUzxsD4E/Q=
 github.com/arbovm/levenshtein v0.0.0-20160628152529-48b4e1c0c4d0/go.mod h1:t2tdKJDJF9BV14lnkjHmOQgcvEKgtqs5a1N3LNdJhGE=
-github.com/aws/aws-sdk-go-v2 v1.41.1 h1:ABlyEARCDLN034NhxlRUSZr4l71mh+T5KAeGh6cerhU=
-github.com/aws/aws-sdk-go-v2 v1.41.1/go.mod h1:MayyLB8y+buD9hZqkCW3kX1AKq07Y5pXxtgB+rRFhz0=
-github.com/aws/aws-sdk-go-v2/aws/protocol/eventstream v1.7.4 h1:489krEF9xIGkOaaX3CE/Be2uWjiXrkCH6gUX+bZA/BU=
-github.com/aws/aws-sdk-go-v2/aws/protocol/eventstream v1.7.4/go.mod h1:IOAPF6oT9KCsceNTvvYMNHy0+kMF8akOjeDvPENWxp4=
-github.com/aws/aws-sdk-go-v2/config v1.32.8 h1:iu+64gwDKEoKnyTQskSku72dAwggKI5sV6rNvgSMpMs=
-github.com/aws/aws-sdk-go-v2/config v1.32.8/go.mod h1:MI2XvA+qDi3i9AJxX1E2fu730syEBzp/jnXrjxuHwgI=
-github.com/aws/aws-sdk-go-v2/credentials v1.19.8 h1:Jp2JYH1lRT3KhX4mshHPvVYsR5qqRec3hGvEarNYoR0=
-github.com/aws/aws-sdk-go-v2/credentials v1.19.8/go.mod h1:fZG9tuvyVfxknv1rKibIz3DobRaFw1Poe8IKtXB3XYY=
-github.com/aws/aws-sdk-go-v2/feature/ec2/imds v1.18.17 h1:I0GyV8wiYrP8XpA70g1HBcQO1JlQxCMTW9npl5UbDHY=
-github.com/aws/aws-sdk-go-v2/feature/ec2/imds v1.18.17/go.mod h1:tyw7BOl5bBe/oqvoIeECFJjMdzXoa/dfVz3QQ5lgHGA=
-github.com/aws/aws-sdk-go-v2/internal/configsources v1.4.17 h1:xOLELNKGp2vsiteLsvLPwxC+mYmO6OZ8PYgiuPJzF8U=
-github.com/aws/aws-sdk-go-v2/internal/configsources v1.4.17/go.mod h1:5M5CI3D12dNOtH3/mk6minaRwI2/37ifCURZISxA/IQ=
-github.com/aws/aws-sdk-go-v2/internal/endpoints/v2 v2.7.17 h1:WWLqlh79iO48yLkj1v3ISRNiv+3KdQoZ6JWyfcsyQik=
-github.com/aws/aws-sdk-go-v2/internal/endpoints/v2 v2.7.17/go.mod h1:EhG22vHRrvF8oXSTYStZhJc1aUgKtnJe+aOiFEV90cM=
+github.com/aws/aws-sdk-go-v2 v1.41.0 h1:tNvqh1s+v0vFYdA1xq0aOJH+Y5cRyZ5upu6roPgPKd4=
+github.com/aws/aws-sdk-go-v2 v1.41.0/go.mod h1:MayyLB8y+buD9hZqkCW3kX1AKq07Y5pXxtgB+rRFhz0=
+github.com/aws/aws-sdk-go-v2/aws/protocol/eventstream v1.7.3 h1:DHctwEM8P8iTXFxC/QK0MRjwEpWQeM9yzidCRjldUz0=
+github.com/aws/aws-sdk-go-v2/aws/protocol/eventstream v1.7.3/go.mod h1:xdCzcZEtnSTKVDOmUZs4l/j3pSV6rpo1WXl5ugNsL8Y=
+github.com/aws/aws-sdk-go-v2/config v1.31.20 h1:/jWF4Wu90EhKCgjTdy1DGxcbcbNrjfBHvksEL79tfQc=
+github.com/aws/aws-sdk-go-v2/config v1.31.20/go.mod h1:95Hh1Tc5VYKL9NJ7tAkDcqeKt+MCXQB1hQZaRdJIZE0=
+github.com/aws/aws-sdk-go-v2/credentials v1.18.24 h1:iJ2FmPT35EaIB0+kMa6TnQ+PwG5A1prEdAw+PsMzfHg=
+github.com/aws/aws-sdk-go-v2/credentials v1.18.24/go.mod h1:U91+DrfjAiXPDEGYhh/x29o4p0qHX5HDqG7y5VViv64=
+github.com/aws/aws-sdk-go-v2/feature/ec2/imds v1.18.13 h1:T1brd5dR3/fzNFAQch/iBKeX07/ffu/cLu+q+RuzEWk=
+github.com/aws/aws-sdk-go-v2/feature/ec2/imds v1.18.13/go.mod h1:Peg/GBAQ6JDt+RoBf4meB1wylmAipb7Kg2ZFakZTlwk=
+github.com/aws/aws-sdk-go-v2/internal/configsources v1.4.13 h1:a+8/MLcWlIxo1lF9xaGt3J/u3yOZx+CdSveSNwjhD40=
+github.com/aws/aws-sdk-go-v2/internal/configsources v1.4.13/go.mod h1:oGnKwIYZ4XttyU2JWxFrwvhF6YKiK/9/wmE3v3Iu9K8=
+github.com/aws/aws-sdk-go-v2/internal/endpoints/v2 v2.7.13 h1:HBSI2kDkMdWz4ZM7FjwE7e/pWDEZ+nR95x8Ztet1ooY=
+github.com/aws/aws-sdk-go-v2/internal/endpoints/v2 v2.7.13/go.mod h1:YE94ZoDArI7awZqJzBAZ3PDD2zSfuP7w6P2knOzIn8M=
 github.com/aws/aws-sdk-go-v2/internal/ini v1.8.4 h1:WKuaxf++XKWlHWu9ECbMlha8WOEGm0OUEZqm4K/Gcfk=
 github.com/aws/aws-sdk-go-v2/internal/ini v1.8.4/go.mod h1:ZWy7j6v1vWGmPReu0iSGvRiise4YI5SkR3OHKTZ6Wuc=
-github.com/aws/aws-sdk-go-v2/internal/v4a v1.4.17 h1:JqcdRG//czea7Ppjb+g/n4o8i/R50aTBHkA7vu0lK+k=
-github.com/aws/aws-sdk-go-v2/internal/v4a v1.4.17/go.mod h1:CO+WeGmIdj/MlPel2KwID9Gt7CNq4M65HUfBW97liM0=
-github.com/aws/aws-sdk-go-v2/service/internal/accept-encoding v1.13.4 h1:0ryTNEdJbzUCEWkVXEXoqlXV72J5keC1GvILMOuD00E=
-github.com/aws/aws-sdk-go-v2/service/internal/accept-encoding v1.13.4/go.mod h1:HQ4qwNZh32C3CBeO6iJLQlgtMzqeG17ziAA/3KDJFow=
-github.com/aws/aws-sdk-go-v2/service/internal/checksum v1.9.8 h1:Z5EiPIzXKewUQK0QTMkutjiaPVeVYXX7KIqhXu/0fXs=
-github.com/aws/aws-sdk-go-v2/service/internal/checksum v1.9.8/go.mod h1:FsTpJtvC4U1fyDXk7c71XoDv3HlRm8V3NiYLeYLh5YE=
-github.com/aws/aws-sdk-go-v2/service/internal/presigned-url v1.13.17 h1:RuNSMoozM8oXlgLG/n6WLaFGoea7/CddrCfIiSA+xdY=
-github.com/aws/aws-sdk-go-v2/service/internal/presigned-url v1.13.17/go.mod h1:F2xxQ9TZz5gDWsclCtPQscGpP0VUOc8RqgFM3vDENmU=
-github.com/aws/aws-sdk-go-v2/service/internal/s3shared v1.19.17 h1:bGeHBsGZx0Dvu/eJC0Lh9adJa3M1xREcndxLNZlve2U=
-github.com/aws/aws-sdk-go-v2/service/internal/s3shared v1.19.17/go.mod h1:dcW24lbU0CzHusTE8LLHhRLI42ejmINN8Lcr22bwh/g=
-github.com/aws/aws-sdk-go-v2/service/s3 v1.96.0 h1:oeu8VPlOre74lBA/PMhxa5vewaMIMmILM+RraSyB8KA=
-github.com/aws/aws-sdk-go-v2/service/s3 v1.96.0/go.mod h1:5jggDlZ2CLQhwJBiZJb4vfk4f0GxWdEDruWKEJ1xOdo=
-github.com/aws/aws-sdk-go-v2/service/signin v1.0.5 h1:VrhDvQib/i0lxvr3zqlUwLwJP4fpmpyD9wYG1vfSu+Y=
-github.com/aws/aws-sdk-go-v2/service/signin v1.0.5/go.mod h1:k029+U8SY30/3/ras4G/Fnv/b88N4mAfliNn08Dem4M=
-github.com/aws/aws-sdk-go-v2/service/sso v1.30.9 h1:v6EiMvhEYBoHABfbGB4alOYmCIrcgyPPiBE1wZAEbqk=
-github.com/aws/aws-sdk-go-v2/service/sso v1.30.9/go.mod h1:yifAsgBxgJWn3ggx70A3urX2AN49Y5sJTD1UQFlfqBw=
-github.com/aws/aws-sdk-go-v2/service/ssooidc v1.35.14 h1:0jbJeuEHlwKJ9PfXtpSFc4MF+WIWORdhN1n30ITZGFM=
-github.com/aws/aws-sdk-go-v2/service/ssooidc v1.35.14/go.mod h1:sTGThjphYE4Ohw8vJiRStAcu3rbjtXRsdNB0TvZ5wwo=
-github.com/aws/aws-sdk-go-v2/service/sts v1.41.6 h1:5fFjR/ToSOzB2OQ/XqWpZBmNvmP/pJ1jOWYlFDJTjRQ=
-github.com/aws/aws-sdk-go-v2/service/sts v1.41.6/go.mod h1:qgFDZQSD/Kys7nJnVqYlWKnh0SSdMjAi0uSwON4wgYQ=
+github.com/aws/aws-sdk-go-v2/internal/v4a v1.4.13 h1:eg/WYAa12vqTphzIdWMzqYRVKKnCboVPRlvaybNCqPA=
+github.com/aws/aws-sdk-go-v2/internal/v4a v1.4.13/go.mod h1:/FDdxWhz1486obGrKKC1HONd7krpk38LBt+dutLcN9k=
+github.com/aws/aws-sdk-go-v2/service/internal/accept-encoding v1.13.3 h1:x2Ibm/Af8Fi+BH+Hsn9TXGdT+hKbDd5XOTZxTMxDk7o=
+github.com/aws/aws-sdk-go-v2/service/internal/accept-encoding v1.13.3/go.mod h1:IW1jwyrQgMdhisceG8fQLmQIydcT/jWY21rFhzgaKwo=
+github.com/aws/aws-sdk-go-v2/service/internal/checksum v1.9.4 h1:NvMjwvv8hpGUILarKw7Z4Q0w1H9anXKsesMxtw++MA4=
+github.com/aws/aws-sdk-go-v2/service/internal/checksum v1.9.4/go.mod h1:455WPHSwaGj2waRSpQp7TsnpOnBfw8iDfPfbwl7KPJE=
+github.com/aws/aws-sdk-go-v2/service/internal/presigned-url v1.13.13 h1:kDqdFvMY4AtKoACfzIGD8A0+hbT41KTKF//gq7jITfM=
+github.com/aws/aws-sdk-go-v2/service/internal/presigned-url v1.13.13/go.mod h1:lmKuogqSU3HzQCwZ9ZtcqOc5XGMqtDK7OIc2+DxiUEg=
+github.com/aws/aws-sdk-go-v2/service/internal/s3shared v1.19.13 h1:zhBJXdhWIFZ1acfDYIhu4+LCzdUS2Vbcum7D01dXlHQ=
+github.com/aws/aws-sdk-go-v2/service/internal/s3shared v1.19.13/go.mod h1:JaaOeCE368qn2Hzi3sEzY6FgAZVCIYcC2nwbro2QCh8=
+github.com/aws/aws-sdk-go-v2/service/s3 v1.90.2 h1:DhdbtDl4FdNlj31+xiRXANxEE+eC7n8JQz+/ilwQ8Uc=
+github.com/aws/aws-sdk-go-v2/service/s3 v1.90.2/go.mod h1:+wArOOrcHUevqdto9k1tKOF5++YTe9JEcPSc9Tx2ZSw=
+github.com/aws/aws-sdk-go-v2/service/sso v1.30.3 h1:NjShtS1t8r5LUfFVtFeI8xLAHQNTa7UI0VawXlrBMFQ=
+github.com/aws/aws-sdk-go-v2/service/sso v1.30.3/go.mod h1:fKvyjJcz63iL/ftA6RaM8sRCtN4r4zl4tjL3qw5ec7k=
+github.com/aws/aws-sdk-go-v2/service/ssooidc v1.35.7 h1:gTsnx0xXNQ6SBbymoDvcoRHL+q4l/dAFsQuKfDWSaGc=
+github.com/aws/aws-sdk-go-v2/service/ssooidc v1.35.7/go.mod h1:klO+ejMvYsB4QATfEOIXk8WAEwN4N0aBfJpvC+5SZBo=
+github.com/aws/aws-sdk-go-v2/service/sts v1.40.2 h1:HK5ON3KmQV2HcAunnx4sKLB9aPf3gKGwVAf7xnx0QT0=
+github.com/aws/aws-sdk-go-v2/service/sts v1.40.2/go.mod h1:E19xDjpzPZC7LS2knI9E6BaRFDK43Eul7vd6rSq2HWk=
 github.com/aws/smithy-go v1.24.0 h1:LpilSUItNPFr1eY85RYgTIg5eIEPtvFbskaFcmmIUnk=
 github.com/aws/smithy-go v1.24.0/go.mod h1:LEj2LM3rBRQJxPZTB4KuzZkaZYnZPnvgIhb4pu07mx0=
 github.com/beorn7/perks v1.0.1 h1:VlbKKnNfV8bJzeqoa4cOKqO6bYr3WgKZxO8Z16+hsOM=
 github.com/beorn7/perks v1.0.1/go.mod h1:G2ZrVWU2WbWT9wwq4/hrbKbnv/1ERSJQ0ibhJ6rlkpw=
-github.com/bmatcuk/doublestar v1.1.1/go.mod h1:UD6OnuiIn0yFxxA2le/rnRU1G4RaI4UvFv1sNto9p6w=
 github.com/cespare/xxhash/v2 v2.3.0 h1:UL815xU9SqsFlibzuggzjXhog7bL6oX9BbNZnL2UFvs=
 github.com/cespare/xxhash/v2 v2.3.0/go.mod h1:VGX0DQ3Q6kWi7AoAeZDth3/j3BFtOZR5XLFGgcrjCOs=
-github.com/coreos/go-oidc/v3 v3.17.0 h1:hWBGaQfbi0iVviX4ibC7bk8OKT5qNr4klBaCHVNvehc=
-github.com/coreos/go-oidc/v3 v3.17.0/go.mod h1:wqPbKFrVnE90vty060SB40FCJ8fTHTxSwyXJqZH+sI8=
+github.com/coreos/go-oidc/v3 v3.16.0 h1:qRQUCFstKpXwmEjDQTIbyY/5jF00+asXzSkmkoa/mow=
+github.com/coreos/go-oidc/v3 v3.16.0/go.mod h1:wqPbKFrVnE90vty060SB40FCJ8fTHTxSwyXJqZH+sI8=
 github.com/cpuguy83/go-md2man/v2 v2.0.7 h1:zbFlGlXEAKlwXpmvle3d8Oe3YnkKIK4xSRTd3sHPnBo=
 github.com/cpuguy83/go-md2man/v2 v2.0.7/go.mod h1:oOW0eioCTA6cOiMLiUPZOpcVxMig6NIQQ7OS05n1F4g=
 github.com/creack/pty v1.1.9/go.mod h1:oKZEueFk5CKHvIhNR5MUki03XCEU+Q6VDXinZuGJ33E=
@@ -93,30 +77,30 @@ github.com/davecgh/go-spew v1.1.2-0.20180830191138-d8f796af33cc h1:U9qPSI2PIWSS1
 github.com/davecgh/go-spew v1.1.2-0.20180830191138-d8f796af33cc/go.mod h1:J7Y8YcW2NihsgmVo/mv3lAwl/skON4iLHjSsI+c5H38=
 github.com/dgryski/trifles v0.0.0-20230903005119-f50d829f2e54 h1:SG7nF6SRlWhcT7cNTs5R6Hk4V2lcmLz2NsG2VnInyNo=
 github.com/dgryski/trifles v0.0.0-20230903005119-f50d829f2e54/go.mod h1:if7Fbed8SFyPtHLHbg49SI7NAdJiC5WIA09pe59rfAA=
-github.com/expr-lang/expr v1.17.8 h1:W1loDTT+0PQf5YteHSTpju2qfUfNoBt4yw9+wOEU9VM=
-github.com/expr-lang/expr v1.17.8/go.mod h1:8/vRC7+7HBzESEqt5kKpYXxrxkr31SaO8r40VO/1IT4=
+github.com/expr-lang/expr v1.17.6 h1:1h6i8ONk9cexhDmowO/A64VPxHScu7qfSl2k8OlINec=
+github.com/expr-lang/expr v1.17.6/go.mod h1:8/vRC7+7HBzESEqt5kKpYXxrxkr31SaO8r40VO/1IT4=
+github.com/felixge/httpsnoop v1.0.4 h1:NFTV2Zj1bL4mc9sqWACXbQFVBBg2W3GPvqp8/ESS2Wg=
+github.com/felixge/httpsnoop v1.0.4/go.mod h1:m8KPJKqk1gH5J9DgRY2ASl2lWCfGKXixSwevea8zH2U=
+github.com/frankban/quicktest v1.11.0/go.mod h1:K+q6oSqb0W0Ininfk863uOk1lMy69l/P6txr3mVT54s=
+github.com/frankban/quicktest v1.11.2/go.mod h1:K+q6oSqb0W0Ininfk863uOk1lMy69l/P6txr3mVT54s=
 github.com/frankban/quicktest v1.13.0 h1:yNZif1OkDfNoDfb9zZa9aXIpejNR4F23Wely0c+Qdqk=
 github.com/frankban/quicktest v1.13.0/go.mod h1:qLE0fzW0VuyUAJgPU19zByoIr0HtCHN/r/VLSOOIySU=
 github.com/fsnotify/fsnotify v1.9.0 h1:2Ml+OJNzbYCTzsxtv8vKSFD9PbJjmhYF14k/jKC7S9k=
 github.com/fsnotify/fsnotify v1.9.0/go.mod h1:8jBTzvmWwFyi3Pb8djgCCO5IBqzKJ/Jwo8TRcHyHii0=
 github.com/go-asn1-ber/asn1-ber v1.5.8-0.20250403174932-29230038a667 h1:BP4M0CvQ4S3TGls2FvczZtj5Re/2ZzkV9VwqPHH/3Bo=
 github.com/go-asn1-ber/asn1-ber v1.5.8-0.20250403174932-29230038a667/go.mod h1:hEBeB/ic+5LoWskz+yKT7vGhhPYkProFKoKdwZRWMe0=
-github.com/go-chi/chi/v5 v5.2.5 h1:Eg4myHZBjyvJmAFjFvWgrqDTXFyOzjj7YIm3L3mu6Ug=
-github.com/go-chi/chi/v5 v5.2.5/go.mod h1:X7Gx4mteadT3eDOMTsXzmI4/rwUpOwBHLpAfupzFJP0=
-github.com/go-chi/cors v1.2.2 h1:Jmey33TE+b+rB7fT8MUy1u0I4L+NARQlK6LhzKPSyQE=
-github.com/go-chi/cors v1.2.2/go.mod h1:sSbTewc+6wYHBBCW7ytsFSn836hqM7JxpglAy2Vzc58=
-github.com/go-co-op/gocron/v2 v2.19.1 h1:B4iLeA0NB/2iO3EKQ7NfKn5KsQgZfjb2fkvoZJU3yBI=
-github.com/go-co-op/gocron/v2 v2.19.1/go.mod h1:5lEiCKk1oVJV39Zg7/YG10OnaVrDAV5GGR6O0663k6U=
+github.com/go-co-op/gocron/v2 v2.18.2 h1:+5VU41FUXPWSPKLXZQ/77SGzUiPCcakU0v7ENc2H20Q=
+github.com/go-co-op/gocron/v2 v2.18.2/go.mod h1:Zii6he+Zfgy5W9B+JKk/KwejFOW0kZTFvHtwIpR4aBI=
 github.com/go-jose/go-jose/v4 v4.1.3 h1:CVLmWDhDVRa6Mi/IgCgaopNosCaHz7zrMeF9MlZRkrs=
 github.com/go-jose/go-jose/v4 v4.1.3/go.mod h1:x4oUasVrzR7071A4TnHLGSPpNOm2a21K9Kf04k1rs08=
 github.com/go-ldap/ldap/v3 v3.4.12 h1:1b81mv7MagXZ7+1r7cLTWmyuTqVqdwbtJSjC0DAp9s4=
 github.com/go-ldap/ldap/v3 v3.4.12/go.mod h1:+SPAGcTtOfmGsCb3h1RFiq4xpp4N636G75OEace8lNo=
-github.com/go-openapi/jsonpointer v0.22.4 h1:dZtK82WlNpVLDW2jlA1YCiVJFVqkED1MegOUy9kR5T4=
-github.com/go-openapi/jsonpointer v0.22.4/go.mod h1:elX9+UgznpFhgBuaMQ7iu4lvvX1nvNsesQ3oxmYTw80=
-github.com/go-openapi/jsonreference v0.21.4 h1:24qaE2y9bx/q3uRK/qN+TDwbok1NhbSmGjjySRCHtC8=
-github.com/go-openapi/jsonreference v0.21.4/go.mod h1:rIENPTjDbLpzQmQWCj5kKj3ZlmEh+EFVbz3RTUh30/4=
-github.com/go-openapi/spec v0.22.3 h1:qRSmj6Smz2rEBxMnLRBMeBWxbbOvuOoElvSvObIgwQc=
-github.com/go-openapi/spec v0.22.3/go.mod h1:iIImLODL2loCh3Vnox8TY2YWYJZjMAKYyLH2Mu8lOZs=
+github.com/go-openapi/jsonpointer v0.22.3 h1:dKMwfV4fmt6Ah90zloTbUKWMD+0he+12XYAsPotrkn8=
+github.com/go-openapi/jsonpointer v0.22.3/go.mod h1:0lBbqeRsQ5lIanv3LHZBrmRGHLHcQoOXQnf88fHlGWo=
+github.com/go-openapi/jsonreference v0.21.3 h1:96Dn+MRPa0nYAR8DR1E03SblB5FJvh7W6krPI0Z7qMc=
+github.com/go-openapi/jsonreference v0.21.3/go.mod h1:RqkUP0MrLf37HqxZxrIAtTWW4ZJIK1VzduhXYBEeGc4=
+github.com/go-openapi/spec v0.22.1 h1:beZMa5AVQzRspNjvhe5aG1/XyBSMeX1eEOs7dMoXh/k=
+github.com/go-openapi/spec v0.22.1/go.mod h1:c7aeIQT175dVowfp7FeCvXXnjN/MrpaONStibD2WtDA=
 github.com/go-openapi/swag v0.19.15 h1:D2NRCBzS9/pEY3gP9Nl8aDqGUcPFrwG2p+CNFrLyrCM=
 github.com/go-openapi/swag/conv v0.25.4 h1:/Dd7p0LZXczgUcC/Ikm1+YqVzkEeCc9LnOWjfkpkfe4=
 github.com/go-openapi/swag/conv v0.25.4/go.mod h1:3LXfie/lwoAv0NHoEuY1hjoFAYkvlqI/Bn5EQDD3PPU=
@@ -141,26 +125,30 @@ github.com/go-openapi/testify/v2 v2.0.2/go.mod h1:HCPmvFFnheKK2BuwSA0TbbdxJ3I16p
 github.com/go-sql-driver/mysql v1.4.1/go.mod h1:zAC/RDZ24gD3HViQzih4MyKcchzm+sOG5ZlKdlhCg5w=
 github.com/go-sql-driver/mysql v1.8.1 h1:LedoTUt/eveggdHS9qUFC1EFSa8bU2+1pZjSRpvNJ1Y=
 github.com/go-sql-driver/mysql v1.8.1/go.mod h1:wEBSXgmK//2ZFJyE+qWnIsVGmvmEKlqwuVSjsCm7DZg=
-github.com/go-viper/mapstructure/v2 v2.5.0 h1:vM5IJoUAy3d7zRSVtIwQgBj7BiWtMPfmPEgAXnvj1Ro=
-github.com/go-viper/mapstructure/v2 v2.5.0/go.mod h1:oJDH3BJKyqBA2TXFhDsKDGDTlndYOZ6rGS0BRZIxGhM=
-github.com/goccy/go-yaml v1.19.2 h1:PmFC1S6h8ljIz6gMRBopkjP1TVT7xuwrButHID66PoM=
-github.com/goccy/go-yaml v1.19.2/go.mod h1:XBurs7gK8ATbW4ZPGKgcbrY1Br56PdM69F7LkFRi1kA=
-github.com/golang-jwt/jwt/v5 v5.3.1 h1:kYf81DTWFe7t+1VvL7eS+jKFVWaUnK9cB1qbwn63YCY=
-github.com/golang-jwt/jwt/v5 v5.3.1/go.mod h1:fxCRLWMO43lRc8nhHWY6LGqRcf+1gQWArsqaEUEa5bE=
+github.com/go-viper/mapstructure/v2 v2.4.0 h1:EBsztssimR/CONLSZZ04E8qAkxNYq4Qp9LvH92wZUgs=
+github.com/go-viper/mapstructure/v2 v2.4.0/go.mod h1:oJDH3BJKyqBA2TXFhDsKDGDTlndYOZ6rGS0BRZIxGhM=
+github.com/goccy/go-yaml v1.19.0 h1:EmkZ9RIsX+Uq4DYFowegAuJo8+xdX3T/2dwNPXbxEYE=
+github.com/goccy/go-yaml v1.19.0/go.mod h1:XBurs7gK8ATbW4ZPGKgcbrY1Br56PdM69F7LkFRi1kA=
+github.com/golang-jwt/jwt/v5 v5.3.0 h1:pv4AsKCKKZuqlgs5sUmn4x8UlGa0kEVt/puTpKx9vvo=
+github.com/golang-jwt/jwt/v5 v5.3.0/go.mod h1:fxCRLWMO43lRc8nhHWY6LGqRcf+1gQWArsqaEUEa5bE=
 github.com/golang-migrate/migrate/v4 v4.19.1 h1:OCyb44lFuQfYXYLx1SCxPZQGU7mcaZ7gH9yH4jSFbBA=
 github.com/golang-migrate/migrate/v4 v4.19.1/go.mod h1:CTcgfjxhaUtsLipnLoQRWCrjYXycRz/g5+RWDuYgPrE=
+github.com/golang/snappy v0.0.1/go.mod h1:/XxbfmMg8lxefKM7IXC3fBNl/7bRcc72aCRzEWrmP2Q=
+github.com/golang/snappy v0.0.4 h1:yAGX7huGHXlcLOEtBnF4w7FQwA26wojNCwOYAEhLjQM=
+github.com/golang/snappy v0.0.4/go.mod h1:/XxbfmMg8lxefKM7IXC3fBNl/7bRcc72aCRzEWrmP2Q=
 github.com/google/go-cmp v0.5.2/go.mod h1:v8dTdLbMG2kIc/vJvl+f65V22dbkXbowE6jgT/gNBxE=
 github.com/google/go-cmp v0.5.5/go.mod h1:v8dTdLbMG2kIc/vJvl+f65V22dbkXbowE6jgT/gNBxE=
 github.com/google/go-cmp v0.7.0 h1:wk8382ETsv4JYUZwIsn6YpYiWiBsYLSJiTsyBybVuN8=
 github.com/google/go-cmp v0.7.0/go.mod h1:pXiqmnSA92OHEEa9HXL2W4E7lf9JzCmGVUdgjX3N/iU=
-github.com/google/go-tpm v0.9.7 h1:u89J4tUUeDTlH8xxC3CTW7OHZjbjKoHdQ9W7gCUhtxA=
-github.com/google/go-tpm v0.9.7/go.mod h1:h9jEsEECg7gtLis0upRBQU+GhYVH6jMjrFxI8u6bVUY=
+github.com/google/gofuzz v1.0.0/go.mod h1:dBl0BpW6vV/+mYPU4Po3pmUjxk6FQPldtuIdl/M65Eg=
 github.com/google/gofuzz v1.2.0 h1:xRy4A+RhZaiKjJ1bPfwQ8sedCA+YS2YcCHW6ec7JMi0=
 github.com/google/gofuzz v1.2.0/go.mod h1:dBl0BpW6vV/+mYPU4Po3pmUjxk6FQPldtuIdl/M65Eg=
-github.com/google/gops v0.3.29 h1:n98J2qSOK1NJvRjdLDcjgDryjpIBGhbaqph1mXKL0rY=
-github.com/google/gops v0.3.29/go.mod h1:8N3jZftuPazvUwtYY/ncG4iPrjp15ysNKLfq+QQPiwc=
+github.com/google/gops v0.3.28 h1:2Xr57tqKAmQYRAfG12E+yLcoa2Y42UJo2lOrUFL9ark=
+github.com/google/gops v0.3.28/go.mod h1:6f6+Nl8LcHrzJwi8+p0ii+vmBFSlB4f8cOOkTJ7sk4c=
 github.com/google/uuid v1.6.0 h1:NIvaJDMOsjHA8n1jAhLSgzrAzy1Hgr+hNrb57e+94F0=
 github.com/google/uuid v1.6.0/go.mod h1:TIyPZe4MgqvfeYDBFedMoGGpEw/LqOeaOT+nhxU+yHo=
+github.com/gorilla/handlers v1.5.2 h1:cLTUSsNkgcwhgRqvCNmdbRWG0A3N4F+M2nWKdScwyEE=
+github.com/gorilla/handlers v1.5.2/go.mod h1:dX+xVpaxdSw+q0Qek8SSsl3dfMk3jNddUkMzo0GtH0w=
 github.com/gorilla/mux v1.8.1 h1:TuBL49tXwgrFYWhqrNgrUNEY92u81SPhu7sTdzQEiWY=
 github.com/gorilla/mux v1.8.1/go.mod h1:AKf9I4AEqPTmMytcMc0KkNouC66V3BtZ4qD5fmWSiMQ=
 github.com/gorilla/securecookie v1.1.2 h1:YCIWL56dvtr73r6715mJs5ZvhtnY73hBvEF8kXD8ePA=
@@ -173,14 +161,17 @@ github.com/hashicorp/go-uuid v1.0.3 h1:2gKiV6YVmrJ1i2CKKa9obLvRieoRGviZFL26PcT/C
 github.com/hashicorp/go-uuid v1.0.3/go.mod h1:6SBZvOh/SIDV7/2o3Jml5SYk/TvGqwFJ/bN7x4byOro=
 github.com/hashicorp/golang-lru/v2 v2.0.7 h1:a+bsQ5rvGLjzHuww6tVxozPZFVghXaHOwFs4luLUK2k=
 github.com/hashicorp/golang-lru/v2 v2.0.7/go.mod h1:QeFd9opnmA6QUJc5vARoKUSoFhyfM2/ZepoAG6RGpeM=
-github.com/hexops/gotextdiff v1.0.3 h1:gitA9+qJrrTCsiCl7+kh75nPqQt1cx4ZkudSTLoUqJM=
-github.com/hexops/gotextdiff v1.0.3/go.mod h1:pSWU5MAI3yDq+fZBTazCSJysOMbxWL1BSow5/V2vxeg=
 github.com/influxdata/influxdb-client-go/v2 v2.14.0 h1:AjbBfJuq+QoaXNcrova8smSjwJdUHnwvfjMF71M1iI4=
 github.com/influxdata/influxdb-client-go/v2 v2.14.0/go.mod h1:Ahpm3QXKMJslpXl3IftVLVezreAUtBOTZssDrjZEFHI=
 github.com/influxdata/line-protocol v0.0.0-20210922203350-b1ad95c89adf h1:7JTmneyiNEwVBOHSjoMxiWAqB992atOeepeFYegn5RU=
 github.com/influxdata/line-protocol v0.0.0-20210922203350-b1ad95c89adf/go.mod h1:xaLFMmpvUxqXtVkUJfg9QmT88cDaCJ3ZKgdZ78oO8Qo=
+github.com/influxdata/line-protocol-corpus v0.0.0-20210519164801-ca6fa5da0184/go.mod h1:03nmhxzZ7Xk2pdG+lmMd7mHDfeVOYFyhOgwO61qWU98=
 github.com/influxdata/line-protocol-corpus v0.0.0-20210922080147-aa28ccfb8937 h1:MHJNQ+p99hFATQm6ORoLmpUCF7ovjwEFshs/NHzAbig=
 github.com/influxdata/line-protocol-corpus v0.0.0-20210922080147-aa28ccfb8937/go.mod h1:BKR9c0uHSmRgM/se9JhFHtTT7JTO67X23MtKMHtZcpo=
+github.com/influxdata/line-protocol/v2 v2.0.0-20210312151457-c52fdecb625a/go.mod h1:6+9Xt5Sq1rWx+glMgxhcg2c0DUaehK+5TDcPZ76GypY=
+github.com/influxdata/line-protocol/v2 v2.1.0/go.mod h1:QKw43hdUBg3GTk2iC3iyCxksNj7PX9aUSeYOYE/ceHY=
+github.com/influxdata/line-protocol/v2 v2.2.1 h1:EAPkqJ9Km4uAxtMRgUubJyqAr6zgWM0dznKMLRauQRE=
+github.com/influxdata/line-protocol/v2 v2.2.1/go.mod h1:DmB3Cnh+3oxmG6LOBIxce4oaL4CPj3OmMPgvauXh+tM=
 github.com/jcmturner/aescts/v2 v2.0.0 h1:9YKLH6ey7H4eDBXW8khjYslgyqG2xZikXP0EQFKrle8=
 github.com/jcmturner/aescts/v2 v2.0.0/go.mod h1:AiaICIRyfYg35RUkr8yESTqvSy7csK90qZ5xfvvsoNs=
 github.com/jcmturner/dnsutils/v2 v2.0.0 h1:lltnkeZGL0wILNvrNiVCR6Ro5PGU/SeBvVO/8c/iPbo=
@@ -199,11 +190,17 @@ github.com/joho/godotenv v1.5.1 h1:7eLL/+HRGLY0ldzfGMeQkb7vMd0as4CfYvUVzLqw0N0=
 github.com/joho/godotenv v1.5.1/go.mod h1:f4LDr5Voq0i2e/R5DDNOoa2zzDfwtkZa6DnEwAbqwq4=
 github.com/jonboulle/clockwork v0.5.0 h1:Hyh9A8u51kptdkR+cqRpT1EebBwTn1oK9YfGYbdFz6I=
 github.com/jonboulle/clockwork v0.5.0/go.mod h1:3mZlmanh0g2NDKO5TWZVJAfofYk64M7XN3SzBPjZF60=
-github.com/juju/gnuflag v0.0.0-20171113085948-2ce1bb71843d/go.mod h1:2PavIy+JPciBPrBUjwbNvtwB6RQlve+hkpll6QSNmOE=
-github.com/klauspost/compress v1.18.4 h1:RPhnKRAQ4Fh8zU2FY/6ZFDwTVTxgJ/EMydqSTzE9a2c=
-github.com/klauspost/compress v1.18.4/go.mod h1:R0h/fSBs8DE4ENlcrlib3PsXS61voFxhIs2DeRhCvJ4=
+github.com/jpillora/backoff v1.0.0 h1:uvFg412JmmHBHw7iwprIxkPMI+sGQ4kzOWsMeHnm2EA=
+github.com/jpillora/backoff v1.0.0/go.mod h1:J/6gKK9jxlEcS3zixgDgUAsiuZ7yrSoa/FX5e0EB2j4=
+github.com/json-iterator/go v1.1.12 h1:PV8peI4a0ysnczrg+LtxykD8LfKY9ML6u2jnxaEnrnM=
+github.com/json-iterator/go v1.1.12/go.mod h1:e30LSqwooZae/UwlEbR2852Gd8hjQvJoHmT4TnhNGBo=
+github.com/klauspost/compress v1.18.1 h1:bcSGx7UbpBqMChDtsF28Lw6v/G94LPrrbMbdC3JH2co=
+github.com/klauspost/compress v1.18.1/go.mod h1:ZQFFVG+MdnR0P+l6wpXgIL4NTtwiKIdBnrBd8Nrxr+0=
+github.com/kr/pretty v0.2.1/go.mod h1:ipq/a2n7PKx3OHsz4KJII5eveXtPO4qwEXGdVfWzfnI=
 github.com/kr/pretty v0.3.1 h1:flRD4NNwYAUpkphVc1HcthR4KEIFJ65n8Mw5qdRn3LE=
 github.com/kr/pretty v0.3.1/go.mod h1:hoEshYVHaxMs3cyo3Yncou5ZscifuDolrwPKZanG3xk=
+github.com/kr/pty v1.1.1/go.mod h1:pFQYn66WHrOpPYNljwOMqo10TkYh1fy3cYio2l3bCsQ=
+github.com/kr/text v0.1.0/go.mod h1:4Jbv+DJW3UT/LiOwJeYQe1efqtUx/iVham/4vfdArNI=
 github.com/kr/text v0.2.0 h1:5Nx0Ya0ZqY2ygV366QzturHI13Jq95ApcVaJBhpS+AY=
 github.com/kr/text v0.2.0/go.mod h1:eLer722TekiGuMkidMxC/pM04lWEeraHUUmBw8l2grE=
 github.com/lann/builder v0.0.0-20180802200727-47ae307949d0 h1:SOEGU9fKiNWd/HOJuq6+3iTQz8KNCLtVX6idSoTLdUw=
@@ -213,36 +210,31 @@ github.com/lann/ps v0.0.0-20150810152359-62de8c46ede0/go.mod h1:vmVJ0l/dxyfGW6Fm
 github.com/lib/pq v1.2.0/go.mod h1:5WUZQaWbwv1U+lTReE5YruASi9Al49XbQIvNi/34Woo=
 github.com/lib/pq v1.10.9 h1:YXG7RB+JIjhP29X+OtkiDnYaXQwpS4JEWq7dtCCRUEw=
 github.com/lib/pq v1.10.9/go.mod h1:AlVN5x4E4T544tWzH6hKfbfQvm3HdbOxrmggDNAPY9o=
+github.com/linkedin/goavro/v2 v2.14.1 h1:/8VjDpd38PRsy02JS0jflAu7JZPfJcGTwqWgMkFS2iI=
+github.com/linkedin/goavro/v2 v2.14.1/go.mod h1:KXx+erlq+RPlGSPmLF7xGo6SAbh8sCQ53x064+ioxhk=
 github.com/mattn/go-sqlite3 v1.10.0/go.mod h1:FPy6KqzDD04eiIsT53CuJW3U88zkxoIYsOqkbpncsNc=
 github.com/mattn/go-sqlite3 v1.14.22/go.mod h1:Uh1q+B4BYcTPb+yiD3kU8Ct7aC0hY9fxUwlHK0RXw+Y=
-github.com/mattn/go-sqlite3 v1.14.34 h1:3NtcvcUnFBPsuRcno8pUtupspG/GM+9nZ88zgJcp6Zk=
-github.com/mattn/go-sqlite3 v1.14.34/go.mod h1:Uh1q+B4BYcTPb+yiD3kU8Ct7aC0hY9fxUwlHK0RXw+Y=
-github.com/minio/highwayhash v1.0.4-0.20251030100505-070ab1a87a76 h1:KGuD/pM2JpL9FAYvBrnBBeENKZNh6eNtjqytV6TYjnk=
-github.com/minio/highwayhash v1.0.4-0.20251030100505-070ab1a87a76/go.mod h1:GGYsuwP/fPD6Y9hMiXuapVvlIUEhFhMTh0rxU3ik1LQ=
+github.com/mattn/go-sqlite3 v1.14.32 h1:JD12Ag3oLy1zQA+BNn74xRgaBbdhbNIDYvQUEuuErjs=
+github.com/mattn/go-sqlite3 v1.14.32/go.mod h1:Uh1q+B4BYcTPb+yiD3kU8Ct7aC0hY9fxUwlHK0RXw+Y=
+github.com/modern-go/concurrent v0.0.0-20180228061459-e0a39a4cb421/go.mod h1:6dJC0mAP4ikYIbvyc7fijjWJddQyLn8Ig3JB5CqoB9Q=
+github.com/modern-go/concurrent v0.0.0-20180306012644-bacd9c7ef1dd h1:TRLaZ9cD/w8PVh93nsPXa1VrQ6jlwL5oN8l14QlcNfg=
+github.com/modern-go/concurrent v0.0.0-20180306012644-bacd9c7ef1dd/go.mod h1:6dJC0mAP4ikYIbvyc7fijjWJddQyLn8Ig3JB5CqoB9Q=
+github.com/modern-go/reflect2 v1.0.2 h1:xBagoLtFs94CBntxluKeaWgTMpvLxC4ur3nMaC9Gz0M=
+github.com/modern-go/reflect2 v1.0.2/go.mod h1:yWuevngMOJpCy52FWWMvUC8ws7m/LJsjYzDa0/r8luk=
 github.com/munnerz/goautoneg v0.0.0-20191010083416-a7dc8b61c822 h1:C3w9PqII01/Oq1c1nUAm88MOHcQC9l5mIlSMApZMrHA=
 github.com/munnerz/goautoneg v0.0.0-20191010083416-a7dc8b61c822/go.mod h1:+n7T8mK8HuQTcFwEeznm/DIxMOiR9yIdICNftLE1DvQ=
-github.com/nats-io/jwt/v2 v2.8.0 h1:K7uzyz50+yGZDO5o772eRE7atlcSEENpL7P+b74JV1g=
-github.com/nats-io/jwt/v2 v2.8.0/go.mod h1:me11pOkwObtcBNR8AiMrUbtVOUGkqYjMQZ6jnSdVUIA=
-github.com/nats-io/nats-server/v2 v2.12.3 h1:KRv+1n7lddMVgkJPQer+pt36TcO0ENxjilBmeWdjcHs=
-github.com/nats-io/nats-server/v2 v2.12.3/go.mod h1:MQXjG9WjyXKz9koWzUc3jYUMKD8x3CLmTNy91IQQz3Y=
-github.com/nats-io/nats.go v1.49.0 h1:yh/WvY59gXqYpgl33ZI+XoVPKyut/IcEaqtsiuTJpoE=
-github.com/nats-io/nats.go v1.49.0/go.mod h1:fDCn3mN5cY8HooHwE2ukiLb4p4G4ImmzvXyJt+tGwdw=
-github.com/nats-io/nkeys v0.4.15 h1:JACV5jRVO9V856KOapQ7x+EY8Jo3qw1vJt/9Jpwzkk4=
-github.com/nats-io/nkeys v0.4.15/go.mod h1:CpMchTXC9fxA5zrMo4KpySxNjiDVvr8ANOSZdiNfUrs=
+github.com/mwitkow/go-conntrack v0.0.0-20190716064945-2f068394615f h1:KUppIJq7/+SVif2QVs3tOP0zanoHgBEVAwHxUSIzRqU=
+github.com/mwitkow/go-conntrack v0.0.0-20190716064945-2f068394615f/go.mod h1:qRWi+5nqEBWmkhHvq77mSJWrCKwh8bxhgT7d/eI7P4U=
+github.com/nats-io/nats.go v1.47.0 h1:YQdADw6J/UfGUd2Oy6tn4Hq6YHxCaJrVKayxxFqYrgM=
+github.com/nats-io/nats.go v1.47.0/go.mod h1:iRWIPokVIFbVijxuMQq4y9ttaBTMe0SFdlZfMDd+33g=
+github.com/nats-io/nkeys v0.4.11 h1:q44qGV008kYd9W1b1nEBkNzvnWxtRSQ7A8BoqRrcfa0=
+github.com/nats-io/nkeys v0.4.11/go.mod h1:szDimtgmfOi9n25JpfIdGw12tZFYXqhGxjhVxsatHVE=
 github.com/nats-io/nuid v1.0.1 h1:5iA8DT8V7q8WK2EScv2padNa/rTESc1KdnPw4TC2paw=
 github.com/nats-io/nuid v1.0.1/go.mod h1:19wcPz3Ph3q0Jbyiqsd0kePYG7A95tJPxeL+1OSON2c=
-github.com/oapi-codegen/runtime v1.2.0 h1:RvKc1CVS1QeKSNzO97FBQbSMZyQ8s6rZd+LpmzwHMP4=
-github.com/oapi-codegen/runtime v1.2.0/go.mod h1:Y7ZhmmlE8ikZOmuHRRndiIm7nf3xcVv+YMweKgG1DT0=
+github.com/niemeyer/pretty v0.0.0-20200227124842-a10e7caefd8e/go.mod h1:zD1mROLANZcx1PVRCS0qkT7pwLkGfwJo4zjcN/Tysno=
+github.com/oapi-codegen/runtime v1.1.1 h1:EXLHh0DXIJnWhdRPN2w4MXAzFyE4CskzhNLUmtpMYro=
+github.com/oapi-codegen/runtime v1.1.1/go.mod h1:SK9X900oXmPWilYR5/WKPzt3Kqxn/uS/+lbpREv+eCg=
 github.com/opentracing/opentracing-go v1.1.0/go.mod h1:UkNAQd3GIcIGf0SeVgPpRdFStlNbqXla1AfSYxPUl2o=
-github.com/parquet-go/bitpack v1.0.0 h1:AUqzlKzPPXf2bCdjfj4sTeacrUwsT7NlcYDMUQxPcQA=
-github.com/parquet-go/bitpack v1.0.0/go.mod h1:XnVk9TH+O40eOOmvpAVZ7K2ocQFrQwysLMnc6M/8lgs=
-github.com/parquet-go/jsonlite v1.4.0 h1:RTG7prqfO0HD5egejU8MUDBN8oToMj55cgSV1I0zNW4=
-github.com/parquet-go/jsonlite v1.4.0/go.mod h1:nDjpkpL4EOtqs6NQugUsi0Rleq9sW/OtC1NnZEnxzF0=
-github.com/parquet-go/parquet-go v0.27.0 h1:vHWK2xaHbj+v1DYps03yDRpEsdtOeKbhiXUaixoPb3g=
-github.com/parquet-go/parquet-go v0.27.0/go.mod h1:navtkAYr2LGoJVp141oXPlO/sxLvaOe3la2JEoD8+rg=
-github.com/pierrec/lz4/v4 v4.1.25 h1:kocOqRffaIbU5djlIBr7Wh+cx82C0vtFb0fOurZHqD0=
-github.com/pierrec/lz4/v4 v4.1.25/go.mod h1:EoQMVJgeeEOMsCqCzqFm2O0cJvljX2nGZjcRIPL34O4=
-github.com/pkg/diff v0.0.0-20210226163009-20ebb0f2a09e/go.mod h1:pJLUxLENpZxwdsKMEsNbx1VGcRFpLqf3715MtcvvzbA=
 github.com/pmezard/go-difflib v1.0.0/go.mod h1:iKH77koFhYxTK1pcRnkKkqfTogsbg7gZNVY4sRDYZ/4=
 github.com/pmezard/go-difflib v1.0.1-0.20181226105442-5d4384ee4fb2 h1:Jamvg5psRIccs7FGNTlIRMkT8wgtp5eCXdBlqhYGL6U=
 github.com/pmezard/go-difflib v1.0.1-0.20181226105442-5d4384ee4fb2/go.mod h1:iKH77koFhYxTK1pcRnkKkqfTogsbg7gZNVY4sRDYZ/4=
@@ -250,17 +242,16 @@ github.com/prometheus/client_golang v1.23.2 h1:Je96obch5RDVy3FDMndoUsjAhG5Edi49h
 github.com/prometheus/client_golang v1.23.2/go.mod h1:Tb1a6LWHB3/SPIzCoaDXI4I8UHKeFTEQ1YCr+0Gyqmg=
 github.com/prometheus/client_model v0.6.2 h1:oBsgwpGs7iVziMvrGhE53c/GrLUsZdHnqNwqPLxwZyk=
 github.com/prometheus/client_model v0.6.2/go.mod h1:y3m2F6Gdpfy6Ut/GBsUqTWZqCUvMVzSfMLjcu6wAwpE=
-github.com/prometheus/common v0.67.5 h1:pIgK94WWlQt1WLwAC5j2ynLaBRDiinoAb86HZHTUGI4=
-github.com/prometheus/common v0.67.5/go.mod h1:SjE/0MzDEEAyrdr5Gqc6G+sXI67maCxzaT3A2+HqjUw=
-github.com/prometheus/procfs v0.20.0 h1:AA7aCvjxwAquZAlonN7888f2u4IN8WVeFgBi4k82M4Q=
-github.com/prometheus/procfs v0.20.0/go.mod h1:o9EMBZGRyvDrSPH1RqdxhojkuXstoe4UlK79eF5TGGo=
+github.com/prometheus/common v0.67.4 h1:yR3NqWO1/UyO1w2PhUvXlGQs/PtFmoveVO0KZ4+Lvsc=
+github.com/prometheus/common v0.67.4/go.mod h1:gP0fq6YjjNCLssJCQp0yk4M8W6ikLURwkdd/YKtTbyI=
+github.com/prometheus/procfs v0.16.1 h1:hZ15bTNuirocR6u0JZ6BAHHmwS1p8B4P6MRqxtzMyRg=
+github.com/prometheus/procfs v0.16.1/go.mod h1:teAbpZRB1iIAJYREa1LsoWUXykVXA1KlTmWl8x/U+Is=
 github.com/qustavo/sqlhooks/v2 v2.1.0 h1:54yBemHnGHp/7xgT+pxwmIlMSDNYKx5JW5dfRAiCZi0=
 github.com/qustavo/sqlhooks/v2 v2.1.0/go.mod h1:aMREyKo7fOKTwiLuWPsaHRXEmtqG4yREztO0idF83AU=
 github.com/robfig/cron/v3 v3.0.1 h1:WdRxkvbJztn8LMz/QEvLN5sBU+xKpSqwwUO1Pjr4qDs=
 github.com/robfig/cron/v3 v3.0.1/go.mod h1:eQICP3HwyT7UooqI/z+Ov+PtYAWygg1TEWWzGIFLtro=
-github.com/rogpeppe/go-internal v1.9.0/go.mod h1:WtVeX8xhTBvf0smdhujwtBcq4Qrzq/fJaraNFVN+nFs=
-github.com/rogpeppe/go-internal v1.10.0 h1:TMyTOH3F/DB16zRVcYyreMH6GnZZrwQVAoYjRBZyWFQ=
-github.com/rogpeppe/go-internal v1.10.0/go.mod h1:UQnix2H7Ngw/k4C5ijL5+65zddjncjaFoBhdsK/akog=
+github.com/rogpeppe/go-internal v1.14.1 h1:UQB4HGPB6osV0SQTLymcB4TgvyWu6ZyliaW0tI/otEQ=
+github.com/rogpeppe/go-internal v1.14.1/go.mod h1:MaRKkUm5W0goXpeCfT7UZI6fk/L7L7so1lCWt35ZSgc=
 github.com/russross/blackfriday/v2 v2.1.0 h1:JIOH55/0cWyOuilr9/qlrm0BSXldqnqwMsf35Ld67mk=
 github.com/russross/blackfriday/v2 v2.1.0/go.mod h1:+Rmxgy9KzJVeS9/2gXHxylqXiyQDYRxCVz55jmeOWTM=
 github.com/santhosh-tekuri/jsonschema/v5 v5.3.1 h1:lZUw3E0/J3roVtGQ+SCrUrg3ON6NgVqpn3+iol9aGu4=
@@ -269,15 +260,15 @@ github.com/sergi/go-diff v1.3.1 h1:xkr+Oxo4BOQKmkn/B9eMK0g5Kg/983T9DqqPHwYqD+8=
 github.com/sergi/go-diff v1.3.1/go.mod h1:aMJSSKb2lpPvRNec0+w3fl7LP9IOFzdc9Pa4NFbPK1I=
 github.com/sosodev/duration v1.3.1 h1:qtHBDMQ6lvMQsL15g4aopM4HEfOaYuhWBw3NPTtlqq4=
 github.com/sosodev/duration v1.3.1/go.mod h1:RQIBBX0+fMLc/D9+Jb/fwvVmo0eZvDDEERAikUR6SDg=
-github.com/spkg/bom v0.0.0-20160624110644-59b7046e48ad/go.mod h1:qLr4V1qq6nMqFKkMo8ZTx3f+BZEkzsRUY10Xsm2mwU0=
-github.com/stmcginnis/gofish v0.21.3 h1:EBLCHfORnbx7MPw7lplOOVe9QAD1T3XRVz6+a1Z4z5Q=
-github.com/stmcginnis/gofish v0.21.3/go.mod h1:PzF5i8ecRG9A2ol8XT64npKUunyraJ+7t0kYMpQAtqU=
 github.com/stretchr/objx v0.1.0/go.mod h1:HFkY916IF+rwdDfMAkV7OtwuqBVzrE8GR6GFx+wExME=
+github.com/stretchr/objx v0.4.0/go.mod h1:YvHI0jy2hoMjB+UWwv71VJQ9isScKT/TqJzVSSt89Yw=
 github.com/stretchr/objx v0.5.2 h1:xuMeJ0Sdp5ZMRXx/aWO6RZxdr3beISkG5/G/aIRr3pY=
 github.com/stretchr/objx v0.5.2/go.mod h1:FRsXN1f5AsAjCGJKqEizvkpNtU+EGNCLh3NxZ/8L+MA=
 github.com/stretchr/testify v1.2.2/go.mod h1:a8OnRcib4nhh0OaRAV+Yts87kKdq0PP7pXfy6kDkUVs=
 github.com/stretchr/testify v1.3.0/go.mod h1:M5WIy9Dh21IEIfnGCwXGc5bZfKNJtfHm1UVUgZn+9EI=
 github.com/stretchr/testify v1.4.0/go.mod h1:j7eGeouHqKxXV5pUuKE4zz7dFj8WfuZ+81PSLYec5m4=
+github.com/stretchr/testify v1.7.1/go.mod h1:6Fq8oRcR53rry900zMqJjRRixrwX3KX962/h/Wwjteg=
+github.com/stretchr/testify v1.7.5/go.mod h1:yNjHg4UonilssWZ8iaSj1OCr/vHnekPRkoO+kdMU+MU=
 github.com/stretchr/testify v1.11.1 h1:7s2iGBzp5EwR7/aIZr8ao5+dra3wiQyKjjFuvgVKu7U=
 github.com/stretchr/testify v1.11.1/go.mod h1:wZwfW3scLgRK+23gO65QZefKpKQRnfz6sD981Nm4B6U=
 github.com/swaggo/files v1.0.1 h1:J1bVJ4XHZNq0I46UU90611i9/YzdrF7x92oX1ig5IdE=
@@ -286,8 +277,6 @@ github.com/swaggo/http-swagger v1.3.4 h1:q7t/XLx0n15H1Q9/tk3Y9L4n210XzJF5WtnDX64
 github.com/swaggo/http-swagger v1.3.4/go.mod h1:9dAh0unqMBAlbp1uE2Uc2mQTxNMU/ha4UbucIg1MFkQ=
 github.com/swaggo/swag v1.16.6 h1:qBNcx53ZaX+M5dxVyTrgQ0PJ/ACK+NzhwcbieTt+9yI=
 github.com/swaggo/swag v1.16.6/go.mod h1:ngP2etMK5a0P3QBizic5MEwpRmluJZPHjXcMoj4Xesg=
-github.com/twpayne/go-geom v1.6.1 h1:iLE+Opv0Ihm/ABIcvQFGIiFBXd76oBIar9drAwHFhR4=
-github.com/twpayne/go-geom v1.6.1/go.mod h1:Kr+Nly6BswFsKM5sd31YaoWS5PeDDH2NftJTK7Gd028=
 github.com/urfave/cli/v2 v2.27.7 h1:bH59vdhbjLv3LAvIu6gd0usJHgoTTPhCFib8qqOwXYU=
 github.com/urfave/cli/v2 v2.27.7/go.mod h1:CyNAG/xg+iAOg0N4MPGZqVmv2rCoP267496AOXUZjA4=
 github.com/urfave/cli/v3 v3.6.1 h1:j8Qq8NyUawj/7rTYdBGrxcH7A/j7/G8Q5LhWEW4G3Mo=
@@ -296,8 +285,6 @@ github.com/vektah/gqlparser/v2 v2.5.31 h1:YhWGA1mfTjID7qJhd1+Vxhpk5HTgydrGU9IgkW
 github.com/vektah/gqlparser/v2 v2.5.31/go.mod h1:c1I28gSOVNzlfc4WuDlqU7voQnsqI6OG2amkBAFmgts=
 github.com/xrash/smetrics v0.0.0-20250705151800-55b8f293f342 h1:FnBeRrxr7OU4VvAzt5X7s6266i6cSVkkFPS0TuXWbIg=
 github.com/xrash/smetrics v0.0.0-20250705151800-55b8f293f342/go.mod h1:Ohn+xnUBiLI6FVj/9LpzZWtj1/D6lUovWYBkxHVV3aM=
-github.com/xyproto/randomstring v1.0.5 h1:YtlWPoRdgMu3NZtP45drfy1GKoojuR7hmRcnhZqKjWU=
-github.com/xyproto/randomstring v1.0.5/go.mod h1:rgmS5DeNXLivK7YprL0pY+lTuhNQW3iGxZ18UQApw/E=
 github.com/yuin/goldmark v1.4.13/go.mod h1:6yULJ656Px+3vBD8DxQVa3kxgyrAnzto9xy5taEt/CY=
 go.uber.org/goleak v1.3.0 h1:2K3zAYmnTNqV73imy9J1T3WC+gmCePx2hEGkimedGto=
 go.uber.org/goleak v1.3.0/go.mod h1:CoHD4mav9JJNrW/WLlf7HGZPjdw8EucARQHekz1X6bE=
@@ -307,33 +294,33 @@ go.yaml.in/yaml/v3 v3.0.4 h1:tfq32ie2Jv2UxXFdLJdh3jXuOzWiL1fo0bu/FbuKpbc=
 go.yaml.in/yaml/v3 v3.0.4/go.mod h1:DhzuOOF2ATzADvBadXxruRBLzYTpT36CKvDb3+aBEFg=
 golang.org/x/crypto v0.0.0-20190308221718-c2843e01d9a2/go.mod h1:djNgcEr1/C05ACkg1iLfiJU5Ep61QUkGW8qpdssI0+w=
 golang.org/x/crypto v0.0.0-20210921155107-089bfa567519/go.mod h1:GvvjBRRGRdwPK5ydBHafDWAxML/pGHZbMvKqRZ5+Abc=
-golang.org/x/crypto v0.48.0 h1:/VRzVqiRSggnhY7gNRxPauEQ5Drw9haKdM0jqfcCFts=
-golang.org/x/crypto v0.48.0/go.mod h1:r0kV5h3qnFPlQnBSrULhlsRfryS2pmewsg+XfMgkVos=
-golang.org/x/exp v0.0.0-20260218203240-3dfff04db8fa h1:Zt3DZoOFFYkKhDT3v7Lm9FDMEV06GpzjG2jrqW+QTE0=
-golang.org/x/exp v0.0.0-20260218203240-3dfff04db8fa/go.mod h1:K79w1Vqn7PoiZn+TkNpx3BUWUQksGO3JcVX6qIjytmA=
+golang.org/x/crypto v0.45.0 h1:jMBrvKuj23MTlT0bQEOBcAE0mjg8mK9RXFhRH6nyF3Q=
+golang.org/x/crypto v0.45.0/go.mod h1:XTGrrkGJve7CYK7J8PEww4aY7gM3qMCElcJQ8n8JdX4=
+golang.org/x/exp v0.0.0-20250620022241-b7579e27df2b h1:M2rDM6z3Fhozi9O7NWsxAkg/yqS/lQJ6PmkyIV3YP+o=
+golang.org/x/exp v0.0.0-20250620022241-b7579e27df2b/go.mod h1:3//PLf8L/X+8b4vuAfHzxeRUl04Adcb341+IGKfnqS8=
 golang.org/x/mod v0.6.0-dev.0.20220419223038-86c51ed26bb4/go.mod h1:jJ57K6gSWd91VN4djpZkiMVwK6gcyfeH4XE8wZrZaV4=
-golang.org/x/mod v0.33.0 h1:tHFzIWbBifEmbwtGz65eaWyGiGZatSrT9prnU8DbVL8=
-golang.org/x/mod v0.33.0/go.mod h1:swjeQEj+6r7fODbD2cqrnje9PnziFuw4bmLbBZFrQ5w=
+golang.org/x/mod v0.30.0 h1:fDEXFVZ/fmCKProc/yAXXUijritrDzahmwwefnjoPFk=
+golang.org/x/mod v0.30.0/go.mod h1:lAsf5O2EvJeSFMiBxXDki7sCgAxEUcZHXoXMKT4GJKc=
 golang.org/x/net v0.0.0-20190620200207-3b0461eec859/go.mod h1:z5CRVTTTmAJ677TzLLGU+0bjPO0LkuOLi4/5GtJWs/s=
 golang.org/x/net v0.0.0-20210226172049-e18ecbb05110/go.mod h1:m0MpNAwzfU5UDzcl9v0D8zg8gWTRqZa9RBIspLL5mdg=
 golang.org/x/net v0.0.0-20220722155237-a158d28d115b/go.mod h1:XRhObCWvk6IyKnWLug+ECip1KBveYUHfp+8e9klMJ9c=
 golang.org/x/net v0.7.0/go.mod h1:2Tu9+aMcznHK/AK1HMvgo6xiTLG5rD5rZLDS+rp2Bjs=
-golang.org/x/net v0.51.0 h1:94R/GTO7mt3/4wIKpcR5gkGmRLOuE/2hNGeWq/GBIFo=
-golang.org/x/net v0.51.0/go.mod h1:aamm+2QF5ogm02fjy5Bb7CQ0WMt1/WVM7FtyaTLlA9Y=
-golang.org/x/oauth2 v0.35.0 h1:Mv2mzuHuZuY2+bkyWXIHMfhNdJAdwW3FuWeCPYN5GVQ=
-golang.org/x/oauth2 v0.35.0/go.mod h1:lzm5WQJQwKZ3nwavOZ3IS5Aulzxi68dUSgRHujetwEA=
+golang.org/x/net v0.47.0 h1:Mx+4dIFzqraBXUugkia1OOvlD6LemFo1ALMHjrXDOhY=
+golang.org/x/net v0.47.0/go.mod h1:/jNxtkgq5yWUGYkaZGqo27cfGZ1c5Nen03aYrrKpVRU=
+golang.org/x/oauth2 v0.32.0 h1:jsCblLleRMDrxMN29H3z/k1KliIvpLgCkE6R8FXXNgY=
+golang.org/x/oauth2 v0.32.0/go.mod h1:lzm5WQJQwKZ3nwavOZ3IS5Aulzxi68dUSgRHujetwEA=
 golang.org/x/sync v0.0.0-20190423024810-112230192c58/go.mod h1:RxMgew5VJxzue5/jJTE5uejpjVlOe/izrB70Jof72aM=
 golang.org/x/sync v0.0.0-20220722155255-886fb9371eb4/go.mod h1:RxMgew5VJxzue5/jJTE5uejpjVlOe/izrB70Jof72aM=
-golang.org/x/sync v0.19.0 h1:vV+1eWNmZ5geRlYjzm2adRgW2/mcpevXNg50YZtPCE4=
-golang.org/x/sync v0.19.0/go.mod h1:9KTHXmSnoGruLpwFjVSX0lNNA75CykiMECbovNTZqGI=
+golang.org/x/sync v0.18.0 h1:kr88TuHDroi+UVf+0hZnirlk8o8T+4MrK6mr60WkH/I=
+golang.org/x/sync v0.18.0/go.mod h1:9KTHXmSnoGruLpwFjVSX0lNNA75CykiMECbovNTZqGI=
 golang.org/x/sys v0.0.0-20190215142949-d0b11bdaac8a/go.mod h1:STP8DvDyc/dI5b8T5hshtkjS+E42TnysNCUPdjciGhY=
 golang.org/x/sys v0.0.0-20201119102817-f84b799fce68/go.mod h1:h1NjWce9XRLGQEsW7wpKNCjG9DtNlClVuFLEZdDNbEs=
 golang.org/x/sys v0.0.0-20210615035016-665e8c7367d1/go.mod h1:oPkhp1MJrh7nUepCBck5+mAzfO9JrbApNNgaTdGDITg=
 golang.org/x/sys v0.0.0-20220520151302-bc2c85ada10a/go.mod h1:oPkhp1MJrh7nUepCBck5+mAzfO9JrbApNNgaTdGDITg=
 golang.org/x/sys v0.0.0-20220722155257-8c9f86f7a55f/go.mod h1:oPkhp1MJrh7nUepCBck5+mAzfO9JrbApNNgaTdGDITg=
 golang.org/x/sys v0.5.0/go.mod h1:oPkhp1MJrh7nUepCBck5+mAzfO9JrbApNNgaTdGDITg=
-golang.org/x/sys v0.41.0 h1:Ivj+2Cp/ylzLiEU89QhWblYnOE9zerudt9Ftecq2C6k=
-golang.org/x/sys v0.41.0/go.mod h1:OgkHotnGiDImocRcuBABYBEXf8A9a87e/uXjp9XT3ks=
+golang.org/x/sys v0.38.0 h1:3yZWxaJjBmCWXqhN1qh02AkOnCQ1poK6oF+a7xWL6Gc=
+golang.org/x/sys v0.38.0/go.mod h1:OgkHotnGiDImocRcuBABYBEXf8A9a87e/uXjp9XT3ks=
 golang.org/x/term v0.0.0-20201126162022-7de9c90e9dd1/go.mod h1:bj7SfCRtBDWHUb9snDiAeCFNEtKQo2Wmx5Cou7ajbmo=
 golang.org/x/term v0.0.0-20210927222741-03fcf44c2211/go.mod h1:jbD1KX2456YbFQfuXm/mYQcufACuNUgVhRMnK/tPxf8=
 golang.org/x/term v0.5.0/go.mod h1:jMB1sMXY+tzblOD4FWmEbocvup2/aLOaQEp7JmGp78k=
@@ -341,23 +328,26 @@ golang.org/x/text v0.3.0/go.mod h1:NqM8EUOU14njkJ3fqMW+pc6Ldnwhi/IjpwHt7yyuwOQ=
 golang.org/x/text v0.3.3/go.mod h1:5Zoc/QRtKVWzQhOtBMvqHzDpF6irO9z98xDceosuGiQ=
 golang.org/x/text v0.3.7/go.mod h1:u+2+/6zg+i71rQMx5EYifcz6MCKuco9NR6JIITiCfzQ=
 golang.org/x/text v0.7.0/go.mod h1:mrYo+phRRbMaCq/xk9113O4dZlRixOauAjOtrjsXDZ8=
-golang.org/x/text v0.34.0 h1:oL/Qq0Kdaqxa1KbNeMKwQq0reLCCaFtqu2eNuSeNHbk=
-golang.org/x/text v0.34.0/go.mod h1:homfLqTYRFyVYemLBFl5GgL/DWEiH5wcsQ5gSh1yziA=
+golang.org/x/text v0.31.0 h1:aC8ghyu4JhP8VojJ2lEHBnochRno1sgL6nEi9WGFGMM=
+golang.org/x/text v0.31.0/go.mod h1:tKRAlv61yKIjGGHX/4tP1LTbc13YSec1pxVEWXzfoeM=
 golang.org/x/time v0.14.0 h1:MRx4UaLrDotUKUdCIqzPC48t1Y9hANFKIRpNx+Te8PI=
 golang.org/x/time v0.14.0/go.mod h1:eL/Oa2bBBK0TkX57Fyni+NgnyQQN4LitPmob2Hjnqw4=
 golang.org/x/tools v0.0.0-20180917221912-90fa682c2a6e/go.mod h1:n7NCudcB/nEzxVGmLbDWY5pfWTLqBcC2KZ6jyYvM4mQ=
 golang.org/x/tools v0.0.0-20191119224855-298f0cb1881e/go.mod h1:b+2E5dAYhXwXZwtnZ6UAqBI28+e2cm9otk0dWdXHAEo=
 golang.org/x/tools v0.1.12/go.mod h1:hNGJHUnrk76NpqgfD5Aqm5Crs+Hm0VOH/i9J2+nxYbc=
-golang.org/x/tools v0.42.0 h1:uNgphsn75Tdz5Ji2q36v/nsFSfR/9BRFvqhGBaJGd5k=
-golang.org/x/tools v0.42.0/go.mod h1:Ma6lCIwGZvHK6XtgbswSoWroEkhugApmsXyrUmBhfr0=
+golang.org/x/tools v0.39.0 h1:ik4ho21kwuQln40uelmciQPp9SipgNDdrafrYA4TmQQ=
+golang.org/x/tools v0.39.0/go.mod h1:JnefbkDPyD8UU2kI5fuf8ZX4/yUeh9W877ZeBONxUqQ=
 golang.org/x/xerrors v0.0.0-20190717185122-a985d3407aa7/go.mod h1:I/5z698sn9Ka8TeJc9MKroUUfqBBauWjQqLJ2OPfmY0=
-google.golang.org/protobuf v1.36.11 h1:fV6ZwhNocDyBLK0dj+fg8ektcVegBBuEolpbTQyBNVE=
-google.golang.org/protobuf v1.36.11/go.mod h1:HTf+CrKn2C3g5S8VImy6tdcUvCska2kB7j23XfzDpco=
+golang.org/x/xerrors v0.0.0-20191204190536-9bdfabe68543/go.mod h1:I/5z698sn9Ka8TeJc9MKroUUfqBBauWjQqLJ2OPfmY0=
+google.golang.org/protobuf v1.36.10 h1:AYd7cD/uASjIL6Q9LiTjz8JLcrh/88q5UObnmY3aOOE=
+google.golang.org/protobuf v1.36.10/go.mod h1:HTf+CrKn2C3g5S8VImy6tdcUvCska2kB7j23XfzDpco=
 gopkg.in/check.v1 v0.0.0-20161208181325-20d25e280405/go.mod h1:Co6ibVJAznAaIkqp8huTwlJQCZ016jof/cbN4VW5Yz0=
+gopkg.in/check.v1 v1.0.0-20200227125254-8fa46927fb4f/go.mod h1:Co6ibVJAznAaIkqp8huTwlJQCZ016jof/cbN4VW5Yz0=
 gopkg.in/check.v1 v1.0.0-20201130134442-10cb98267c6c h1:Hei/4ADfdWqJk1ZMxUNpqntNwaWcugrBjAiHlqqRiVk=
 gopkg.in/check.v1 v1.0.0-20201130134442-10cb98267c6c/go.mod h1:JHkPIbrfpd72SG/EVd6muEfDQjcINNoR0C8j2r3qZ4Q=
 gopkg.in/yaml.v2 v2.2.2/go.mod h1:hI93XBmqTisBFMUTm0b8Fm+jr3Dg1NNxqwp+5A1VGuI=
 gopkg.in/yaml.v3 v3.0.0-20200313102051-9f266ea9e77c/go.mod h1:K4uyk7z7BCEPqu6E+C64Yfv1cQ7kz7rIZviUmN+EgEM=
+gopkg.in/yaml.v3 v3.0.0-20200615113413-eeeca48fe776/go.mod h1:K4uyk7z7BCEPqu6E+C64Yfv1cQ7kz7rIZviUmN+EgEM=
 gopkg.in/yaml.v3 v3.0.1 h1:fxVm/GzAzEWqLHuvctI91KS9hhNmmWOoWu0XTYJS7CA=
 gopkg.in/yaml.v3 v3.0.1/go.mod h1:K4uyk7z7BCEPqu6E+C64Yfv1cQ7kz7rIZviUmN+EgEM=
 sigs.k8s.io/yaml v1.6.0 h1:G8fkbMSAFqgEFgh4b1wmtzDnioxFCUgTZhlbj5P9QYs=
--- a/gqlgen.yml
+++ b/gqlgen.yml
@@ -52,51 +52,51 @@ models:
      - github.com/99designs/gqlgen/graphql.Int64
      - github.com/99designs/gqlgen/graphql.Int32
  Job:
-    model: "github.com/ClusterCockpit/cc-lib/v2/schema.Job"
+    model: "github.com/ClusterCockpit/cc-lib/schema.Job"
    fields:
      tags:
        resolver: true
      metaData:
        resolver: true
  Cluster:
-    model: "github.com/ClusterCockpit/cc-lib/v2/schema.Cluster"
+    model: "github.com/ClusterCockpit/cc-lib/schema.Cluster"
    fields:
      partitions:
        resolver: true
  # Node:
-  #   model: "github.com/ClusterCockpit/cc-lib/v2/schema.Node"
+  #   model: "github.com/ClusterCockpit/cc-lib/schema.Node"
  #   fields:
  #     metaData:
  #       resolver: true
-  NullableFloat: { model: "github.com/ClusterCockpit/cc-lib/v2/schema.Float" }
-  MetricScope: { model: "github.com/ClusterCockpit/cc-lib/v2/schema.MetricScope" }
-  MetricValue: { model: "github.com/ClusterCockpit/cc-lib/v2/schema.MetricValue" }
+  NullableFloat: { model: "github.com/ClusterCockpit/cc-lib/schema.Float" }
+  MetricScope: { model: "github.com/ClusterCockpit/cc-lib/schema.MetricScope" }
+  MetricValue: { model: "github.com/ClusterCockpit/cc-lib/schema.MetricValue" }
  JobStatistics:
-    { model: "github.com/ClusterCockpit/cc-lib/v2/schema.JobStatistics" }
+    { model: "github.com/ClusterCockpit/cc-lib/schema.JobStatistics" }
  GlobalMetricListItem:
-    { model: "github.com/ClusterCockpit/cc-lib/v2/schema.GlobalMetricListItem" }
+    { model: "github.com/ClusterCockpit/cc-lib/schema.GlobalMetricListItem" }
  ClusterSupport:
-    { model: "github.com/ClusterCockpit/cc-lib/v2/schema.ClusterSupport" }
-  Tag: { model: "github.com/ClusterCockpit/cc-lib/v2/schema.Tag" }
-  Resource: { model: "github.com/ClusterCockpit/cc-lib/v2/schema.Resource" }
-  JobState: { model: "github.com/ClusterCockpit/cc-lib/v2/schema.JobState" }
-  Node: { model: "github.com/ClusterCockpit/cc-lib/v2/schema.Node" }
+    { model: "github.com/ClusterCockpit/cc-lib/schema.ClusterSupport" }
+  Tag: { model: "github.com/ClusterCockpit/cc-lib/schema.Tag" }
+  Resource: { model: "github.com/ClusterCockpit/cc-lib/schema.Resource" }
+  JobState: { model: "github.com/ClusterCockpit/cc-lib/schema.JobState" }
+  Node: { model: "github.com/ClusterCockpit/cc-lib/schema.Node" }
  SchedulerState:
-    { model: "github.com/ClusterCockpit/cc-lib/v2/schema.SchedulerState" }
+    { model: "github.com/ClusterCockpit/cc-lib/schema.SchedulerState" }
  HealthState:
-    { model: "github.com/ClusterCockpit/cc-lib/v2/schema.MonitoringState" }
-  JobMetric: { model: "github.com/ClusterCockpit/cc-lib/v2/schema.JobMetric" }
-  Series: { model: "github.com/ClusterCockpit/cc-lib/v2/schema.Series" }
+    { model: "github.com/ClusterCockpit/cc-lib/schema.MonitoringState" }
+  JobMetric: { model: "github.com/ClusterCockpit/cc-lib/schema.JobMetric" }
+  Series: { model: "github.com/ClusterCockpit/cc-lib/schema.Series" }
  MetricStatistics:
-    { model: "github.com/ClusterCockpit/cc-lib/v2/schema.MetricStatistics" }
+    { model: "github.com/ClusterCockpit/cc-lib/schema.MetricStatistics" }
  MetricConfig:
-    { model: "github.com/ClusterCockpit/cc-lib/v2/schema.MetricConfig" }
+    { model: "github.com/ClusterCockpit/cc-lib/schema.MetricConfig" }
  SubClusterConfig:
-    { model: "github.com/ClusterCockpit/cc-lib/v2/schema.SubClusterConfig" }
-  Accelerator: { model: "github.com/ClusterCockpit/cc-lib/v2/schema.Accelerator" }
-  Topology: { model: "github.com/ClusterCockpit/cc-lib/v2/schema.Topology" }
+    { model: "github.com/ClusterCockpit/cc-lib/schema.SubClusterConfig" }
+  Accelerator: { model: "github.com/ClusterCockpit/cc-lib/schema.Accelerator" }
+  Topology: { model: "github.com/ClusterCockpit/cc-lib/schema.Topology" }
  FilterRanges:
-    { model: "github.com/ClusterCockpit/cc-lib/v2/schema.FilterRanges" }
-  SubCluster: { model: "github.com/ClusterCockpit/cc-lib/v2/schema.SubCluster" }
-  StatsSeries: { model: "github.com/ClusterCockpit/cc-lib/v2/schema.StatsSeries" }
-  Unit: { model: "github.com/ClusterCockpit/cc-lib/v2/schema.Unit" }
+    { model: "github.com/ClusterCockpit/cc-lib/schema.FilterRanges" }
+  SubCluster: { model: "github.com/ClusterCockpit/cc-lib/schema.SubCluster" }
+  StatsSeries: { model: "github.com/ClusterCockpit/cc-lib/schema.StatsSeries" }
+  Unit: { model: "github.com/ClusterCockpit/cc-lib/schema.Unit" }
--- a/init/clustercockpit.service
+++ b/init/clustercockpit.service
@@ -12,7 +12,7 @@ NotifyAccess=all
 Restart=on-failure
 RestartSec=30
 TimeoutStopSec=100
-ExecStart=/opt/monitoring/cc-backend/cc-backend --config ./config.json --server
+ExecStart=/opt/monitoring/cc-backend/cc-backend --config ./config.json

 [Install]
 WantedBy=multi-user.target
--- a/internal/api/api_test.go
+++ b/internal/api/api_test.go
@@ -17,51 +17,53 @@ import (
 	"strings"
 	"testing"
 	"time"
+	"sync"
+

 	"github.com/ClusterCockpit/cc-backend/internal/api"
 	"github.com/ClusterCockpit/cc-backend/internal/archiver"
 	"github.com/ClusterCockpit/cc-backend/internal/auth"
 	"github.com/ClusterCockpit/cc-backend/internal/config"
 	"github.com/ClusterCockpit/cc-backend/internal/graph"
-	"github.com/ClusterCockpit/cc-backend/internal/metricdispatch"
+	"github.com/ClusterCockpit/cc-backend/internal/memorystore"
 	"github.com/ClusterCockpit/cc-backend/internal/repository"
 	"github.com/ClusterCockpit/cc-backend/pkg/archive"
-	"github.com/ClusterCockpit/cc-backend/pkg/metricstore"
-	ccconf "github.com/ClusterCockpit/cc-lib/v2/ccConfig"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
-	"github.com/go-chi/chi/v5"
+	ccconf "github.com/ClusterCockpit/cc-lib/ccConfig"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/schema"
+	"github.com/gorilla/mux"

 	_ "github.com/mattn/go-sqlite3"
 )

 func setup(t *testing.T) *api.RestAPI {
-	repository.ResetConnection()
-
 	const testconfig = `{
-	"main": {
-	   "addr":            "0.0.0.0:8080",
-	   "validate": false,
-     "api-allowed-ips": [
-       "*"
-      ]
-  },
-  "metric-store": {
-    "checkpoints": {
-      "interval": "12h"
-    },
-    "retention-in-memory": "48h",
-    "memory-cap": 100
-  },
+		"main": {
+	"addr":            "0.0.0.0:8080",
+	"validate": false,
+  "apiAllowedIPs": [
+    "*"
+  ]
+	},
 	"archive": {
-		 "kind": "file",
-		 "path": "./var/job-archive"
+		"kind": "file",
+		"path": "./var/job-archive"
 	},
 	"auth": {
-     "jwts": {
-     "max-age": "2m"
-  }
+  "jwts": {
+      "max-age": "2m"
  }
+	},
+	"clusters": [
+	{
+	   "name": "testcluster",
+	   "filterRanges": {
+		"numNodes": { "from": 1, "to": 64 },
+		"duration": { "from": 0, "to": 86400 },
+		"startTime": { "from": "2022-01-01T00:00:00Z", "to": null }
+	   }
+	}
+	]
 }`
 	const testclusterJSON = `{
        "name": "testcluster",
@@ -150,23 +152,32 @@ func setup(t *testing.T) *api.RestAPI {
 	}

 	ccconf.Init(cfgFilePath)
-	metricstore.MetricStoreHandle = &metricstore.InternalMetricStore{}

 	// Load and check main configuration
 	if cfg := ccconf.GetPackageConfig("main"); cfg != nil {
-		config.Init(cfg)
+		if clustercfg := ccconf.GetPackageConfig("clusters"); clustercfg != nil {
+			config.Init(cfg, clustercfg)
+		} else {
+			cclog.Abort("Cluster configuration must be present")
+		}
 	} else {
 		cclog.Abort("Main configuration must be present")
 	}
 	archiveCfg := fmt.Sprintf("{\"kind\": \"file\",\"path\": \"%s\"}", jobarchive)

-	repository.Connect(dbfilepath)
+	repository.Connect("sqlite3", dbfilepath)

-	if err := archive.Init(json.RawMessage(archiveCfg)); err != nil {
+	if err := archive.Init(json.RawMessage(archiveCfg), config.Keys.DisableArchive); err != nil {
 		t.Fatal(err)
 	}

-	// metricstore initialization removed - it's initialized via callback in tests
+	// Initialize memorystore (optional - will return nil if not configured)
+	// For this test, we don't initialize it to test the nil handling
+	mscfg := ccconf.GetPackageConfig("metric-store")
+	if mscfg != nil {
+		var wg sync.WaitGroup
+		memorystore.Init(mscfg, &wg)
+	}

 	archiver.Start(repository.GetJobRepository(), context.Background())

@@ -183,43 +194,30 @@ func setup(t *testing.T) *api.RestAPI {
 }

 func cleanup() {
+	// Gracefully shutdown archiver with timeout
 	if err := archiver.Shutdown(5 * time.Second); err != nil {
 		cclog.Warnf("Archiver shutdown timeout in tests: %v", err)
 	}
+	
+	// Shutdown memorystore if it was initialized
+	memorystore.Shutdown()
 }

 /*
-* This function starts a job, stops it, and then reads its data from the job-archive.
-* Do not run sub-tests in parallel! Tests should not be run in parallel at all, because
-* at least `setup` modifies global state.
+ * This function starts a job, stops it, and tests the REST API.
+ * Do not run sub-tests in parallel! Tests should not be run in parallel at all, because
+ * at least `setup` modifies global state.
 */
 func TestRestApi(t *testing.T) {
 	restapi := setup(t)
 	t.Cleanup(cleanup)
-	testData := schema.JobData{
-		"load_one": map[schema.MetricScope]*schema.JobMetric{
-			schema.MetricScopeNode: {
-				Unit:     schema.Unit{Base: "load"},
-				Timestep: 60,
-				Series: []schema.Series{
-					{
-						Hostname:   "host123",
-						Statistics: schema.MetricStatistics{Min: 0.1, Avg: 0.2, Max: 0.3},
-						Data:       []schema.Float{0.1, 0.1, 0.1, 0.2, 0.2, 0.2, 0.3, 0.3, 0.3},
-					},
-				},
-			},
-		},
-	}

-	metricstore.TestLoadDataCallback = func(job *schema.Job, metrics []string, scopes []schema.MetricScope, ctx context.Context, resolution int) (schema.JobData, error) {
-		return testData, nil
-	}
-
-	r := chi.NewRouter()
+	r := mux.NewRouter()
+	r.PathPrefix("/api").Subrouter()
+	r.StrictSlash(true)
 	restapi.MountAPIRoutes(r)

-	var TestJobID int64 = 123
+	var TestJobId int64 = 123
 	TestClusterName := "testcluster"
 	var TestStartTime int64 = 123456789

@@ -267,18 +265,12 @@ func TestRestApi(t *testing.T) {
 		if response.StatusCode != http.StatusCreated {
 			t.Fatal(response.Status, recorder.Body.String())
 		}
-		// resolver := graph.GetResolverInstance()
 		restapi.JobRepository.SyncJobs()
-		job, err := restapi.JobRepository.Find(&TestJobID, &TestClusterName, &TestStartTime)
+		job, err := restapi.JobRepository.Find(&TestJobId, &TestClusterName, &TestStartTime)
 		if err != nil {
 			t.Fatal(err)
 		}

-		// job.Tags, err = resolver.Job().Tags(ctx, job)
-		// if err != nil {
-		// 	t.Fatal(err)
-		// }
-
 		if job.JobID != 123 ||
 			job.User != "testuser" ||
 			job.Project != "testproj" ||
@@ -296,10 +288,6 @@ func TestRestApi(t *testing.T) {
 			job.StartTime != 123456789 {
 			t.Fatalf("unexpected job properties: %#v", job)
 		}
-
-		// if len(job.Tags) != 1 || job.Tags[0].Type != "testTagType" || job.Tags[0].Name != "testTagName" || job.Tags[0].Scope != "testuser" {
-		// 	t.Fatalf("unexpected tags: %#v", job.Tags)
-		// }
 	}); !ok {
 		return
 	}
@@ -313,7 +301,6 @@ func TestRestApi(t *testing.T) {
 		"stopTime": 123457789
 	}`

-	var stoppedJob *schema.Job
 	if ok := t.Run("StopJob", func(t *testing.T) {
 		req := httptest.NewRequest(http.MethodPost, "/jobs/stop_job/", bytes.NewBuffer([]byte(stopJobBody)))
 		recorder := httptest.NewRecorder()
@@ -327,7 +314,7 @@ func TestRestApi(t *testing.T) {
 		}

 		// Archiving happens asynchronously, will be completed in cleanup
-		job, err := restapi.JobRepository.Find(&TestJobID, &TestClusterName, &TestStartTime)
+		job, err := restapi.JobRepository.Find(&TestJobId, &TestClusterName, &TestStartTime)
 		if err != nil {
 			t.Fatal(err)
 		}
@@ -349,21 +336,12 @@ func TestRestApi(t *testing.T) {
 			t.Fatalf("unexpected job.metaData: %#v", job.MetaData)
 		}

-		stoppedJob = job
 	}); !ok {
 		return
 	}

-	t.Run("CheckArchive", func(t *testing.T) {
-		data, err := metricdispatch.LoadData(stoppedJob, []string{"load_one"}, []schema.MetricScope{schema.MetricScopeNode}, context.Background(), 60)
-		if err != nil {
-			t.Fatal(err)
-		}
-
-		if !reflect.DeepEqual(data, testData) {
-			t.Fatal("unexpected data fetched from archive")
-		}
-	})
+	// Note: We skip the CheckArchive test because without memorystore initialized,
+	// archiving will fail gracefully. This test now focuses on the REST API itself.

 	t.Run("CheckDoubleStart", func(t *testing.T) {
 		// Starting a job with the same jobId and cluster should only be allowed if the startTime is far appart!
@@ -453,198 +431,4 @@ func TestRestApi(t *testing.T) {
 	if !ok {
 		t.Fatal("subtest failed")
 	}
-
-	t.Run("GetUsedNodesNoRunning", func(t *testing.T) {
-		contextUserValue := &schema.User{
-			Username:   "testuser",
-			Projects:   make([]string, 0),
-			Roles:      []string{"api"},
-			AuthType:   0,
-			AuthSource: 2,
-		}
-
-		req := httptest.NewRequest(http.MethodGet, "/jobs/used_nodes?ts=123456790", nil)
-		recorder := httptest.NewRecorder()
-
-		ctx := context.WithValue(req.Context(), contextUserKey, contextUserValue)
-
-		r.ServeHTTP(recorder, req.WithContext(ctx))
-		response := recorder.Result()
-		if response.StatusCode != http.StatusOK {
-			t.Fatal(response.Status, recorder.Body.String())
-		}
-
-		var result api.GetUsedNodesAPIResponse
-		if err := json.NewDecoder(response.Body).Decode(&result); err != nil {
-			t.Fatal(err)
-		}
-
-		if result.UsedNodes == nil {
-			t.Fatal("expected usedNodes to be non-nil")
-		}
-
-		if len(result.UsedNodes) != 0 {
-			t.Fatalf("expected no used nodes for stopped jobs, got: %v", result.UsedNodes)
-		}
-	})
-}
-
-// TestStopJobWithReusedJobId verifies that stopping a recently started job works
-// even when an older job with the same jobId exists in the job table (e.g. with
-// state "failed"). This is a regression test for the bug where Find() on the job
-// table would match the old job instead of the new one still in job_cache.
-func TestStopJobWithReusedJobId(t *testing.T) {
-	restapi := setup(t)
-	t.Cleanup(cleanup)
-
-	testData := schema.JobData{
-		"load_one": map[schema.MetricScope]*schema.JobMetric{
-			schema.MetricScopeNode: {
-				Unit:     schema.Unit{Base: "load"},
-				Timestep: 60,
-				Series: []schema.Series{
-					{
-						Hostname:   "host123",
-						Statistics: schema.MetricStatistics{Min: 0.1, Avg: 0.2, Max: 0.3},
-						Data:       []schema.Float{0.1, 0.1, 0.1, 0.2, 0.2, 0.2, 0.3, 0.3, 0.3},
-					},
-				},
-			},
-		},
-	}
-
-	metricstore.TestLoadDataCallback = func(job *schema.Job, metrics []string, scopes []schema.MetricScope, ctx context.Context, resolution int) (schema.JobData, error) {
-		return testData, nil
-	}
-
-	r := chi.NewRouter()
-	restapi.MountAPIRoutes(r)
-
-	const contextUserKey repository.ContextKey = "user"
-	contextUserValue := &schema.User{
-		Username:   "testuser",
-		Projects:   make([]string, 0),
-		Roles:      []string{"user"},
-		AuthType:   0,
-		AuthSource: 2,
-	}
-
-	// Step 1: Start the first job (jobId=999)
-	const startJobBody1 string = `{
-		"jobId":            999,
-		"user":             "testuser",
-		"project":          "testproj",
-		"cluster":          "testcluster",
-		"partition":        "default",
-		"walltime":         3600,
-		"numNodes":         1,
-		"numHwthreads":     8,
-		"numAcc":           0,
-		"shared":           "none",
-		"monitoringStatus": 1,
-		"smt":              1,
-		"resources": [{"hostname": "host123", "hwthreads": [0, 1, 2, 3, 4, 5, 6, 7]}],
-		"startTime":        200000000
-	}`
-
-	if ok := t.Run("StartFirstJob", func(t *testing.T) {
-		req := httptest.NewRequest(http.MethodPost, "/jobs/start_job/", bytes.NewBuffer([]byte(startJobBody1)))
-		recorder := httptest.NewRecorder()
-		ctx := context.WithValue(req.Context(), contextUserKey, contextUserValue)
-		r.ServeHTTP(recorder, req.WithContext(ctx))
-		if recorder.Result().StatusCode != http.StatusCreated {
-			t.Fatal(recorder.Result().Status, recorder.Body.String())
-		}
-	}); !ok {
-		return
-	}
-
-	// Step 2: Sync to move job from cache to job table, then stop it as "failed"
-	time.Sleep(1 * time.Second)
-	restapi.JobRepository.SyncJobs()
-
-	const stopJobBody1 string = `{
-		"jobId":     999,
-		"startTime": 200000000,
-		"cluster":   "testcluster",
-		"jobState":  "failed",
-		"stopTime":  200001000
-	}`
-
-	if ok := t.Run("StopFirstJobAsFailed", func(t *testing.T) {
-		req := httptest.NewRequest(http.MethodPost, "/jobs/stop_job/", bytes.NewBuffer([]byte(stopJobBody1)))
-		recorder := httptest.NewRecorder()
-		ctx := context.WithValue(req.Context(), contextUserKey, contextUserValue)
-		r.ServeHTTP(recorder, req.WithContext(ctx))
-		if recorder.Result().StatusCode != http.StatusOK {
-			t.Fatal(recorder.Result().Status, recorder.Body.String())
-		}
-
-		jobid, cluster := int64(999), "testcluster"
-		job, err := restapi.JobRepository.Find(&jobid, &cluster, nil)
-		if err != nil {
-			t.Fatal(err)
-		}
-		if job.State != schema.JobStateFailed {
-			t.Fatalf("expected first job to be failed, got: %s", job.State)
-		}
-	}); !ok {
-		return
-	}
-
-	// Wait for archiving to complete
-	time.Sleep(1 * time.Second)
-
-	// Step 3: Start a NEW job with the same jobId=999 but different startTime.
-	// This job will sit in job_cache (not yet synced).
-	const startJobBody2 string = `{
-		"jobId":            999,
-		"user":             "testuser",
-		"project":          "testproj",
-		"cluster":          "testcluster",
-		"partition":        "default",
-		"walltime":         3600,
-		"numNodes":         1,
-		"numHwthreads":     8,
-		"numAcc":           0,
-		"shared":           "none",
-		"monitoringStatus": 1,
-		"smt":              1,
-		"resources": [{"hostname": "host123", "hwthreads": [0, 1, 2, 3, 4, 5, 6, 7]}],
-		"startTime":        300000000
-	}`
-
-	if ok := t.Run("StartSecondJob", func(t *testing.T) {
-		req := httptest.NewRequest(http.MethodPost, "/jobs/start_job/", bytes.NewBuffer([]byte(startJobBody2)))
-		recorder := httptest.NewRecorder()
-		ctx := context.WithValue(req.Context(), contextUserKey, contextUserValue)
-		r.ServeHTTP(recorder, req.WithContext(ctx))
-		if recorder.Result().StatusCode != http.StatusCreated {
-			t.Fatal(recorder.Result().Status, recorder.Body.String())
-		}
-	}); !ok {
-		return
-	}
-
-	// Step 4: Stop the second job WITHOUT syncing first.
-	// Before the fix, this would fail because Find() on the job table would
-	// match the old failed job (jobId=999) and reject with "already stopped".
-	const stopJobBody2 string = `{
-		"jobId":     999,
-		"startTime": 300000000,
-		"cluster":   "testcluster",
-		"jobState":  "completed",
-		"stopTime":  300001000
-	}`
-
-	t.Run("StopSecondJobBeforeSync", func(t *testing.T) {
-		req := httptest.NewRequest(http.MethodPost, "/jobs/stop_job/", bytes.NewBuffer([]byte(stopJobBody2)))
-		recorder := httptest.NewRecorder()
-		ctx := context.WithValue(req.Context(), contextUserKey, contextUserValue)
-		r.ServeHTTP(recorder, req.WithContext(ctx))
-		if recorder.Result().StatusCode != http.StatusOK {
-			t.Fatalf("expected stop to succeed for cached job, got: %s %s",
-				recorder.Result().Status, recorder.Body.String())
-		}
-	})
 }
--- a/internal/api/cluster.go
+++ b/internal/api/cluster.go
@@ -13,7 +13,7 @@ import (

 	"github.com/ClusterCockpit/cc-backend/internal/repository"
 	"github.com/ClusterCockpit/cc-backend/pkg/archive"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
+	"github.com/ClusterCockpit/cc-lib/schema"
 )

 // GetClustersAPIResponse model
@@ -27,7 +27,7 @@ type GetClustersAPIResponse struct {
 // @description Get a list of all cluster configs. Specific cluster can be requested using query parameter.
 // @produce     json
 // @param       cluster        query    string            false "Job Cluster"
-// @success     200            {object} api.GetClustersAPIResponse  "Array of clusters"
+// @success     200            {object} api.GetClustersApiResponse  "Array of clusters"
 // @failure     400            {object} api.ErrorResponse       "Bad Request"
 // @failure     401            {object} api.ErrorResponse       "Unauthorized"
 // @failure     403            {object} api.ErrorResponse       "Forbidden"
@@ -36,9 +36,9 @@ type GetClustersAPIResponse struct {
 // @router      /api/clusters/ [get]
 func (api *RestAPI) getClusters(rw http.ResponseWriter, r *http.Request) {
 	if user := repository.GetUserFromContext(r.Context()); user != nil &&
-		!user.HasRole(schema.RoleAPI) {
+		!user.HasRole(schema.RoleApi) {

-		handleError(fmt.Errorf("missing role: %v", schema.GetRoleString(schema.RoleAPI)), http.StatusForbidden, rw)
+		handleError(fmt.Errorf("missing role: %v", schema.GetRoleString(schema.RoleApi)), http.StatusForbidden, rw)
 		return
 	}

--- a/internal/api/docs.go
+++ b/internal/api/docs.go
--- a/internal/api/job.go
+++ b/internal/api/job.go
@@ -22,12 +22,12 @@ import (
 	"github.com/ClusterCockpit/cc-backend/internal/graph"
 	"github.com/ClusterCockpit/cc-backend/internal/graph/model"
 	"github.com/ClusterCockpit/cc-backend/internal/importer"
-	"github.com/ClusterCockpit/cc-backend/internal/metricdispatch"
+	"github.com/ClusterCockpit/cc-backend/internal/metricdispatcher"
 	"github.com/ClusterCockpit/cc-backend/internal/repository"
 	"github.com/ClusterCockpit/cc-backend/pkg/archive"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
-	"github.com/go-chi/chi/v5"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/schema"
+	"github.com/gorilla/mux"
 )

 const (
@@ -72,14 +72,6 @@ type EditMetaRequest struct {
 	Value string `json:"value" example:"bash script"`
 }

-// JobMetaRequest model
-type JobMetaRequest struct {
-	JobId     *int64          `json:"jobId" validate:"required" example:"123000"` // Cluster Job ID of job
-	Cluster   *string         `json:"cluster" example:"fritz"`                    // Cluster of job
-	StartTime *int64          `json:"startTime" example:"1649723812"`             // Start Time of job as epoch
-	Payload   EditMetaRequest `json:"payload"`                                    // Content to Add to Job Meta_Data
-}
-
 type TagJobAPIRequest []*APITag

 type GetJobAPIRequest []string
@@ -112,7 +104,7 @@ type JobMetricWithName struct {
 // @param       items-per-page query    int               false "Items per page (Default: 25)"
 // @param       page           query    int               false "Page Number (Default: 1)"
 // @param       with-metadata  query    bool              false "Include metadata (e.g. jobScript) in response"
-// @success     200            {object} api.GetJobsAPIResponse  "Job array and page info"
+// @success     200            {object} api.GetJobsApiResponse  "Job array and page info"
 // @failure     400            {object} api.ErrorResponse       "Bad Request"
 // @failure     401   		   {object} api.ErrorResponse       "Unauthorized"
 // @failure     403            {object} api.ErrorResponse       "Forbidden"
@@ -240,7 +232,7 @@ func (api *RestAPI) getJobs(rw http.ResponseWriter, r *http.Request) {
 // @produce     json
 // @param       id          path     int                  true "Database ID of Job"
 // @param       all-metrics query    bool                 false "Include all available metrics"
-// @success     200     {object} api.GetJobAPIResponse      "Job resource"
+// @success     200     {object} api.GetJobApiResponse      "Job resource"
 // @failure     400     {object} api.ErrorResponse          "Bad Request"
 // @failure     401     {object} api.ErrorResponse          "Unauthorized"
 // @failure     403     {object} api.ErrorResponse          "Forbidden"
@@ -251,10 +243,10 @@ func (api *RestAPI) getJobs(rw http.ResponseWriter, r *http.Request) {
 // @router      /api/jobs/{id} [get]
 func (api *RestAPI) getCompleteJobByID(rw http.ResponseWriter, r *http.Request) {
 	// Fetch job from db
-	id := chi.URLParam(r, "id")
+	id, ok := mux.Vars(r)["id"]
 	var job *schema.Job
 	var err error
-	if id != "" {
+	if ok {
 		id, e := strconv.ParseInt(id, 10, 64)
 		if e != nil {
 			handleError(fmt.Errorf("integer expected in path for id: %w", e), http.StatusBadRequest, rw)
@@ -301,7 +293,7 @@ func (api *RestAPI) getCompleteJobByID(rw http.ResponseWriter, r *http.Request)
 	}

 	if r.URL.Query().Get("all-metrics") == "true" {
-		data, err = metricdispatch.LoadData(job, nil, scopes, r.Context(), resolution)
+		data, err = metricdispatcher.LoadData(job, nil, scopes, r.Context(), resolution)
 		if err != nil {
 			cclog.Warnf("REST: error while loading all-metrics job data for JobID %d on %s", job.JobID, job.Cluster)
 			return
@@ -332,8 +324,8 @@ func (api *RestAPI) getCompleteJobByID(rw http.ResponseWriter, r *http.Request)
 // @accept      json
 // @produce     json
 // @param       id          path     int                  true "Database ID of Job"
-// @param       request     body     api.GetJobAPIRequest true  "Array of metric names"
-// @success     200     {object} api.GetJobAPIResponse      "Job resource"
+// @param       request     body     api.GetJobApiRequest true  "Array of metric names"
+// @success     200     {object} api.GetJobApiResponse      "Job resource"
 // @failure     400     {object} api.ErrorResponse          "Bad Request"
 // @failure     401     {object} api.ErrorResponse          "Unauthorized"
 // @failure     403     {object} api.ErrorResponse          "Forbidden"
@@ -344,10 +336,10 @@ func (api *RestAPI) getCompleteJobByID(rw http.ResponseWriter, r *http.Request)
 // @router      /api/jobs/{id} [post]
 func (api *RestAPI) getJobByID(rw http.ResponseWriter, r *http.Request) {
 	// Fetch job from db
-	id := chi.URLParam(r, "id")
+	id, ok := mux.Vars(r)["id"]
 	var job *schema.Job
 	var err error
-	if id != "" {
+	if ok {
 		id, e := strconv.ParseInt(id, 10, 64)
 		if e != nil {
 			handleError(fmt.Errorf("integer expected in path for id: %w", e), http.StatusBadRequest, rw)
@@ -397,7 +389,7 @@ func (api *RestAPI) getJobByID(rw http.ResponseWriter, r *http.Request) {
 		resolution = max(resolution, mc.Timestep)
 	}

-	data, err := metricdispatch.LoadData(job, metrics, scopes, r.Context(), resolution)
+	data, err := metricdispatcher.LoadData(job, metrics, scopes, r.Context(), resolution)
 	if err != nil {
 		cclog.Warnf("REST: error while loading job data for JobID %d on %s", job.JobID, job.Cluster)
 		return
@@ -431,23 +423,23 @@ func (api *RestAPI) getJobByID(rw http.ResponseWriter, r *http.Request) {
 }

 // editMeta godoc
-// @summary    Edit meta-data json of job identified by database id
+// @summary    Edit meta-data json
 // @tags Job add and modify
-// @description Edit key value pairs in job metadata json of job specified by database id
+// @description Edit key value pairs in job metadata json
 // @description If a key already exists its content will be overwritten
 // @accept      json
 // @produce     json
 // @param       id      path     int                  true "Job Database ID"
-// @param       request body     api.EditMetaRequest  true "Metadata Key value pair to add or update"
+// @param       request body     api.EditMetaRequest  true "Kay value pair to add"
 // @success     200     {object} schema.Job                "Updated job resource"
 // @failure     400     {object} api.ErrorResponse         "Bad Request"
 // @failure     401     {object} api.ErrorResponse         "Unauthorized"
 // @failure     404     {object} api.ErrorResponse         "Job does not exist"
 // @failure     500     {object} api.ErrorResponse         "Internal Server Error"
 // @security    ApiKeyAuth
-// @router      /api/jobs/edit_meta/{id} [patch]
+// @router      /api/jobs/edit_meta/{id} [post]
 func (api *RestAPI) editMeta(rw http.ResponseWriter, r *http.Request) {
-	id, err := strconv.ParseInt(chi.URLParam(r, "id"), 10, 64)
+	id, err := strconv.ParseInt(mux.Vars(r)["id"], 10, 64)
 	if err != nil {
 		handleError(fmt.Errorf("parsing job ID failed: %w", err), http.StatusBadRequest, rw)
 		return
@@ -477,54 +469,6 @@ func (api *RestAPI) editMeta(rw http.ResponseWriter, r *http.Request) {
 	}
 }

-// editMetaByRequest godoc
-// @summary    Edit meta-data json of job identified by request
-// @tags Job add and modify
-// @description Edit key value pairs in metadata json of job specified by jobID, StartTime and Cluster
-// @description If a key already exists its content will be overwritten
-// @accept      json
-// @produce     json
-// @param       request body     api.JobMetaRequest   true "Specifies job and payload to add or update"
-// @success     200     {object} schema.Job                "Updated job resource"
-// @failure     400     {object} api.ErrorResponse         "Bad Request"
-// @failure     401     {object} api.ErrorResponse         "Unauthorized"
-// @failure     404     {object} api.ErrorResponse         "Job does not exist"
-// @failure     500     {object} api.ErrorResponse         "Internal Server Error"
-// @security    ApiKeyAuth
-// @router      /api/jobs/edit_meta/ [patch]
-func (api *RestAPI) editMetaByRequest(rw http.ResponseWriter, r *http.Request) {
-	// Parse request body
-	req := JobMetaRequest{}
-	if err := decode(r.Body, &req); err != nil {
-		handleError(fmt.Errorf("parsing request body failed: %w", err), http.StatusBadRequest, rw)
-		return
-	}
-
-	// Fetch job (that will have its meta_data edited) from db
-	var job *schema.Job
-	var err error
-	if req.JobId == nil {
-		handleError(errors.New("the field 'jobId' is required"), http.StatusBadRequest, rw)
-		return
-	}
-
-	// log.Printf("loading db job for editMetaByRequest... : JobMetaRequest=%v", req)
-	job, err = api.JobRepository.Find(req.JobId, req.Cluster, req.StartTime)
-	if err != nil {
-		handleError(fmt.Errorf("finding job failed: %w", err), http.StatusUnprocessableEntity, rw)
-		return
-	}
-
-	if err := api.JobRepository.UpdateMetadata(job, req.Payload.Key, req.Payload.Value); err != nil {
-		http.Error(rw, err.Error(), http.StatusInternalServerError)
-		return
-	}
-
-	rw.Header().Add("Content-Type", "application/json")
-	rw.WriteHeader(http.StatusOK)
-	json.NewEncoder(rw).Encode(job)
-}
-
 // tagJob godoc
 // @summary     Adds one or more tags to a job
 // @tags Job add and modify
@@ -534,7 +478,7 @@ func (api *RestAPI) editMetaByRequest(rw http.ResponseWriter, r *http.Request) {
 // @accept      json
 // @produce     json
 // @param       id      path     int                  true "Job Database ID"
-// @param       request body     api.TagJobAPIRequest true "Array of tag-objects to add"
+// @param       request body     api.TagJobApiRequest true "Array of tag-objects to add"
 // @success     200     {object} schema.Job                "Updated job resource"
 // @failure     400     {object} api.ErrorResponse         "Bad Request"
 // @failure     401     {object} api.ErrorResponse         "Unauthorized"
@@ -543,7 +487,7 @@ func (api *RestAPI) editMetaByRequest(rw http.ResponseWriter, r *http.Request) {
 // @security    ApiKeyAuth
 // @router      /api/jobs/tag_job/{id} [post]
 func (api *RestAPI) tagJob(rw http.ResponseWriter, r *http.Request) {
-	id, err := strconv.ParseInt(chi.URLParam(r, "id"), 10, 64)
+	id, err := strconv.ParseInt(mux.Vars(r)["id"], 10, 64)
 	if err != nil {
 		handleError(fmt.Errorf("parsing job ID failed: %w", err), http.StatusBadRequest, rw)
 		return
@@ -598,7 +542,7 @@ func (api *RestAPI) tagJob(rw http.ResponseWriter, r *http.Request) {
 // @accept      json
 // @produce     json
 // @param       id      path     int                  true "Job Database ID"
-// @param       request body     api.TagJobAPIRequest true "Array of tag-objects to remove"
+// @param       request body     api.TagJobApiRequest true "Array of tag-objects to remove"
 // @success     200     {object} schema.Job                "Updated job resource"
 // @failure     400     {object} api.ErrorResponse         "Bad Request"
 // @failure     401     {object} api.ErrorResponse         "Unauthorized"
@@ -607,7 +551,7 @@ func (api *RestAPI) tagJob(rw http.ResponseWriter, r *http.Request) {
 // @security    ApiKeyAuth
 // @router      /jobs/tag_job/{id} [delete]
 func (api *RestAPI) removeTagJob(rw http.ResponseWriter, r *http.Request) {
-	id, err := strconv.ParseInt(chi.URLParam(r, "id"), 10, 64)
+	id, err := strconv.ParseInt(mux.Vars(r)["id"], 10, 64)
 	if err != nil {
 		handleError(fmt.Errorf("parsing job ID failed: %w", err), http.StatusBadRequest, rw)
 		return
@@ -662,7 +606,7 @@ func (api *RestAPI) removeTagJob(rw http.ResponseWriter, r *http.Request) {
 // @description Tag wills be removed from respective archive files.
 // @accept      json
 // @produce     plain
-// @param       request body     api.TagJobAPIRequest true "Array of tag-objects to remove"
+// @param       request body     api.TagJobApiRequest true "Array of tag-objects to remove"
 // @success     200     {string} string                    "Success Response"
 // @failure     400     {object} api.ErrorResponse         "Bad Request"
 // @failure     401     {object} api.ErrorResponse         "Unauthorized"
@@ -706,7 +650,7 @@ func (api *RestAPI) removeTags(rw http.ResponseWriter, r *http.Request) {
 // @accept      json
 // @produce     json
 // @param       request body     schema.Job true "Job to add"
-// @success     201     {object} api.DefaultAPIResponse    "Job added successfully"
+// @success     201     {object} api.DefaultApiResponse    "Job added successfully"
 // @failure     400     {object} api.ErrorResponse            "Bad Request"
 // @failure     401     {object} api.ErrorResponse            "Unauthorized"
 // @failure     403     {object} api.ErrorResponse            "Forbidden"
@@ -747,21 +691,13 @@ func (api *RestAPI) startJob(rw http.ResponseWriter, r *http.Request) {
 		for _, job := range jobs {
 			// Check if jobs are within the same day (prevent duplicates)
 			if (req.StartTime - job.StartTime) < secondsPerDay {
-				handleError(fmt.Errorf("a job with that jobId, cluster and startTime already exists: dbid: %d, jobid: %d", *job.ID, job.JobID), http.StatusUnprocessableEntity, rw)
+				handleError(fmt.Errorf("a job with that jobId, cluster and startTime already exists: dbid: %d, jobid: %d", job.ID, job.JobID), http.StatusUnprocessableEntity, rw)
 				return
 			}
 		}
 	}

-	// When tags are present, insert directly into the job table so that the
-	// returned ID can be used with AddTagOrCreate (which queries the job table).
-	// Jobs without tags use the cache path as before.
-	var id int64
-	if len(req.Tags) > 0 {
-		id, err = api.JobRepository.StartDirect(&req)
-	} else {
-		id, err = api.JobRepository.Start(&req)
-	}
+	id, err := api.JobRepository.Start(&req)
 	if err != nil {
 		handleError(fmt.Errorf("insert into database failed: %w", err), http.StatusInternalServerError, rw)
 		return
@@ -792,7 +728,7 @@ func (api *RestAPI) startJob(rw http.ResponseWriter, r *http.Request) {
 // @description Job to stop is specified by request body. All fields are required in this case.
 // @description Returns full job resource information according to 'Job' scheme.
 // @produce     json
-// @param       request body     api.StopJobAPIRequest true "All fields required"
+// @param       request body     api.StopJobApiRequest true "All fields required"
 // @success     200     {object} schema.Job                 "Success message"
 // @failure     400     {object} api.ErrorResponse          "Bad Request"
 // @failure     401     {object} api.ErrorResponse          "Unauthorized"
@@ -818,20 +754,20 @@ func (api *RestAPI) stopJobByRequest(rw http.ResponseWriter, r *http.Request) {
 		return
 	}

-	isCached := false
-	job, err = api.JobRepository.FindCached(req.JobID, req.Cluster, req.StartTime)
+	// cclog.Printf("loading db job for stopJobByRequest... : stopJobApiRequest=%v", req)
+	job, err = api.JobRepository.Find(req.JobID, req.Cluster, req.StartTime)
 	if err != nil {
-		// Not in cache, try main job table
-		job, err = api.JobRepository.Find(req.JobID, req.Cluster, req.StartTime)
-		if err != nil {
-			handleError(fmt.Errorf("finding job failed: %w", err), http.StatusNotFound, rw)
+		// Try cached jobs if not found in main repository
+		cachedJob, cachedErr := api.JobRepository.FindCached(req.JobID, req.Cluster, req.StartTime)
+		if cachedErr != nil {
+			// Combine both errors for better debugging
+			handleError(fmt.Errorf("finding job failed: %w (cached lookup also failed: %v)", err, cachedErr), http.StatusNotFound, rw)
 			return
 		}
-	} else {
-		isCached = true
+		job = cachedJob
 	}

-	api.checkAndHandleStopJob(rw, job, req, isCached)
+	api.checkAndHandleStopJob(rw, job, req)
 }

 // deleteJobByID godoc
@@ -840,7 +776,7 @@ func (api *RestAPI) stopJobByRequest(rw http.ResponseWriter, r *http.Request) {
 // @description Job to remove is specified by database ID. This will not remove the job from the job archive.
 // @produce     json
 // @param       id      path     int                   true "Database ID of Job"
-// @success     200     {object} api.DefaultAPIResponse  "Success message"
+// @success     200     {object} api.DefaultApiResponse  "Success message"
 // @failure     400     {object} api.ErrorResponse          "Bad Request"
 // @failure     401     {object} api.ErrorResponse          "Unauthorized"
 // @failure     403     {object} api.ErrorResponse          "Forbidden"
@@ -851,9 +787,9 @@ func (api *RestAPI) stopJobByRequest(rw http.ResponseWriter, r *http.Request) {
 // @router      /api/jobs/delete_job/{id} [delete]
 func (api *RestAPI) deleteJobByID(rw http.ResponseWriter, r *http.Request) {
 	// Fetch job (that will be stopped) from db
-	id := chi.URLParam(r, "id")
+	id, ok := mux.Vars(r)["id"]
 	var err error
-	if id != "" {
+	if ok {
 		id, e := strconv.ParseInt(id, 10, 64)
 		if e != nil {
 			handleError(fmt.Errorf("integer expected in path for id: %w", e), http.StatusBadRequest, rw)
@@ -884,8 +820,8 @@ func (api *RestAPI) deleteJobByID(rw http.ResponseWriter, r *http.Request) {
 // @description Job to delete is specified by request body. All fields are required in this case.
 // @accept      json
 // @produce     json
-// @param       request body     api.DeleteJobAPIRequest true "All fields required"
-// @success     200     {object} api.DefaultAPIResponse  "Success message"
+// @param       request body     api.DeleteJobApiRequest true "All fields required"
+// @success     200     {object} api.DefaultApiResponse  "Success message"
 // @failure     400     {object} api.ErrorResponse          "Bad Request"
 // @failure     401     {object} api.ErrorResponse          "Unauthorized"
 // @failure     403     {object} api.ErrorResponse          "Forbidden"
@@ -925,7 +861,7 @@ func (api *RestAPI) deleteJobByRequest(rw http.ResponseWriter, r *http.Request)
 	rw.Header().Add("Content-Type", "application/json")
 	rw.WriteHeader(http.StatusOK)
 	if err := json.NewEncoder(rw).Encode(DefaultAPIResponse{
-		Message: fmt.Sprintf("Successfully deleted job %d", *job.ID),
+		Message: fmt.Sprintf("Successfully deleted job %d", job.ID),
 	}); err != nil {
 		cclog.Errorf("Failed to encode response: %v", err)
 	}
@@ -937,7 +873,7 @@ func (api *RestAPI) deleteJobByRequest(rw http.ResponseWriter, r *http.Request)
 // @description Remove all jobs with start time before timestamp. The jobs will not be removed from the job archive.
 // @produce     json
 // @param       ts      path     int                   true "Unix epoch timestamp"
-// @success     200     {object} api.DefaultAPIResponse  "Success message"
+// @success     200     {object} api.DefaultApiResponse  "Success message"
 // @failure     400     {object} api.ErrorResponse          "Bad Request"
 // @failure     401     {object} api.ErrorResponse          "Unauthorized"
 // @failure     403     {object} api.ErrorResponse          "Forbidden"
@@ -950,9 +886,9 @@ func (api *RestAPI) deleteJobByRequest(rw http.ResponseWriter, r *http.Request)
 func (api *RestAPI) deleteJobBefore(rw http.ResponseWriter, r *http.Request) {
 	var cnt int
 	// Fetch job (that will be stopped) from db
-	id := chi.URLParam(r, "ts")
+	id, ok := mux.Vars(r)["ts"]
 	var err error
-	if id != "" {
+	if ok {
 		ts, e := strconv.ParseInt(id, 10, 64)
 		if e != nil {
 			handleError(fmt.Errorf("integer expected in path for ts: %w", e), http.StatusBadRequest, rw)
@@ -960,13 +896,11 @@ func (api *RestAPI) deleteJobBefore(rw http.ResponseWriter, r *http.Request) {
 		}

 		// Check for omit-tagged query parameter
-		omitTagged := "none"
+		omitTagged := false
 		if omitTaggedStr := r.URL.Query().Get("omit-tagged"); omitTaggedStr != "" {
-			switch omitTaggedStr {
-			case "none", "all", "user":
-				omitTagged = omitTaggedStr
-			default:
-				handleError(fmt.Errorf("omit-tagged must be one of: none, all, user"), http.StatusBadRequest, rw)
+			omitTagged, e = strconv.ParseBool(omitTaggedStr)
+			if e != nil {
+				handleError(fmt.Errorf("boolean expected for omit-tagged parameter: %w", e), http.StatusBadRequest, rw)
 				return
 			}
 		}
@@ -990,20 +924,20 @@ func (api *RestAPI) deleteJobBefore(rw http.ResponseWriter, r *http.Request) {
 	}
 }

-func (api *RestAPI) checkAndHandleStopJob(rw http.ResponseWriter, job *schema.Job, req StopJobAPIRequest, isCached bool) {
+func (api *RestAPI) checkAndHandleStopJob(rw http.ResponseWriter, job *schema.Job, req StopJobAPIRequest) {
 	// Sanity checks
 	if job.State != schema.JobStateRunning {
-		handleError(fmt.Errorf("jobId %d (id %d) on %s : job has already been stopped (state is: %s)", job.JobID, *job.ID, job.Cluster, job.State), http.StatusUnprocessableEntity, rw)
+		handleError(fmt.Errorf("jobId %d (id %d) on %s : job has already been stopped (state is: %s)", job.JobID, job.ID, job.Cluster, job.State), http.StatusUnprocessableEntity, rw)
 		return
 	}

 	if job.StartTime > req.StopTime {
-		handleError(fmt.Errorf("jobId %d (id %d) on %s : stopTime %d must be larger/equal than startTime %d", job.JobID, *job.ID, job.Cluster, req.StopTime, job.StartTime), http.StatusBadRequest, rw)
+		handleError(fmt.Errorf("jobId %d (id %d) on %s : stopTime %d must be larger/equal than startTime %d", job.JobID, job.ID, job.Cluster, req.StopTime, job.StartTime), http.StatusBadRequest, rw)
 		return
 	}

 	if req.State != "" && !req.State.Valid() {
-		handleError(fmt.Errorf("jobId %d (id %d) on %s : invalid requested job state: %#v", job.JobID, *job.ID, job.Cluster, req.State), http.StatusBadRequest, rw)
+		handleError(fmt.Errorf("jobId %d (id %d) on %s : invalid requested job state: %#v", job.JobID, job.ID, job.Cluster, req.State), http.StatusBadRequest, rw)
 		return
 	} else if req.State == "" {
 		req.State = schema.JobStateCompleted
@@ -1015,24 +949,14 @@ func (api *RestAPI) checkAndHandleStopJob(rw http.ResponseWriter, job *schema.Jo
 	api.JobRepository.Mutex.Lock()
 	defer api.JobRepository.Mutex.Unlock()

-	// If the job is still in job_cache, transfer it to the job table first
-	// so that job.ID always points to the job table for downstream code
-	if isCached {
-		newID, err := api.JobRepository.TransferCachedJobToMain(*job.ID)
-		if err != nil {
-			handleError(fmt.Errorf("jobId %d (id %d) on %s : transferring cached job failed: %w", job.JobID, *job.ID, job.Cluster, err), http.StatusInternalServerError, rw)
+	if err := api.JobRepository.Stop(*job.ID, job.Duration, job.State, job.MonitoringStatus); err != nil {
+		if err := api.JobRepository.StopCached(*job.ID, job.Duration, job.State, job.MonitoringStatus); err != nil {
+			handleError(fmt.Errorf("jobId %d (id %d) on %s : marking job as '%s' (duration: %d) in DB failed: %w", job.JobID, job.ID, job.Cluster, job.State, job.Duration, err), http.StatusInternalServerError, rw)
 			return
 		}
-		cclog.Infof("transferred cached job to main table: old id %d -> new id %d (jobId=%d)", *job.ID, newID, job.JobID)
-		job.ID = &newID
 	}

-	if err := api.JobRepository.Stop(*job.ID, job.Duration, job.State, job.MonitoringStatus); err != nil {
-		handleError(fmt.Errorf("jobId %d (id %d) on %s : marking job as '%s' (duration: %d) in DB failed: %w", job.JobID, *job.ID, job.Cluster, job.State, job.Duration, err), http.StatusInternalServerError, rw)
-		return
-	}
-
-	cclog.Infof("archiving job... (dbid: %d): cluster=%s, jobId=%d, user=%s, startTime=%d, duration=%d, state=%s", *job.ID, job.Cluster, job.JobID, job.User, job.StartTime, job.Duration, job.State)
+	cclog.Infof("archiving job... (dbid: %d): cluster=%s, jobId=%d, user=%s, startTime=%d, duration=%d, state=%s", job.ID, job.Cluster, job.JobID, job.User, job.StartTime, job.Duration, job.State)

 	// Send a response (with status OK). This means that errors that happen from here on forward
 	// can *NOT* be communicated to the client. If reading from a MetricDataRepository or
@@ -1053,7 +977,7 @@ func (api *RestAPI) checkAndHandleStopJob(rw http.ResponseWriter, job *schema.Jo
 }

 func (api *RestAPI) getJobMetrics(rw http.ResponseWriter, r *http.Request) {
-	id := chi.URLParam(r, "id")
+	id := mux.Vars(r)["id"]
 	metrics := r.URL.Query()["metric"]
 	var scopes []schema.MetricScope
 	for _, scope := range r.URL.Query()["scope"] {
@@ -1098,57 +1022,3 @@ func (api *RestAPI) getJobMetrics(rw http.ResponseWriter, r *http.Request) {
 		cclog.Errorf("Failed to encode response: %v", err)
 	}
 }
-
-// GetUsedNodesAPIResponse model
-type GetUsedNodesAPIResponse struct {
-	UsedNodes map[string][]string `json:"usedNodes"` // Map of cluster names to lists of used node hostnames
-}
-
-// getUsedNodes godoc
-// @summary     Lists used nodes by cluster
-// @tags Job query
-// @description Get a map of cluster names to lists of unique hostnames that are currently in use by running jobs that started before the specified timestamp.
-// @produce     json
-// @param       ts             query    int               true  "Unix timestamp to filter jobs (jobs with start_time < ts)"
-// @success     200            {object} api.GetUsedNodesAPIResponse  "Map of cluster names to hostname lists"
-// @failure     400            {object} api.ErrorResponse            "Bad Request"
-// @failure     401            {object} api.ErrorResponse            "Unauthorized"
-// @failure     403            {object} api.ErrorResponse            "Forbidden"
-// @failure     500            {object} api.ErrorResponse            "Internal Server Error"
-// @security    ApiKeyAuth
-// @router      /api/jobs/used_nodes [get]
-func (api *RestAPI) getUsedNodes(rw http.ResponseWriter, r *http.Request) {
-	if user := repository.GetUserFromContext(r.Context()); user != nil &&
-		!user.HasRole(schema.RoleAPI) {
-		handleError(fmt.Errorf("missing role: %v", schema.GetRoleString(schema.RoleAPI)), http.StatusForbidden, rw)
-		return
-	}
-
-	tsStr := r.URL.Query().Get("ts")
-	if tsStr == "" {
-		handleError(fmt.Errorf("missing required query parameter: ts"), http.StatusBadRequest, rw)
-		return
-	}
-
-	ts, err := strconv.ParseInt(tsStr, 10, 64)
-	if err != nil {
-		handleError(fmt.Errorf("invalid timestamp format: %w", err), http.StatusBadRequest, rw)
-		return
-	}
-
-	usedNodes, err := api.JobRepository.GetUsedNodes(ts)
-	if err != nil {
-		handleError(fmt.Errorf("failed to get used nodes: %w", err), http.StatusInternalServerError, rw)
-		return
-	}
-
-	rw.Header().Add("Content-Type", "application/json")
-	payload := GetUsedNodesAPIResponse{
-		UsedNodes: usedNodes,
-	}
-
-	if err := json.NewEncoder(rw).Encode(payload); err != nil {
-		handleError(err, http.StatusInternalServerError, rw)
-		return
-	}
-}
--- a/internal/api/log.go
+++ b/internal/api/log.go
@@ -1,165 +0,0 @@
-// Copyright (C) NHR@FAU, University Erlangen-Nuremberg.
-// All rights reserved. This file is part of cc-backend.
-// Use of this source code is governed by a MIT-style
-// license that can be found in the LICENSE file.
-package api
-
-import (
-	"bufio"
-	"encoding/json"
-	"fmt"
-	"net/http"
-	"os/exec"
-	"regexp"
-	"strconv"
-	"strings"
-
-	"github.com/ClusterCockpit/cc-backend/internal/config"
-	"github.com/ClusterCockpit/cc-backend/internal/repository"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
-)
-
-type LogEntry struct {
-	Timestamp string `json:"timestamp"`
-	Priority  int    `json:"priority"`
-	Message   string `json:"message"`
-	Unit      string `json:"unit"`
-}
-
-var safePattern = regexp.MustCompile(`^[a-zA-Z0-9 :\-\.]+$`)
-
-func (api *RestAPI) getJournalLog(rw http.ResponseWriter, r *http.Request) {
-	user := repository.GetUserFromContext(r.Context())
-	if !user.HasRole(schema.RoleAdmin) {
-		handleError(fmt.Errorf("only admins are allowed to view logs"), http.StatusForbidden, rw)
-		return
-	}
-
-	since := r.URL.Query().Get("since")
-	if since == "" {
-		since = "1 hour ago"
-	}
-	if !safePattern.MatchString(since) {
-		handleError(fmt.Errorf("invalid 'since' parameter"), http.StatusBadRequest, rw)
-		return
-	}
-
-	lines := 200
-	if l := r.URL.Query().Get("lines"); l != "" {
-		n, err := strconv.Atoi(l)
-		if err != nil || n < 1 {
-			handleError(fmt.Errorf("invalid 'lines' parameter"), http.StatusBadRequest, rw)
-			return
-		}
-		if n > 1000 {
-			n = 1000
-		}
-		lines = n
-	}
-
-	unit := config.Keys.SystemdUnit
-	if unit == "" {
-		unit = "clustercockpit.service"
-	}
-
-	args := []string{
-		"--output=json",
-		"--no-pager",
-		"-n", fmt.Sprintf("%d", lines),
-		"--since", since,
-		"-u", unit,
-	}
-
-	if level := r.URL.Query().Get("level"); level != "" {
-		n, err := strconv.Atoi(level)
-		if err != nil || n < 0 || n > 7 {
-			handleError(fmt.Errorf("invalid 'level' parameter (must be 0-7)"), http.StatusBadRequest, rw)
-			return
-		}
-		args = append(args, "--priority", fmt.Sprintf("%d", n))
-	}
-
-	if search := r.URL.Query().Get("search"); search != "" {
-		if !safePattern.MatchString(search) {
-			handleError(fmt.Errorf("invalid 'search' parameter"), http.StatusBadRequest, rw)
-			return
-		}
-		args = append(args, "--grep", search)
-	}
-
-	cclog.Debugf("calling journalctl with %s", strings.Join(args, " "))
-	cmd := exec.CommandContext(r.Context(), "journalctl", args...)
-	stdout, err := cmd.StdoutPipe()
-	if err != nil {
-		handleError(fmt.Errorf("failed to create pipe: %w", err), http.StatusInternalServerError, rw)
-		return
-	}
-
-	if err := cmd.Start(); err != nil {
-		handleError(fmt.Errorf("failed to start journalctl: %w", err), http.StatusInternalServerError, rw)
-		return
-	}
-
-	entries := make([]LogEntry, 0, lines)
-	scanner := bufio.NewScanner(stdout)
-	for scanner.Scan() {
-		var raw map[string]any
-		if err := json.Unmarshal(scanner.Bytes(), &raw); err != nil {
-			cclog.Debugf("error unmarshal log output: %v", err)
-			continue
-		}
-
-		priority := 6 // default info
-		if p, ok := raw["PRIORITY"]; ok {
-			switch v := p.(type) {
-			case string:
-				if n, err := strconv.Atoi(v); err == nil {
-					priority = n
-				}
-			case float64:
-				priority = int(v)
-			}
-		}
-
-		msg := ""
-		if m, ok := raw["MESSAGE"]; ok {
-			if s, ok := m.(string); ok {
-				msg = s
-			}
-		}
-
-		ts := ""
-		if t, ok := raw["__REALTIME_TIMESTAMP"]; ok {
-			if s, ok := t.(string); ok {
-				ts = s
-			}
-		}
-
-		unitName := ""
-		if u, ok := raw["_SYSTEMD_UNIT"]; ok {
-			if s, ok := u.(string); ok {
-				unitName = s
-			}
-		}
-
-		entries = append(entries, LogEntry{
-			Timestamp: ts,
-			Priority:  priority,
-			Message:   msg,
-			Unit:      unitName,
-		})
-	}
-
-	if err := cmd.Wait(); err != nil {
-		// journalctl returns exit code 1 when --grep matches nothing
-		if len(entries) == 0 {
-			cclog.Debugf("journalctl exited with: %v", err)
-		}
-	}
-
-	rw.Header().Set("Content-Type", "application/json")
-	if err := json.NewEncoder(rw).Encode(entries); err != nil {
-		cclog.Errorf("Failed to encode log entries: %v", err)
-	}
-}
--- a/internal/api/memorystore.go
+++ b/internal/api/memorystore.go
@@ -10,14 +10,15 @@ import (
 	"encoding/json"
 	"errors"
 	"fmt"
+	"io"
 	"net/http"
 	"strconv"
 	"strings"

-	"github.com/ClusterCockpit/cc-backend/pkg/metricstore"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
+	"github.com/ClusterCockpit/cc-backend/internal/memorystore"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"

-	"github.com/ClusterCockpit/cc-line-protocol/v2/lineprotocol"
+	"github.com/influxdata/line-protocol/v2/lineprotocol"
 )

 // handleFree godoc
@@ -57,7 +58,7 @@ func freeMetrics(rw http.ResponseWriter, r *http.Request) {
 		return
 	}

-	ms := metricstore.GetMemoryStore()
+	ms := memorystore.GetMemoryStore()
 	n := 0
 	for _, sel := range selectors {
 		bn, err := ms.Free(sel, to)
@@ -89,17 +90,16 @@ func freeMetrics(rw http.ResponseWriter, r *http.Request) {
 // @security    ApiKeyAuth
 // @router      /write/ [post]
 func writeMetrics(rw http.ResponseWriter, r *http.Request) {
+	bytes, err := io.ReadAll(r.Body)
 	rw.Header().Add("Content-Type", "application/json")
+	if err != nil {
+		handleError(err, http.StatusInternalServerError, rw)
+		return
+	}

-	// Extract the "cluster" query parameter without allocating a url.Values map.
-	cluster := queryParam(r.URL.RawQuery, "cluster")
-
-	// Stream directly from the request body instead of copying it into a
-	// temporary buffer via io.ReadAll. The line-protocol decoder supports
-	// io.Reader natively, so this avoids the largest heap allocation.
-	ms := metricstore.GetMemoryStore()
-	dec := lineprotocol.NewDecoder(r.Body)
-	if err := metricstore.DecodeLine(dec, ms, cluster); err != nil {
+	ms := memorystore.GetMemoryStore()
+	dec := lineprotocol.NewDecoderWithBytes(bytes)
+	if err := memorystore.DecodeLine(dec, ms, r.URL.Query().Get("cluster")); err != nil {
 		cclog.Errorf("/api/write error: %s", err.Error())
 		handleError(err, http.StatusBadRequest, rw)
 		return
@@ -107,20 +107,6 @@ func writeMetrics(rw http.ResponseWriter, r *http.Request) {
 	rw.WriteHeader(http.StatusOK)
 }

-// queryParam extracts a single query-parameter value from a raw query string
-// without allocating a url.Values map. Returns "" if the key is not present.
-func queryParam(raw, key string) string {
-	for raw != "" {
-		var kv string
-		kv, raw, _ = strings.Cut(raw, "&")
-		k, v, _ := strings.Cut(kv, "=")
-		if k == key {
-			return v
-		}
-	}
-	return ""
-}
-
 // handleDebug godoc
 // @summary Debug endpoint
 // @tags debug
@@ -143,9 +129,42 @@ func debugMetrics(rw http.ResponseWriter, r *http.Request) {
 		selector = strings.Split(raw, ":")
 	}

-	ms := metricstore.GetMemoryStore()
+	ms := memorystore.GetMemoryStore()
 	if err := ms.DebugDump(bufio.NewWriter(rw), selector); err != nil {
 		handleError(err, http.StatusBadRequest, rw)
 		return
 	}
 }
+
+// handleHealthCheck godoc
+// @summary HealthCheck endpoint
+// @tags healthcheck
+// @description This endpoint allows the users to check if a node is healthy
+// @produce     json
+// @param       selector        query    string            false "Selector"
+// @success     200            {string} string  "Debug dump"
+// @failure     400            {object} api.ErrorResponse       "Bad Request"
+// @failure     401            {object} api.ErrorResponse       "Unauthorized"
+// @failure     403            {object} api.ErrorResponse       "Forbidden"
+// @failure     500            {object} api.ErrorResponse       "Internal Server Error"
+// @security    ApiKeyAuth
+// @router      /healthcheck/ [get]
+func metricsHealth(rw http.ResponseWriter, r *http.Request) {
+	rawCluster := r.URL.Query().Get("cluster")
+	rawNode := r.URL.Query().Get("node")
+
+	if rawCluster == "" || rawNode == "" {
+		handleError(errors.New("'cluster' and 'node' are required query parameter"), http.StatusBadRequest, rw)
+		return
+	}
+
+	rw.Header().Add("Content-Type", "application/json")
+
+	selector := []string{rawCluster, rawNode}
+
+	ms := memorystore.GetMemoryStore()
+	if err := ms.HealthCheck(bufio.NewWriter(rw), selector); err != nil {
+		handleError(err, http.StatusBadRequest, rw)
+		return
+	}
+}
--- a/internal/api/nats.go
+++ b/internal/api/nats.go
@@ -6,9 +6,9 @@
 package api

 import (
+	"bytes"
 	"database/sql"
 	"encoding/json"
-	"strings"
 	"sync"
 	"time"

@@ -16,49 +16,13 @@ import (
 	"github.com/ClusterCockpit/cc-backend/internal/config"
 	"github.com/ClusterCockpit/cc-backend/internal/importer"
 	"github.com/ClusterCockpit/cc-backend/internal/repository"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	lp "github.com/ClusterCockpit/cc-lib/v2/ccMessage"
-	"github.com/ClusterCockpit/cc-lib/v2/nats"
-	"github.com/ClusterCockpit/cc-lib/v2/receivers"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
-	influx "github.com/ClusterCockpit/cc-line-protocol/v2/lineprotocol"
+	"github.com/ClusterCockpit/cc-backend/pkg/nats"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/schema"
 )

 // NatsAPI provides NATS subscription-based handlers for Job and Node operations.
-// It mirrors the functionality of the REST API but uses NATS messaging with
-// InfluxDB line protocol as the message format.
-//
-// # Message Format
-//
-// All NATS messages use InfluxDB line protocol format (https://docs.influxdata.com/influxdb/v2.0/reference/syntax/line-protocol/)
-// with the following structure:
-//
-//	measurement,tag1=value1,tag2=value2 field1=value1,field2=value2 timestamp
-//
-// # Job Events
-//
-// Job start/stop events use the "job" measurement with a "function" tag to distinguish operations:
-//
-//	job,function=start_job event="{...JSON payload...}" <timestamp>
-//	job,function=stop_job event="{...JSON payload...}" <timestamp>
-//
-// The JSON payload in the "event" field follows the schema.Job or StopJobAPIRequest structure.
-//
-// Example job start message:
-//
-//	job,function=start_job event="{\"jobId\":1001,\"user\":\"testuser\",\"cluster\":\"testcluster\",...}" 1234567890000000000
-//
-// # Node State Events
-//
-// Node state updates use the "nodestate" measurement with cluster information:
-//
-//	nodestate event="{...JSON payload...}" <timestamp>
-//
-// The JSON payload follows the UpdateNodeStatesRequest structure.
-//
-// Example node state message:
-//
-//	nodestate event="{\"cluster\":\"testcluster\",\"nodes\":[{\"hostname\":\"node01\",\"states\":[\"idle\"]}]}" 1234567890000000000
+// It mirrors the functionality of the REST API but uses NATS messaging.
 type NatsAPI struct {
 	JobRepository *repository.JobRepository
 	// RepositoryMutex protects job creation operations from race conditions
@@ -86,7 +50,11 @@ func (api *NatsAPI) StartSubscriptions() error {

 		s := config.Keys.APISubjects

-		if err := client.Subscribe(s.SubjectJobEvent, api.handleJobEvent); err != nil {
+		if err := client.Subscribe(s.SubjectJobStart, api.handleStartJob); err != nil {
+			return err
+		}
+
+		if err := client.Subscribe(s.SubjectJobStop, api.handleStopJob); err != nil {
 			return err
 		}

@@ -99,96 +67,26 @@ func (api *NatsAPI) StartSubscriptions() error {
 	return nil
 }

-// processJobEvent routes job event messages to the appropriate handler based on the "function" tag.
-// Validates that required tags and fields are present before processing.
-func (api *NatsAPI) processJobEvent(msg lp.CCMessage) {
-	function, ok := msg.GetTag("function")
-	if !ok {
-		cclog.Errorf("Job event is missing required tag 'function': measurement=%s", msg.Name())
-		return
-	}
-
-	switch function {
-	case "start_job":
-		v, ok := msg.GetEventValue()
-		if !ok {
-			cclog.Errorf("Job start event is missing event field with JSON payload")
-			return
-		}
-		api.handleStartJob(v)
-
-	case "stop_job":
-		v, ok := msg.GetEventValue()
-		if !ok {
-			cclog.Errorf("Job stop event is missing event field with JSON payload")
-			return
-		}
-		api.handleStopJob(v)
-
-	default:
-		cclog.Warnf("Unknown job event function '%s', expected 'start_job' or 'stop_job'", function)
-	}
-}
-
-// handleJobEvent processes job-related messages received via NATS using InfluxDB line protocol.
-// The message must be in line protocol format with measurement="job" and include:
-//   - tag "function" with value "start_job" or "stop_job"
-//   - field "event" containing JSON payload (schema.Job or StopJobAPIRequest)
-//
-// Example: job,function=start_job event="{\"jobId\":1001,...}" 1234567890000000000
-func (api *NatsAPI) handleJobEvent(subject string, data []byte) {
-	if len(data) == 0 {
-		cclog.Warnf("NATS %s: received empty message", subject)
-		return
-	}
-
-	d := influx.NewDecoderWithBytes(data)
-
-	for d.Next() {
-		m, err := receivers.DecodeInfluxMessage(d)
-		if err != nil {
-			cclog.Errorf("NATS %s: failed to decode InfluxDB line protocol message: %v", subject, err)
-			return
-		}
-
-		if !m.IsEvent() {
-			cclog.Debugf("NATS %s: received non-event message, skipping", subject)
-			continue
-		}
-
-		if m.Name() == "job" {
-			api.processJobEvent(m)
-		} else {
-			cclog.Debugf("NATS %s: unexpected measurement name '%s', expected 'job'", subject, m.Name())
-		}
-	}
-}
-
 // handleStartJob processes job start messages received via NATS.
-// The payload parameter contains JSON following the schema.Job structure.
-// Jobs are validated, checked for duplicates, and inserted into the database.
-func (api *NatsAPI) handleStartJob(payload string) {
-	if payload == "" {
-		cclog.Error("NATS start job: payload is empty")
-		return
-	}
+// Expected JSON payload follows the schema.Job structure.
+func (api *NatsAPI) handleStartJob(subject string, data []byte) {
 	req := schema.Job{
 		Shared:           "none",
 		MonitoringStatus: schema.MonitoringStatusRunningOrArchiving,
 	}

-	dec := json.NewDecoder(strings.NewReader(payload))
+	dec := json.NewDecoder(bytes.NewReader(data))
 	dec.DisallowUnknownFields()
 	if err := dec.Decode(&req); err != nil {
-		cclog.Errorf("NATS start job: parsing request failed: %v", err)
+		cclog.Errorf("NATS %s: parsing request failed: %v", subject, err)
 		return
 	}

-	cclog.Debugf("NATS start job: %s", req.GoString())
+	cclog.Debugf("NATS %s: %s", subject, req.GoString())
 	req.State = schema.JobStateRunning

 	if err := importer.SanityChecks(&req); err != nil {
-		cclog.Errorf("NATS start job: sanity check failed: %v", err)
+		cclog.Errorf("NATS %s: sanity check failed: %v", subject, err)
 		return
 	}

@@ -198,36 +96,29 @@ func (api *NatsAPI) handleStartJob(payload string) {

 	jobs, err := api.JobRepository.FindAll(&req.JobID, &req.Cluster, nil)
 	if err != nil && err != sql.ErrNoRows {
-		cclog.Errorf("NATS start job: checking for duplicate failed: %v", err)
+		cclog.Errorf("NATS %s: checking for duplicate failed: %v", subject, err)
 		return
 	}
 	if err == nil {
 		for _, job := range jobs {
 			if (req.StartTime - job.StartTime) < secondsPerDay {
-				cclog.Errorf("NATS start job: job with jobId %d, cluster %s already exists (dbid: %d)",
-					req.JobID, req.Cluster, job.ID)
+				cclog.Errorf("NATS %s: job with jobId %d, cluster %s already exists (dbid: %d)",
+					subject, req.JobID, req.Cluster, job.ID)
 				return
 			}
 		}
 	}

-	// When tags are present, insert directly into the job table so that the
-	// returned ID can be used with AddTagOrCreate (which queries the job table).
-	var id int64
-	if len(req.Tags) > 0 {
-		id, err = api.JobRepository.StartDirect(&req)
-	} else {
-		id, err = api.JobRepository.Start(&req)
-	}
+	id, err := api.JobRepository.Start(&req)
 	if err != nil {
-		cclog.Errorf("NATS start job: insert into database failed: %v", err)
+		cclog.Errorf("NATS %s: insert into database failed: %v", subject, err)
 		return
 	}
 	unlockOnce.Do(api.RepositoryMutex.Unlock)

 	for _, tag := range req.Tags {
 		if _, err := api.JobRepository.AddTagOrCreate(nil, id, tag.Type, tag.Name, tag.Scope); err != nil {
-			cclog.Errorf("NATS start job: adding tag to new job %d failed: %v", id, err)
+			cclog.Errorf("NATS %s: adding tag to new job %d failed: %v", subject, id, err)
 			return
 		}
 	}
@@ -237,55 +128,48 @@ func (api *NatsAPI) handleStartJob(payload string) {
 }

 // handleStopJob processes job stop messages received via NATS.
-// The payload parameter contains JSON following the StopJobAPIRequest structure.
-// The job is marked as stopped in the database and archiving is triggered if monitoring is enabled.
-func (api *NatsAPI) handleStopJob(payload string) {
-	if payload == "" {
-		cclog.Error("NATS stop job: payload is empty")
-		return
-	}
+// Expected JSON payload follows the StopJobAPIRequest structure.
+func (api *NatsAPI) handleStopJob(subject string, data []byte) {
 	var req StopJobAPIRequest

-	dec := json.NewDecoder(strings.NewReader(payload))
+	dec := json.NewDecoder(bytes.NewReader(data))
 	dec.DisallowUnknownFields()
 	if err := dec.Decode(&req); err != nil {
-		cclog.Errorf("NATS job stop: parsing request failed: %v", err)
+		cclog.Errorf("NATS %s: parsing request failed: %v", subject, err)
 		return
 	}

 	if req.JobID == nil {
-		cclog.Errorf("NATS job stop: the field 'jobId' is required")
+		cclog.Errorf("NATS %s: the field 'jobId' is required", subject)
 		return
 	}

-	isCached := false
-	job, err := api.JobRepository.FindCached(req.JobID, req.Cluster, req.StartTime)
+	job, err := api.JobRepository.Find(req.JobID, req.Cluster, req.StartTime)
 	if err != nil {
-		// Not in cache, try main job table
-		job, err = api.JobRepository.Find(req.JobID, req.Cluster, req.StartTime)
-		if err != nil {
-			cclog.Errorf("NATS job stop: finding job failed: %v", err)
+		cachedJob, cachedErr := api.JobRepository.FindCached(req.JobID, req.Cluster, req.StartTime)
+		if cachedErr != nil {
+			cclog.Errorf("NATS %s: finding job failed: %v (cached lookup also failed: %v)",
+				subject, err, cachedErr)
 			return
 		}
-	} else {
-		isCached = true
+		job = cachedJob
 	}

 	if job.State != schema.JobStateRunning {
-		cclog.Errorf("NATS job stop: jobId %d (id %d) on %s: job has already been stopped (state is: %s)",
-			job.JobID, job.ID, job.Cluster, job.State)
+		cclog.Errorf("NATS %s: jobId %d (id %d) on %s: job has already been stopped (state is: %s)",
+			subject, job.JobID, job.ID, job.Cluster, job.State)
 		return
 	}

 	if job.StartTime > req.StopTime {
-		cclog.Errorf("NATS job stop: jobId %d (id %d) on %s: stopTime %d must be >= startTime %d",
-			job.JobID, job.ID, job.Cluster, req.StopTime, job.StartTime)
+		cclog.Errorf("NATS %s: jobId %d (id %d) on %s: stopTime %d must be >= startTime %d",
+			subject, job.JobID, job.ID, job.Cluster, req.StopTime, job.StartTime)
 		return
 	}

 	if req.State != "" && !req.State.Valid() {
-		cclog.Errorf("NATS job stop: jobId %d (id %d) on %s: invalid job state: %#v",
-			job.JobID, job.ID, job.Cluster, req.State)
+		cclog.Errorf("NATS %s: jobId %d (id %d) on %s: invalid job state: %#v",
+			subject, job.JobID, job.ID, job.Cluster, req.State)
 		return
 	} else if req.State == "" {
 		req.State = schema.JobStateCompleted
@@ -296,26 +180,16 @@ func (api *NatsAPI) handleStopJob(payload string) {
 	api.JobRepository.Mutex.Lock()
 	defer api.JobRepository.Mutex.Unlock()

-	// If the job is still in job_cache, transfer it to the job table first
-	if isCached {
-		newID, err := api.JobRepository.TransferCachedJobToMain(*job.ID)
-		if err != nil {
-			cclog.Errorf("NATS job stop: jobId %d (id %d) on %s: transferring cached job failed: %v",
-				job.JobID, *job.ID, job.Cluster, err)
+	if err := api.JobRepository.Stop(*job.ID, job.Duration, job.State, job.MonitoringStatus); err != nil {
+		if err := api.JobRepository.StopCached(*job.ID, job.Duration, job.State, job.MonitoringStatus); err != nil {
+			cclog.Errorf("NATS %s: jobId %d (id %d) on %s: marking job as '%s' failed: %v",
+				subject, job.JobID, job.ID, job.Cluster, job.State, err)
 			return
 		}
-		cclog.Infof("NATS: transferred cached job to main table: old id %d -> new id %d (jobId=%d)", *job.ID, newID, job.JobID)
-		job.ID = &newID
-	}
-
-	if err := api.JobRepository.Stop(*job.ID, job.Duration, job.State, job.MonitoringStatus); err != nil {
-		cclog.Errorf("NATS job stop: jobId %d (id %d) on %s: marking job as '%s' failed: %v",
-			job.JobID, *job.ID, job.Cluster, job.State, err)
-		return
 	}

 	cclog.Infof("NATS: archiving job (dbid: %d): cluster=%s, jobId=%d, user=%s, startTime=%d, duration=%d, state=%s",
-		*job.ID, job.Cluster, job.JobID, job.User, job.StartTime, job.Duration, job.State)
+		job.ID, job.Cluster, job.JobID, job.User, job.StartTime, job.Duration, job.State)

 	if job.MonitoringStatus == schema.MonitoringStatusDisabled {
 		return
@@ -324,31 +198,24 @@ func (api *NatsAPI) handleStopJob(payload string) {
 	archiver.TriggerArchiving(job)
 }

-// processNodestateEvent extracts and processes node state data from the InfluxDB message.
-// Updates node states in the repository for all nodes in the payload.
-func (api *NatsAPI) processNodestateEvent(msg lp.CCMessage) {
-	v, ok := msg.GetEventValue()
-	if !ok {
-		cclog.Errorf("Nodestate event is missing event field with JSON payload")
-		return
-	}
-
+// handleNodeState processes node state update messages received via NATS.
+// Expected JSON payload follows the UpdateNodeStatesRequest structure.
+func (api *NatsAPI) handleNodeState(subject string, data []byte) {
 	var req UpdateNodeStatesRequest

-	dec := json.NewDecoder(strings.NewReader(v))
+	dec := json.NewDecoder(bytes.NewReader(data))
 	dec.DisallowUnknownFields()
 	if err := dec.Decode(&req); err != nil {
-		cclog.Errorf("NATS nodestate: parsing request failed: %v", err)
+		cclog.Errorf("NATS %s: parsing request failed: %v", subject, err)
 		return
 	}

 	repo := repository.GetNodeRepository()
-	requestReceived := time.Now().Unix()

 	for _, node := range req.Nodes {
 		state := determineState(node.States)
 		nodeState := schema.NodeStateDB{
-			TimeStamp:       requestReceived,
+			TimeStamp:       time.Now().Unix(),
 			NodeState:       state,
 			CpusAllocated:   node.CpusAllocated,
 			MemoryAllocated: node.MemoryAllocated,
@@ -357,44 +224,8 @@ func (api *NatsAPI) processNodestateEvent(msg lp.CCMessage) {
 			JobsRunning:     node.JobsRunning,
 		}

-		if err := repo.UpdateNodeState(node.Hostname, req.Cluster, &nodeState); err != nil {
-			cclog.Errorf("NATS nodestate: updating node state for %s on %s failed: %v",
-				node.Hostname, req.Cluster, err)
-		}
+		repo.UpdateNodeState(node.Hostname, req.Cluster, &nodeState)
 	}

-	cclog.Debugf("NATS nodestate: updated %d node states for cluster %s", len(req.Nodes), req.Cluster)
-}
-
-// handleNodeState processes node state update messages received via NATS using InfluxDB line protocol.
-// The message must be in line protocol format with measurement="nodestate" and include:
-//   - field "event" containing JSON payload (UpdateNodeStatesRequest)
-//
-// Example: nodestate event="{\"cluster\":\"testcluster\",\"nodes\":[...]}" 1234567890000000000
-func (api *NatsAPI) handleNodeState(subject string, data []byte) {
-	if len(data) == 0 {
-		cclog.Warnf("NATS %s: received empty message", subject)
-		return
-	}
-
-	d := influx.NewDecoderWithBytes(data)
-
-	for d.Next() {
-		m, err := receivers.DecodeInfluxMessage(d)
-		if err != nil {
-			cclog.Errorf("NATS %s: failed to decode InfluxDB line protocol message: %v", subject, err)
-			return
-		}
-
-		if !m.IsEvent() {
-			cclog.Warnf("NATS %s: received non-event message, skipping", subject)
-			continue
-		}
-
-		if m.Name() == "nodestate" {
-			api.processNodestateEvent(m)
-		} else {
-			cclog.Warnf("NATS %s: unexpected measurement name '%s', expected 'nodestate'", subject, m.Name())
-		}
-	}
+	cclog.Debugf("NATS %s: updated %d node states for cluster %s", subject, len(req.Nodes), req.Cluster)
 }
--- a/internal/api/nats_test.go
+++ b/internal/api/nats_test.go
@@ -1,947 +0,0 @@
-// Copyright (C) NHR@FAU, University Erlangen-Nuremberg.
-// All rights reserved. This file is part of cc-backend.
-// Use of this source code is governed by a MIT-style
-// license that can be found in the LICENSE file.
-package api
-
-import (
-	"context"
-	"database/sql"
-	"encoding/json"
-	"fmt"
-	"os"
-	"path/filepath"
-	"testing"
-	"time"
-
-	"github.com/ClusterCockpit/cc-backend/internal/archiver"
-	"github.com/ClusterCockpit/cc-backend/internal/auth"
-	"github.com/ClusterCockpit/cc-backend/internal/config"
-	"github.com/ClusterCockpit/cc-backend/internal/graph"
-	"github.com/ClusterCockpit/cc-backend/internal/repository"
-	"github.com/ClusterCockpit/cc-backend/pkg/archive"
-	"github.com/ClusterCockpit/cc-backend/pkg/metricstore"
-	ccconf "github.com/ClusterCockpit/cc-lib/v2/ccConfig"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	lp "github.com/ClusterCockpit/cc-lib/v2/ccMessage"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
-
-	_ "github.com/mattn/go-sqlite3"
-)
-
-func setupNatsTest(t *testing.T) *NatsAPI {
-	repository.ResetConnection()
-
-	const testconfig = `{
-		"main": {
-	"addr":            "0.0.0.0:8080",
-	"validate": false,
-  "api-allowed-ips": [
-    "*"
-  ]
-	},
-	"archive": {
-		"kind": "file",
-		"path": "./var/job-archive"
-	},
-	"auth": {
-  "jwts": {
-      "max-age": "2m"
-  }
-	}
-}`
-	const testclusterJSON = `{
-        "name": "testcluster",
-		"subClusters": [
-			{
-				"name": "sc1",
-				"nodes": "host123,host124,host125",
-				"processorType": "Intel Core i7-4770",
-				"socketsPerNode": 1,
-				"coresPerSocket": 4,
-				"threadsPerCore": 2,
-                "flopRateScalar": {
-                  "unit": {
-                    "prefix": "G",
-                    "base": "F/s"
-                  },
-                  "value": 14
-                },
-                "flopRateSimd": {
-                  "unit": {
-                    "prefix": "G",
-                    "base": "F/s"
-                  },
-                  "value": 112
-                },
-                "memoryBandwidth": {
-                  "unit": {
-                    "prefix": "G",
-                    "base": "B/s"
-                  },
-                  "value": 24
-                },
-                "numberOfNodes": 70,
-				"topology": {
-					"node": [0, 1, 2, 3, 4, 5, 6, 7],
-					"socket": [[0, 1, 2, 3, 4, 5, 6, 7]],
-					"memoryDomain": [[0, 1, 2, 3, 4, 5, 6, 7]],
-					"die": [[0, 1, 2, 3, 4, 5, 6, 7]],
-					"core": [[0], [1], [2], [3], [4], [5], [6], [7]]
-				}
-			}
-		],
-		"metricConfig": [
-			{
-				"name": "load_one",
-			    "unit": { "base": ""},
-				"scope": "node",
-				"timestep": 60,
-                "aggregation": "avg",
-				"peak": 8,
-				"normal": 0,
-				"caution": 0,
-				"alert": 0
-			}
-		]
-	}`
-
-	cclog.Init("info", true)
-	tmpdir := t.TempDir()
-	jobarchive := filepath.Join(tmpdir, "job-archive")
-	if err := os.Mkdir(jobarchive, 0o777); err != nil {
-		t.Fatal(err)
-	}
-
-	if err := os.WriteFile(filepath.Join(jobarchive, "version.txt"), fmt.Appendf(nil, "%d", 3), 0o666); err != nil {
-		t.Fatal(err)
-	}
-
-	if err := os.Mkdir(filepath.Join(jobarchive, "testcluster"), 0o777); err != nil {
-		t.Fatal(err)
-	}
-
-	if err := os.WriteFile(filepath.Join(jobarchive, "testcluster", "cluster.json"), []byte(testclusterJSON), 0o666); err != nil {
-		t.Fatal(err)
-	}
-
-	dbfilepath := filepath.Join(tmpdir, "test.db")
-	err := repository.MigrateDB(dbfilepath)
-	if err != nil {
-		t.Fatal(err)
-	}
-
-	cfgFilePath := filepath.Join(tmpdir, "config.json")
-	if err := os.WriteFile(cfgFilePath, []byte(testconfig), 0o666); err != nil {
-		t.Fatal(err)
-	}
-
-	ccconf.Init(cfgFilePath)
-
-	// Load and check main configuration
-	if cfg := ccconf.GetPackageConfig("main"); cfg != nil {
-		config.Init(cfg)
-	} else {
-		cclog.Abort("Main configuration must be present")
-	}
-	archiveCfg := fmt.Sprintf("{\"kind\": \"file\",\"path\": \"%s\"}", jobarchive)
-
-	repository.Connect(dbfilepath)
-
-	if err := archive.Init(json.RawMessage(archiveCfg)); err != nil {
-		t.Fatal(err)
-	}
-
-	// metricstore initialization removed - it's initialized via callback in tests
-
-	archiver.Start(repository.GetJobRepository(), context.Background())
-
-	if cfg := ccconf.GetPackageConfig("auth"); cfg != nil {
-		auth.Init(&cfg)
-	} else {
-		cclog.Warn("Authentication disabled due to missing configuration")
-		auth.Init(nil)
-	}
-
-	graph.Init()
-
-	return NewNatsAPI()
-}
-
-func cleanupNatsTest() {
-	if err := archiver.Shutdown(5 * time.Second); err != nil {
-		cclog.Warnf("Archiver shutdown timeout in tests: %v", err)
-	}
-}
-
-func TestNatsHandleStartJob(t *testing.T) {
-	natsAPI := setupNatsTest(t)
-	t.Cleanup(cleanupNatsTest)
-
-	tests := []struct {
-		name          string
-		payload       string
-		expectError   bool
-		validateJob   func(t *testing.T, job *schema.Job)
-		shouldFindJob bool
-	}{
-		{
-			name: "valid job start",
-			payload: `{
-				"jobId": 1001,
-				"user": "testuser1",
-				"project": "testproj1",
-				"cluster": "testcluster",
-				"partition": "main",
-				"walltime": 7200,
-				"numNodes": 1,
-				"numHwthreads": 8,
-				"numAcc": 0,
-				"shared": "none",
-				"monitoringStatus": 1,
-				"smt": 1,
-				"resources": [
-					{
-						"hostname": "host123",
-						"hwthreads": [0, 1, 2, 3, 4, 5, 6, 7]
-					}
-				],
-				"startTime": 1234567890
-			}`,
-			expectError:   false,
-			shouldFindJob: true,
-			validateJob: func(t *testing.T, job *schema.Job) {
-				if job.JobID != 1001 {
-					t.Errorf("expected JobID 1001, got %d", job.JobID)
-				}
-				if job.User != "testuser1" {
-					t.Errorf("expected user testuser1, got %s", job.User)
-				}
-				if job.State != schema.JobStateRunning {
-					t.Errorf("expected state running, got %s", job.State)
-				}
-			},
-		},
-		{
-			name: "invalid JSON",
-			payload: `{
-				"jobId": "not a number",
-				"user": "testuser2"
-			}`,
-			expectError:   true,
-			shouldFindJob: false,
-		},
-		{
-			name: "missing required fields",
-			payload: `{
-				"jobId": 1002
-			}`,
-			expectError:   true,
-			shouldFindJob: false,
-		},
-		{
-			name: "job with unknown fields (should fail due to DisallowUnknownFields)",
-			payload: `{
-				"jobId": 1003,
-				"user": "testuser3",
-				"project": "testproj3",
-				"cluster": "testcluster",
-				"partition": "main",
-				"walltime": 3600,
-				"numNodes": 1,
-				"numHwthreads": 8,
-				"unknownField": "should cause error",
-				"startTime": 1234567900
-			}`,
-			expectError:   true,
-			shouldFindJob: false,
-		},
-		{
-			name: "job with tags",
-			payload: `{
-				"jobId": 1004,
-				"user": "testuser4",
-				"project": "testproj4",
-				"cluster": "testcluster",
-				"partition": "main",
-				"walltime": 3600,
-				"numNodes": 1,
-				"numHwthreads": 8,
-				"numAcc": 0,
-				"shared": "none",
-				"monitoringStatus": 1,
-				"smt": 1,
-				"resources": [
-					{
-						"hostname": "host123",
-						"hwthreads": [0, 1, 2, 3]
-					}
-				],
-				"tags": [
-					{
-						"type": "test",
-						"name": "testtag",
-						"scope": "testuser4"
-					}
-				],
-				"startTime": 1234567910
-			}`,
-			expectError:   false,
-			shouldFindJob: true,
-			validateJob: func(t *testing.T, job *schema.Job) {
-				if job.JobID != 1004 {
-					t.Errorf("expected JobID 1004, got %d", job.JobID)
-				}
-			},
-		},
-	}
-
-	for _, tt := range tests {
-		t.Run(tt.name, func(t *testing.T) {
-			natsAPI.handleStartJob(tt.payload)
-			natsAPI.JobRepository.SyncJobs()
-
-			// Allow some time for async operations
-			time.Sleep(100 * time.Millisecond)
-
-			if tt.shouldFindJob {
-				// Extract jobId from payload
-				var payloadMap map[string]any
-				json.Unmarshal([]byte(tt.payload), &payloadMap)
-				jobID := int64(payloadMap["jobId"].(float64))
-				cluster := payloadMap["cluster"].(string)
-				startTime := int64(payloadMap["startTime"].(float64))
-
-				job, err := natsAPI.JobRepository.Find(&jobID, &cluster, &startTime)
-				if err != nil {
-					if !tt.expectError {
-						t.Fatalf("expected to find job, but got error: %v", err)
-					}
-					return
-				}
-
-				if tt.validateJob != nil {
-					tt.validateJob(t, job)
-				}
-			}
-		})
-	}
-}
-
-func TestNatsHandleStopJob(t *testing.T) {
-	natsAPI := setupNatsTest(t)
-	t.Cleanup(cleanupNatsTest)
-
-	// First, create a running job
-	startPayload := `{
-		"jobId": 2001,
-		"user": "testuser",
-		"project": "testproj",
-		"cluster": "testcluster",
-		"partition": "main",
-		"walltime": 3600,
-		"numNodes": 1,
-		"numHwthreads": 8,
-		"numAcc": 0,
-		"shared": "none",
-		"monitoringStatus": 1,
-		"smt": 1,
-		"resources": [
-			{
-				"hostname": "host123",
-				"hwthreads": [0, 1, 2, 3, 4, 5, 6, 7]
-			}
-		],
-		"startTime": 1234567890
-	}`
-
-	natsAPI.handleStartJob(startPayload)
-	natsAPI.JobRepository.SyncJobs()
-	time.Sleep(100 * time.Millisecond)
-
-	tests := []struct {
-		name         string
-		payload      string
-		expectError  bool
-		validateJob  func(t *testing.T, job *schema.Job)
-		setupJobFunc func() // Optional: create specific test job
-	}{
-		{
-			name: "valid job stop - completed",
-			payload: `{
-				"jobId": 2001,
-				"cluster": "testcluster",
-				"startTime": 1234567890,
-				"jobState": "completed",
-				"stopTime": 1234571490
-			}`,
-			expectError: false,
-			validateJob: func(t *testing.T, job *schema.Job) {
-				if job.State != schema.JobStateCompleted {
-					t.Errorf("expected state completed, got %s", job.State)
-				}
-				expectedDuration := int32(1234571490 - 1234567890)
-				if job.Duration != expectedDuration {
-					t.Errorf("expected duration %d, got %d", expectedDuration, job.Duration)
-				}
-			},
-		},
-		{
-			name: "valid job stop - failed",
-			setupJobFunc: func() {
-				startPayloadFailed := `{
-					"jobId": 2002,
-					"user": "testuser",
-					"project": "testproj",
-					"cluster": "testcluster",
-					"partition": "main",
-					"walltime": 3600,
-					"numNodes": 1,
-					"numHwthreads": 8,
-					"numAcc": 0,
-					"shared": "none",
-					"monitoringStatus": 1,
-					"smt": 1,
-					"resources": [
-						{
-							"hostname": "host123",
-							"hwthreads": [0, 1, 2, 3]
-						}
-					],
-					"startTime": 1234567900
-				}`
-				natsAPI.handleStartJob(startPayloadFailed)
-				natsAPI.JobRepository.SyncJobs()
-				time.Sleep(100 * time.Millisecond)
-			},
-			payload: `{
-				"jobId": 2002,
-				"cluster": "testcluster",
-				"startTime": 1234567900,
-				"jobState": "failed",
-				"stopTime": 1234569900
-			}`,
-			expectError: false,
-			validateJob: func(t *testing.T, job *schema.Job) {
-				if job.State != schema.JobStateFailed {
-					t.Errorf("expected state failed, got %s", job.State)
-				}
-			},
-		},
-		{
-			name: "invalid JSON",
-			payload: `{
-				"jobId": "not a number"
-			}`,
-			expectError: true,
-		},
-		{
-			name: "missing jobId",
-			payload: `{
-				"cluster": "testcluster",
-				"jobState": "completed",
-				"stopTime": 1234571490
-			}`,
-			expectError: true,
-		},
-		{
-			name: "invalid job state",
-			setupJobFunc: func() {
-				startPayloadInvalid := `{
-					"jobId": 2003,
-					"user": "testuser",
-					"project": "testproj",
-					"cluster": "testcluster",
-					"partition": "main",
-					"walltime": 3600,
-					"numNodes": 1,
-					"numHwthreads": 8,
-					"numAcc": 0,
-					"shared": "none",
-					"monitoringStatus": 1,
-					"smt": 1,
-					"resources": [
-						{
-							"hostname": "host123",
-							"hwthreads": [0, 1]
-						}
-					],
-					"startTime": 1234567910
-				}`
-				natsAPI.handleStartJob(startPayloadInvalid)
-				natsAPI.JobRepository.SyncJobs()
-				time.Sleep(100 * time.Millisecond)
-			},
-			payload: `{
-				"jobId": 2003,
-				"cluster": "testcluster",
-				"startTime": 1234567910,
-				"jobState": "invalid_state",
-				"stopTime": 1234571510
-			}`,
-			expectError: true,
-		},
-		{
-			name: "stopTime before startTime",
-			setupJobFunc: func() {
-				startPayloadTime := `{
-					"jobId": 2004,
-					"user": "testuser",
-					"project": "testproj",
-					"cluster": "testcluster",
-					"partition": "main",
-					"walltime": 3600,
-					"numNodes": 1,
-					"numHwthreads": 8,
-					"numAcc": 0,
-					"shared": "none",
-					"monitoringStatus": 1,
-					"smt": 1,
-					"resources": [
-						{
-							"hostname": "host123",
-							"hwthreads": [0]
-						}
-					],
-					"startTime": 1234567920
-				}`
-				natsAPI.handleStartJob(startPayloadTime)
-				natsAPI.JobRepository.SyncJobs()
-				time.Sleep(100 * time.Millisecond)
-			},
-			payload: `{
-				"jobId": 2004,
-				"cluster": "testcluster",
-				"startTime": 1234567920,
-				"jobState": "completed",
-				"stopTime": 1234567900
-			}`,
-			expectError: true,
-		},
-		{
-			name: "job not found",
-			payload: `{
-				"jobId": 99999,
-				"cluster": "testcluster",
-				"startTime": 1234567890,
-				"jobState": "completed",
-				"stopTime": 1234571490
-			}`,
-			expectError: true,
-		},
-	}
-
-	testData := schema.JobData{
-		"load_one": map[schema.MetricScope]*schema.JobMetric{
-			schema.MetricScopeNode: {
-				Unit:     schema.Unit{Base: "load"},
-				Timestep: 60,
-				Series: []schema.Series{
-					{
-						Hostname:   "host123",
-						Statistics: schema.MetricStatistics{Min: 0.1, Avg: 0.2, Max: 0.3},
-						Data:       []schema.Float{0.1, 0.1, 0.1, 0.2, 0.2, 0.2, 0.3, 0.3, 0.3},
-					},
-				},
-			},
-		},
-	}
-
-	metricstore.TestLoadDataCallback = func(job *schema.Job, metrics []string, scopes []schema.MetricScope, ctx context.Context, resolution int) (schema.JobData, error) {
-		return testData, nil
-	}
-
-	for _, tt := range tests {
-		t.Run(tt.name, func(t *testing.T) {
-			if tt.setupJobFunc != nil {
-				tt.setupJobFunc()
-			}
-
-			natsAPI.handleStopJob(tt.payload)
-
-			// Allow some time for async operations
-			time.Sleep(100 * time.Millisecond)
-
-			if !tt.expectError && tt.validateJob != nil {
-				// Extract job details from payload
-				var payloadMap map[string]any
-				json.Unmarshal([]byte(tt.payload), &payloadMap)
-				jobID := int64(payloadMap["jobId"].(float64))
-				cluster := payloadMap["cluster"].(string)
-
-				var startTime *int64
-				if st, ok := payloadMap["startTime"]; ok {
-					t := int64(st.(float64))
-					startTime = &t
-				}
-
-				job, err := natsAPI.JobRepository.Find(&jobID, &cluster, startTime)
-				if err != nil {
-					t.Fatalf("expected to find job, but got error: %v", err)
-				}
-
-				tt.validateJob(t, job)
-			}
-		})
-	}
-}
-
-func TestNatsHandleNodeState(t *testing.T) {
-	natsAPI := setupNatsTest(t)
-	t.Cleanup(cleanupNatsTest)
-
-	tests := []struct {
-		name        string
-		data        []byte
-		expectError bool
-		validateFn  func(t *testing.T)
-	}{
-		{
-			name:        "valid node state update",
-			data:        []byte(`nodestate event="{\"cluster\":\"testcluster\",\"nodes\":[{\"hostname\":\"host123\",\"states\":[\"allocated\"],\"cpusAllocated\":8,\"memoryAllocated\":16384,\"gpusAllocated\":0,\"jobsRunning\":1}]}" 1234567890000000000`),
-			expectError: false,
-			validateFn: func(t *testing.T) {
-				// In a full test, we would verify the node state was updated in the database
-				// For now, just ensure no error occurred
-			},
-		},
-		{
-			name:        "multiple nodes",
-			data:        []byte(`nodestate event="{\"cluster\":\"testcluster\",\"nodes\":[{\"hostname\":\"host123\",\"states\":[\"idle\"],\"cpusAllocated\":0,\"memoryAllocated\":0,\"gpusAllocated\":0,\"jobsRunning\":0},{\"hostname\":\"host124\",\"states\":[\"allocated\"],\"cpusAllocated\":4,\"memoryAllocated\":8192,\"gpusAllocated\":1,\"jobsRunning\":1}]}" 1234567890000000000`),
-			expectError: false,
-		},
-		{
-			name:        "invalid JSON in event field",
-			data:        []byte(`nodestate event="{\"cluster\":\"testcluster\",\"nodes\":\"not an array\"}" 1234567890000000000`),
-			expectError: true,
-		},
-		{
-			name:        "empty nodes array",
-			data:        []byte(`nodestate event="{\"cluster\":\"testcluster\",\"nodes\":[]}" 1234567890000000000`),
-			expectError: false, // Empty array should not cause error
-		},
-		{
-			name:        "invalid line protocol format",
-			data:        []byte(`invalid line protocol format`),
-			expectError: true,
-		},
-		{
-			name:        "empty data",
-			data:        []byte(``),
-			expectError: false, // Should be handled gracefully with warning
-		},
-	}
-
-	for _, tt := range tests {
-		t.Run(tt.name, func(t *testing.T) {
-			natsAPI.handleNodeState("test.subject", tt.data)
-
-			// Allow some time for async operations
-			time.Sleep(50 * time.Millisecond)
-
-			if tt.validateFn != nil {
-				tt.validateFn(t)
-			}
-		})
-	}
-}
-
-func TestNatsProcessJobEvent(t *testing.T) {
-	natsAPI := setupNatsTest(t)
-	t.Cleanup(cleanupNatsTest)
-
-	msgStartJob, err := lp.NewMessage(
-		"job",
-		map[string]string{"function": "start_job"},
-		nil,
-		map[string]any{
-			"event": `{
-				"jobId": 3001,
-				"user": "testuser",
-				"project": "testproj",
-				"cluster": "testcluster",
-				"partition": "main",
-				"walltime": 3600,
-				"numNodes": 1,
-				"numHwthreads": 8,
-				"numAcc": 0,
-				"shared": "none",
-				"monitoringStatus": 1,
-				"smt": 1,
-				"resources": [
-					{
-						"hostname": "host123",
-						"hwthreads": [0, 1, 2, 3]
-					}
-				],
-				"startTime": 1234567890
-			}`,
-		},
-		time.Now(),
-	)
-	if err != nil {
-		t.Fatalf("failed to create test message: %v", err)
-	}
-
-	msgMissingTag, err := lp.NewMessage(
-		"job",
-		map[string]string{},
-		nil,
-		map[string]any{
-			"event": `{}`,
-		},
-		time.Now(),
-	)
-	if err != nil {
-		t.Fatalf("failed to create test message: %v", err)
-	}
-
-	msgUnknownFunc, err := lp.NewMessage(
-		"job",
-		map[string]string{"function": "unknown_function"},
-		nil,
-		map[string]any{
-			"event": `{}`,
-		},
-		time.Now(),
-	)
-	if err != nil {
-		t.Fatalf("failed to create test message: %v", err)
-	}
-
-	tests := []struct {
-		name        string
-		message     lp.CCMessage
-		expectError bool
-	}{
-		{
-			name:        "start_job function",
-			message:     msgStartJob,
-			expectError: false,
-		},
-		{
-			name:        "missing function tag",
-			message:     msgMissingTag,
-			expectError: true,
-		},
-		{
-			name:        "unknown function",
-			message:     msgUnknownFunc,
-			expectError: false,
-		},
-	}
-
-	for _, tt := range tests {
-		t.Run(tt.name, func(t *testing.T) {
-			natsAPI.processJobEvent(tt.message)
-			time.Sleep(50 * time.Millisecond)
-		})
-	}
-}
-
-func TestNatsHandleJobEvent(t *testing.T) {
-	natsAPI := setupNatsTest(t)
-	t.Cleanup(cleanupNatsTest)
-
-	tests := []struct {
-		name        string
-		data        []byte
-		expectError bool
-	}{
-		{
-			name:        "valid influx line protocol",
-			data:        []byte(`job,function=start_job event="{\"jobId\":4001,\"user\":\"testuser\",\"project\":\"testproj\",\"cluster\":\"testcluster\",\"partition\":\"main\",\"walltime\":3600,\"numNodes\":1,\"numHwthreads\":8,\"numAcc\":0,\"shared\":\"none\",\"monitoringStatus\":1,\"smt\":1,\"resources\":[{\"hostname\":\"host123\",\"hwthreads\":[0,1,2,3]}],\"startTime\":1234567890}" 1234567890000000000`),
-			expectError: false,
-		},
-		{
-			name:        "invalid influx line protocol",
-			data:        []byte(`invalid line protocol format`),
-			expectError: true,
-		},
-		{
-			name:        "empty data",
-			data:        []byte(``),
-			expectError: false, // Decoder should handle empty input gracefully
-		},
-	}
-
-	for _, tt := range tests {
-		t.Run(tt.name, func(t *testing.T) {
-			// HandleJobEvent doesn't return errors, it logs them
-			// We're just ensuring it doesn't panic
-			natsAPI.handleJobEvent("test.subject", tt.data)
-			time.Sleep(50 * time.Millisecond)
-		})
-	}
-}
-
-func TestNatsHandleJobEventEdgeCases(t *testing.T) {
-	natsAPI := setupNatsTest(t)
-	t.Cleanup(cleanupNatsTest)
-
-	tests := []struct {
-		name        string
-		data        []byte
-		expectError bool
-		description string
-	}{
-		{
-			name:        "non-event message (metric data)",
-			data:        []byte(`job,function=start_job value=123.45 1234567890000000000`),
-			expectError: false,
-			description: "Should skip non-event messages gracefully",
-		},
-		{
-			name:        "wrong measurement name",
-			data:        []byte(`wrongmeasurement,function=start_job event="{}" 1234567890000000000`),
-			expectError: false,
-			description: "Should warn about unexpected measurement but not fail",
-		},
-		{
-			name:        "missing event field",
-			data:        []byte(`job,function=start_job other_field="value" 1234567890000000000`),
-			expectError: true,
-			description: "Should error when event field is missing",
-		},
-		{
-			name:        "multiple measurements in one message",
-			data:        []byte("job,function=start_job event=\"{}\" 1234567890000000000\njob,function=stop_job event=\"{}\" 1234567890000000000"),
-			expectError: false,
-			description: "Should process multiple lines",
-		},
-		{
-			name:        "escaped quotes in JSON payload",
-			data:        []byte(`job,function=start_job event="{\"jobId\":6001,\"user\":\"test\\\"user\",\"cluster\":\"test\"}" 1234567890000000000`),
-			expectError: true,
-			description: "Should handle escaped quotes (though JSON parsing may fail)",
-		},
-	}
-
-	for _, tt := range tests {
-		t.Run(tt.name, func(t *testing.T) {
-			natsAPI.handleJobEvent("test.subject", tt.data)
-			time.Sleep(50 * time.Millisecond)
-		})
-	}
-}
-
-func TestNatsHandleNodeStateEdgeCases(t *testing.T) {
-	natsAPI := setupNatsTest(t)
-	t.Cleanup(cleanupNatsTest)
-
-	tests := []struct {
-		name        string
-		data        []byte
-		expectError bool
-		description string
-	}{
-		{
-			name:        "missing cluster field in JSON",
-			data:        []byte(`nodestate event="{\"nodes\":[]}" 1234567890000000000`),
-			expectError: true,
-			description: "Should fail when cluster is missing",
-		},
-		{
-			name:        "malformed JSON with unescaped quotes",
-			data:        []byte(`nodestate event="{\"cluster\":\"test"cluster\",\"nodes\":[]}" 1234567890000000000`),
-			expectError: true,
-			description: "Should fail on malformed JSON",
-		},
-		{
-			name:        "unicode characters in hostname",
-			data:        []byte(`nodestate event="{\"cluster\":\"testcluster\",\"nodes\":[{\"hostname\":\"host-ñ123\",\"states\":[\"idle\"],\"cpusAllocated\":0,\"memoryAllocated\":0,\"gpusAllocated\":0,\"jobsRunning\":0}]}" 1234567890000000000`),
-			expectError: false,
-			description: "Should handle unicode characters",
-		},
-		{
-			name:        "very large node count",
-			data:        []byte(`nodestate event="{\"cluster\":\"testcluster\",\"nodes\":[{\"hostname\":\"node1\",\"states\":[\"idle\"],\"cpusAllocated\":0,\"memoryAllocated\":0,\"gpusAllocated\":0,\"jobsRunning\":0},{\"hostname\":\"node2\",\"states\":[\"idle\"],\"cpusAllocated\":0,\"memoryAllocated\":0,\"gpusAllocated\":0,\"jobsRunning\":0},{\"hostname\":\"node3\",\"states\":[\"idle\"],\"cpusAllocated\":0,\"memoryAllocated\":0,\"gpusAllocated\":0,\"jobsRunning\":0}]}" 1234567890000000000`),
-			expectError: false,
-			description: "Should handle multiple nodes efficiently",
-		},
-		{
-			name:        "timestamp in past",
-			data:        []byte(`nodestate event="{\"cluster\":\"testcluster\",\"nodes\":[]}" 1000000000000000000`),
-			expectError: false,
-			description: "Should accept any valid timestamp",
-		},
-	}
-
-	for _, tt := range tests {
-		t.Run(tt.name, func(t *testing.T) {
-			natsAPI.handleNodeState("test.subject", tt.data)
-			time.Sleep(50 * time.Millisecond)
-		})
-	}
-}
-
-func TestNatsHandleStartJobDuplicatePrevention(t *testing.T) {
-	natsAPI := setupNatsTest(t)
-	t.Cleanup(cleanupNatsTest)
-
-	// Start a job
-	payload := `{
-		"jobId": 5001,
-		"user": "testuser",
-		"project": "testproj",
-		"cluster": "testcluster",
-		"partition": "main",
-		"walltime": 3600,
-		"numNodes": 1,
-		"numHwthreads": 8,
-		"numAcc": 0,
-		"shared": "none",
-		"monitoringStatus": 1,
-		"smt": 1,
-		"resources": [
-			{
-				"hostname": "host123",
-				"hwthreads": [0, 1, 2, 3]
-			}
-		],
-		"startTime": 1234567890
-	}`
-
-	natsAPI.handleStartJob(payload)
-	natsAPI.JobRepository.SyncJobs()
-	time.Sleep(100 * time.Millisecond)
-
-	// Try to start the same job again (within 24 hours)
-	duplicatePayload := `{
-		"jobId": 5001,
-		"user": "testuser",
-		"project": "testproj",
-		"cluster": "testcluster",
-		"partition": "main",
-		"walltime": 3600,
-		"numNodes": 1,
-		"numHwthreads": 8,
-		"numAcc": 0,
-		"shared": "none",
-		"monitoringStatus": 1,
-		"smt": 1,
-		"resources": [
-			{
-				"hostname": "host123",
-				"hwthreads": [0, 1, 2, 3]
-			}
-		],
-		"startTime": 1234567900
-	}`
-
-	natsAPI.handleStartJob(duplicatePayload)
-	natsAPI.JobRepository.SyncJobs()
-	time.Sleep(100 * time.Millisecond)
-
-	// Verify only one job exists
-	jobID := int64(5001)
-	cluster := "testcluster"
-	jobs, err := natsAPI.JobRepository.FindAll(&jobID, &cluster, nil)
-	if err != nil && err != sql.ErrNoRows {
-		t.Fatalf("unexpected error: %v", err)
-	}
-
-	if len(jobs) != 1 {
-		t.Errorf("expected 1 job, got %d", len(jobs))
-	}
-}
--- a/internal/api/node.go
+++ b/internal/api/node.go
@@ -7,17 +7,12 @@ package api

 import (
 	"fmt"
-	"maps"
 	"net/http"
 	"strings"
 	"time"

-	"github.com/ClusterCockpit/cc-backend/internal/metricdispatch"
 	"github.com/ClusterCockpit/cc-backend/internal/repository"
-	"github.com/ClusterCockpit/cc-backend/pkg/archive"
-	"github.com/ClusterCockpit/cc-backend/pkg/metricstore"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
+	"github.com/ClusterCockpit/cc-lib/schema"
 )

 type UpdateNodeStatesRequest struct {
@@ -25,15 +20,6 @@ type UpdateNodeStatesRequest struct {
 	Cluster string               `json:"cluster" example:"fritz"`
 }

-// metricListToNames converts a map of metric configurations to a list of metric names
-func metricListToNames(metricList map[string]*schema.Metric) []string {
-	names := make([]string, 0, len(metricList))
-	for name := range metricList {
-		names = append(names, name)
-	}
-	return names
-}
-
 // this routine assumes that only one of them exists per node
 func determineState(states []string) schema.SchedulerState {
 	for _, state := range states {
@@ -61,7 +47,7 @@ func determineState(states []string) schema.SchedulerState {
 // @description Required query-parameter defines if all users or only users with additional special roles are returned.
 // @produce     json
 // @param       request body UpdateNodeStatesRequest true "Request body containing nodes and their states"
-// @success     200     {object} api.DefaultAPIResponse "Success message"
+// @success     200     {object} api.DefaultApiResponse "Success message"
 // @failure     400     {object} api.ErrorResponse      "Bad Request"
 // @failure     401     {object} api.ErrorResponse      "Unauthorized"
 // @failure     403     {object} api.ErrorResponse      "Forbidden"
@@ -76,70 +62,19 @@ func (api *RestAPI) updateNodeStates(rw http.ResponseWriter, r *http.Request) {
 			http.StatusBadRequest, rw)
 		return
 	}
-	requestReceived := time.Now().Unix()
 	repo := repository.GetNodeRepository()

-	m := make(map[string][]string)
-	metricNames := make(map[string][]string)
-	healthResults := make(map[string]metricstore.HealthCheckResult)
-
-	startMs := time.Now()
-
-	// Step 1: Build nodeList and metricList per subcluster
-	for _, node := range req.Nodes {
-		if sc, err := archive.GetSubClusterByNode(req.Cluster, node.Hostname); err == nil {
-			m[sc] = append(m[sc], node.Hostname)
-		}
-	}
-
-	for sc := range m {
-		if sc != "" {
-			metricList := archive.GetMetricConfigSubCluster(req.Cluster, sc)
-			metricNames[sc] = metricListToNames(metricList)
-		}
-	}
-
-	// Step 2: Determine which metric store to query and perform health check
-	healthRepo, err := metricdispatch.GetHealthCheckRepo(req.Cluster)
-	if err != nil {
-		cclog.Warnf("updateNodeStates: no metric store for cluster %s, skipping health check: %v", req.Cluster, err)
-	} else {
-		for sc, nl := range m {
-			if sc != "" {
-				if results, err := healthRepo.HealthCheck(req.Cluster, nl, metricNames[sc]); err == nil {
-					maps.Copy(healthResults, results)
-				}
-			}
-		}
-	}
-
-	cclog.Debugf("Timer updateNodeStates, MemStore HealthCheck: %s", time.Since(startMs))
-	startDB := time.Now()
-
 	for _, node := range req.Nodes {
 		state := determineState(node.States)
-		healthState := schema.MonitoringStateFailed
-		var healthMetrics string
-		if result, ok := healthResults[node.Hostname]; ok {
-			healthState = result.State
-			healthMetrics = result.HealthMetrics
-		}
 		nodeState := schema.NodeStateDB{
-			TimeStamp:       requestReceived,
-			NodeState:       state,
+			TimeStamp: time.Now().Unix(), NodeState: state,
 			CpusAllocated:   node.CpusAllocated,
 			MemoryAllocated: node.MemoryAllocated,
 			GpusAllocated:   node.GpusAllocated,
-			HealthState:     healthState,
-			HealthMetrics:   healthMetrics,
+			HealthState:     schema.MonitoringStateFull,
 			JobsRunning:     node.JobsRunning,
 		}

-		if err := repo.UpdateNodeState(node.Hostname, req.Cluster, &nodeState); err != nil {
-			cclog.Errorf("updateNodeStates: updating node state for %s on %s failed: %v",
-				node.Hostname, req.Cluster, err)
-		}
+		repo.UpdateNodeState(node.Hostname, req.Cluster, &nodeState)
 	}
-
-	cclog.Debugf("Timer updateNodeStates, SQLite Inserts: %s", time.Since(startDB))
 }
--- a/internal/api/rest.go
+++ b/internal/api/rest.go
@@ -22,11 +22,10 @@ import (
 	"github.com/ClusterCockpit/cc-backend/internal/auth"
 	"github.com/ClusterCockpit/cc-backend/internal/config"
 	"github.com/ClusterCockpit/cc-backend/internal/repository"
-	"github.com/ClusterCockpit/cc-backend/internal/tagger"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
-	"github.com/ClusterCockpit/cc-lib/v2/util"
-	"github.com/go-chi/chi/v5"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/schema"
+	"github.com/ClusterCockpit/cc-lib/util"
+	"github.com/gorilla/mux"
 )

 // @title                      ClusterCockpit REST API
@@ -49,7 +48,6 @@ import (
 const (
 	noticeFilePath  = "./var/notice.txt"
 	noticeFilePerms = 0o644
-	maxNoticeLength = 10000 // Maximum allowed notice content length in characters
 )

 type RestAPI struct {
@@ -63,7 +61,6 @@ type RestAPI struct {
 	RepositoryMutex sync.Mutex
 }

-// New creates and initializes a new RestAPI instance with configured dependencies.
 func New() *RestAPI {
 	return &RestAPI{
 		JobRepository:   repository.GetJobRepository(),
@@ -72,100 +69,82 @@ func New() *RestAPI {
 	}
 }

-// MountAPIRoutes registers REST API endpoints for job and cluster management.
-// These routes use JWT token authentication via the X-Auth-Token header.
-func (api *RestAPI) MountAPIRoutes(r chi.Router) {
+func (api *RestAPI) MountAPIRoutes(r *mux.Router) {
+	r.StrictSlash(true)
 	// REST API Uses TokenAuth
 	// User List
-	r.Get("/users/", api.getUsers)
+	r.HandleFunc("/users/", api.getUsers).Methods(http.MethodGet)
 	// Cluster List
-	r.Get("/clusters/", api.getClusters)
+	r.HandleFunc("/clusters/", api.getClusters).Methods(http.MethodGet)
 	// Slurm node state
-	r.Post("/nodestate/", api.updateNodeStates)
-	r.Put("/nodestate/", api.updateNodeStates)
+	r.HandleFunc("/nodestate/", api.updateNodeStates).Methods(http.MethodPost, http.MethodPut)
 	// Job Handler
 	if config.Keys.APISubjects == nil {
 		cclog.Info("Enabling REST start/stop job API")
-		r.Post("/jobs/start_job/", api.startJob)
-		r.Put("/jobs/start_job/", api.startJob)
-		r.Post("/jobs/stop_job/", api.stopJobByRequest)
-		r.Put("/jobs/stop_job/", api.stopJobByRequest)
+		r.HandleFunc("/jobs/start_job/", api.startJob).Methods(http.MethodPost, http.MethodPut)
+		r.HandleFunc("/jobs/stop_job/", api.stopJobByRequest).Methods(http.MethodPost, http.MethodPut)
 	}
-	r.Get("/jobs/", api.getJobs)
-	r.Get("/jobs/used_nodes", api.getUsedNodes)
-	r.Post("/jobs/tag_job/{id}", api.tagJob)
-	r.Patch("/jobs/tag_job/{id}", api.tagJob)
-	r.Delete("/jobs/tag_job/{id}", api.removeTagJob)
-	r.Patch("/jobs/edit_meta/{id}", api.editMeta)
-	r.Patch("/jobs/edit_meta/", api.editMetaByRequest)
-	r.Get("/jobs/metrics/{id}", api.getJobMetrics)
-	r.Delete("/jobs/delete_job/", api.deleteJobByRequest)
-	r.Delete("/jobs/delete_job/{id}", api.deleteJobByID)
-	r.Delete("/jobs/delete_job_before/{ts}", api.deleteJobBefore)
-	r.Post("/jobs/{id}", api.getJobByID)
-	r.Get("/jobs/{id}", api.getCompleteJobByID)
+	r.HandleFunc("/jobs/", api.getJobs).Methods(http.MethodGet)
+	r.HandleFunc("/jobs/{id}", api.getJobByID).Methods(http.MethodPost)
+	r.HandleFunc("/jobs/{id}", api.getCompleteJobByID).Methods(http.MethodGet)
+	r.HandleFunc("/jobs/tag_job/{id}", api.tagJob).Methods(http.MethodPost, http.MethodPatch)
+	r.HandleFunc("/jobs/tag_job/{id}", api.removeTagJob).Methods(http.MethodDelete)
+	r.HandleFunc("/jobs/edit_meta/{id}", api.editMeta).Methods(http.MethodPost, http.MethodPatch)
+	r.HandleFunc("/jobs/metrics/{id}", api.getJobMetrics).Methods(http.MethodGet)
+	r.HandleFunc("/jobs/delete_job/", api.deleteJobByRequest).Methods(http.MethodDelete)
+	r.HandleFunc("/jobs/delete_job/{id}", api.deleteJobByID).Methods(http.MethodDelete)
+	r.HandleFunc("/jobs/delete_job_before/{ts}", api.deleteJobBefore).Methods(http.MethodDelete)

-	r.Delete("/tags/", api.removeTags)
+	r.HandleFunc("/tags/", api.removeTags).Methods(http.MethodDelete)

 	if api.MachineStateDir != "" {
-		r.Get("/machine_state/{cluster}/{host}", api.getMachineState)
-		r.Put("/machine_state/{cluster}/{host}", api.putMachineState)
-		r.Post("/machine_state/{cluster}/{host}", api.putMachineState)
+		r.HandleFunc("/machine_state/{cluster}/{host}", api.getMachineState).Methods(http.MethodGet)
+		r.HandleFunc("/machine_state/{cluster}/{host}", api.putMachineState).Methods(http.MethodPut, http.MethodPost)
 	}
 }

-// MountUserAPIRoutes registers user-accessible REST API endpoints.
-// These are limited endpoints for regular users with JWT token authentication.
-func (api *RestAPI) MountUserAPIRoutes(r chi.Router) {
+func (api *RestAPI) MountUserAPIRoutes(r *mux.Router) {
+	r.StrictSlash(true)
 	// REST API Uses TokenAuth
-	r.Get("/jobs/", api.getJobs)
-	r.Post("/jobs/{id}", api.getJobByID)
-	r.Get("/jobs/{id}", api.getCompleteJobByID)
-	r.Get("/jobs/metrics/{id}", api.getJobMetrics)
+	r.HandleFunc("/jobs/", api.getJobs).Methods(http.MethodGet)
+	r.HandleFunc("/jobs/{id}", api.getJobByID).Methods(http.MethodPost)
+	r.HandleFunc("/jobs/{id}", api.getCompleteJobByID).Methods(http.MethodGet)
+	r.HandleFunc("/jobs/metrics/{id}", api.getJobMetrics).Methods(http.MethodGet)
 }

-// MountMetricStoreAPIRoutes registers metric storage API endpoints.
-// These endpoints handle metric data ingestion and health checks with JWT token authentication.
-func (api *RestAPI) MountMetricStoreAPIRoutes(r chi.Router) {
+func (api *RestAPI) MountMetricStoreAPIRoutes(r *mux.Router) {
 	// REST API Uses TokenAuth
-	r.Post("/free", freeMetrics)
-	r.Post("/write", writeMetrics)
-	r.Get("/debug", debugMetrics)
-	r.Post("/healthcheck", api.updateNodeStates)
+	// Note: StrictSlash handles trailing slash variations automatically
+	r.HandleFunc("/api/free", freeMetrics).Methods(http.MethodPost)
+	r.HandleFunc("/api/write", writeMetrics).Methods(http.MethodPost)
+	r.HandleFunc("/api/debug", debugMetrics).Methods(http.MethodGet)
+	r.HandleFunc("/api/healthcheck", metricsHealth).Methods(http.MethodGet)
 	// Same endpoints but with trailing slash
-	r.Post("/free/", freeMetrics)
-	r.Post("/write/", writeMetrics)
-	r.Get("/debug/", debugMetrics)
-	r.Post("/healthcheck/", api.updateNodeStates)
+	r.HandleFunc("/api/free/", freeMetrics).Methods(http.MethodPost)
+	r.HandleFunc("/api/write/", writeMetrics).Methods(http.MethodPost)
+	r.HandleFunc("/api/debug/", debugMetrics).Methods(http.MethodGet)
+	r.HandleFunc("/api/healthcheck/", metricsHealth).Methods(http.MethodGet)
 }

-// MountConfigAPIRoutes registers configuration and user management endpoints.
-// These routes use session-based authentication and require admin privileges.
-// Routes use full paths (including /config prefix) to avoid conflicting with
-// the /config page route when registered via Group instead of Route.
-func (api *RestAPI) MountConfigAPIRoutes(r chi.Router) {
+func (api *RestAPI) MountConfigAPIRoutes(r *mux.Router) {
+	r.StrictSlash(true)
 	// Settings Frontend Uses SessionAuth
 	if api.Authentication != nil {
-		r.Get("/config/roles/", api.getRoles)
-		r.Post("/config/users/", api.createUser)
-		r.Put("/config/users/", api.createUser)
-		r.Get("/config/users/", api.getUsers)
-		r.Delete("/config/users/", api.deleteUser)
-		r.Post("/config/user/{id}", api.updateUser)
-		r.Post("/config/notice/", api.editNotice)
-		r.Get("/config/taggers/", api.getTaggers)
-		r.Post("/config/taggers/run/", api.runTagger)
+		r.HandleFunc("/roles/", api.getRoles).Methods(http.MethodGet)
+		r.HandleFunc("/users/", api.createUser).Methods(http.MethodPost, http.MethodPut)
+		r.HandleFunc("/users/", api.getUsers).Methods(http.MethodGet)
+		r.HandleFunc("/users/", api.deleteUser).Methods(http.MethodDelete)
+		r.HandleFunc("/user/{id}", api.updateUser).Methods(http.MethodPost)
+		r.HandleFunc("/notice/", api.editNotice).Methods(http.MethodPost)
 	}
 }

-// MountFrontendAPIRoutes registers frontend-specific API endpoints.
-// These routes support JWT generation and user configuration updates with session authentication.
-func (api *RestAPI) MountFrontendAPIRoutes(r chi.Router) {
-	r.Get("/logs/", api.getJournalLog)
+func (api *RestAPI) MountFrontendAPIRoutes(r *mux.Router) {
+	r.StrictSlash(true)
 	// Settings Frontend Uses SessionAuth
 	if api.Authentication != nil {
-		r.Get("/jwt/", api.getJWT)
-		r.Post("/configuration/", api.updateConfiguration)
+		r.HandleFunc("/jwt/", api.getJWT).Methods(http.MethodGet)
+		r.HandleFunc("/configuration/", api.updateConfiguration).Methods(http.MethodPost)
 	}
 }

@@ -181,8 +160,6 @@ type DefaultAPIResponse struct {
 	Message string `json:"msg"`
 }

-// handleError writes a standardized JSON error response with the given status code.
-// It logs the error at WARN level and ensures proper Content-Type headers are set.
 func handleError(err error, statusCode int, rw http.ResponseWriter) {
 	cclog.Warnf("REST ERROR : %s", err.Error())
 	rw.Header().Add("Content-Type", "application/json")
@@ -195,38 +172,15 @@ func handleError(err error, statusCode int, rw http.ResponseWriter) {
 	}
 }

-// decode reads JSON from r into val with strict validation that rejects unknown fields.
 func decode(r io.Reader, val any) error {
 	dec := json.NewDecoder(r)
 	dec.DisallowUnknownFields()
 	return dec.Decode(val)
 }

-// validatePathComponent checks if a path component contains potentially malicious patterns
-// that could be used for path traversal attacks. Returns an error if validation fails.
-func validatePathComponent(component, componentName string) error {
-	if strings.Contains(component, "..") ||
-		strings.Contains(component, "/") ||
-		strings.Contains(component, "\\") {
-		return fmt.Errorf("invalid %s", componentName)
-	}
-	return nil
-}
-
-// editNotice godoc
-// @summary     Update system notice
-// @tags        Config
-// @description Updates the notice.txt file content. Only admins are allowed. Content is limited to 10000 characters.
-// @accept      mpfd
-// @produce     plain
-// @param       new-content formData string true "New notice content (max 10000 characters)"
-// @success     200 {string} string "Update Notice Content Success"
-// @failure     400 {object} ErrorResponse "Bad Request"
-// @failure     403 {object} ErrorResponse "Forbidden"
-// @failure     500 {object} ErrorResponse "Internal Server Error"
-// @security    ApiKeyAuth
-// @router      /notice/ [post]
 func (api *RestAPI) editNotice(rw http.ResponseWriter, r *http.Request) {
+	// SecuredCheck() only worked with TokenAuth: Removed
+
 	if user := repository.GetUserFromContext(r.Context()); !user.HasRole(schema.RoleAdmin) {
 		handleError(fmt.Errorf("only admins are allowed to update the notice.txt file"), http.StatusForbidden, rw)
 		return
@@ -235,8 +189,9 @@ func (api *RestAPI) editNotice(rw http.ResponseWriter, r *http.Request) {
 	// Get Value
 	newContent := r.FormValue("new-content")

-	if len(newContent) > maxNoticeLength {
-		handleError(fmt.Errorf("notice content exceeds maximum length of %d characters", maxNoticeLength), http.StatusBadRequest, rw)
+	// Validate content length to prevent DoS
+	if len(newContent) > 10000 {
+		handleError(fmt.Errorf("notice content exceeds maximum length of 10000 characters"), http.StatusBadRequest, rw)
 		return
 	}

@@ -248,9 +203,7 @@ func (api *RestAPI) editNotice(rw http.ResponseWriter, r *http.Request) {
 			handleError(fmt.Errorf("creating notice file failed: %w", err), http.StatusInternalServerError, rw)
 			return
 		}
-		if err := ntxt.Close(); err != nil {
-			cclog.Warnf("Failed to close notice file: %v", err)
-		}
+		ntxt.Close()
 	}

 	if err := os.WriteFile(noticeFilePath, []byte(newContent), noticeFilePerms); err != nil {
@@ -260,66 +213,13 @@ func (api *RestAPI) editNotice(rw http.ResponseWriter, r *http.Request) {

 	rw.Header().Set("Content-Type", "text/plain")
 	rw.WriteHeader(http.StatusOK)
-	var msg []byte
 	if newContent != "" {
-		msg = []byte("Update Notice Content Success")
+		rw.Write([]byte("Update Notice Content Success"))
 	} else {
-		msg = []byte("Empty Notice Content Success")
-	}
-	if _, err := rw.Write(msg); err != nil {
-		cclog.Errorf("Failed to write response: %v", err)
+		rw.Write([]byte("Empty Notice Content Success"))
 	}
 }

-func (api *RestAPI) getTaggers(rw http.ResponseWriter, r *http.Request) {
-	if user := repository.GetUserFromContext(r.Context()); !user.HasRole(schema.RoleAdmin) {
-		handleError(fmt.Errorf("only admins are allowed to list taggers"), http.StatusForbidden, rw)
-		return
-	}
-
-	rw.Header().Set("Content-Type", "application/json")
-	if err := json.NewEncoder(rw).Encode(tagger.ListTaggers()); err != nil {
-		cclog.Errorf("Failed to encode tagger list: %v", err)
-	}
-}
-
-func (api *RestAPI) runTagger(rw http.ResponseWriter, r *http.Request) {
-	if user := repository.GetUserFromContext(r.Context()); !user.HasRole(schema.RoleAdmin) {
-		handleError(fmt.Errorf("only admins are allowed to run taggers"), http.StatusForbidden, rw)
-		return
-	}
-
-	name := r.FormValue("name")
-	if name == "" {
-		handleError(fmt.Errorf("missing required parameter: name"), http.StatusBadRequest, rw)
-		return
-	}
-
-	if err := tagger.RunTaggerByName(name); err != nil {
-		handleError(err, http.StatusConflict, rw)
-		return
-	}
-
-	rw.Header().Set("Content-Type", "text/plain")
-	rw.WriteHeader(http.StatusOK)
-	if _, err := rw.Write(fmt.Appendf(nil, "Tagger %s started", name)); err != nil {
-		cclog.Errorf("Failed to write response: %v", err)
-	}
-}
-
-// getJWT godoc
-// @summary     Generate JWT token
-// @tags        Frontend
-// @description Generates a JWT token for a user. Admins can generate tokens for any user, regular users only for themselves.
-// @accept      mpfd
-// @produce     plain
-// @param       username formData string true "Username to generate JWT for"
-// @success     200 {string} string "JWT token"
-// @failure     403 {object} ErrorResponse "Forbidden"
-// @failure     404 {object} ErrorResponse "User Not Found"
-// @failure     500 {object} ErrorResponse "Internal Server Error"
-// @security    ApiKeyAuth
-// @router      /jwt/ [get]
 func (api *RestAPI) getJWT(rw http.ResponseWriter, r *http.Request) {
 	rw.Header().Set("Content-Type", "text/plain")
 	username := r.FormValue("username")
@@ -344,22 +244,12 @@ func (api *RestAPI) getJWT(rw http.ResponseWriter, r *http.Request) {
 	}

 	rw.WriteHeader(http.StatusOK)
-	if _, err := rw.Write([]byte(jwt)); err != nil {
-		cclog.Errorf("Failed to write JWT response: %v", err)
-	}
+	rw.Write([]byte(jwt))
 }

-// getRoles godoc
-// @summary     Get available roles
-// @tags        Config
-// @description Returns a list of valid user roles. Only admins are allowed.
-// @produce     json
-// @success     200 {array} string "List of role names"
-// @failure     403 {object} ErrorResponse "Forbidden"
-// @failure     500 {object} ErrorResponse "Internal Server Error"
-// @security    ApiKeyAuth
-// @router      /roles/ [get]
 func (api *RestAPI) getRoles(rw http.ResponseWriter, r *http.Request) {
+	// SecuredCheck() only worked with TokenAuth: Removed
+
 	user := repository.GetUserFromContext(r.Context())
 	if !user.HasRole(schema.RoleAdmin) {
 		handleError(fmt.Errorf("only admins are allowed to fetch a list of roles"), http.StatusForbidden, rw)
@@ -378,18 +268,6 @@ func (api *RestAPI) getRoles(rw http.ResponseWriter, r *http.Request) {
 	}
 }

-// updateConfiguration godoc
-// @summary     Update user configuration
-// @tags        Frontend
-// @description Updates a user's configuration key-value pair.
-// @accept      mpfd
-// @produce     plain
-// @param       key formData string true "Configuration key"
-// @param       value formData string true "Configuration value"
-// @success     200 {string} string "success"
-// @failure     500 {object} ErrorResponse "Internal Server Error"
-// @security    ApiKeyAuth
-// @router      /configuration/ [post]
 func (api *RestAPI) updateConfiguration(rw http.ResponseWriter, r *http.Request) {
 	rw.Header().Set("Content-Type", "text/plain")
 	key, value := r.FormValue("key"), r.FormValue("value")
@@ -400,40 +278,26 @@ func (api *RestAPI) updateConfiguration(rw http.ResponseWriter, r *http.Request)
 	}

 	rw.WriteHeader(http.StatusOK)
-	if _, err := rw.Write([]byte("success")); err != nil {
-		cclog.Errorf("Failed to write response: %v", err)
-	}
+	rw.Write([]byte("success"))
 }

-// putMachineState godoc
-// @summary     Store machine state
-// @tags        Machine State
-// @description Stores machine state data for a specific cluster node. Validates cluster and host names to prevent path traversal.
-// @accept      json
-// @produce     plain
-// @param       cluster path string true "Cluster name"
-// @param       host path string true "Host name"
-// @success     201 "Created"
-// @failure     400 {object} ErrorResponse "Bad Request"
-// @failure     404 {object} ErrorResponse "Machine state not enabled"
-// @failure     500 {object} ErrorResponse "Internal Server Error"
-// @security    ApiKeyAuth
-// @router      /machine_state/{cluster}/{host} [put]
 func (api *RestAPI) putMachineState(rw http.ResponseWriter, r *http.Request) {
 	if api.MachineStateDir == "" {
 		handleError(fmt.Errorf("machine state not enabled"), http.StatusNotFound, rw)
 		return
 	}

-	cluster := chi.URLParam(r, "cluster")
-	host := chi.URLParam(r, "host")
+	vars := mux.Vars(r)
+	cluster := vars["cluster"]
+	host := vars["host"]

-	if err := validatePathComponent(cluster, "cluster name"); err != nil {
-		handleError(err, http.StatusBadRequest, rw)
+	// Validate cluster and host to prevent path traversal attacks
+	if strings.Contains(cluster, "..") || strings.Contains(cluster, "/") || strings.Contains(cluster, "\\") {
+		handleError(fmt.Errorf("invalid cluster name"), http.StatusBadRequest, rw)
 		return
 	}
-	if err := validatePathComponent(host, "host name"); err != nil {
-		handleError(err, http.StatusBadRequest, rw)
+	if strings.Contains(host, "..") || strings.Contains(host, "/") || strings.Contains(host, "\\") {
+		handleError(fmt.Errorf("invalid host name"), http.StatusBadRequest, rw)
 		return
 	}

@@ -459,33 +323,23 @@ func (api *RestAPI) putMachineState(rw http.ResponseWriter, r *http.Request) {
 	rw.WriteHeader(http.StatusCreated)
 }

-// getMachineState godoc
-// @summary     Retrieve machine state
-// @tags        Machine State
-// @description Retrieves stored machine state data for a specific cluster node. Validates cluster and host names to prevent path traversal.
-// @produce     json
-// @param       cluster path string true "Cluster name"
-// @param       host path string true "Host name"
-// @success     200 {object} object "Machine state JSON data"
-// @failure     400 {object} ErrorResponse "Bad Request"
-// @failure     404 {object} ErrorResponse "Machine state not enabled or file not found"
-// @security    ApiKeyAuth
-// @router      /machine_state/{cluster}/{host} [get]
 func (api *RestAPI) getMachineState(rw http.ResponseWriter, r *http.Request) {
 	if api.MachineStateDir == "" {
 		handleError(fmt.Errorf("machine state not enabled"), http.StatusNotFound, rw)
 		return
 	}

-	cluster := chi.URLParam(r, "cluster")
-	host := chi.URLParam(r, "host")
+	vars := mux.Vars(r)
+	cluster := vars["cluster"]
+	host := vars["host"]

-	if err := validatePathComponent(cluster, "cluster name"); err != nil {
-		handleError(err, http.StatusBadRequest, rw)
+	// Validate cluster and host to prevent path traversal attacks
+	if strings.Contains(cluster, "..") || strings.Contains(cluster, "/") || strings.Contains(cluster, "\\") {
+		handleError(fmt.Errorf("invalid cluster name"), http.StatusBadRequest, rw)
 		return
 	}
-	if err := validatePathComponent(host, "host name"); err != nil {
-		handleError(err, http.StatusBadRequest, rw)
+	if strings.Contains(host, "..") || strings.Contains(host, "/") || strings.Contains(host, "\\") {
+		handleError(fmt.Errorf("invalid host name"), http.StatusBadRequest, rw)
 		return
 	}

--- a/internal/api/user.go
+++ b/internal/api/user.go
@@ -11,9 +11,9 @@ import (
 	"net/http"

 	"github.com/ClusterCockpit/cc-backend/internal/repository"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
-	"github.com/go-chi/chi/v5"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/schema"
+	"github.com/gorilla/mux"
 )

 type APIReturnedUser struct {
@@ -31,7 +31,7 @@ type APIReturnedUser struct {
 // @description Required query-parameter defines if all users or only users with additional special roles are returned.
 // @produce     json
 // @param       not-just-user query bool true "If returned list should contain all users or only users with additional special roles"
-// @success     200     {array} api.APIReturnedUser "List of users returned successfully"
+// @success     200     {array} api.ApiReturnedUser "List of users returned successfully"
 // @failure     400     {string} string             "Bad Request"
 // @failure     401     {string} string             "Unauthorized"
 // @failure     403     {string} string             "Forbidden"
@@ -91,7 +91,7 @@ func (api *RestAPI) updateUser(rw http.ResponseWriter, r *http.Request) {

 	// Handle role updates
 	if newrole != "" {
-		if err := repository.GetUserRepository().AddRole(r.Context(), chi.URLParam(r, "id"), newrole); err != nil {
+		if err := repository.GetUserRepository().AddRole(r.Context(), mux.Vars(r)["id"], newrole); err != nil {
 			handleError(fmt.Errorf("adding role failed: %w", err), http.StatusUnprocessableEntity, rw)
 			return
 		}
@@ -99,7 +99,7 @@ func (api *RestAPI) updateUser(rw http.ResponseWriter, r *http.Request) {
 			cclog.Errorf("Failed to encode response: %v", err)
 		}
 	} else if delrole != "" {
-		if err := repository.GetUserRepository().RemoveRole(r.Context(), chi.URLParam(r, "id"), delrole); err != nil {
+		if err := repository.GetUserRepository().RemoveRole(r.Context(), mux.Vars(r)["id"], delrole); err != nil {
 			handleError(fmt.Errorf("removing role failed: %w", err), http.StatusUnprocessableEntity, rw)
 			return
 		}
@@ -107,7 +107,7 @@ func (api *RestAPI) updateUser(rw http.ResponseWriter, r *http.Request) {
 			cclog.Errorf("Failed to encode response: %v", err)
 		}
 	} else if newproj != "" {
-		if err := repository.GetUserRepository().AddProject(r.Context(), chi.URLParam(r, "id"), newproj); err != nil {
+		if err := repository.GetUserRepository().AddProject(r.Context(), mux.Vars(r)["id"], newproj); err != nil {
 			handleError(fmt.Errorf("adding project failed: %w", err), http.StatusUnprocessableEntity, rw)
 			return
 		}
@@ -115,7 +115,7 @@ func (api *RestAPI) updateUser(rw http.ResponseWriter, r *http.Request) {
 			cclog.Errorf("Failed to encode response: %v", err)
 		}
 	} else if delproj != "" {
-		if err := repository.GetUserRepository().RemoveProject(r.Context(), chi.URLParam(r, "id"), delproj); err != nil {
+		if err := repository.GetUserRepository().RemoveProject(r.Context(), mux.Vars(r)["id"], delproj); err != nil {
 			handleError(fmt.Errorf("removing project failed: %w", err), http.StatusUnprocessableEntity, rw)
 			return
 		}
@@ -164,7 +164,7 @@ func (api *RestAPI) createUser(rw http.ResponseWriter, r *http.Request) {
 		return
 	}

-	if len(password) == 0 && role != schema.GetRoleString(schema.RoleAPI) {
+	if len(password) == 0 && role != schema.GetRoleString(schema.RoleApi) {
 		handleError(fmt.Errorf("only API users are allowed to have a blank password (login will be impossible)"), http.StatusBadRequest, rw)
 		return
 	}
--- a/internal/archiver/README.md
+++ b/internal/archiver/README.md
@@ -106,7 +106,7 @@ Data is archived at the highest available resolution (typically 60s intervals).

 ```go
 // In archiver.go ArchiveJob() function
-jobData, err := metricdispatch.LoadData(job, allMetrics, scopes, ctx, 300)
+jobData, err := metricdispatcher.LoadData(job, allMetrics, scopes, ctx, 300)
 // 0 = highest resolution
 // 300 = 5-minute resolution
 ```
@@ -170,6 +170,7 @@ All exported functions are safe for concurrent use:
 - `Start()` - Safe to call once
 - `TriggerArchiving()` - Safe from multiple goroutines
 - `Shutdown()` - Safe to call once
+- `WaitForArchiving()` - Deprecated, but safe

 Internal state is protected by:
 - Channel synchronization (`archiveChannel`)
@@ -184,6 +185,6 @@ Internal state is protected by:
 ## Dependencies

 - `internal/repository`: Database operations for job metadata
- `internal/metricdispatch`: Loading metric data from various backends
+- `internal/metricdispatcher`: Loading metric data from various backends
 - `pkg/archive`: Archive backend abstraction (filesystem, S3, SQLite)
 - `cc-lib/schema`: Job and metric data structures
--- a/internal/archiver/archiveWorker.go
+++ b/internal/archiver/archiveWorker.go
@@ -54,8 +54,8 @@ import (
 	"time"

 	"github.com/ClusterCockpit/cc-backend/internal/repository"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/schema"
 	sq "github.com/Masterminds/squirrel"
 )

@@ -126,7 +126,7 @@ func archivingWorker() {
 			// not using meta data, called to load JobMeta into Cache?
 			// will fail if job meta not in repository
 			if _, err := jobRepo.FetchMetadata(job); err != nil {
-				cclog.Errorf("archiving job (dbid: %d) failed at check metadata step: %s", *job.ID, err.Error())
+				cclog.Errorf("archiving job (dbid: %d) failed at check metadata step: %s", job.ID, err.Error())
 				jobRepo.UpdateMonitoringStatus(*job.ID, schema.MonitoringStatusArchivingFailed)
 				archivePending.Done()
 				continue
@@ -136,7 +136,7 @@ func archivingWorker() {
 			// Use shutdown context to allow cancellation
 			jobMeta, err := ArchiveJob(job, shutdownCtx)
 			if err != nil {
-				cclog.Errorf("archiving job (dbid: %d) failed at archiving job step: %s", *job.ID, err.Error())
+				cclog.Errorf("archiving job (dbid: %d) failed at archiving job step: %s", job.ID, err.Error())
 				jobRepo.UpdateMonitoringStatus(*job.ID, schema.MonitoringStatusArchivingFailed)
 				archivePending.Done()
 				continue
@@ -145,24 +145,24 @@ func archivingWorker() {
 			stmt := sq.Update("job").Where("job.id = ?", job.ID)

 			if stmt, err = jobRepo.UpdateFootprint(stmt, jobMeta); err != nil {
-				cclog.Errorf("archiving job (dbid: %d) failed at update Footprint step: %s", *job.ID, err.Error())
+				cclog.Errorf("archiving job (dbid: %d) failed at update Footprint step: %s", job.ID, err.Error())
 				archivePending.Done()
 				continue
 			}
 			if stmt, err = jobRepo.UpdateEnergy(stmt, jobMeta); err != nil {
-				cclog.Errorf("archiving job (dbid: %d) failed at update Energy step: %s", *job.ID, err.Error())
+				cclog.Errorf("archiving job (dbid: %d) failed at update Energy step: %s", job.ID, err.Error())
 				archivePending.Done()
 				continue
 			}
 			// Update the jobs database entry one last time:
 			stmt = jobRepo.MarkArchived(stmt, schema.MonitoringStatusArchivingSuccessful)
 			if err := jobRepo.Execute(stmt); err != nil {
-				cclog.Errorf("archiving job (dbid: %d) failed at db execute: %s", *job.ID, err.Error())
+				cclog.Errorf("archiving job (dbid: %d) failed at db execute: %s", job.ID, err.Error())
 				archivePending.Done()
 				continue
 			}
 			cclog.Debugf("archiving job %d took %s", job.JobID, time.Since(start))
-			cclog.Infof("archiving job (dbid: %d) successful", *job.ID)
+			cclog.Infof("archiving job (dbid: %d) successful", job.ID)

 			repository.CallJobStopHooks(job)
 			archivePending.Done()
--- a/internal/archiver/archiver.go
+++ b/internal/archiver/archiver.go
@@ -9,10 +9,11 @@ import (
 	"context"
 	"math"

-	"github.com/ClusterCockpit/cc-backend/internal/metricdispatch"
+	"github.com/ClusterCockpit/cc-backend/internal/config"
+	"github.com/ClusterCockpit/cc-backend/internal/metricdispatcher"
 	"github.com/ClusterCockpit/cc-backend/pkg/archive"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/schema"
 )

 // ArchiveJob archives a completed job's metric data to the configured archive backend.
@@ -59,7 +60,7 @@ func ArchiveJob(job *schema.Job, ctx context.Context) (*schema.Job, error) {
 		scopes = append(scopes, schema.MetricScopeAccelerator)
 	}

-	jobData, err := metricdispatch.LoadData(job, allMetrics, scopes, ctx, 0) // 0 Resulotion-Value retrieves highest res (60s)
+	jobData, err := metricdispatcher.LoadData(job, allMetrics, scopes, ctx, 0) // 0 Resulotion-Value retrieves highest res (60s)
 	if err != nil {
 		cclog.Error("Error wile loading job data for archiving")
 		return nil, err
@@ -93,5 +94,12 @@ func ArchiveJob(job *schema.Job, ctx context.Context) (*schema.Job, error) {
 		}
 	}

+	// If the file based archive is disabled,
+	// only return the JobMeta structure as the
+	// statistics in there are needed.
+	if config.Keys.DisableArchive {
+		return job, nil
+	}
+
 	return job, archive.GetHandle().ImportJob(job, &jobData)
 }
--- a/internal/auth/auth.go
+++ b/internal/auth/auth.go
@@ -25,9 +25,9 @@ import (

 	"github.com/ClusterCockpit/cc-backend/internal/config"
 	"github.com/ClusterCockpit/cc-backend/internal/repository"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
-	"github.com/ClusterCockpit/cc-lib/v2/util"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/schema"
+	"github.com/ClusterCockpit/cc-lib/util"
 	"github.com/gorilla/sessions"
 )

@@ -40,7 +40,7 @@ type Authenticator interface {
 	// authenticator should attempt the login. This method should not perform
 	// expensive operations or actual authentication.
 	CanLogin(user *schema.User, username string, rw http.ResponseWriter, r *http.Request) (*schema.User, bool)
-
+	
 	// Login performs the actually authentication for the user.
 	// It returns the authenticated user or an error if authentication fails.
 	// The user parameter may be nil if the user doesn't exist in the database yet.
@@ -65,13 +65,13 @@ var ipUserLimiters sync.Map
 func getIPUserLimiter(ip, username string) *rate.Limiter {
 	key := ip + ":" + username
 	now := time.Now()
-
+	
 	if entry, ok := ipUserLimiters.Load(key); ok {
 		rle := entry.(*rateLimiterEntry)
 		rle.lastUsed = now
 		return rle.limiter
 	}
-
+	
 	// More aggressive rate limiting: 5 attempts per 15 minutes
 	newLimiter := rate.NewLimiter(rate.Every(15*time.Minute/5), 5)
 	ipUserLimiters.Store(key, &rateLimiterEntry{
@@ -176,7 +176,7 @@ func (auth *Authentication) AuthViaSession(
 func Init(authCfg *json.RawMessage) {
 	initOnce.Do(func() {
 		authInstance = &Authentication{}
-
+		
 		// Start background cleanup of rate limiters
 		startRateLimiterCleanup()

@@ -263,7 +263,7 @@ func GetAuthInstance() *Authentication {
 }

 // handleUserSync syncs or updates a user in the database based on configuration.
-// This is used for LDAP, JWT and OIDC authentications when syncUserOnLogin or updateUserOnLogin is enabled.
+// This is used for both JWT and OIDC authentication when syncUserOnLogin or updateUserOnLogin is enabled.
 func handleUserSync(user *schema.User, syncUserOnLogin, updateUserOnLogin bool) {
 	r := repository.GetUserRepository()
 	dbUser, err := r.GetUser(user.Username)
@@ -272,7 +272,7 @@ func handleUserSync(user *schema.User, syncUserOnLogin, updateUserOnLogin bool)
 		cclog.Errorf("Error while loading user '%s': %v", user.Username, err)
 		return
 	}
-
+	
 	if err == sql.ErrNoRows && syncUserOnLogin { // Add new user
 		if err := r.AddUser(user); err != nil {
 			cclog.Errorf("Error while adding user '%s' to DB: %v", user.Username, err)
@@ -294,11 +294,6 @@ func handleOIDCUser(OIDCUser *schema.User) {
 	handleUserSync(OIDCUser, Keys.OpenIDConfig.SyncUserOnLogin, Keys.OpenIDConfig.UpdateUserOnLogin)
 }

-// handleLdapUser syncs LDAP user with database
-func handleLdapUser(ldapUser *schema.User) {
-	handleUserSync(ldapUser, Keys.LdapConfig.SyncUserOnLogin, Keys.LdapConfig.UpdateUserOnLogin)
-}
-
 func (auth *Authentication) SaveSession(rw http.ResponseWriter, r *http.Request, user *schema.User) error {
 	session, err := auth.sessionStore.New(r, "session")
 	if err != nil {
@@ -310,13 +305,8 @@ func (auth *Authentication) SaveSession(rw http.ResponseWriter, r *http.Request,
 	if auth.SessionMaxAge != 0 {
 		session.Options.MaxAge = int(auth.SessionMaxAge.Seconds())
 	}
-	if r.TLS == nil && r.Header.Get("X-Forwarded-Proto") != "https" {
-		// If neither TLS or an encrypted reverse proxy are used, do not mark cookies as secure.
-		cclog.Warn("Authenticating with unencrypted request. Session cookies will not have Secure flag set (insecure for production)")
-		if r.Header.Get("X-Forwarded-Proto") == "" {
-			// This warning will not be printed if e.g. X-Forwarded-Proto == http
-			cclog.Warn("If you are using a reverse proxy, make sure X-Forwarded-Proto is set")
-		}
+	if config.Keys.HTTPSCertFile == "" && config.Keys.HTTPSKeyFile == "" {
+		cclog.Warn("HTTPS not configured - session cookies will not have Secure flag set (insecure for production)")
 		session.Options.Secure = false
 	}
 	session.Options.SameSite = http.SameSiteStrictMode
@@ -448,13 +438,13 @@ func (auth *Authentication) AuthAPI(
 		if user != nil {
 			switch {
 			case len(user.Roles) == 1:
-				if user.HasRole(schema.RoleAPI) {
+				if user.HasRole(schema.RoleApi) {
 					ctx := context.WithValue(r.Context(), repository.ContextUserKey, user)
 					onsuccess.ServeHTTP(rw, r.WithContext(ctx))
 					return
 				}
 			case len(user.Roles) >= 2:
-				if user.HasAllRoles([]schema.Role{schema.RoleAdmin, schema.RoleAPI}) {
+				if user.HasAllRoles([]schema.Role{schema.RoleAdmin, schema.RoleApi}) {
 					ctx := context.WithValue(r.Context(), repository.ContextUserKey, user)
 					onsuccess.ServeHTTP(rw, r.WithContext(ctx))
 					return
@@ -484,13 +474,13 @@ func (auth *Authentication) AuthUserAPI(
 		if user != nil {
 			switch {
 			case len(user.Roles) == 1:
-				if user.HasRole(schema.RoleAPI) {
+				if user.HasRole(schema.RoleApi) {
 					ctx := context.WithValue(r.Context(), repository.ContextUserKey, user)
 					onsuccess.ServeHTTP(rw, r.WithContext(ctx))
 					return
 				}
 			case len(user.Roles) >= 2:
-				if user.HasRole(schema.RoleAPI) && user.HasAnyRole([]schema.Role{schema.RoleUser, schema.RoleManager, schema.RoleSupport, schema.RoleAdmin}) {
+				if user.HasRole(schema.RoleApi) && user.HasAnyRole([]schema.Role{schema.RoleUser, schema.RoleManager, schema.RoleSupport, schema.RoleAdmin}) {
 					ctx := context.WithValue(r.Context(), repository.ContextUserKey, user)
 					onsuccess.ServeHTTP(rw, r.WithContext(ctx))
 					return
@@ -520,13 +510,13 @@ func (auth *Authentication) AuthMetricStoreAPI(
 		if user != nil {
 			switch {
 			case len(user.Roles) == 1:
-				if user.HasRole(schema.RoleAPI) {
+				if user.HasRole(schema.RoleApi) {
 					ctx := context.WithValue(r.Context(), repository.ContextUserKey, user)
 					onsuccess.ServeHTTP(rw, r.WithContext(ctx))
 					return
 				}
 			case len(user.Roles) >= 2:
-				if user.HasRole(schema.RoleAPI) && user.HasAnyRole([]schema.Role{schema.RoleUser, schema.RoleManager, schema.RoleAdmin}) {
+				if user.HasRole(schema.RoleApi) && user.HasAnyRole([]schema.Role{schema.RoleUser, schema.RoleManager, schema.RoleAdmin}) {
 					ctx := context.WithValue(r.Context(), repository.ContextUserKey, user)
 					onsuccess.ServeHTTP(rw, r.WithContext(ctx))
 					return
@@ -626,9 +616,9 @@ func securedCheck(user *schema.User, r *http.Request) error {
 	}
 	// If SplitHostPort fails, IPAddress is already just a host (no port)

-	// If nothing declared in config: Continue
+	// If nothing declared in config: deny all request to this api endpoint
 	if len(config.Keys.APIAllowedIPs) == 0 {
-		return nil
+		return fmt.Errorf("missing configuration key ApiAllowedIPs")
 	}
 	// If wildcard declared in config: Continue
 	if config.Keys.APIAllowedIPs[0] == "*" {
--- a/internal/auth/auth_test.go
+++ b/internal/auth/auth_test.go
@@ -15,25 +15,25 @@ import (
 func TestGetIPUserLimiter(t *testing.T) {
 	ip := "192.168.1.1"
 	username := "testuser"
-
+	
 	// Get limiter for the first time
 	limiter1 := getIPUserLimiter(ip, username)
 	if limiter1 == nil {
 		t.Fatal("Expected limiter to be created")
 	}
-
+	
 	// Get the same limiter again
 	limiter2 := getIPUserLimiter(ip, username)
 	if limiter1 != limiter2 {
 		t.Error("Expected to get the same limiter instance")
 	}
-
+	
 	// Get a different limiter for different user
 	limiter3 := getIPUserLimiter(ip, "otheruser")
 	if limiter1 == limiter3 {
 		t.Error("Expected different limiter for different user")
 	}
-
+	
 	// Get a different limiter for different IP
 	limiter4 := getIPUserLimiter("192.168.1.2", username)
 	if limiter1 == limiter4 {
@@ -45,16 +45,16 @@ func TestGetIPUserLimiter(t *testing.T) {
 func TestRateLimiterBehavior(t *testing.T) {
 	ip := "10.0.0.1"
 	username := "ratelimituser"
-
+	
 	limiter := getIPUserLimiter(ip, username)
-
+	
 	// Should allow first 5 attempts
-	for i := range 5 {
+	for i := 0; i < 5; i++ {
 		if !limiter.Allow() {
 			t.Errorf("Request %d should be allowed within rate limit", i+1)
 		}
 	}
-
+	
 	// 6th attempt should be blocked
 	if limiter.Allow() {
 		t.Error("Request 6 should be blocked by rate limiter")
@@ -65,19 +65,19 @@ func TestRateLimiterBehavior(t *testing.T) {
 func TestCleanupOldRateLimiters(t *testing.T) {
 	// Clear all existing limiters first to avoid interference from other tests
 	cleanupOldRateLimiters(time.Now().Add(24 * time.Hour))
-
+	
 	// Create some new rate limiters
 	limiter1 := getIPUserLimiter("1.1.1.1", "user1")
 	limiter2 := getIPUserLimiter("2.2.2.2", "user2")
-
+	
 	if limiter1 == nil || limiter2 == nil {
 		t.Fatal("Failed to create test limiters")
 	}
-
+	
 	// Cleanup limiters older than 1 second from now (should keep both)
 	time.Sleep(10 * time.Millisecond) // Small delay to ensure timestamp difference
 	cleanupOldRateLimiters(time.Now().Add(-1 * time.Second))
-
+	
 	// Verify they still exist (should get same instance)
 	if getIPUserLimiter("1.1.1.1", "user1") != limiter1 {
 		t.Error("Limiter 1 was incorrectly cleaned up")
@@ -85,10 +85,10 @@ func TestCleanupOldRateLimiters(t *testing.T) {
 	if getIPUserLimiter("2.2.2.2", "user2") != limiter2 {
 		t.Error("Limiter 2 was incorrectly cleaned up")
 	}
-
+	
 	// Cleanup limiters older than 1 hour from now (should remove both)
 	cleanupOldRateLimiters(time.Now().Add(2 * time.Hour))
-
+	
 	// Getting them again should create new instances
 	newLimiter1 := getIPUserLimiter("1.1.1.1", "user1")
 	if newLimiter1 == limiter1 {
@@ -107,14 +107,14 @@ func TestIPv4Extraction(t *testing.T) {
 		{"IPv4 without port", "192.168.1.1", "192.168.1.1"},
 		{"Localhost with port", "127.0.0.1:3000", "127.0.0.1"},
 	}
-
+	
 	for _, tt := range tests {
 		t.Run(tt.name, func(t *testing.T) {
 			result := tt.input
 			if host, _, err := net.SplitHostPort(result); err == nil {
 				result = host
 			}
-
+			
 			if result != tt.expected {
 				t.Errorf("Expected %s, got %s", tt.expected, result)
 			}
@@ -122,7 +122,7 @@ func TestIPv4Extraction(t *testing.T) {
 	}
 }

-// TestIPv6Extraction tests extracting IPv6 addresses
+// TestIPv6Extraction tests extracting IPv6 addresses  
 func TestIPv6Extraction(t *testing.T) {
 	tests := []struct {
 		name     string
@@ -134,14 +134,14 @@ func TestIPv6Extraction(t *testing.T) {
 		{"IPv6 without port", "2001:db8::1", "2001:db8::1"},
 		{"IPv6 localhost", "::1", "::1"},
 	}
-
+	
 	for _, tt := range tests {
 		t.Run(tt.name, func(t *testing.T) {
 			result := tt.input
 			if host, _, err := net.SplitHostPort(result); err == nil {
 				result = host
 			}
-
+			
 			if result != tt.expected {
 				t.Errorf("Expected %s, got %s", tt.expected, result)
 			}
@@ -160,14 +160,14 @@ func TestIPExtractionEdgeCases(t *testing.T) {
 		{"Empty string", "", ""},
 		{"Just port", ":8080", ""},
 	}
-
+	
 	for _, tt := range tests {
 		t.Run(tt.name, func(t *testing.T) {
 			result := tt.input
 			if host, _, err := net.SplitHostPort(result); err == nil {
 				result = host
 			}
-
+			
 			if result != tt.expected {
 				t.Errorf("Expected %s, got %s", tt.expected, result)
 			}
--- a/internal/auth/jwt.go
+++ b/internal/auth/jwt.go
@@ -14,8 +14,8 @@ import (
 	"strings"
 	"time"

-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/schema"
 	"github.com/golang-jwt/jwt/v5"
 )

@@ -25,20 +25,20 @@ type JWTAuthConfig struct {
 	MaxAge string `json:"max-age"`

 	// Specifies which cookie should be checked for a JWT token (if no authorization header is present)
-	CookieName string `json:"cookie-name"`
+	CookieName string `json:"cookieName"`

 	// Deny login for users not in database (but defined in JWT).
 	// Ignore user roles defined in JWTs ('roles' claim), get them from db.
-	ValidateUser bool `json:"validate-user"`
+	ValidateUser bool `json:"validateUser"`

 	// Specifies which issuer should be accepted when validating external JWTs ('iss' claim)
-	TrustedIssuer string `json:"trusted-issuer"`
+	TrustedIssuer string `json:"trustedIssuer"`

 	// Should an non-existent user be added to the DB based on the information in the token
-	SyncUserOnLogin bool `json:"sync-user-on-login"`
+	SyncUserOnLogin bool `json:"syncUserOnLogin"`

 	// Should an existent user be updated in the DB based on the information in the token
-	UpdateUserOnLogin bool `json:"update-user-on-login"`
+	UpdateUserOnLogin bool `json:"updateUserOnLogin"`
 }

 type JWTAuthenticator struct {
@@ -101,20 +101,20 @@ func (ja *JWTAuthenticator) AuthViaJWT(

 	// Token is valid, extract payload
 	claims := token.Claims.(jwt.MapClaims)
-
+	
 	// Use shared helper to get user from JWT claims
 	var user *schema.User
 	user, err = getUserFromJWT(claims, Keys.JwtConfig.ValidateUser, schema.AuthToken, -1)
 	if err != nil {
 		return nil, err
 	}
-
+	
 	// If not validating user, we only get roles from JWT (no projects for this auth method)
 	if !Keys.JwtConfig.ValidateUser {
 		user.Roles = extractRolesFromClaims(claims, false)
 		user.Projects = nil // Standard JWT auth doesn't include projects
 	}
-
+	
 	return user, nil
 }

--- a/internal/auth/jwtCookieSession.go
+++ b/internal/auth/jwtCookieSession.go
@@ -12,8 +12,8 @@ import (
 	"net/http"
 	"os"

-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/schema"
 	"github.com/golang-jwt/jwt/v5"
 )

@@ -146,13 +146,13 @@ func (ja *JWTCookieSessionAuthenticator) Login(
 	}

 	claims := token.Claims.(jwt.MapClaims)
-
+	
 	// Use shared helper to get user from JWT claims
 	user, err = getUserFromJWT(claims, jc.ValidateUser, schema.AuthSession, schema.AuthViaToken)
 	if err != nil {
 		return nil, err
 	}
-
+	
 	// Sync or update user if configured
 	if !jc.ValidateUser && (jc.SyncUserOnLogin || jc.UpdateUserOnLogin) {
 		handleTokenUser(user)
--- a/internal/auth/jwtHelpers.go
+++ b/internal/auth/jwtHelpers.go
@@ -9,11 +9,10 @@ import (
 	"database/sql"
 	"errors"
 	"fmt"
-	"strings"

 	"github.com/ClusterCockpit/cc-backend/internal/repository"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/schema"
 	"github.com/golang-jwt/jwt/v5"
 )

@@ -29,7 +28,7 @@ func extractStringFromClaims(claims jwt.MapClaims, key string) string {
 // If validateRoles is true, only valid roles are returned
 func extractRolesFromClaims(claims jwt.MapClaims, validateRoles bool) []string {
 	var roles []string
-
+	
 	if rawroles, ok := claims["roles"].([]any); ok {
 		for _, rr := range rawroles {
 			if r, ok := rr.(string); ok {
@@ -43,14 +42,14 @@ func extractRolesFromClaims(claims jwt.MapClaims, validateRoles bool) []string {
 			}
 		}
 	}
-
+	
 	return roles
 }

 // extractProjectsFromClaims extracts projects from JWT claims
 func extractProjectsFromClaims(claims jwt.MapClaims) []string {
 	projects := make([]string, 0)
-
+	
 	if rawprojs, ok := claims["projects"].([]any); ok {
 		for _, pp := range rawprojs {
 			if p, ok := pp.(string); ok {
@@ -62,7 +61,7 @@ func extractProjectsFromClaims(claims jwt.MapClaims) []string {
 			projects = append(projects, projSlice...)
 		}
 	}
-
+	
 	return projects
 }

@@ -73,23 +72,22 @@ func extractNameFromClaims(claims jwt.MapClaims) string {
 	if name, ok := claims["name"].(string); ok {
 		return name
 	}
-
+	
 	// Try nested structure: {name: {values: [...]}}
 	if wrap, ok := claims["name"].(map[string]any); ok {
 		if vals, ok := wrap["values"].([]any); ok {
 			if len(vals) == 0 {
 				return ""
 			}
-
-			var name strings.Builder
-			name.WriteString(fmt.Sprintf("%v", vals[0]))
+			
+			name := fmt.Sprintf("%v", vals[0])
 			for i := 1; i < len(vals); i++ {
-				name.WriteString(fmt.Sprintf(" %v", vals[i]))
+				name += fmt.Sprintf(" %v", vals[i])
 			}
-			return name.String()
+			return name
 		}
 	}
-
+	
 	return ""
 }

@@ -102,7 +100,7 @@ func getUserFromJWT(claims jwt.MapClaims, validateUser bool, authType schema.Aut
 	if sub == "" {
 		return nil, errors.New("missing 'sub' claim in JWT")
 	}
-
+	
 	if validateUser {
 		// Validate user against database
 		ur := repository.GetUserRepository()
@@ -111,22 +109,22 @@ func getUserFromJWT(claims jwt.MapClaims, validateUser bool, authType schema.Aut
 			cclog.Errorf("Error while loading user '%v': %v", sub, err)
 			return nil, fmt.Errorf("database error: %w", err)
 		}
-
+		
 		// Deny any logins for unknown usernames
 		if user == nil || err == sql.ErrNoRows {
 			cclog.Warn("Could not find user from JWT in internal database.")
 			return nil, errors.New("unknown user")
 		}
-
+		
 		// Return database user (with database roles)
 		return user, nil
 	}
-
+	
 	// Create user from JWT claims
 	name := extractNameFromClaims(claims)
 	roles := extractRolesFromClaims(claims, true) // Validate roles
 	projects := extractProjectsFromClaims(claims)
-
+	
 	return &schema.User{
 		Username:   sub,
 		Name:       name,
--- a/internal/auth/jwtHelpers_test.go
+++ b/internal/auth/jwtHelpers_test.go
@@ -8,7 +8,7 @@ package auth
 import (
 	"testing"

-	"github.com/ClusterCockpit/cc-lib/v2/schema"
+	"github.com/ClusterCockpit/cc-lib/schema"
 	"github.com/golang-jwt/jwt/v5"
 )

@@ -19,7 +19,7 @@ func TestExtractStringFromClaims(t *testing.T) {
 		"email": "test@example.com",
 		"age":   25, // not a string
 	}
-
+	
 	tests := []struct {
 		name     string
 		key      string
@@ -30,7 +30,7 @@ func TestExtractStringFromClaims(t *testing.T) {
 		{"Non-existent key", "missing", ""},
 		{"Non-string value", "age", ""},
 	}
-
+	
 	for _, tt := range tests {
 		t.Run(tt.name, func(t *testing.T) {
 			result := extractStringFromClaims(claims, tt.key)
@@ -88,16 +88,16 @@ func TestExtractRolesFromClaims(t *testing.T) {
 			expected:      []string{},
 		},
 	}
-
+	
 	for _, tt := range tests {
 		t.Run(tt.name, func(t *testing.T) {
 			result := extractRolesFromClaims(tt.claims, tt.validateRoles)
-
+			
 			if len(result) != len(tt.expected) {
 				t.Errorf("Expected %d roles, got %d", len(tt.expected), len(result))
 				return
 			}
-
+			
 			for i, role := range result {
 				if i >= len(tt.expected) || role != tt.expected[i] {
 					t.Errorf("Expected role %s at position %d, got %s", tt.expected[i], i, role)
@@ -141,16 +141,16 @@ func TestExtractProjectsFromClaims(t *testing.T) {
 			expected: []string{"project1", "project2"}, // Should skip non-strings
 		},
 	}
-
+	
 	for _, tt := range tests {
 		t.Run(tt.name, func(t *testing.T) {
 			result := extractProjectsFromClaims(tt.claims)
-
+			
 			if len(result) != len(tt.expected) {
 				t.Errorf("Expected %d projects, got %d", len(tt.expected), len(result))
 				return
 			}
-
+			
 			for i, project := range result {
 				if i >= len(tt.expected) || project != tt.expected[i] {
 					t.Errorf("Expected project %s at position %d, got %s", tt.expected[i], i, project)
@@ -216,7 +216,7 @@ func TestExtractNameFromClaims(t *testing.T) {
 			expected: "123 Smith", // Should convert to string
 		},
 	}
-
+	
 	for _, tt := range tests {
 		t.Run(tt.name, func(t *testing.T) {
 			result := extractNameFromClaims(tt.claims)
@@ -235,28 +235,29 @@ func TestGetUserFromJWT_NoValidation(t *testing.T) {
 		"roles":    []any{"user", "admin"},
 		"projects": []any{"project1", "project2"},
 	}
-
+	
 	user, err := getUserFromJWT(claims, false, schema.AuthToken, -1)
+	
 	if err != nil {
 		t.Fatalf("Unexpected error: %v", err)
 	}
-
+	
 	if user.Username != "testuser" {
 		t.Errorf("Expected username 'testuser', got '%s'", user.Username)
 	}
-
+	
 	if user.Name != "Test User" {
 		t.Errorf("Expected name 'Test User', got '%s'", user.Name)
 	}
-
+	
 	if len(user.Roles) != 2 {
 		t.Errorf("Expected 2 roles, got %d", len(user.Roles))
 	}
-
+	
 	if len(user.Projects) != 2 {
 		t.Errorf("Expected 2 projects, got %d", len(user.Projects))
 	}
-
+	
 	if user.AuthType != schema.AuthToken {
 		t.Errorf("Expected AuthType %v, got %v", schema.AuthToken, user.AuthType)
 	}
@@ -267,13 +268,13 @@ func TestGetUserFromJWT_MissingSub(t *testing.T) {
 	claims := jwt.MapClaims{
 		"name": "Test User",
 	}
-
+	
 	_, err := getUserFromJWT(claims, false, schema.AuthToken, -1)
-
+	
 	if err == nil {
 		t.Error("Expected error for missing sub claim")
 	}
-
+	
 	if err.Error() != "missing 'sub' claim in JWT" {
 		t.Errorf("Expected specific error message, got: %v", err)
 	}
--- a/internal/auth/jwtSession.go
+++ b/internal/auth/jwtSession.go
@@ -13,8 +13,8 @@ import (
 	"os"
 	"strings"

-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/schema"
 	"github.com/golang-jwt/jwt/v5"
 )

@@ -75,13 +75,13 @@ func (ja *JWTSessionAuthenticator) Login(
 	}

 	claims := token.Claims.(jwt.MapClaims)
-
+	
 	// Use shared helper to get user from JWT claims
 	user, err = getUserFromJWT(claims, Keys.JwtConfig.ValidateUser, schema.AuthSession, schema.AuthViaToken)
 	if err != nil {
 		return nil, err
 	}
-
+	
 	// Sync or update user if configured
 	if !Keys.JwtConfig.ValidateUser && (Keys.JwtConfig.SyncUserOnLogin || Keys.JwtConfig.UpdateUserOnLogin) {
 		handleTokenUser(user)
--- a/internal/auth/ldap.go
+++ b/internal/auth/ldap.go
@@ -6,39 +6,35 @@
 package auth

 import (
+	"errors"
 	"fmt"
-	"net"
 	"net/http"
 	"os"
 	"strings"
-	"time"

 	"github.com/ClusterCockpit/cc-backend/internal/repository"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/schema"
 	"github.com/go-ldap/ldap/v3"
 )

 type LdapConfig struct {
 	URL             string `json:"url"`
-	UserBase        string `json:"user-base"`
-	SearchDN        string `json:"search-dn"`
-	UserBind        string `json:"user-bind"`
-	UserFilter      string `json:"user-filter"`
-	UserAttr        string `json:"username-attr"`
-	UIDAttr         string `json:"uid-attr"`
-	SyncInterval    string `json:"sync-interval"` // Parsed using time.ParseDuration.
-	SyncDelOldUsers bool   `json:"sync-del-old-users"`
+	UserBase        string `json:"user_base"`
+	SearchDN        string `json:"search_dn"`
+	UserBind        string `json:"user_bind"`
+	UserFilter      string `json:"user_filter"`
+	UserAttr        string `json:"username_attr"`
+	SyncInterval    string `json:"sync_interval"` // Parsed using time.ParseDuration.
+	SyncDelOldUsers bool   `json:"sync_del_old_users"`

-	// Should a non-existent user be added to the DB if user exists in ldap directory
-	SyncUserOnLogin   bool `json:"sync-user-on-login"`
-	UpdateUserOnLogin bool `json:"update-user-on-login"`
+	// Should an non-existent user be added to the DB if user exists in ldap directory
+	SyncUserOnLogin bool `json:"syncUserOnLogin"`
 }

 type LdapAuthenticator struct {
 	syncPassword string
 	UserAttr     string
-	UIDAttr      string
 }

 var _ Authenticator = (*LdapAuthenticator)(nil)
@@ -55,12 +51,6 @@ func (la *LdapAuthenticator) Init() error {
 		la.UserAttr = "gecos"
 	}

-	if Keys.LdapConfig.UIDAttr != "" {
-		la.UIDAttr = Keys.LdapConfig.UIDAttr
-	} else {
-		la.UIDAttr = "uid"
-	}
-
 	return nil
 }

@@ -76,44 +66,55 @@ func (la *LdapAuthenticator) CanLogin(
 		if user.AuthSource == schema.AuthViaLDAP {
 			return user, true
 		}
-	} else if lc.SyncUserOnLogin {
-		l, err := la.getLdapConnection(true)
-		if err != nil {
-			cclog.Error("LDAP connection error")
-			return nil, false
+	} else {
+		if lc.SyncUserOnLogin {
+			l, err := la.getLdapConnection(true)
+			if err != nil {
+				cclog.Error("LDAP connection error")
+				return nil, false
+			}
+			defer l.Close()
+
+			// Search for the given username
+			searchRequest := ldap.NewSearchRequest(
+				lc.UserBase,
+				ldap.ScopeWholeSubtree, ldap.NeverDerefAliases, 0, 0, false,
+				fmt.Sprintf("(&%s(uid=%s))", lc.UserFilter, username),
+				[]string{"dn", "uid", la.UserAttr}, nil)
+
+			sr, err := l.Search(searchRequest)
+			if err != nil {
+				cclog.Warn(err)
+				return nil, false
+			}
+
+			if len(sr.Entries) != 1 {
+				cclog.Warn("LDAP: User does not exist or too many entries returned")
+				return nil, false
+			}
+
+			entry := sr.Entries[0]
+			name := entry.GetAttributeValue(la.UserAttr)
+			var roles []string
+			roles = append(roles, schema.GetRoleString(schema.RoleUser))
+			projects := make([]string, 0)
+
+			user = &schema.User{
+				Username:   username,
+				Name:       name,
+				Roles:      roles,
+				Projects:   projects,
+				AuthType:   schema.AuthSession,
+				AuthSource: schema.AuthViaLDAP,
+			}
+
+			if err := repository.GetUserRepository().AddUser(user); err != nil {
+				cclog.Errorf("User '%s' LDAP: Insert into DB failed", username)
+				return nil, false
+			}
+
+			return user, true
 		}
-		defer l.Close()
-
-		// Search for the given username
-		searchRequest := ldap.NewSearchRequest(
-			lc.UserBase,
-			ldap.ScopeWholeSubtree, ldap.NeverDerefAliases, 0, 0, false,
-			fmt.Sprintf("(&%s(%s=%s))", lc.UserFilter, la.UIDAttr, ldap.EscapeFilter(username)),
-			[]string{"dn", la.UIDAttr, la.UserAttr}, nil)
-
-		sr, err := l.Search(searchRequest)
-		if err != nil {
-			cclog.Warn(err)
-			return nil, false
-		}
-
-		if len(sr.Entries) != 1 {
-			cclog.Warn("LDAP: User does not exist or too many entries returned")
-			return nil, false
-		}
-
-		entry := sr.Entries[0]
-		user = &schema.User{
-			Username:   username,
-			Name:       entry.GetAttributeValue(la.UserAttr),
-			Roles:      []string{schema.GetRoleString(schema.RoleUser)},
-			Projects:   make([]string, 0),
-			AuthType:   schema.AuthSession,
-			AuthSource: schema.AuthViaLDAP,
-		}
-
-		handleLdapUser(user)
-		return user, true
 	}

 	return nil, false
@@ -131,7 +132,7 @@ func (la *LdapAuthenticator) Login(
 	}
 	defer l.Close()

-	userDn := strings.ReplaceAll(Keys.LdapConfig.UserBind, "{username}", ldap.EscapeDN(user.Username))
+	userDn := strings.ReplaceAll(Keys.LdapConfig.UserBind, "{username}", user.Username)
 	if err := l.Bind(userDn, r.FormValue("password")); err != nil {
 		cclog.Errorf("AUTH/LDAP > Authentication for user %s failed: %v",
 			user.Username, err)
@@ -169,7 +170,7 @@ func (la *LdapAuthenticator) Sync() error {
 		lc.UserBase,
 		ldap.ScopeWholeSubtree, ldap.NeverDerefAliases, 0, 0, false,
 		lc.UserFilter,
-		[]string{"dn", la.UIDAttr, la.UserAttr}, nil))
+		[]string{"dn", "uid", la.UserAttr}, nil))
 	if err != nil {
 		cclog.Warn("LDAP search error")
 		return err
@@ -177,9 +178,9 @@ func (la *LdapAuthenticator) Sync() error {

 	newnames := map[string]string{}
 	for _, entry := range ldapResults.Entries {
-		username := entry.GetAttributeValue(la.UIDAttr)
+		username := entry.GetAttributeValue("uid")
 		if username == "" {
-			return fmt.Errorf("no attribute '%s'", la.UIDAttr)
+			return errors.New("no attribute 'uid'")
 		}

 		_, ok := users[username]
@@ -193,19 +194,20 @@ func (la *LdapAuthenticator) Sync() error {

 	for username, where := range users {
 		if where == InDB && lc.SyncDelOldUsers {
-			if err := ur.DelUser(username); err != nil {
-				cclog.Errorf("User '%s' LDAP: Delete from DB failed: %v", username, err)
-				return err
-			}
+			ur.DelUser(username)
 			cclog.Debugf("sync: remove %v (does not show up in LDAP anymore)", username)
 		} else if where == InLdap {
 			name := newnames[username]

+			var roles []string
+			roles = append(roles, schema.GetRoleString(schema.RoleUser))
+			projects := make([]string, 0)
+
 			user := &schema.User{
 				Username:   username,
 				Name:       name,
-				Roles:      []string{schema.GetRoleString(schema.RoleUser)},
-				Projects:   make([]string, 0),
+				Roles:      roles,
+				Projects:   projects,
 				AuthSource: schema.AuthViaLDAP,
 			}

@@ -222,13 +224,11 @@ func (la *LdapAuthenticator) Sync() error {

 func (la *LdapAuthenticator) getLdapConnection(admin bool) (*ldap.Conn, error) {
 	lc := Keys.LdapConfig
-	conn, err := ldap.DialURL(lc.URL,
-		ldap.DialWithDialer(&net.Dialer{Timeout: 10 * time.Second}))
+	conn, err := ldap.DialURL(lc.URL)
 	if err != nil {
 		cclog.Warn("LDAP URL dial failed")
 		return nil, err
 	}
-	conn.SetTimeout(30 * time.Second)

 	if admin {
 		if err := conn.Bind(lc.SearchDN, la.syncPassword); err != nil {
--- a/internal/auth/local.go
+++ b/internal/auth/local.go
@@ -9,8 +9,8 @@ import (
 	"fmt"
 	"net/http"

-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/schema"
 	"golang.org/x/crypto/bcrypt"
 )

--- a/internal/auth/oidc.go
+++ b/internal/auth/oidc.go
@@ -9,24 +9,23 @@ import (
 	"context"
 	"crypto/rand"
 	"encoding/base64"
-	"fmt"
 	"io"
 	"net/http"
 	"os"
 	"time"

 	"github.com/ClusterCockpit/cc-backend/internal/repository"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/schema"
 	"github.com/coreos/go-oidc/v3/oidc"
-	"github.com/go-chi/chi/v5"
+	"github.com/gorilla/mux"
 	"golang.org/x/oauth2"
 )

 type OpenIDConfig struct {
 	Provider          string `json:"provider"`
-	SyncUserOnLogin   bool   `json:"sync-user-on-login"`
-	UpdateUserOnLogin bool   `json:"update-user-on-login"`
+	SyncUserOnLogin   bool   `json:"syncUserOnLogin"`
+	UpdateUserOnLogin bool   `json:"updateUserOnLogin"`
 }

 type OIDC struct {
@@ -51,7 +50,6 @@ func setCallbackCookie(w http.ResponseWriter, r *http.Request, name, value strin
 		MaxAge:   int(time.Hour.Seconds()),
 		Secure:   r.TLS != nil,
 		HttpOnly: true,
-		SameSite: http.SameSiteLaxMode,
 	}
 	http.SetCookie(w, c)
 }
@@ -61,7 +59,7 @@ func NewOIDC(a *Authentication) *OIDC {
 	// Use context with timeout for provider initialization
 	ctx, cancel := context.WithTimeout(context.Background(), 30*time.Second)
 	defer cancel()
-
+	
 	provider, err := oidc.NewProvider(ctx, Keys.OpenIDConfig.Provider)
 	if err != nil {
 		cclog.Fatal(err)
@@ -79,7 +77,8 @@ func NewOIDC(a *Authentication) *OIDC {
 		ClientID:     clientID,
 		ClientSecret: clientSecret,
 		Endpoint:     provider.Endpoint(),
-		Scopes:       []string{oidc.ScopeOpenID, "profile"},
+		RedirectURL:  "oidc-callback",
+		Scopes:       []string{oidc.ScopeOpenID, "profile", "email"},
 	}

 	oa := &OIDC{provider: provider, client: client, clientID: clientID, authentication: a}
@@ -87,7 +86,7 @@ func NewOIDC(a *Authentication) *OIDC {
 	return oa
 }

-func (oa *OIDC) RegisterEndpoints(r chi.Router) {
+func (oa *OIDC) RegisterEndpoints(r *mux.Router) {
 	r.HandleFunc("/oidc-login", oa.OAuth2Login)
 	r.HandleFunc("/oidc-callback", oa.OAuth2Callback)
 }
@@ -120,96 +119,57 @@ func (oa *OIDC) OAuth2Callback(rw http.ResponseWriter, r *http.Request) {
 	// Exchange authorization code for token with timeout
 	ctx, cancel := context.WithTimeout(context.Background(), 30*time.Second)
 	defer cancel()
-
+	
 	token, err := oa.client.Exchange(ctx, code, oauth2.VerifierOption(codeVerifier))
 	if err != nil {
-		cclog.Errorf("token exchange failed: %s", err.Error())
-		http.Error(rw, "Authentication failed during token exchange", http.StatusInternalServerError)
+		http.Error(rw, "Failed to exchange token: "+err.Error(), http.StatusInternalServerError)
 		return
 	}

 	// Get user info from OIDC provider with same timeout
 	userInfo, err := oa.provider.UserInfo(ctx, oauth2.StaticTokenSource(token))
 	if err != nil {
-		cclog.Errorf("failed to get userinfo: %s", err.Error())
-		http.Error(rw, "Failed to retrieve user information", http.StatusInternalServerError)
+		http.Error(rw, "Failed to get userinfo: "+err.Error(), http.StatusInternalServerError)
 		return
 	}

-	// Verify ID token and nonce to prevent replay attacks
-	rawIDToken, ok := token.Extra("id_token").(string)
-	if !ok {
-		http.Error(rw, "ID token not found in response", http.StatusInternalServerError)
-		return
-	}
-
-	nonceCookie, err := r.Cookie("nonce")
-	if err != nil {
-		http.Error(rw, "nonce cookie not found", http.StatusBadRequest)
-		return
-	}
-
-	verifier := oa.provider.Verifier(&oidc.Config{ClientID: oa.clientID})
-	idToken, err := verifier.Verify(ctx, rawIDToken)
-	if err != nil {
-		cclog.Errorf("ID token verification failed: %s", err.Error())
-		http.Error(rw, "ID token verification failed", http.StatusInternalServerError)
-		return
-	}
-
-	if idToken.Nonce != nonceCookie.Value {
-		http.Error(rw, "Nonce mismatch", http.StatusBadRequest)
-		return
-	}
+	// // Extract the ID Token from OAuth2 token.
+	// rawIDToken, ok := token.Extra("id_token").(string)
+	// if !ok {
+	// 	http.Error(rw, "Cannot access idToken", http.StatusInternalServerError)
+	// }
+	//
+	// verifier := oa.provider.Verifier(&oidc.Config{ClientID: oa.clientID})
+	// // Parse and verify ID Token payload.
+	// idToken, err := verifier.Verify(context.Background(), rawIDToken)
+	// if err != nil {
+	// 	http.Error(rw, "Failed to extract idToken: "+err.Error(), http.StatusInternalServerError)
+	// }

 	projects := make([]string, 0)

-	// Extract custom claims from userinfo
+	// Extract custom claims
 	var claims struct {
 		Username string `json:"preferred_username"`
 		Name     string `json:"name"`
-		// Keycloak realm-level roles
-		RealmAccess struct {
-			Roles []string `json:"roles"`
-		} `json:"realm_access"`
-		// Keycloak client-level roles
-		ResourceAccess struct {
+		Profile  struct {
 			Client struct {
 				Roles []string `json:"roles"`
 			} `json:"clustercockpit"`
 		} `json:"resource_access"`
 	}
 	if err := userInfo.Claims(&claims); err != nil {
-		cclog.Errorf("failed to extract claims: %s", err.Error())
-		http.Error(rw, "Failed to extract user claims", http.StatusInternalServerError)
-		return
-	}
-
-	if claims.Username == "" {
-		http.Error(rw, "Username claim missing from OIDC provider", http.StatusBadRequest)
-		return
-	}
-
-	// Merge roles from both client-level and realm-level access
-	oidcRoles := append(claims.ResourceAccess.Client.Roles, claims.RealmAccess.Roles...)
-
-	roleSet := make(map[string]bool)
-	for _, r := range oidcRoles {
-		switch r {
-		case "user":
-			roleSet[schema.GetRoleString(schema.RoleUser)] = true
-		case "admin":
-			roleSet[schema.GetRoleString(schema.RoleAdmin)] = true
-		case "manager":
-			roleSet[schema.GetRoleString(schema.RoleManager)] = true
-		case "support":
-			roleSet[schema.GetRoleString(schema.RoleSupport)] = true
-		}
+		http.Error(rw, "Failed to extract Claims: "+err.Error(), http.StatusInternalServerError)
 	}

 	var roles []string
-	for role := range roleSet {
-		roles = append(roles, role)
+	for _, r := range claims.Profile.Client.Roles {
+		switch r {
+		case "user":
+			roles = append(roles, schema.GetRoleString(schema.RoleUser))
+		case "admin":
+			roles = append(roles, schema.GetRoleString(schema.RoleAdmin))
+		}
 	}

 	if len(roles) == 0 {
@@ -228,12 +188,8 @@ func (oa *OIDC) OAuth2Callback(rw http.ResponseWriter, r *http.Request) {
 		handleOIDCUser(user)
 	}

-	if err := oa.authentication.SaveSession(rw, r, user); err != nil {
-		cclog.Errorf("session save failed for user %q: %s", user.Username, err.Error())
-		http.Error(rw, "Failed to create session", http.StatusInternalServerError)
-		return
-	}
-	cclog.Infof("login successful: user: %#v (roles: %v, projects: %v)", user.Username, user.Roles, user.Projects)
+	oa.authentication.SaveSession(rw, r, user)
+	cclog.Infof("login successfull: user: %#v (roles: %v, projects: %v)", user.Username, user.Roles, user.Projects)
 	userCtx := context.WithValue(r.Context(), repository.ContextUserKey, user)
 	http.RedirectHandler("/", http.StatusTemporaryRedirect).ServeHTTP(rw, r.WithContext(userCtx))
 }
@@ -250,24 +206,7 @@ func (oa *OIDC) OAuth2Login(rw http.ResponseWriter, r *http.Request) {
 	codeVerifier := oauth2.GenerateVerifier()
 	setCallbackCookie(rw, r, "verifier", codeVerifier)

-	// Generate nonce for ID token replay protection
-	nonce, err := randString(16)
-	if err != nil {
-		http.Error(rw, "Internal error", http.StatusInternalServerError)
-		return
-	}
-	setCallbackCookie(rw, r, "nonce", nonce)
-
-	// Build redirect URL from the incoming request
-	scheme := "https"
-	if r.TLS == nil && r.Header.Get("X-Forwarded-Proto") != "https" {
-		scheme = "http"
-	}
-	oa.client.RedirectURL = fmt.Sprintf("%s://%s/oidc-callback", scheme, r.Host)
-
 	// Redirect user to consent page to ask for permission
-	url := oa.client.AuthCodeURL(state, oauth2.AccessTypeOffline,
-		oauth2.S256ChallengeOption(codeVerifier),
-		oidc.Nonce(nonce))
+	url := oa.client.AuthCodeURL(state, oauth2.AccessTypeOffline, oauth2.S256ChallengeOption(codeVerifier))
 	http.Redirect(rw, r, url, http.StatusFound)
 }
--- a/internal/auth/schema.go
+++ b/internal/auth/schema.go
@@ -15,44 +15,37 @@ var configSchema = `
          "description": "Configure how long a token is valid. As string parsable by time.ParseDuration()",
          "type": "string"
        },
-        "cookie-name": {
+        "cookieName": {
          "description": "Cookie that should be checked for a JWT token.",
          "type": "string"
        },
-        "validate-user": {
+        "validateUser": {
          "description": "Deny login for users not in database (but defined in JWT). Overwrite roles in JWT with database roles.",
          "type": "boolean"
        },
-        "trusted-issuer": {
+        "trustedIssuer": {
          "description": "Issuer that should be accepted when validating external JWTs ",
          "type": "string"
        },
-        "sync-user-on-login": {
+        "syncUserOnLogin": {
          "description": "Add non-existent user to DB at login attempt with values provided in JWT.",
          "type": "boolean"
-        },
-        "update-user-on-login": {
-          "description": "Should an existent user attributes in the DB be updated at login attempt with values provided in JWT.",
-          "type": "boolean"
        }
      },
      "required": ["max-age"]
    },
    "oidc": {
-      "type": "object",
-      "properties": {
-        "provider": {
-          "description": "OpenID Connect provider URL.",
-          "type": "string"
-        },
-        "sync-user-on-login": {
-          "description": "Add non-existent user to DB at login attempt with values provided.",
-          "type": "boolean"
-        },
-        "update-user-on-login": {
-          "description": "Should an existent user attributes in the DB be updated at login attempt with values provided.",
-          "type": "boolean"
-        }
+      "provider": {
+        "description": "",
+        "type": "string"
+      },
+      "syncUserOnLogin": {
+        "description": "",
+        "type": "boolean"
+      },
+      "updateUserOnLogin": {
+        "description": "",
+        "type": "boolean"
      },
      "required": ["provider"]
    },
@@ -64,48 +57,40 @@ var configSchema = `
          "description": "URL of LDAP directory server.",
          "type": "string"
        },
-        "user-base": {
+        "user_base": {
          "description": "Base DN of user tree root.",
          "type": "string"
        },
-        "search-dn": {
+        "search_dn": {
          "description": "DN for authenticating LDAP admin account with general read rights.",
          "type": "string"
        },
-        "user-bind": {
+        "user_bind": {
          "description": "Expression used to authenticate users via LDAP bind. Must contain uid={username}.",
          "type": "string"
        },
-        "user-filter": {
+        "user_filter": {
          "description": "Filter to extract users for syncing.",
          "type": "string"
        },
-        "username-attr": {
+        "username_attr": {
          "description": "Attribute with full username. Default: gecos",
          "type": "string"
        },
-        "sync-interval": {
+        "sync_interval": {
          "description": "Interval used for syncing local user table with LDAP directory. Parsed using time.ParseDuration.",
          "type": "string"
        },
-        "sync-del-old-users": {
+        "sync_del_old_users": {
          "description": "Delete obsolete users in database.",
          "type": "boolean"
        },
-        "uid-attr": {
-          "description": "LDAP attribute used as login username. Default: uid",
-          "type": "string"
-        },
-        "sync-user-on-login": {
+        "syncUserOnLogin": {
          "description": "Add non-existent user to DB at login attempt if user exists in Ldap directory",
          "type": "boolean"
-        },
-        "update-user-on-login": {
-          "description": "Should an existent user attributes in the DB be updated at login attempt with values from LDAP.",
-          "type": "boolean"
        }
      },
-      "required": ["url", "user-base", "search-dn", "user-bind", "user-filter"]
+      "required": ["url", "user_base", "search_dn", "user_bind", "user_filter"]
    },
  "required": ["jwts"]
 	}`
--- a/internal/config/config.go
+++ b/internal/config/config.go
@@ -11,8 +11,8 @@ import (
 	"encoding/json"
 	"time"

-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/resampler"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/resampler"
 )

 type ProgramConfig struct {
@@ -20,9 +20,9 @@ type ProgramConfig struct {
 	Addr string `json:"addr"`

 	// Addresses from which secured admin API endpoints can be reached, can be wildcard "*"
-	APIAllowedIPs []string `json:"api-allowed-ips"`
+	APIAllowedIPs []string `json:"apiAllowedIPs"`

-	APISubjects *NATSConfig `json:"api-subjects"`
+	APISubjects *NATSConfig `json:"apiSubjects"`

 	// Drop root permissions once .env was read and the port was taken.
 	User  string `json:"user"`
@@ -37,9 +37,16 @@ type ProgramConfig struct {
 	EmbedStaticFiles bool   `json:"embed-static-files"`
 	StaticFiles      string `json:"static-files"`

+	// Database driver - only 'sqlite3' is supported
+	DBDriver string `json:"db-driver"`
+
 	// Path to SQLite database file
 	DB string `json:"db"`

+	// Keep all metric data in the metric data repositories,
+	// do not write to the job-archive.
+	DisableArchive bool `json:"disable-archive"`
+
 	EnableJobTaggers bool `json:"enable-job-taggers"`

 	// Validate json input against schema
@@ -72,30 +79,13 @@ type ProgramConfig struct {
 	// If exists, will enable dynamic zoom in frontend metric plots using the configured values
 	EnableResampling *ResampleConfig `json:"resampling"`

-	// Systemd unit name for log viewer (default: "clustercockpit")
-	SystemdUnit string `json:"systemd-unit"`
-
-	// Node state retention configuration
-	NodeStateRetention *NodeStateRetention `json:"nodestate-retention"`
-}
-
-type NodeStateRetention struct {
-	Policy             string `json:"policy"`      // "delete" or "move"
-	Age                int    `json:"age"`         // hours, default 24
-	TargetKind         string `json:"target-kind"` // "file" or "s3"
-	TargetPath         string `json:"target-path"`
-	TargetEndpoint     string `json:"target-endpoint"`
-	TargetBucket       string `json:"target-bucket"`
-	TargetAccessKey    string `json:"target-access-key"`
-	TargetSecretKey    string `json:"target-secret-key"`
-	TargetRegion       string `json:"target-region"`
-	TargetUsePathStyle bool   `json:"target-use-path-style"`
-	MaxFileSizeMB      int    `json:"max-file-size-mb"`
+	// Global upstream metric repository configuration for metric pull workers
+	UpstreamMetricRepository *json.RawMessage `json:"upstreamMetricRepository,omitempty"`
 }

 type ResampleConfig struct {
 	// Minimum number of points to trigger resampling of data
-	MinimumPoints int `json:"minimum-points"`
+	MinimumPoints int `json:"minimumPoints"`
 	// Array of resampling target resolutions, in seconds; Example: [600,300,60]
 	Resolutions []int `json:"resolutions"`
 	// Trigger next zoom level at less than this many visible datapoints
@@ -103,8 +93,9 @@ type ResampleConfig struct {
 }

 type NATSConfig struct {
-	SubjectJobEvent  string `json:"subject-job-event"`
-	SubjectNodeState string `json:"subject-node-state"`
+	SubjectJobStart  string `json:"subjectJobStart"`
+	SubjectJobStop   string `json:"subjectJobStop"`
+	SubjectNodeState string `json:"subjectNodeState"`
 }

 type IntRange struct {
@@ -120,20 +111,31 @@ type TimeRange struct {

 type FilterRanges struct {
 	Duration  *IntRange  `json:"duration"`
-	NumNodes  *IntRange  `json:"num-nodes"`
-	StartTime *TimeRange `json:"start-time"`
+	NumNodes  *IntRange  `json:"numNodes"`
+	StartTime *TimeRange `json:"startTime"`
 }

+type ClusterConfig struct {
+	Name         string        `json:"name"`
+	FilterRanges *FilterRanges `json:"filterRanges"`
+}
+
+var Clusters []*ClusterConfig
+
 var Keys ProgramConfig = ProgramConfig{
 	Addr:                      "localhost:8080",
+	DisableAuthentication:     false,
 	EmbedStaticFiles:          true,
+	DBDriver:                  "sqlite3",
 	DB:                        "./var/job.db",
+	DisableArchive:            false,
+	Validate:                  false,
 	SessionMaxAge:             "168h",
 	StopJobsExceedingWalltime: 0,
 	ShortRunningJobsDuration:  5 * 60,
 }

-func Init(mainConfig json.RawMessage) {
+func Init(mainConfig json.RawMessage, clusterConfig json.RawMessage) {
 	Validate(configSchema, mainConfig)
 	dec := json.NewDecoder(bytes.NewReader(mainConfig))
 	dec.DisallowUnknownFields()
@@ -141,6 +143,17 @@ func Init(mainConfig json.RawMessage) {
 		cclog.Abortf("Config Init: Could not decode config file '%s'.\nError: %s\n", mainConfig, err.Error())
 	}

+	Validate(clustersSchema, clusterConfig)
+	dec = json.NewDecoder(bytes.NewReader(clusterConfig))
+	dec.DisallowUnknownFields()
+	if err := dec.Decode(&Clusters); err != nil {
+		cclog.Abortf("Config Init: Could not decode config file '%s'.\nError: %s\n", mainConfig, err.Error())
+	}
+
+	if len(Clusters) < 1 {
+		cclog.Abort("Config Init: At least one cluster required in config. Exited with error.")
+	}
+
 	if Keys.EnableResampling != nil && Keys.EnableResampling.MinimumPoints > 0 {
 		resampler.SetMinimumRequiredPoints(Keys.EnableResampling.MinimumPoints)
 	}
--- a/internal/config/config_test.go
+++ b/internal/config/config_test.go
@@ -8,15 +8,19 @@ package config
 import (
 	"testing"

-	ccconf "github.com/ClusterCockpit/cc-lib/v2/ccConfig"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
+	ccconf "github.com/ClusterCockpit/cc-lib/ccConfig"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
 )

 func TestInit(t *testing.T) {
 	fp := "../../configs/config.json"
 	ccconf.Init(fp)
 	if cfg := ccconf.GetPackageConfig("main"); cfg != nil {
-		Init(cfg)
+		if clustercfg := ccconf.GetPackageConfig("clusters"); clustercfg != nil {
+			Init(cfg, clustercfg)
+		} else {
+			cclog.Abort("Cluster configuration must be present")
+		}
 	} else {
 		cclog.Abort("Main configuration must be present")
 	}
@@ -30,7 +34,11 @@ func TestInitMinimal(t *testing.T) {
 	fp := "../../configs/config-demo.json"
 	ccconf.Init(fp)
 	if cfg := ccconf.GetPackageConfig("main"); cfg != nil {
-		Init(cfg)
+		if clustercfg := ccconf.GetPackageConfig("clusters"); clustercfg != nil {
+			Init(cfg, clustercfg)
+		} else {
+			cclog.Abort("Cluster configuration must be present")
+		}
 	} else {
 		cclog.Abort("Main configuration must be present")
 	}
--- a/internal/config/default_metrics.go
+++ b/internal/config/default_metrics.go
@@ -15,7 +15,7 @@ import (

 type DefaultMetricsCluster struct {
 	Name           string `json:"name"`
-	DefaultMetrics string `json:"default-metrics"`
+	DefaultMetrics string `json:"default_metrics"`
 }

 type DefaultMetricsConfig struct {
--- a/internal/config/schema.go
+++ b/internal/config/schema.go
@@ -6,14 +6,14 @@
 package config

 var configSchema = `
-{
+	{
  "type": "object",
  "properties": {
    "addr": {
      "description": "Address where the http (or https) server will listen on (for example: 'localhost:80').",
      "type": "string"
    },
-    "api-allowed-ips": {
+    "apiAllowedIPs": {
      "description": "Addresses from which secured API endpoints can be reached",
      "type": "array",
      "items": {
@@ -44,6 +44,10 @@ var configSchema = `
      "description": "Path to SQLite database file (e.g., './var/job.db')",
      "type": "string"
    },
+    "disable-archive": {
+      "description": "Keep all metric data in the metric data repositories, do not write to the job-archive.",
+      "type": "boolean"
+    },
    "enable-job-taggers": {
      "description": "Turn on automatic application and jobclass taggers",
      "type": "boolean"
@@ -77,22 +81,28 @@ var configSchema = `
      "type": "integer"
    },
    "emission-constant": {
-      "description": "Energy mix CO2 emission constant [g/kWh]. If set, displays estimated CO2 emission for jobs.",
+      "description": ".",
      "type": "integer"
    },
-    "machine-state-dir": {
-      "description": "Where to store MachineState files.",
-      "type": "string"
+    "cron-frequency": {
+      "description": "Frequency of cron job workers.",
+      "type": "object",
+      "properties": {
+        "duration-worker": {
+          "description": "Duration Update Worker [Defaults to '5m']",
+          "type": "string"
+        },
+        "footprint-worker": {
+          "description": "Metric-Footprint Update Worker [Defaults to '10m']",
+          "type": "string"
+        }
+      }
    },
-    "systemd-unit": {
-      "description": "Systemd unit name for log viewer (default: 'clustercockpit').",
-      "type": "string"
-    },
-    "resampling": {
+    "enable-resampling": {
      "description": "Enable dynamic zoom in frontend metric plots.",
      "type": "object",
      "properties": {
-        "minimum-points": {
+        "minimumPoints": {
          "description": "Minimum points to trigger resampling of time-series data.",
          "type": "integer"
        },
@@ -110,73 +120,103 @@ var configSchema = `
      },
      "required": ["trigger", "resolutions"]
    },
-    "api-subjects": {
-      "description": "NATS subjects configuration for subscribing to job and node events.",
+    "upstreamMetricRepository": {
+      "description": "Global upstream metric repository configuration for metric pull workers",
      "type": "object",
      "properties": {
-        "subject-job-event": {
-          "description": "NATS subject for job events (start_job, stop_job)",
+        "kind": {
+          "type": "string",
+          "enum": ["influxdb", "prometheus", "cc-metric-store", "cc-metric-store-internal", "test"]
+        },
+        "url": {
          "type": "string"
        },
-        "subject-node-state": {
-          "description": "NATS subject for node state updates",
+        "token": {
          "type": "string"
        }
      },
-      "required": ["subject-job-event", "subject-node-state"]
-    },
-    "nodestate-retention": {
-      "description": "Node state retention configuration for cleaning up old node_state rows.",
-      "type": "object",
-      "properties": {
-        "policy": {
-          "description": "Retention policy: 'delete' to remove old rows, 'move' to archive to Parquet then delete.",
-          "type": "string",
-          "enum": ["delete", "move"]
-        },
-        "age": {
-          "description": "Retention age in hours (default: 24).",
-          "type": "integer"
-        },
-        "target-kind": {
-          "description": "Target kind for parquet archiving: 'file' or 's3'.",
-          "type": "string",
-          "enum": ["file", "s3"]
-        },
-        "target-path": {
-          "description": "Filesystem path for parquet file target.",
-          "type": "string"
-        },
-        "target-endpoint": {
-          "description": "S3 endpoint URL.",
-          "type": "string"
-        },
-        "target-bucket": {
-          "description": "S3 bucket name.",
-          "type": "string"
-        },
-        "target-access-key": {
-          "description": "S3 access key.",
-          "type": "string"
-        },
-        "target-secret-key": {
-          "description": "S3 secret key.",
-          "type": "string"
-        },
-        "target-region": {
-          "description": "S3 region.",
-          "type": "string"
-        },
-        "target-use-path-style": {
-          "description": "Use path-style S3 addressing.",
-          "type": "boolean"
-        },
-        "max-file-size-mb": {
-          "description": "Maximum parquet file size in MB (default: 128).",
-          "type": "integer"
-        }
-      },
-      "required": ["policy"]
+      "required": ["kind"]
    }
-  }
-}`
+	},
+  "required": ["apiAllowedIPs"]
+	}`
+
+var clustersSchema = `
+  {
+    "type": "array",
+    "items": {
+      "type": "object",
+      "properties": {
+        "name": {
+          "description": "The name of the cluster.",
+          "type": "string"
+        },
+        "metricDataRepository": {
+          "description": "Type of the metric data repository for this cluster",
+          "type": "object",
+          "properties": {
+            "kind": {
+              "type": "string",
+                "enum": ["influxdb", "prometheus", "cc-metric-store", "cc-metric-store-internal", "test"]
+            },
+            "url": {
+              "type": "string"
+            },
+            "token": {
+              "type": "string"
+            }
+          },
+          "required": ["kind"]
+        },
+        "filterRanges": {
+          "description": "This option controls the slider ranges for the UI controls of numNodes, duration, and startTime.",
+          "type": "object",
+          "properties": {
+            "numNodes": {
+              "description": "UI slider range for number of nodes",
+              "type": "object",
+              "properties": {
+                "from": {
+                  "type": "integer"
+                },
+                "to": {
+                  "type": "integer"
+                }
+              },
+              "required": ["from", "to"]
+            },
+            "duration": {
+              "description": "UI slider range for duration",
+              "type": "object",
+              "properties": {
+                "from": {
+                  "type": "integer"
+                },
+                "to": {
+                  "type": "integer"
+                }
+              },
+              "required": ["from", "to"]
+            },
+            "startTime": {
+              "description": "UI slider range for start time",
+              "type": "object",
+              "properties": {
+                "from": {
+                  "type": "string",
+                  "format": "date-time"
+                },
+                "to": {
+                  "type": "null"
+                }
+              },
+              "required": ["from", "to"]
+            }
+          },
+          "required": ["numNodes", "duration", "startTime"]
+        }
+      },
+      "required": ["name", "filterRanges"],
+      "minItems": 1
+    }
+  }`
--- a/internal/config/validate.go
+++ b/internal/config/validate.go
@@ -8,7 +8,7 @@ package config
 import (
 	"encoding/json"

-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
 	"github.com/santhosh-tekuri/jsonschema/v5"
 )

--- a/internal/graph/generated/generated.go
+++ b/internal/graph/generated/generated.go
--- a/internal/graph/model/models_gen.go
+++ b/internal/graph/model/models_gen.go
@@ -10,7 +10,7 @@ import (
 	"time"

 	"github.com/ClusterCockpit/cc-backend/internal/config"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
+	"github.com/ClusterCockpit/cc-lib/schema"
 )

 type ClusterMetricWithName struct {
@@ -71,7 +71,6 @@ type JobFilter struct {
 	Project         *StringInput      `json:"project,omitempty"`
 	JobName         *StringInput      `json:"jobName,omitempty"`
 	Cluster         *StringInput      `json:"cluster,omitempty"`
-	SubCluster      *StringInput      `json:"subCluster,omitempty"`
 	Partition       *StringInput      `json:"partition,omitempty"`
 	Duration        *config.IntRange  `json:"duration,omitempty"`
 	Energy          *FloatRange       `json:"energy,omitempty"`
@@ -83,7 +82,6 @@ type JobFilter struct {
 	State           []schema.JobState `json:"state,omitempty"`
 	MetricStats     []*MetricStatItem `json:"metricStats,omitempty"`
 	Shared          *string           `json:"shared,omitempty"`
-	Schedule        *string           `json:"schedule,omitempty"`
 	Node            *StringInput      `json:"node,omitempty"`
 }

@@ -187,7 +185,7 @@ type NamedStatsWithScope struct {
 type NodeFilter struct {
 	Hostname       *StringInput           `json:"hostname,omitempty"`
 	Cluster        *StringInput           `json:"cluster,omitempty"`
-	SubCluster     *StringInput           `json:"subCluster,omitempty"`
+	Subcluster     *StringInput           `json:"subcluster,omitempty"`
 	SchedulerState *schema.SchedulerState `json:"schedulerState,omitempty"`
 	HealthState    *string                `json:"healthState,omitempty"`
 	TimeStart      *int                   `json:"timeStart,omitempty"`
--- a/internal/graph/resolver.go
+++ b/internal/graph/resolver.go
@@ -4,7 +4,7 @@ import (
 	"sync"

 	"github.com/ClusterCockpit/cc-backend/internal/repository"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
 	"github.com/jmoiron/sqlx"
 )

--- a/internal/graph/schema.resolvers.go
+++ b/internal/graph/schema.resolvers.go
@@ -3,7 +3,7 @@ package graph
 // This file will be automatically regenerated based on the schema, any resolver
 // implementations
 // will be copied through when generating and any unknown code will be moved to the end.
-// Code generated by github.com/99designs/gqlgen version v0.17.85
+// Code generated by github.com/99designs/gqlgen version v0.17.84

 import (
 	"context"
@@ -19,12 +19,11 @@ import (
 	"github.com/ClusterCockpit/cc-backend/internal/config"
 	"github.com/ClusterCockpit/cc-backend/internal/graph/generated"
 	"github.com/ClusterCockpit/cc-backend/internal/graph/model"
-	"github.com/ClusterCockpit/cc-backend/internal/metricdispatch"
+	"github.com/ClusterCockpit/cc-backend/internal/metricdispatcher"
 	"github.com/ClusterCockpit/cc-backend/internal/repository"
 	"github.com/ClusterCockpit/cc-backend/pkg/archive"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	ccunit "github.com/ClusterCockpit/cc-lib/v2/ccUnits"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/schema"
 )

 // Partitions is the resolver for the partitions field.
@@ -284,7 +283,7 @@ func (r *mutationResolver) RemoveTagFromList(ctx context.Context, tagIds []strin
 		// Test Access: Admins && Admin Tag OR Everyone && Private Tag
 		if user.HasRole(schema.RoleAdmin) && (tscope == "global" || tscope == "admin") || user.Username == tscope {
 			// Remove from DB
-			if err = r.Repo.RemoveTagByID(tid); err != nil {
+			if err = r.Repo.RemoveTagById(tid); err != nil {
 				cclog.Warn("Error while removing tag")
 				return nil, err
 			} else {
@@ -318,39 +317,18 @@ func (r *nodeResolver) SchedulerState(ctx context.Context, obj *schema.Node) (sc
 	if obj.NodeState != "" {
 		return obj.NodeState, nil
 	} else {
-		return "", fmt.Errorf("resolver: no SchedulerState (NodeState) on node object")
+		return "", fmt.Errorf("no SchedulerState (NodeState) on Object")
 	}
 }

 // HealthState is the resolver for the healthState field.
 func (r *nodeResolver) HealthState(ctx context.Context, obj *schema.Node) (string, error) {
-	if obj.HealthState != "" {
-		return string(obj.HealthState), nil
-	} else {
-		return "", fmt.Errorf("resolver: no HealthState (NodeState) on node object")
-	}
+	panic(fmt.Errorf("not implemented: HealthState - healthState"))
 }

 // MetaData is the resolver for the metaData field.
 func (r *nodeResolver) MetaData(ctx context.Context, obj *schema.Node) (any, error) {
-	if obj.MetaData != nil {
-		return obj.MetaData, nil
-	} else {
-		cclog.Debug("resolver: no MetaData (NodeState) on node object")
-		emptyMeta := make(map[string]string, 0)
-		return emptyMeta, nil
-	}
-}
-
-// HealthData is the resolver for the healthData field.
-func (r *nodeResolver) HealthData(ctx context.Context, obj *schema.Node) (any, error) {
-	if obj.HealthData != nil {
-		return obj.HealthData, nil
-	} else {
-		cclog.Debug("resolver: no HealthData (NodeState) on node object")
-		emptyHealth := make(map[string][]string, 0)
-		return emptyHealth, nil
-	}
+	panic(fmt.Errorf("not implemented: MetaData - metaData"))
 }

 // Clusters is the resolver for the clusters field.
@@ -419,15 +397,6 @@ func (r *queryResolver) Nodes(ctx context.Context, filter []*model.NodeFilter, o
 	return &model.NodeStateResultList{Items: nodes, Count: &count}, err
 }

-// NodesWithMeta is the resolver for the nodesWithMeta field.
-func (r *queryResolver) NodesWithMeta(ctx context.Context, filter []*model.NodeFilter, order *model.OrderByInput) (*model.NodeStateResultList, error) {
-	// Why Extra Handler? -> graphql.CollectAllFields(ctx) only returns toplevel fields (i.e.: items, count), and not subfields like item.metaData
-	repo := repository.GetNodeRepository()
-	nodes, err := repo.QueryNodesWithMeta(ctx, filter, nil, order) // Ignore Paging, Order Unused
-	count := len(nodes)
-	return &model.NodeStateResultList{Items: nodes, Count: &count}, err
-}
-
 // NodeStates is the resolver for the nodeStates field.
 func (r *queryResolver) NodeStates(ctx context.Context, filter []*model.NodeFilter) ([]*model.NodeStates, error) {
 	repo := repository.GetNodeRepository()
@@ -515,7 +484,7 @@ func (r *queryResolver) JobMetrics(ctx context.Context, id string, metrics []str
 		return nil, err
 	}

-	data, err := metricdispatch.LoadData(job, metrics, scopes, ctx, *resolution)
+	data, err := metricdispatcher.LoadData(job, metrics, scopes, ctx, *resolution)
 	if err != nil {
 		cclog.Warn("Error while loading job data")
 		return nil, err
@@ -543,7 +512,7 @@ func (r *queryResolver) JobStats(ctx context.Context, id string, metrics []strin
 		return nil, err
 	}

-	data, err := metricdispatch.LoadJobStats(job, metrics, ctx)
+	data, err := metricdispatcher.LoadJobStats(job, metrics, ctx)
 	if err != nil {
 		cclog.Warnf("Error while loading jobStats data for job id %s", id)
 		return nil, err
@@ -568,7 +537,7 @@ func (r *queryResolver) ScopedJobStats(ctx context.Context, id string, metrics [
 		return nil, err
 	}

-	data, err := metricdispatch.LoadScopedJobStats(job, metrics, scopes, ctx)
+	data, err := metricdispatcher.LoadScopedJobStats(job, metrics, scopes, ctx)
 	if err != nil {
 		cclog.Warnf("Error while loading scopedJobStats data for job id %s", id)
 		return nil, err
@@ -582,7 +551,7 @@ func (r *queryResolver) ScopedJobStats(ctx context.Context, id string, metrics [
 			for _, stat := range stats {
 				mdlStats = append(mdlStats, &model.ScopedStats{
 					Hostname: stat.Hostname,
-					ID:       stat.ID,
+					ID:       stat.Id,
 					Data:     stat.Data,
 				})
 			}
@@ -621,24 +590,21 @@ func (r *queryResolver) Jobs(ctx context.Context, filter []*model.JobFilter, pag

 	// Note: Even if App-Default 'config.Keys.UiDefaults["job_list_usePaging"]' is set, always return hasNextPage boolean.
 	// Users can decide in frontend to use continuous scroll, even if app-default is paging!
-	// Skip if page.ItemsPerPage == -1 ("Load All" -> No Next Page required, Status Dashboards)
 	/*
 	  Example Page 4 @ 10 IpP : Does item 41 exist?
 	  Minimal Page 41 @ 1 IpP : If len(result) is 1, Page 5 @ 10 IpP exists.
 	*/
-	hasNextPage := false
-	if page.ItemsPerPage != -1 {
-		nextPage := &model.PageRequest{
-			ItemsPerPage: 1,
-			Page:         ((page.Page * page.ItemsPerPage) + 1),
-		}
-		nextJobs, err := r.Repo.QueryJobs(ctx, filter, nextPage, order)
-		if err != nil {
-			cclog.Warn("Error while querying next jobs")
-			return nil, err
-		}
-		hasNextPage = len(nextJobs) == 1
+	nextPage := &model.PageRequest{
+		ItemsPerPage: 1,
+		Page:         ((page.Page * page.ItemsPerPage) + 1),
 	}
+	nextJobs, err := r.Repo.QueryJobs(ctx, filter, nextPage, order)
+	if err != nil {
+		cclog.Warn("Error while querying next jobs")
+		return nil, err
+	}
+
+	hasNextPage := len(nextJobs) == 1

 	return &model.JobResultList{Items: jobs, Count: &count, HasNextPage: &hasNextPage}, nil
 }
@@ -736,7 +702,7 @@ func (r *queryResolver) JobsMetricStats(ctx context.Context, filter []*model.Job

 	res := []*model.JobStats{}
 	for _, job := range jobs {
-		data, err := metricdispatch.LoadJobStats(job, metrics, ctx)
+		data, err := metricdispatcher.LoadJobStats(job, metrics, ctx)
 		if err != nil {
 			cclog.Warnf("Error while loading comparison jobStats data for job id %d", job.JobID)
 			continue
@@ -787,19 +753,13 @@ func (r *queryResolver) NodeMetrics(ctx context.Context, cluster string, nodes [
 		return nil, errors.New("you need to be administrator or support staff for this query")
 	}

-	defaultMetrics := make([]string, 0)
-	for _, mc := range archive.GetCluster(cluster).MetricConfig {
-		defaultMetrics = append(defaultMetrics, mc.Name)
-	}
 	if metrics == nil {
-		metrics = defaultMetrics
-	} else {
-		metrics = slices.DeleteFunc(metrics, func(metric string) bool {
-			return !slices.Contains(defaultMetrics, metric) // Remove undefined metrics.
-		})
+		for _, mc := range archive.GetCluster(cluster).MetricConfig {
+			metrics = append(metrics, mc.Name)
+		}
 	}

-	data, err := metricdispatch.LoadNodeData(cluster, metrics, nodes, scopes, from, to, ctx)
+	data, err := metricdispatcher.LoadNodeData(cluster, metrics, nodes, scopes, from, to, ctx)
 	if err != nil {
 		cclog.Warn("error while loading node data")
 		return nil, err
@@ -854,7 +814,6 @@ func (r *queryResolver) NodeMetricsList(ctx context.Context, cluster string, sub
 	}

 	nodeRepo := repository.GetNodeRepository()
-	// nodes -> array hostname
 	nodes, stateMap, countNodes, hasNextPage, nerr := nodeRepo.GetNodesForList(ctx, cluster, subCluster, stateFilter, nodeFilter, page)
 	if nerr != nil {
 		return nil, errors.New("could not retrieve node list required for resolving NodeMetricsList")
@@ -866,26 +825,25 @@ func (r *queryResolver) NodeMetricsList(ctx context.Context, cluster string, sub
 		}
 	}

-	// data -> map hostname:jobdata
-	data, err := metricdispatch.LoadNodeListData(cluster, subCluster, nodes, metrics, scopes, *resolution, from, to, ctx)
+	data, err := metricdispatcher.LoadNodeListData(cluster, subCluster, nodes, metrics, scopes, *resolution, from, to, ctx)
 	if err != nil {
 		cclog.Warn("error while loading node data (Resolver.NodeMetricsList")
 		return nil, err
 	}

 	nodeMetricsList := make([]*model.NodeMetrics, 0, len(data))
-	for _, hostname := range nodes {
+	for hostname, metrics := range data {
 		host := &model.NodeMetrics{
 			Host:    hostname,
 			State:   stateMap[hostname],
-			Metrics: make([]*model.JobMetricWithName, 0),
+			Metrics: make([]*model.JobMetricWithName, 0, len(metrics)*len(scopes)),
 		}
 		host.SubCluster, err = archive.GetSubClusterByNode(cluster, hostname)
 		if err != nil {
 			cclog.Warnf("error in nodeMetrics resolver: %s", err)
 		}

-		for metric, scopedMetrics := range data[hostname] {
+		for metric, scopedMetrics := range metrics {
 			for scope, scopedMetric := range scopedMetrics {
 				host.Metrics = append(host.Metrics, &model.JobMetricWithName{
 					Name:   metric,
@@ -899,8 +857,7 @@ func (r *queryResolver) NodeMetricsList(ctx context.Context, cluster string, sub
 	}

 	nodeMetricsListResult := &model.NodesResultList{
-		Items: nodeMetricsList,
-		// TotalNodes depends on sum of nodes grouped on latest timestamp, see repo/node.go:357
+		Items:       nodeMetricsList,
 		TotalNodes:  &countNodes,
 		HasNextPage: &hasNextPage,
 	}
@@ -923,7 +880,7 @@ func (r *queryResolver) ClusterMetrics(ctx context.Context, cluster string, metr

 	// 'nodes' == nil -> Defaults to all nodes of cluster for existing query workflow
 	scopes := []schema.MetricScope{"node"}
-	data, err := metricdispatch.LoadNodeData(cluster, metrics, nil, scopes, from, to, ctx)
+	data, err := metricdispatcher.LoadNodeData(cluster, metrics, nil, scopes, from, to, ctx)
 	if err != nil {
 		cclog.Warn("error while loading node data")
 		return nil, err
@@ -939,36 +896,26 @@ func (r *queryResolver) ClusterMetrics(ctx context.Context, cluster string, metr
 	for _, metrics := range data {
 		clusterMetrics.NodeCount += 1
 		for metric, scopedMetrics := range metrics {
-			for _, scopedMetric := range scopedMetrics {
-				// Collect Info Once
-				_, okTimestep := collectorTimestep[metric]
-				if !okTimestep {
+			_, ok := collectorData[metric]
+			if !ok {
+				collectorData[metric] = make([]schema.Float, 0)
+				for _, scopedMetric := range scopedMetrics {
+					// Collect Info
 					collectorTimestep[metric] = scopedMetric.Timestep
-				}
-				_, okUnit := collectorUnit[metric]
-				if !okUnit {
 					collectorUnit[metric] = scopedMetric.Unit
-				}
-				// Collect Data
-				for _, ser := range scopedMetric.Series {
-					_, okData := collectorData[metric]
-					// Init With Datasize > 0
-					if !okData && len(ser.Data) != 0 {
-						collectorData[metric] = make([]schema.Float, len(ser.Data))
-					} else if !okData {
-						cclog.Debugf("[SCHEMARESOLVER] clusterMetrics skip init: no data -> %s at %s; size %d", metric, ser.Hostname, len(ser.Data))
+					// Collect Initial Data
+					for _, ser := range scopedMetric.Series {
+						collectorData[metric] = append(collectorData[metric], ser.Data...)
 					}
-					// Sum if init'd and matching size
-					if okData && len(ser.Data) == len(collectorData[metric]) {
+				}
+			} else {
+				// Sum up values by index
+				for _, scopedMetric := range scopedMetrics {
+					// For This Purpose (Cluster_Wide-Sum of Node Metrics) OK
+					for _, ser := range scopedMetric.Series {
 						for i, val := range ser.Data {
-							if val.IsNaN() {
-								continue
-							} else {
-								collectorData[metric][i] += val
-							}
+							collectorData[metric][i] += val
 						}
-					} else if okData {
-						cclog.Debugf("[SCHEMARESOLVER] clusterMetrics skip sum: data diff -> %s at %s; want size %d, have size %d", metric, ser.Hostname, len(collectorData[metric]), len(ser.Data))
 					}
 				}
 			}
@@ -976,21 +923,15 @@ func (r *queryResolver) ClusterMetrics(ctx context.Context, cluster string, metr
 	}

 	for metricName, data := range collectorData {
-		// use ccUnits for backend normalization to "Tera"
-		p_old := ccunit.NewPrefix(collectorUnit[metricName].Prefix)
-		p_new := ccunit.NewPrefix("T")
-		convFunc := ccunit.GetPrefixPrefixFactor(p_old, p_new)
-		u_new := schema.Unit{Prefix: p_new.Prefix(), Base: collectorUnit[metricName].Base}
-
+		cu := collectorUnit[metricName]
 		roundedData := make([]schema.Float, 0)
-		for _, v_old := range data {
-			v_new := math.Round(convFunc(float64(v_old)).(float64)*100.0) / 100.0
-			roundedData = append(roundedData, schema.Float(v_new))
+		for _, val := range data {
+			roundedData = append(roundedData, schema.Float((math.Round(float64(val)*100.0) / 100.0)))
 		}

 		cm := model.ClusterMetricWithName{
 			Name:     metricName,
-			Unit:     &u_new,
+			Unit:     &cu,
 			Timestep: collectorTimestep[metricName],
 			Data:     roundedData,
 		}
@@ -1031,10 +972,12 @@ func (r *Resolver) Query() generated.QueryResolver { return &queryResolver{r} }
 // SubCluster returns generated.SubClusterResolver implementation.
 func (r *Resolver) SubCluster() generated.SubClusterResolver { return &subClusterResolver{r} }

-type clusterResolver struct{ *Resolver }
-type jobResolver struct{ *Resolver }
-type metricValueResolver struct{ *Resolver }
-type mutationResolver struct{ *Resolver }
-type nodeResolver struct{ *Resolver }
-type queryResolver struct{ *Resolver }
-type subClusterResolver struct{ *Resolver }
+type (
+	clusterResolver     struct{ *Resolver }
+	jobResolver         struct{ *Resolver }
+	metricValueResolver struct{ *Resolver }
+	mutationResolver    struct{ *Resolver }
+	nodeResolver        struct{ *Resolver }
+	queryResolver       struct{ *Resolver }
+	subClusterResolver  struct{ *Resolver }
+)
--- a/internal/graph/util.go
+++ b/internal/graph/util.go
@@ -13,9 +13,9 @@ import (

 	"github.com/99designs/gqlgen/graphql"
 	"github.com/ClusterCockpit/cc-backend/internal/graph/model"
-	"github.com/ClusterCockpit/cc-backend/internal/metricdispatch"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
+	"github.com/ClusterCockpit/cc-backend/internal/metricdispatcher"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/schema"
 )

 const MAX_JOBS_FOR_ANALYSIS = 500
@@ -55,15 +55,15 @@ func (r *queryResolver) rooflineHeatmap(
 		// 	resolution = max(resolution, mc.Timestep)
 		// }

-		jobdata, err := metricdispatch.LoadData(job, []string{"flops_any", "mem_bw"}, []schema.MetricScope{schema.MetricScopeNode}, ctx, 0)
+		jobdata, err := metricdispatcher.LoadData(job, []string{"flops_any", "mem_bw"}, []schema.MetricScope{schema.MetricScopeNode}, ctx, 0)
 		if err != nil {
-			cclog.Warnf("Error while loading roofline metrics for job %d", *job.ID)
+			cclog.Errorf("Error while loading roofline metrics for job %d", job.ID)
 			return nil, err
 		}

 		flops_, membw_ := jobdata["flops_any"], jobdata["mem_bw"]
 		if flops_ == nil && membw_ == nil {
-			cclog.Warnf("rooflineHeatmap(): 'flops_any' or 'mem_bw' missing for job %d", *job.ID)
+			cclog.Infof("rooflineHeatmap(): 'flops_any' or 'mem_bw' missing for job %d", job.ID)
 			continue
 			// return nil, fmt.Errorf("GRAPH/UTIL > 'flops_any' or 'mem_bw' missing for job %d", job.ID)
 		}
@@ -128,7 +128,7 @@ func (r *queryResolver) jobsFootprints(ctx context.Context, filter []*model.JobF
 			continue
 		}

-		if err := metricdispatch.LoadAverages(job, metrics, avgs, ctx); err != nil {
+		if err := metricdispatcher.LoadAverages(job, metrics, avgs, ctx); err != nil {
 			cclog.Error("Error while loading averages for footprint")
 			return nil, err
 		}
--- a/internal/importer/handleImport.go
+++ b/internal/importer/handleImport.go
@@ -2,7 +2,6 @@
 // All rights reserved. This file is part of cc-backend.
 // Use of this source code is governed by a MIT-style
 // license that can be found in the LICENSE file.
-
 package importer

 import (
@@ -15,8 +14,8 @@ import (
 	"github.com/ClusterCockpit/cc-backend/internal/config"
 	"github.com/ClusterCockpit/cc-backend/internal/repository"
 	"github.com/ClusterCockpit/cc-backend/pkg/archive"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/schema"
 )

 // HandleImportFlag imports jobs from file pairs specified in a comma-separated flag string.
@@ -38,7 +37,7 @@ import (
 func HandleImportFlag(flag string) error {
 	r := repository.GetJobRepository()

-	for pair := range strings.SplitSeq(flag, ",") {
+	for _, pair := range strings.Split(flag, ",") {
 		files := strings.Split(pair, ":")
 		if len(files) != 2 {
 			return fmt.Errorf("REPOSITORY/INIT > invalid import flag format")
@@ -102,7 +101,7 @@ func HandleImportFlag(flag string) error {
 			return err
 		}

-		id, err := r.InsertJobDirect(&job)
+		id, err := r.InsertJob(&job)
 		if err != nil {
 			cclog.Warn("Error while job db insert")
 			return err
--- a/internal/importer/importer_test.go
+++ b/internal/importer/importer_test.go
@@ -16,8 +16,8 @@ import (
 	"github.com/ClusterCockpit/cc-backend/internal/importer"
 	"github.com/ClusterCockpit/cc-backend/internal/repository"
 	"github.com/ClusterCockpit/cc-backend/pkg/archive"
-	ccconf "github.com/ClusterCockpit/cc-lib/v2/ccConfig"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
+	ccconf "github.com/ClusterCockpit/cc-lib/ccConfig"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
 )

 // copyFile copies a file from source path to destination path.
@@ -50,14 +50,39 @@ func setup(t *testing.T) *repository.JobRepository {
 		"main": {
 	"addr":            "0.0.0.0:8080",
 	"validate": false,
-  "api-allowed-ips": [
+  "apiAllowedIPs": [
    "*"
  ]},
 	"archive": {
 		"kind": "file",
 		"path": "./var/job-archive"
-	}
-	}`
+	},
+	"clusters": [
+	{
+	   "name": "testcluster",
+	   "filterRanges": {
+		"numNodes": { "from": 1, "to": 64 },
+		"duration": { "from": 0, "to": 86400 },
+		"startTime": { "from": "2022-01-01T00:00:00Z", "to": null }
+	   }
+	},
+    {
+	   "name": "fritz",
+	   "filterRanges": {
+		"numNodes": { "from": 1, "to": 944 },
+		"duration": { "from": 0, "to": 86400 },
+		"startTime": { "from": "2022-01-01T00:00:00Z", "to": null }
+	   }
+	},
+    {
+		"name": "taurus",
+		 "filterRanges": {
+		   "numNodes": { "from": 1, "to": 4000 },
+		   "duration": { "from": 0, "to": 604800 },
+		   "startTime": { "from": "2010-01-01T00:00:00Z", "to": null }
+		 }
+	 }
+	]}`

 	cclog.Init("info", true)
 	tmpdir := t.TempDir()
@@ -93,18 +118,22 @@ func setup(t *testing.T) *repository.JobRepository {

 	// Load and check main configuration
 	if cfg := ccconf.GetPackageConfig("main"); cfg != nil {
-		config.Init(cfg)
+		if clustercfg := ccconf.GetPackageConfig("clusters"); clustercfg != nil {
+			config.Init(cfg, clustercfg)
+		} else {
+			t.Fatal("Cluster configuration must be present")
+		}
 	} else {
 		t.Fatal("Main configuration must be present")
 	}

 	archiveCfg := fmt.Sprintf("{\"kind\": \"file\",\"path\": \"%s\"}", jobarchive)

-	if err := archive.Init(json.RawMessage(archiveCfg)); err != nil {
+	if err := archive.Init(json.RawMessage(archiveCfg), config.Keys.DisableArchive); err != nil {
 		t.Fatal(err)
 	}

-	repository.Connect(dbfilepath)
+	repository.Connect("sqlite3", dbfilepath)
 	return repository.GetJobRepository()
 }

@@ -165,7 +194,7 @@ func TestHandleImportFlag(t *testing.T) {
 			}

 			result := readResult(t, testname)
-			job, err := r.Find(&result.JobId, &result.Cluster, &result.StartTime)
+			job, err := r.FindCached(&result.JobId, &result.Cluster, &result.StartTime)
 			if err != nil {
 				t.Fatal(err)
 			}
--- a/internal/importer/initDB.go
+++ b/internal/importer/initDB.go
@@ -22,8 +22,8 @@ import (

 	"github.com/ClusterCockpit/cc-backend/internal/repository"
 	"github.com/ClusterCockpit/cc-backend/pkg/archive"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/schema"
 )

 const (
@@ -216,7 +216,7 @@ func enrichJobMetadata(job *schema.Job) error {
 				metricEnergy = math.Round(rawEnergy*100.0) / 100.0
 			}
 		} else {
-			cclog.Warnf("Error while collecting energy metric %s for job, DB ID '%v', return '0.0'", fp, *job.ID)
+			cclog.Warnf("Error while collecting energy metric %s for job, DB ID '%v', return '0.0'", fp, job.ID)
 		}

 		job.EnergyFootprint[fp] = metricEnergy
@@ -225,7 +225,7 @@ func enrichJobMetadata(job *schema.Job) error {

 	job.Energy = (math.Round(totalEnergy*100.0) / 100.0)
 	if job.RawEnergyFootprint, err = json.Marshal(job.EnergyFootprint); err != nil {
-		cclog.Warnf("Error while marshaling energy footprint for job INTO BYTES, DB ID '%v'", *job.ID)
+		cclog.Warnf("Error while marshaling energy footprint for job INTO BYTES, DB ID '%v'", job.ID)
 		return err
 	}

--- a/internal/importer/normalize.go
+++ b/internal/importer/normalize.go
@@ -2,13 +2,12 @@
 // All rights reserved. This file is part of cc-backend.
 // Use of this source code is governed by a MIT-style
 // license that can be found in the LICENSE file.
-
 package importer

 import (
 	"math"

-	ccunits "github.com/ClusterCockpit/cc-lib/v2/ccUnits"
+	ccunits "github.com/ClusterCockpit/cc-lib/ccUnits"
 )

 // getNormalizationFactor calculates the scaling factor needed to normalize a value
--- a/internal/importer/normalize_test.go
+++ b/internal/importer/normalize_test.go
@@ -8,7 +8,7 @@ import (
 	"fmt"
 	"testing"

-	ccunits "github.com/ClusterCockpit/cc-lib/v2/ccUnits"
+	ccunits "github.com/ClusterCockpit/cc-lib/ccUnits"
 )

 // TestNormalizeFactor tests the normalization of large byte values to gigabyte prefix.
--- a/internal/memorystore/api.go
+++ b/internal/memorystore/api.go
@@ -3,35 +3,22 @@
 // Use of this source code is governed by a MIT-style
 // license that can be found in the LICENSE file.

-// This file contains the API types and data fetching logic for querying metric data
-// from the in-memory metric store. It provides structures for building complex queries
-// with support for aggregation, scaling, padding, and statistics computation.
-package metricstore
+package memorystore

 import (
 	"errors"
 	"fmt"
 	"math"

-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
-	"github.com/ClusterCockpit/cc-lib/v2/util"
+	"github.com/ClusterCockpit/cc-lib/schema"
+	"github.com/ClusterCockpit/cc-lib/util"
 )

 var (
-	// ErrNoHostOrMetric is returned when the metric store does not find the host or the metric
-	ErrNoHostOrMetric error = errors.New("[METRICSTORE]> metric or host not found")
-	// ErrInvalidTimeRange is returned when a query has 'from' >= 'to'
 	ErrInvalidTimeRange = errors.New("[METRICSTORE]> invalid time range: 'from' must be before 'to'")
-	// ErrEmptyCluster is returned when a query with ForAllNodes has no cluster specified
-	ErrEmptyCluster = errors.New("[METRICSTORE]> cluster name cannot be empty")
+	ErrEmptyCluster     = errors.New("[METRICSTORE]> cluster name cannot be empty")
 )

-// APIMetricData represents the response data for a single metric query.
-//
-// It contains both the time-series data points and computed statistics (avg, min, max).
-// If an error occurred during data retrieval, the Error field will be set and other
-// fields may be incomplete.
 type APIMetricData struct {
 	Error      *string           `json:"error,omitempty"`
 	Data       schema.FloatArray `json:"data,omitempty"`
@@ -43,13 +30,6 @@ type APIMetricData struct {
 	Max        schema.Float      `json:"max"`
 }

-// APIQueryRequest represents a batch query request for metric data.
-//
-// It supports two modes of operation:
-//  1. Explicit queries via the Queries field
-//  2. Automatic query generation via ForAllNodes (queries all specified metrics for all nodes in the cluster)
-//
-// The request can be customized with flags to include/exclude statistics, raw data, and padding.
 type APIQueryRequest struct {
 	Cluster     string     `json:"cluster"`
 	Queries     []APIQuery `json:"queries"`
@@ -61,25 +41,11 @@ type APIQueryRequest struct {
 	WithPadding bool       `json:"with-padding"`
 }

-// APIQueryResponse represents the response to an APIQueryRequest.
-//
-// Results is a 2D array where each outer element corresponds to a query,
-// and each inner element corresponds to a selector within that query
-// (e.g., multiple CPUs or cores).
 type APIQueryResponse struct {
 	Queries []APIQuery        `json:"queries,omitempty"`
 	Results [][]APIMetricData `json:"results"`
 }

-// APIQuery represents a single metric query with optional hierarchical selectors.
-//
-// The hierarchical selection works as follows:
-//   - Hostname: The node to query
-//   - Type + TypeIds: First level of hierarchy (e.g., "cpu" + ["0", "1", "2"])
-//   - SubType + SubTypeIds: Second level of hierarchy (e.g., "core" + ["0", "1"])
-//
-// If Aggregate is true, data from multiple type/subtype IDs will be aggregated according
-// to the metric's aggregation strategy. Otherwise, separate results are returned for each combination.
 type APIQuery struct {
 	Type        *string      `json:"type,omitempty"`
 	SubType     *string      `json:"subtype,omitempty"`
@@ -92,11 +58,6 @@ type APIQuery struct {
 	Aggregate   bool         `json:"aggreg"`
 }

-// AddStats computes and populates the Avg, Min, and Max fields from the Data array.
-//
-// NaN values in the data are ignored during computation. If all values are NaN,
-// the statistics fields will be set to NaN.
-//
 // TODO: Optimize this, just like the stats endpoint!
 func (data *APIMetricData) AddStats() {
 	n := 0
@@ -122,10 +83,6 @@ func (data *APIMetricData) AddStats() {
 	}
 }

-// ScaleBy multiplies all data points and statistics by the given factor.
-//
-// This is commonly used for unit conversion (e.g., bytes to gigabytes).
-// Scaling by 0 or 1 is a no-op for performance reasons.
 func (data *APIMetricData) ScaleBy(f schema.Float) {
 	if f == 0 || f == 1 {
 		return
@@ -139,17 +96,6 @@ func (data *APIMetricData) ScaleBy(f schema.Float) {
 	}
 }

-// PadDataWithNull pads the beginning of the data array with NaN values if needed.
-//
-// This ensures that the data aligns with the requested 'from' timestamp, even if
-// the metric store doesn't have data for the earliest time points. This is useful
-// for maintaining consistent array indexing across multiple queries.
-//
-// Parameters:
-//   - ms: MemoryStore instance to lookup metric configuration
-//   - from: The requested start timestamp
-//   - to: The requested end timestamp (unused but kept for API consistency)
-//   - metric: The metric name to lookup frequency information
 func (data *APIMetricData) PadDataWithNull(ms *MemoryStore, from, to int64, metric string) {
 	minfo, ok := ms.Metrics[metric]
 	if !ok {
@@ -169,31 +115,6 @@ func (data *APIMetricData) PadDataWithNull(ms *MemoryStore, from, to int64, metr
 	}
 }

-// FetchData executes a batch metric query request and returns the results.
-//
-// This is the primary API for retrieving metric data from the memory store. It supports:
-//   - Individual queries via req.Queries
-//   - Batch queries for all nodes via req.ForAllNodes
-//   - Hierarchical selector construction (cluster → host → type → subtype)
-//   - Optional statistics computation (avg, min, max)
-//   - Optional data scaling
-//   - Optional data padding with NaN values
-//
-// The function constructs selectors based on the query parameters and calls MemoryStore.Read()
-// for each selector. If a query specifies Aggregate=false with multiple type/subtype IDs,
-// separate results are returned for each combination.
-//
-// Parameters:
-//   - req: The query request containing queries, time range, and options
-//
-// Returns:
-//   - APIQueryResponse containing results for each query, or error if validation fails
-//
-// Errors:
-//   - ErrInvalidTimeRange if req.From > req.To
-//   - ErrEmptyCluster if req.ForAllNodes is used without specifying a cluster
-//   - Error if MemoryStore is not initialized
-//   - Individual query errors are stored in APIMetricData.Error field
 func FetchData(req APIQueryRequest) (*APIQueryResponse, error) {
 	if req.From > req.To {
 		return nil, ErrInvalidTimeRange
@@ -205,9 +126,10 @@ func FetchData(req APIQueryRequest) (*APIQueryResponse, error) {
 	req.WithData = true
 	ms := GetMemoryStore()
 	if ms == nil {
-		return nil, fmt.Errorf("[METRICSTORE]> memorystore not initialized")
+		return nil, fmt.Errorf("memorystore not initialized")
 	}

+
 	response := APIQueryResponse{
 		Results: make([][]APIMetricData, 0, len(req.Queries)),
 	}
@@ -274,6 +196,8 @@ func FetchData(req APIQueryRequest) (*APIQueryResponse, error) {
 			}
 		}

+		// log.Printf("query: %#v\n", query)
+		// log.Printf("sels: %#v\n", sels)
 		var err error
 		res := make([]APIMetricData, 0, len(sels))
 		for _, sel := range sels {
@@ -281,15 +205,9 @@ func FetchData(req APIQueryRequest) (*APIQueryResponse, error) {

 			data.Data, data.From, data.To, data.Resolution, err = ms.Read(sel, query.Metric, req.From, req.To, query.Resolution)
 			if err != nil {
-				// Skip Error If Just Missing Host or Metric, Continue
-				// Empty Return For Metric Handled Gracefully By Frontend
-				if err != ErrNoHostOrMetric {
-					msg := err.Error()
-					data.Error = &msg
-					res = append(res, data)
-				} else {
-					cclog.Warnf("failed to fetch '%s' from host '%s' (cluster: %s): %s", query.Metric, query.Hostname, req.Cluster, err.Error())
-				}
+				msg := err.Error()
+				data.Error = &msg
+				res = append(res, data)
 				continue
 			}

--- a/internal/memorystore/archive.go
+++ b/internal/memorystore/archive.go
@@ -0,0 +1,196 @@
+// Copyright (C) NHR@FAU, University Erlangen-Nuremberg.
+// All rights reserved. This file is part of cc-backend.
+// Use of this source code is governed by a MIT-style
+// license that can be found in the LICENSE file.
+
+package memorystore
+
+import (
+	"archive/zip"
+	"bufio"
+	"context"
+	"errors"
+	"fmt"
+	"io"
+	"os"
+	"path/filepath"
+	"sync"
+	"sync/atomic"
+	"time"
+
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+)
+
+func Archiving(wg *sync.WaitGroup, ctx context.Context) {
+	go func() {
+		defer wg.Done()
+		d, err := time.ParseDuration(Keys.Archive.Interval)
+		if err != nil {
+			cclog.Fatalf("[METRICSTORE]> error parsing archive interval duration: %v\n", err)
+		}
+		if d <= 0 {
+			return
+		}
+
+		ticker := time.NewTicker(d)
+		defer ticker.Stop()
+
+		for {
+			select {
+			case <-ctx.Done():
+				return
+			case <-ticker.C:
+				t := time.Now().Add(-d)
+				cclog.Infof("[METRICSTORE]> start archiving checkpoints (older than %s)...", t.Format(time.RFC3339))
+				n, err := ArchiveCheckpoints(Keys.Checkpoints.RootDir,
+					Keys.Archive.RootDir, t.Unix(), Keys.Archive.DeleteInstead)
+
+				if err != nil {
+					cclog.Errorf("[METRICSTORE]> archiving failed: %s", err.Error())
+				} else {
+					cclog.Infof("[METRICSTORE]> done: %d files zipped and moved to archive", n)
+				}
+			}
+		}
+	}()
+}
+
+var ErrNoNewArchiveData error = errors.New("all data already archived")
+
+// ZIP all checkpoint files older than `from` together and write them to the `archiveDir`,
+// deleting them from the `checkpointsDir`.
+func ArchiveCheckpoints(checkpointsDir, archiveDir string, from int64, deleteInstead bool) (int, error) {
+	entries1, err := os.ReadDir(checkpointsDir)
+	if err != nil {
+		return 0, err
+	}
+
+	type workItem struct {
+		cdir, adir    string
+		cluster, host string
+	}
+
+	var wg sync.WaitGroup
+	n, errs := int32(0), int32(0)
+	work := make(chan workItem, Keys.NumWorkers)
+
+	wg.Add(Keys.NumWorkers)
+	for worker := 0; worker < Keys.NumWorkers; worker++ {
+		go func() {
+			defer wg.Done()
+			for workItem := range work {
+				m, err := archiveCheckpoints(workItem.cdir, workItem.adir, from, deleteInstead)
+				if err != nil {
+					cclog.Errorf("error while archiving %s/%s: %s", workItem.cluster, workItem.host, err.Error())
+					atomic.AddInt32(&errs, 1)
+				}
+				atomic.AddInt32(&n, int32(m))
+			}
+		}()
+	}
+
+	for _, de1 := range entries1 {
+		entries2, e := os.ReadDir(filepath.Join(checkpointsDir, de1.Name()))
+		if e != nil {
+			err = e
+		}
+
+		for _, de2 := range entries2 {
+			cdir := filepath.Join(checkpointsDir, de1.Name(), de2.Name())
+			adir := filepath.Join(archiveDir, de1.Name(), de2.Name())
+			work <- workItem{
+				adir: adir, cdir: cdir,
+				cluster: de1.Name(), host: de2.Name(),
+			}
+		}
+	}
+
+	close(work)
+	wg.Wait()
+
+	if err != nil {
+		return int(n), err
+	}
+
+	if errs > 0 {
+		return int(n), fmt.Errorf("%d errors happened while archiving (%d successes)", errs, n)
+	}
+	return int(n), nil
+}
+
+// Helper function for `ArchiveCheckpoints`.
+func archiveCheckpoints(dir string, archiveDir string, from int64, deleteInstead bool) (int, error) {
+	entries, err := os.ReadDir(dir)
+	if err != nil {
+		return 0, err
+	}
+
+	extension := Keys.Checkpoints.FileFormat
+	files, err := findFiles(entries, from, extension, false)
+	if err != nil {
+		return 0, err
+	}
+
+	if deleteInstead {
+		n := 0
+		for _, checkpoint := range files {
+			filename := filepath.Join(dir, checkpoint)
+			if err = os.Remove(filename); err != nil {
+				return n, err
+			}
+			n += 1
+		}
+		return n, nil
+	}
+
+	filename := filepath.Join(archiveDir, fmt.Sprintf("%d.zip", from))
+	f, err := os.OpenFile(filename, os.O_CREATE|os.O_WRONLY, CheckpointFilePerms)
+	if err != nil && os.IsNotExist(err) {
+		err = os.MkdirAll(archiveDir, CheckpointDirPerms)
+		if err == nil {
+			f, err = os.OpenFile(filename, os.O_CREATE|os.O_WRONLY, CheckpointFilePerms)
+		}
+	}
+	if err != nil {
+		return 0, err
+	}
+	defer f.Close()
+	bw := bufio.NewWriter(f)
+	defer bw.Flush()
+	zw := zip.NewWriter(bw)
+	defer zw.Close()
+
+	n := 0
+	for _, checkpoint := range files {
+		// Use closure to ensure file is closed immediately after use,
+		// avoiding file descriptor leak from defer in loop
+		err := func() error {
+			filename := filepath.Join(dir, checkpoint)
+			r, err := os.Open(filename)
+			if err != nil {
+				return err
+			}
+			defer r.Close()
+
+			w, err := zw.Create(checkpoint)
+			if err != nil {
+				return err
+			}
+
+			if _, err = io.Copy(w, r); err != nil {
+				return err
+			}
+
+			if err = os.Remove(filename); err != nil {
+				return err
+			}
+			return nil
+		}()
+		if err != nil {
+			return n, err
+		}
+		n += 1
+	}
+
+	return n, nil
+}
--- a/internal/memorystore/avroCheckpoint.go
+++ b/internal/memorystore/avroCheckpoint.go
@@ -0,0 +1,477 @@
+// Copyright (C) NHR@FAU, University Erlangen-Nuremberg.
+// All rights reserved. This file is part of cc-backend.
+// Use of this source code is governed by a MIT-style
+// license that can be found in the LICENSE file.
+
+package memorystore
+
+import (
+	"bufio"
+	"encoding/json"
+	"errors"
+	"fmt"
+	"os"
+	"path"
+	"sort"
+	"strconv"
+	"strings"
+	"sync"
+	"sync/atomic"
+	"time"
+
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/schema"
+	"github.com/linkedin/goavro/v2"
+)
+
+var NumAvroWorkers int = DefaultAvroWorkers
+var startUp bool = true
+
+func (as *AvroStore) ToCheckpoint(dir string, dumpAll bool) (int, error) {
+	levels := make([]*AvroLevel, 0)
+	selectors := make([][]string, 0)
+	as.root.lock.RLock()
+	// Cluster
+	for sel1, l1 := range as.root.children {
+		l1.lock.RLock()
+		// Node
+		for sel2, l2 := range l1.children {
+			l2.lock.RLock()
+			// Frequency
+			for sel3, l3 := range l2.children {
+				levels = append(levels, l3)
+				selectors = append(selectors, []string{sel1, sel2, sel3})
+			}
+			l2.lock.RUnlock()
+		}
+		l1.lock.RUnlock()
+	}
+	as.root.lock.RUnlock()
+
+	type workItem struct {
+		level    *AvroLevel
+		dir      string
+		selector []string
+	}
+
+	n, errs := int32(0), int32(0)
+
+	var wg sync.WaitGroup
+	wg.Add(NumAvroWorkers)
+	work := make(chan workItem, NumAvroWorkers*2)
+	for range NumAvroWorkers {
+		go func() {
+			defer wg.Done()
+
+			for workItem := range work {
+				from := getTimestamp(workItem.dir)
+
+				if err := workItem.level.toCheckpoint(workItem.dir, from, dumpAll); err != nil {
+					if err == ErrNoNewArchiveData {
+						continue
+					}
+
+					cclog.Errorf("error while checkpointing %#v: %s", workItem.selector, err.Error())
+					atomic.AddInt32(&errs, 1)
+				} else {
+					atomic.AddInt32(&n, 1)
+				}
+			}
+		}()
+	}
+
+	for i := range len(levels) {
+		dir := path.Join(dir, path.Join(selectors[i]...))
+		work <- workItem{
+			level:    levels[i],
+			dir:      dir,
+			selector: selectors[i],
+		}
+	}
+
+	close(work)
+	wg.Wait()
+
+	if errs > 0 {
+		return int(n), fmt.Errorf("%d errors happend while creating avro checkpoints (%d successes)", errs, n)
+	}
+
+	startUp = false
+
+	return int(n), nil
+}
+
+// getTimestamp returns the timestamp from the directory name
+func getTimestamp(dir string) int64 {
+	// Extract the resolution and timestamp from the directory name
+	// The existing avro file will be in epoch timestamp format
+	// iterate over all the files in the directory and find the maximum timestamp
+	// and return it
+
+	resolution := path.Base(dir)
+	dir = path.Dir(dir)
+
+	files, err := os.ReadDir(dir)
+	if err != nil {
+		return 0
+	}
+	var maxTS int64 = 0
+
+	if len(files) == 0 {
+		return 0
+	}
+
+	for _, file := range files {
+		if file.IsDir() {
+			continue
+		}
+		name := file.Name()
+
+		if len(name) < 5 || !strings.HasSuffix(name, ".avro") || !strings.HasPrefix(name, resolution+"_") {
+			continue
+		}
+
+		ts, err := strconv.ParseInt(name[strings.Index(name, "_")+1:len(name)-5], 10, 64)
+		if err != nil {
+			fmt.Printf("error while parsing timestamp: %s\n", err.Error())
+			continue
+		}
+
+		if ts > maxTS {
+			maxTS = ts
+		}
+	}
+
+	interval, _ := time.ParseDuration(Keys.Checkpoints.Interval)
+	updateTime := time.Unix(maxTS, 0).Add(interval).Add(time.Duration(CheckpointBufferMinutes-1) * time.Minute).Unix()
+
+	if startUp {
+		return 0
+	}
+
+	if updateTime < time.Now().Unix() {
+		return 0
+	}
+
+	return maxTS
+}
+
+func (l *AvroLevel) toCheckpoint(dir string, from int64, dumpAll bool) error {
+	l.lock.Lock()
+	defer l.lock.Unlock()
+
+	// fmt.Printf("Checkpointing directory: %s\n", dir)
+	// filepath contains the resolution
+	intRes, _ := strconv.Atoi(path.Base(dir))
+
+	// find smallest overall timestamp in l.data map and delete it from l.data
+	minTS := int64(1<<63 - 1)
+	for ts, dat := range l.data {
+		if ts < minTS && len(dat) != 0 {
+			minTS = ts
+		}
+	}
+
+	if from == 0 && minTS != int64(1<<63-1) {
+		from = minTS
+	}
+
+	if from == 0 {
+		return ErrNoNewArchiveData
+	}
+
+	var schema string
+	var codec *goavro.Codec
+	recordList := make([]map[string]any, 0)
+
+	var f *os.File
+
+	filePath := dir + fmt.Sprintf("_%d.avro", from)
+
+	var err error
+
+	fp_, err_ := os.Stat(filePath)
+	if errors.Is(err_, os.ErrNotExist) {
+		err = os.MkdirAll(path.Dir(dir), 0o755)
+		if err != nil {
+			return fmt.Errorf("failed to create directory: %v", err)
+		}
+	} else if fp_.Size() != 0 {
+		f, err = os.Open(filePath)
+		if err != nil {
+			return fmt.Errorf("failed to open existing avro file: %v", err)
+		}
+
+		br := bufio.NewReader(f)
+
+		reader, err := goavro.NewOCFReader(br)
+		if err != nil {
+			return fmt.Errorf("failed to create OCF reader: %v", err)
+		}
+		codec = reader.Codec()
+		schema = codec.Schema()
+
+		f.Close()
+	}
+
+	timeRef := time.Now().Add(time.Duration(-CheckpointBufferMinutes+1) * time.Minute).Unix()
+
+	if dumpAll {
+		timeRef = time.Now().Unix()
+	}
+
+	// Empty values
+	if len(l.data) == 0 {
+		// we checkpoint avro files every 60 seconds
+		repeat := 60 / intRes
+
+		for range repeat {
+			recordList = append(recordList, make(map[string]any))
+		}
+	}
+
+	readFlag := true
+
+	for ts := range l.data {
+		flag := false
+		if ts < timeRef {
+			data := l.data[ts]
+
+			schemaGen, err := generateSchema(data)
+			if err != nil {
+				return err
+			}
+
+			flag, schema, err = compareSchema(schema, schemaGen)
+			if err != nil {
+				return fmt.Errorf("failed to compare read and generated schema: %v", err)
+			}
+			if flag && readFlag && !errors.Is(err_, os.ErrNotExist) {
+
+				f.Close()
+
+				f, err = os.Open(filePath)
+				if err != nil {
+					return fmt.Errorf("failed to open Avro file: %v", err)
+				}
+
+				br := bufio.NewReader(f)
+
+				ocfReader, err := goavro.NewOCFReader(br)
+				if err != nil {
+					return fmt.Errorf("failed to create OCF reader while changing schema: %v", err)
+				}
+
+				for ocfReader.Scan() {
+					record, err := ocfReader.Read()
+					if err != nil {
+						return fmt.Errorf("failed to read record: %v", err)
+					}
+
+					recordList = append(recordList, record.(map[string]any))
+				}
+
+				f.Close()
+
+				err = os.Remove(filePath)
+				if err != nil {
+					return fmt.Errorf("failed to delete file: %v", err)
+				}
+
+				readFlag = false
+			}
+			codec, err = goavro.NewCodec(schema)
+			if err != nil {
+				return fmt.Errorf("failed to create codec after merged schema: %v", err)
+			}
+
+			recordList = append(recordList, generateRecord(data))
+			delete(l.data, ts)
+		}
+	}
+
+	if len(recordList) == 0 {
+		return ErrNoNewArchiveData
+	}
+
+	f, err = os.OpenFile(filePath, os.O_CREATE|os.O_APPEND|os.O_RDWR, 0o644)
+	if err != nil {
+		return fmt.Errorf("failed to append new avro file: %v", err)
+	}
+
+	// fmt.Printf("Codec : %#v\n", codec)
+
+	writer, err := goavro.NewOCFWriter(goavro.OCFConfig{
+		W:               f,
+		Codec:           codec,
+		CompressionName: goavro.CompressionDeflateLabel,
+	})
+	if err != nil {
+		return fmt.Errorf("failed to create OCF writer: %v", err)
+	}
+
+	// Append the new record
+	if err := writer.Append(recordList); err != nil {
+		return fmt.Errorf("failed to append record: %v", err)
+	}
+
+	f.Close()
+
+	return nil
+}
+
+func compareSchema(schemaRead, schemaGen string) (bool, string, error) {
+	var genSchema, readSchema AvroSchema
+
+	if schemaRead == "" {
+		return false, schemaGen, nil
+	}
+
+	// Unmarshal the schema strings into AvroSchema structs
+	if err := json.Unmarshal([]byte(schemaGen), &genSchema); err != nil {
+		return false, "", fmt.Errorf("failed to parse generated schema: %v", err)
+	}
+	if err := json.Unmarshal([]byte(schemaRead), &readSchema); err != nil {
+		return false, "", fmt.Errorf("failed to parse read schema: %v", err)
+	}
+
+	sort.Slice(genSchema.Fields, func(i, j int) bool {
+		return genSchema.Fields[i].Name < genSchema.Fields[j].Name
+	})
+
+	sort.Slice(readSchema.Fields, func(i, j int) bool {
+		return readSchema.Fields[i].Name < readSchema.Fields[j].Name
+	})
+
+	// Check if schemas are identical
+	schemasEqual := true
+	if len(genSchema.Fields) <= len(readSchema.Fields) {
+
+		for i := range genSchema.Fields {
+			if genSchema.Fields[i].Name != readSchema.Fields[i].Name {
+				schemasEqual = false
+				break
+			}
+		}
+
+		// If schemas are identical, return the read schema
+		if schemasEqual {
+			return false, schemaRead, nil
+		}
+	}
+
+	// Create a map to hold unique fields from both schemas
+	fieldMap := make(map[string]AvroField)
+
+	// Add fields from the read schema
+	for _, field := range readSchema.Fields {
+		fieldMap[field.Name] = field
+	}
+
+	// Add or update fields from the generated schema
+	for _, field := range genSchema.Fields {
+		fieldMap[field.Name] = field
+	}
+
+	// Create a union schema by collecting fields from the map
+	var mergedFields []AvroField
+	for _, field := range fieldMap {
+		mergedFields = append(mergedFields, field)
+	}
+
+	// Sort fields by name for consistency
+	sort.Slice(mergedFields, func(i, j int) bool {
+		return mergedFields[i].Name < mergedFields[j].Name
+	})
+
+	// Create the merged schema
+	mergedSchema := AvroSchema{
+		Type:   "record",
+		Name:   genSchema.Name,
+		Fields: mergedFields,
+	}
+
+	// Check if schemas are identical
+	schemasEqual = len(mergedSchema.Fields) == len(readSchema.Fields)
+	if schemasEqual {
+		for i := range mergedSchema.Fields {
+			if mergedSchema.Fields[i].Name != readSchema.Fields[i].Name {
+				schemasEqual = false
+				break
+			}
+		}
+
+		if schemasEqual {
+			return false, schemaRead, nil
+		}
+	}
+
+	// Marshal the merged schema back to JSON
+	mergedSchemaJSON, err := json.Marshal(mergedSchema)
+	if err != nil {
+		return false, "", fmt.Errorf("failed to marshal merged schema: %v", err)
+	}
+
+	return true, string(mergedSchemaJSON), nil
+}
+
+func generateSchema(data map[string]schema.Float) (string, error) {
+	// Define the Avro schema structure
+	schema := map[string]any{
+		"type":   "record",
+		"name":   "DataRecord",
+		"fields": []map[string]any{},
+	}
+
+	fieldTracker := make(map[string]struct{})
+
+	for key := range data {
+		if _, exists := fieldTracker[key]; !exists {
+			key = correctKey(key)
+
+			field := map[string]any{
+				"name":    key,
+				"type":    "double",
+				"default": -1.0,
+			}
+			schema["fields"] = append(schema["fields"].([]map[string]any), field)
+			fieldTracker[key] = struct{}{}
+		}
+	}
+
+	schemaString, err := json.Marshal(schema)
+	if err != nil {
+		return "", fmt.Errorf("failed to marshal schema: %v", err)
+	}
+
+	return string(schemaString), nil
+}
+
+func generateRecord(data map[string]schema.Float) map[string]any {
+	record := make(map[string]any)
+
+	// Iterate through each map in data
+	for key, value := range data {
+		key = correctKey(key)
+
+		// Set the value in the record
+		// avro only accepts basic types
+		record[key] = value.Double()
+	}
+
+	return record
+}
+
+func correctKey(key string) string {
+	key = strings.ReplaceAll(key, "_", "_0x5F_")
+	key = strings.ReplaceAll(key, ":", "_0x3A_")
+	key = strings.ReplaceAll(key, ".", "_0x2E_")
+	return key
+}
+
+func ReplaceKey(key string) string {
+	key = strings.ReplaceAll(key, "_0x2E_", ".")
+	key = strings.ReplaceAll(key, "_0x3A_", ":")
+	key = strings.ReplaceAll(key, "_0x5F_", "_")
+	return key
+}
--- a/internal/memorystore/avroHelper.go
+++ b/internal/memorystore/avroHelper.go
@@ -0,0 +1,84 @@
+// Copyright (C) NHR@FAU, University Erlangen-Nuremberg.
+// All rights reserved. This file is part of cc-backend.
+// Use of this source code is governed by a MIT-style
+// license that can be found in the LICENSE file.
+
+package memorystore
+
+import (
+	"context"
+	"slices"
+	"strconv"
+	"sync"
+
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+)
+
+func DataStaging(wg *sync.WaitGroup, ctx context.Context) {
+	// AvroPool is a pool of Avro writers.
+	go func() {
+		if Keys.Checkpoints.FileFormat == "json" {
+			wg.Done() // Mark this goroutine as done
+			return    // Exit the goroutine
+		}
+
+		defer wg.Done()
+
+		var avroLevel *AvroLevel
+		oldSelector := make([]string, 0)
+
+		for {
+			select {
+			case <-ctx.Done():
+				return
+			case val := <-LineProtocolMessages:
+				// Fetch the frequency of the metric from the global configuration
+				freq, err := GetMetricFrequency(val.MetricName)
+				if err != nil {
+					cclog.Errorf("Error fetching metric frequency: %s\n", err)
+					continue
+				}
+
+				metricName := ""
+
+				for _, selectorName := range val.Selector {
+					metricName += selectorName + SelectorDelimiter
+				}
+
+				metricName += val.MetricName
+
+				// Create a new selector for the Avro level
+				// The selector is a slice of strings that represents the path to the
+				// Avro level. It is created by appending the cluster, node, and metric
+				// name to the selector.
+				var selector []string
+				selector = append(selector, val.Cluster, val.Node, strconv.FormatInt(freq, 10))
+
+				if !stringSlicesEqual(oldSelector, selector) {
+					// Get the Avro level for the metric
+					avroLevel = avroStore.root.findAvroLevelOrCreate(selector)
+
+					// If the Avro level is nil, create a new one
+					if avroLevel == nil {
+						cclog.Errorf("Error creating or finding the level with cluster : %s, node : %s, metric : %s\n", val.Cluster, val.Node, val.MetricName)
+					}
+					oldSelector = slices.Clone(selector)
+				}
+
+				avroLevel.addMetric(metricName, val.Value, val.Timestamp, int(freq))
+			}
+		}
+	}()
+}
+
+func stringSlicesEqual(a, b []string) bool {
+	if len(a) != len(b) {
+		return false
+	}
+	for i := range a {
+		if a[i] != b[i] {
+			return false
+		}
+	}
+	return true
+}
--- a/internal/memorystore/avroStruct.go
+++ b/internal/memorystore/avroStruct.go
@@ -0,0 +1,167 @@
+// Copyright (C) NHR@FAU, University Erlangen-Nuremberg.
+// All rights reserved. This file is part of cc-backend.
+// Use of this source code is governed by a MIT-style
+// license that can be found in the LICENSE file.
+
+package memorystore
+
+import (
+	"sync"
+
+	"github.com/ClusterCockpit/cc-lib/schema"
+)
+
+var (
+	LineProtocolMessages = make(chan *AvroStruct)
+	// SelectorDelimiter separates hierarchical selector components in metric names for Avro encoding
+	SelectorDelimiter = "_SEL_"
+)
+
+var CheckpointBufferMinutes = DefaultCheckpointBufferMin
+
+type AvroStruct struct {
+	MetricName string
+	Cluster    string
+	Node       string
+	Selector   []string
+	Value      schema.Float
+	Timestamp  int64
+}
+
+type AvroStore struct {
+	root AvroLevel
+}
+
+var avroStore AvroStore
+
+type AvroLevel struct {
+	children map[string]*AvroLevel
+	data     map[int64]map[string]schema.Float
+	lock     sync.RWMutex
+}
+
+type AvroField struct {
+	Name    string `json:"name"`
+	Type    any    `json:"type"`
+	Default any    `json:"default,omitempty"`
+}
+
+type AvroSchema struct {
+	Type   string      `json:"type"`
+	Name   string      `json:"name"`
+	Fields []AvroField `json:"fields"`
+}
+
+func (l *AvroLevel) findAvroLevelOrCreate(selector []string) *AvroLevel {
+	if len(selector) == 0 {
+		return l
+	}
+
+	// Allow concurrent reads:
+	l.lock.RLock()
+	var child *AvroLevel
+	var ok bool
+	if l.children == nil {
+		// Children map needs to be created...
+		l.lock.RUnlock()
+	} else {
+		child, ok := l.children[selector[0]]
+		l.lock.RUnlock()
+		if ok {
+			return child.findAvroLevelOrCreate(selector[1:])
+		}
+	}
+
+	// The level does not exist, take write lock for unique access:
+	l.lock.Lock()
+	// While this thread waited for the write lock, another thread
+	// could have created the child node.
+	if l.children != nil {
+		child, ok = l.children[selector[0]]
+		if ok {
+			l.lock.Unlock()
+			return child.findAvroLevelOrCreate(selector[1:])
+		}
+	}
+
+	child = &AvroLevel{
+		data:     make(map[int64]map[string]schema.Float, 0),
+		children: nil,
+	}
+
+	if l.children != nil {
+		l.children[selector[0]] = child
+	} else {
+		l.children = map[string]*AvroLevel{selector[0]: child}
+	}
+	l.lock.Unlock()
+	return child.findAvroLevelOrCreate(selector[1:])
+}
+
+func (l *AvroLevel) addMetric(metricName string, value schema.Float, timestamp int64, Freq int) {
+	l.lock.Lock()
+	defer l.lock.Unlock()
+
+	KeyCounter := int(CheckpointBufferMinutes * 60 / Freq)
+
+	// Create keys in advance for the given amount of time
+	if len(l.data) != KeyCounter {
+		if len(l.data) == 0 {
+			for i := range KeyCounter {
+				l.data[timestamp+int64(i*Freq)] = make(map[string]schema.Float, 0)
+			}
+		} else {
+			// Get the last timestamp
+			var lastTS int64
+			for ts := range l.data {
+				if ts > lastTS {
+					lastTS = ts
+				}
+			}
+			// Create keys for the next KeyCounter timestamps
+			l.data[lastTS+int64(Freq)] = make(map[string]schema.Float, 0)
+		}
+	}
+
+	closestTS := int64(0)
+	minDiff := int64(Freq) + 1 // Start with diff just outside the valid range
+	found := false
+
+	// Iterate over timestamps and choose the one which is within range.
+	// Since its epoch time, we check if the difference is less than 60 seconds.
+	for ts, dat := range l.data {
+		// Check if timestamp is within range
+		diff := timestamp - ts
+		if diff < -int64(Freq) || diff > int64(Freq) {
+			continue
+		}
+
+		// Metric already present at this timestamp — skip
+		if _, ok := dat[metricName]; ok {
+			continue
+		}
+
+		// Check if this is the closest timestamp so far
+		if Abs(diff) < minDiff {
+			minDiff = Abs(diff)
+			closestTS = ts
+			found = true
+		}
+	}
+
+	if found {
+		l.data[closestTS][metricName] = value
+	}
+}
+
+func GetAvroStore() *AvroStore {
+	return &avroStore
+}
+
+// Abs returns the absolute value of x.
+func Abs(x int64) int64 {
+	if x < 0 {
+		return -x
+	}
+	return x
+}
--- a/internal/memorystore/buffer.go
+++ b/internal/memorystore/buffer.go
@@ -0,0 +1,190 @@
+// Copyright (C) NHR@FAU, University Erlangen-Nuremberg.
+// All rights reserved. This file is part of cc-backend.
+// Use of this source code is governed by a MIT-style
+// license that can be found in the LICENSE file.
+
+package memorystore
+
+import (
+	"errors"
+	"sync"
+
+	"github.com/ClusterCockpit/cc-lib/schema"
+)
+
+// BufferCap is the default buffer capacity.
+// buffer.data will only ever grow up to its capacity and a new link
+// in the buffer chain will be created if needed so that no copying
+// of data or reallocation needs to happen on writes.
+const BufferCap int = DefaultBufferCapacity
+
+var bufferPool sync.Pool = sync.Pool{
+	New: func() any {
+		return &buffer{
+			data: make([]schema.Float, 0, BufferCap),
+		}
+	},
+}
+
+var (
+	ErrNoData           error = errors.New("[METRICSTORE]> no data for this metric/level")
+	ErrDataDoesNotAlign error = errors.New("[METRICSTORE]> data from lower granularities does not align")
+)
+
+// Each metric on each level has it's own buffer.
+// This is where the actual values go.
+// If `cap(data)` is reached, a new buffer is created and
+// becomes the new head of a buffer list.
+type buffer struct {
+	prev      *buffer
+	next      *buffer
+	data      []schema.Float
+	frequency int64
+	start     int64
+	archived  bool
+	closed    bool
+}
+
+func newBuffer(ts, freq int64) *buffer {
+	b := bufferPool.Get().(*buffer)
+	b.frequency = freq
+	b.start = ts - (freq / 2)
+	b.prev = nil
+	b.next = nil
+	b.archived = false
+	b.closed = false
+	b.data = b.data[:0]
+	return b
+}
+
+// If a new buffer was created, the new head is returnd.
+// Otherwise, the existing buffer is returnd.
+// Normaly, only "newer" data should be written, but if the value would
+// end up in the same buffer anyways it is allowed.
+func (b *buffer) write(ts int64, value schema.Float) (*buffer, error) {
+	if ts < b.start {
+		return nil, errors.New("[METRICSTORE]> cannot write value to buffer from past")
+	}
+
+	// idx := int((ts - b.start + (b.frequency / 3)) / b.frequency)
+	idx := int((ts - b.start) / b.frequency)
+	if idx >= cap(b.data) {
+		newbuf := newBuffer(ts, b.frequency)
+		newbuf.prev = b
+		b.next = newbuf
+		b = newbuf
+		idx = 0
+	}
+
+	// Overwriting value or writing value from past
+	if idx < len(b.data) {
+		b.data[idx] = value
+		return b, nil
+	}
+
+	// Fill up unwritten slots with NaN
+	for i := len(b.data); i < idx; i++ {
+		b.data = append(b.data, schema.NaN)
+	}
+
+	b.data = append(b.data, value)
+	return b, nil
+}
+
+func (b *buffer) end() int64 {
+	return b.firstWrite() + int64(len(b.data))*b.frequency
+}
+
+func (b *buffer) firstWrite() int64 {
+	return b.start + (b.frequency / 2)
+}
+
+// Return all known values from `from` to `to`. Gaps of information are represented as NaN.
+// Simple linear interpolation is done between the two neighboring cells if possible.
+// If values at the start or end are missing, instead of NaN values, the second and thrid
+// return values contain the actual `from`/`to`.
+// This function goes back the buffer chain if `from` is older than the currents buffer start.
+// The loaded values are added to `data` and `data` is returned, possibly with a shorter length.
+// If `data` is not long enough to hold all values, this function will panic!
+func (b *buffer) read(from, to int64, data []schema.Float) ([]schema.Float, int64, int64, error) {
+	if from < b.firstWrite() {
+		if b.prev != nil {
+			return b.prev.read(from, to, data)
+		}
+		from = b.firstWrite()
+	}
+
+	i := 0
+	t := from
+	for ; t < to; t += b.frequency {
+		idx := int((t - b.start) / b.frequency)
+		if idx >= cap(b.data) {
+			if b.next == nil {
+				break
+			}
+			b = b.next
+			idx = 0
+		}
+
+		if idx >= len(b.data) {
+			if b.next == nil || to <= b.next.start {
+				break
+			}
+			data[i] += schema.NaN
+		} else if t < b.start {
+			data[i] += schema.NaN
+		} else {
+			data[i] += b.data[idx]
+		}
+		i++
+	}
+
+	return data[:i], from, t, nil
+}
+
+// Returns true if this buffer needs to be freed.
+func (b *buffer) free(t int64) (delme bool, n int) {
+	if b.prev != nil {
+		delme, m := b.prev.free(t)
+		n += m
+		if delme {
+			b.prev.next = nil
+			if cap(b.prev.data) == BufferCap {
+				bufferPool.Put(b.prev)
+			}
+			b.prev = nil
+		}
+	}
+
+	end := b.end()
+	if end < t {
+		return true, n + 1
+	}
+
+	return false, n
+}
+
+// Call `callback` on every buffer that contains data in the range from `from` to `to`.
+func (b *buffer) iterFromTo(from, to int64, callback func(b *buffer) error) error {
+	if b == nil {
+		return nil
+	}
+
+	if err := b.prev.iterFromTo(from, to, callback); err != nil {
+		return err
+	}
+
+	if from <= b.end() && b.start <= to {
+		return callback(b)
+	}
+
+	return nil
+}
+
+func (b *buffer) count() int64 {
+	res := int64(len(b.data))
+	if b.prev != nil {
+		res += b.prev.count()
+	}
+	return res
+}
--- a/internal/memorystore/checkpoint.go
+++ b/internal/memorystore/checkpoint.go
@@ -0,0 +1,761 @@
+// Copyright (C) NHR@FAU, University Erlangen-Nuremberg.
+// All rights reserved. This file is part of cc-backend.
+// Use of this source code is governed by a MIT-style
+// license that can be found in the LICENSE file.
+
+package memorystore
+
+import (
+	"bufio"
+	"context"
+	"encoding/json"
+	"errors"
+	"fmt"
+	"io/fs"
+	"os"
+	"path"
+	"path/filepath"
+	"runtime"
+	"sort"
+	"strconv"
+	"strings"
+	"sync"
+	"sync/atomic"
+	"time"
+
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/schema"
+	"github.com/linkedin/goavro/v2"
+)
+
+const (
+	CheckpointFilePerms = 0o644
+	CheckpointDirPerms  = 0o755
+	GCTriggerInterval   = DefaultGCTriggerInterval
+)
+
+// Whenever changed, update MarshalJSON as well!
+type CheckpointMetrics struct {
+	Data      []schema.Float `json:"data"`
+	Frequency int64          `json:"frequency"`
+	Start     int64          `json:"start"`
+}
+
+type CheckpointFile struct {
+	Metrics  map[string]*CheckpointMetrics `json:"metrics"`
+	Children map[string]*CheckpointFile    `json:"children"`
+	From     int64                         `json:"from"`
+	To       int64                         `json:"to"`
+}
+
+var lastCheckpoint time.Time
+
+func Checkpointing(wg *sync.WaitGroup, ctx context.Context) {
+	lastCheckpoint = time.Now()
+
+	if Keys.Checkpoints.FileFormat == "json" {
+		ms := GetMemoryStore()
+
+		go func() {
+			defer wg.Done()
+			d, err := time.ParseDuration(Keys.Checkpoints.Interval)
+			if err != nil {
+				cclog.Fatal(err)
+			}
+			if d <= 0 {
+				return
+			}
+
+			ticker := time.NewTicker(d)
+			defer ticker.Stop()
+
+			for {
+				select {
+				case <-ctx.Done():
+					return
+				case <-ticker.C:
+					cclog.Infof("[METRICSTORE]> start checkpointing (starting at %s)...", lastCheckpoint.Format(time.RFC3339))
+					now := time.Now()
+					n, err := ms.ToCheckpoint(Keys.Checkpoints.RootDir,
+						lastCheckpoint.Unix(), now.Unix())
+					if err != nil {
+						cclog.Errorf("[METRICSTORE]> checkpointing failed: %s", err.Error())
+					} else {
+						cclog.Infof("[METRICSTORE]> done: %d checkpoint files created", n)
+						lastCheckpoint = now
+					}
+				}
+			}
+		}()
+	} else {
+		go func() {
+			defer wg.Done()
+
+			select {
+			case <-ctx.Done():
+				return
+			case <-time.After(time.Duration(CheckpointBufferMinutes) * time.Minute):
+				GetAvroStore().ToCheckpoint(Keys.Checkpoints.RootDir, false)
+			}
+
+			ticker := time.NewTicker(DefaultAvroCheckpointInterval)
+			defer ticker.Stop()
+
+			for {
+				select {
+				case <-ctx.Done():
+					return
+				case <-ticker.C:
+					GetAvroStore().ToCheckpoint(Keys.Checkpoints.RootDir, false)
+				}
+			}
+		}()
+	}
+}
+
+// As `Float` implements a custom MarshalJSON() function,
+// serializing an array of such types has more overhead
+// than one would assume (because of extra allocations, interfaces and so on).
+func (cm *CheckpointMetrics) MarshalJSON() ([]byte, error) {
+	buf := make([]byte, 0, 128+len(cm.Data)*8)
+	buf = append(buf, `{"frequency":`...)
+	buf = strconv.AppendInt(buf, cm.Frequency, 10)
+	buf = append(buf, `,"start":`...)
+	buf = strconv.AppendInt(buf, cm.Start, 10)
+	buf = append(buf, `,"data":[`...)
+	for i, x := range cm.Data {
+		if i != 0 {
+			buf = append(buf, ',')
+		}
+		if x.IsNaN() {
+			buf = append(buf, `null`...)
+		} else {
+			buf = strconv.AppendFloat(buf, float64(x), 'f', 1, 32)
+		}
+	}
+	buf = append(buf, `]}`...)
+	return buf, nil
+}
+
+// Metrics stored at the lowest 2 levels are not stored away (root and cluster)!
+// On a per-host basis a new JSON file is created. I have no idea if this will scale.
+// The good thing: Only a host at a time is locked, so this function can run
+// in parallel to writes/reads.
+func (m *MemoryStore) ToCheckpoint(dir string, from, to int64) (int, error) {
+	levels := make([]*Level, 0)
+	selectors := make([][]string, 0)
+	m.root.lock.RLock()
+	for sel1, l1 := range m.root.children {
+		l1.lock.RLock()
+		for sel2, l2 := range l1.children {
+			levels = append(levels, l2)
+			selectors = append(selectors, []string{sel1, sel2})
+		}
+		l1.lock.RUnlock()
+	}
+	m.root.lock.RUnlock()
+
+	type workItem struct {
+		level    *Level
+		dir      string
+		selector []string
+	}
+
+	n, errs := int32(0), int32(0)
+
+	var wg sync.WaitGroup
+	wg.Add(Keys.NumWorkers)
+	work := make(chan workItem, Keys.NumWorkers*2)
+	for worker := 0; worker < Keys.NumWorkers; worker++ {
+		go func() {
+			defer wg.Done()
+
+			for workItem := range work {
+				if err := workItem.level.toCheckpoint(workItem.dir, from, to, m); err != nil {
+					if err == ErrNoNewArchiveData {
+						continue
+					}
+
+					cclog.Errorf("[METRICSTORE]> error while checkpointing %#v: %s", workItem.selector, err.Error())
+					atomic.AddInt32(&errs, 1)
+				} else {
+					atomic.AddInt32(&n, 1)
+				}
+			}
+		}()
+	}
+
+	for i := 0; i < len(levels); i++ {
+		dir := path.Join(dir, path.Join(selectors[i]...))
+		work <- workItem{
+			level:    levels[i],
+			dir:      dir,
+			selector: selectors[i],
+		}
+	}
+
+	close(work)
+	wg.Wait()
+
+	if errs > 0 {
+		return int(n), fmt.Errorf("[METRICSTORE]> %d errors happened while creating checkpoints (%d successes)", errs, n)
+	}
+	return int(n), nil
+}
+
+func (l *Level) toCheckpointFile(from, to int64, m *MemoryStore) (*CheckpointFile, error) {
+	l.lock.RLock()
+	defer l.lock.RUnlock()
+
+	retval := &CheckpointFile{
+		From:     from,
+		To:       to,
+		Metrics:  make(map[string]*CheckpointMetrics),
+		Children: make(map[string]*CheckpointFile),
+	}
+
+	for metric, minfo := range m.Metrics {
+		b := l.metrics[minfo.offset]
+		if b == nil {
+			continue
+		}
+
+		allArchived := true
+		b.iterFromTo(from, to, func(b *buffer) error {
+			if !b.archived {
+				allArchived = false
+			}
+			return nil
+		})
+
+		if allArchived {
+			continue
+		}
+
+		data := make([]schema.Float, (to-from)/b.frequency+1)
+		data, start, end, err := b.read(from, to, data)
+		if err != nil {
+			return nil, err
+		}
+
+		for i := int((end - start) / b.frequency); i < len(data); i++ {
+			data[i] = schema.NaN
+		}
+
+		retval.Metrics[metric] = &CheckpointMetrics{
+			Frequency: b.frequency,
+			Start:     start,
+			Data:      data,
+		}
+	}
+
+	for name, child := range l.children {
+		val, err := child.toCheckpointFile(from, to, m)
+		if err != nil {
+			return nil, err
+		}
+
+		if val != nil {
+			retval.Children[name] = val
+		}
+	}
+
+	if len(retval.Children) == 0 && len(retval.Metrics) == 0 {
+		return nil, nil
+	}
+
+	return retval, nil
+}
+
+func (l *Level) toCheckpoint(dir string, from, to int64, m *MemoryStore) error {
+	cf, err := l.toCheckpointFile(from, to, m)
+	if err != nil {
+		return err
+	}
+
+	if cf == nil {
+		return ErrNoNewArchiveData
+	}
+
+	filepath := path.Join(dir, fmt.Sprintf("%d.json", from))
+	f, err := os.OpenFile(filepath, os.O_CREATE|os.O_WRONLY, CheckpointFilePerms)
+	if err != nil && os.IsNotExist(err) {
+		err = os.MkdirAll(dir, CheckpointDirPerms)
+		if err == nil {
+			f, err = os.OpenFile(filepath, os.O_CREATE|os.O_WRONLY, CheckpointFilePerms)
+		}
+	}
+	if err != nil {
+		return err
+	}
+	defer f.Close()
+
+	bw := bufio.NewWriter(f)
+	if err = json.NewEncoder(bw).Encode(cf); err != nil {
+		return err
+	}
+
+	return bw.Flush()
+}
+
+func (m *MemoryStore) FromCheckpoint(dir string, from int64, extension string) (int, error) {
+	var wg sync.WaitGroup
+	work := make(chan [2]string, Keys.NumWorkers)
+	n, errs := int32(0), int32(0)
+
+	wg.Add(Keys.NumWorkers)
+	for worker := 0; worker < Keys.NumWorkers; worker++ {
+		go func() {
+			defer wg.Done()
+			for host := range work {
+				lvl := m.root.findLevelOrCreate(host[:], len(m.Metrics))
+				nn, err := lvl.fromCheckpoint(m, filepath.Join(dir, host[0], host[1]), from, extension)
+				if err != nil {
+					cclog.Errorf("[METRICSTORE]> error while loading checkpoints for %s/%s: %s", host[0], host[1], err.Error())
+					atomic.AddInt32(&errs, 1)
+				}
+				atomic.AddInt32(&n, int32(nn))
+			}
+		}()
+	}
+
+	i := 0
+	clustersDir, err := os.ReadDir(dir)
+	for _, clusterDir := range clustersDir {
+		if !clusterDir.IsDir() {
+			err = errors.New("[METRICSTORE]> expected only directories at first level of checkpoints/ directory")
+			goto done
+		}
+
+		hostsDir, e := os.ReadDir(filepath.Join(dir, clusterDir.Name()))
+		if e != nil {
+			err = e
+			goto done
+		}
+
+		for _, hostDir := range hostsDir {
+			if !hostDir.IsDir() {
+				err = errors.New("[METRICSTORE]> expected only directories at second level of checkpoints/ directory")
+				goto done
+			}
+
+			i++
+			if i%Keys.NumWorkers == 0 && i > GCTriggerInterval {
+				// Forcing garbage collection runs here regulary during the loading of checkpoints
+				// will decrease the total heap size after loading everything back to memory is done.
+				// While loading data, the heap will grow fast, so the GC target size will double
+				// almost always. By forcing GCs here, we can keep it growing more slowly so that
+				// at the end, less memory is wasted.
+				runtime.GC()
+			}
+
+			work <- [2]string{clusterDir.Name(), hostDir.Name()}
+		}
+	}
+done:
+	close(work)
+	wg.Wait()
+
+	if err != nil {
+		return int(n), err
+	}
+
+	if errs > 0 {
+		return int(n), fmt.Errorf("[METRICSTORE]> %d errors happened while creating checkpoints (%d successes)", errs, n)
+	}
+	return int(n), nil
+}
+
+// Metrics stored at the lowest 2 levels are not loaded (root and cluster)!
+// This function can only be called once and before the very first write or read.
+// Different host's data is loaded to memory in parallel.
+func (m *MemoryStore) FromCheckpointFiles(dir string, from int64) (int, error) {
+	if _, err := os.Stat(dir); os.IsNotExist(err) {
+		// The directory does not exist, so create it using os.MkdirAll()
+		err := os.MkdirAll(dir, CheckpointDirPerms) // CheckpointDirPerms sets the permissions for the directory
+		if err != nil {
+			cclog.Fatalf("[METRICSTORE]> Error creating directory: %#v\n", err)
+		}
+		cclog.Debugf("[METRICSTORE]> %#v Directory created successfully", dir)
+	}
+
+	// Config read (replace with your actual config read)
+	fileFormat := Keys.Checkpoints.FileFormat
+	if fileFormat == "" {
+		fileFormat = "avro"
+	}
+
+	// Map to easily get the fallback format
+	oppositeFormat := map[string]string{
+		"json": "avro",
+		"avro": "json",
+	}
+
+	// First, attempt to load the specified format
+	if found, err := checkFilesWithExtension(dir, fileFormat); err != nil {
+		return 0, fmt.Errorf("[METRICSTORE]> error checking files with extension: %v", err)
+	} else if found {
+		cclog.Infof("[METRICSTORE]> Loading %s files because fileformat is %s", fileFormat, fileFormat)
+		return m.FromCheckpoint(dir, from, fileFormat)
+	}
+
+	// If not found, attempt the opposite format
+	altFormat := oppositeFormat[fileFormat]
+	if found, err := checkFilesWithExtension(dir, altFormat); err != nil {
+		return 0, fmt.Errorf("[METRICSTORE]> error checking files with extension: %v", err)
+	} else if found {
+		cclog.Infof("[METRICSTORE]> Loading %s files but fileformat is %s", altFormat, fileFormat)
+		return m.FromCheckpoint(dir, from, altFormat)
+	}
+
+	cclog.Print("[METRICSTORE]> No valid checkpoint files found in the directory")
+	return 0, nil
+}
+
+func checkFilesWithExtension(dir string, extension string) (bool, error) {
+	found := false
+
+	err := filepath.Walk(dir, func(path string, info os.FileInfo, err error) error {
+		if err != nil {
+			return fmt.Errorf("[METRICSTORE]> error accessing path %s: %v", path, err)
+		}
+		if !info.IsDir() && filepath.Ext(info.Name()) == "."+extension {
+			found = true
+			return nil
+		}
+		return nil
+	})
+	if err != nil {
+		return false, fmt.Errorf("[METRICSTORE]> error walking through directories: %s", err)
+	}
+
+	return found, nil
+}
+
+func (l *Level) loadAvroFile(m *MemoryStore, f *os.File, from int64) error {
+	br := bufio.NewReader(f)
+
+	fileName := f.Name()[strings.LastIndex(f.Name(), "/")+1:]
+	resolution, err := strconv.ParseInt(fileName[0:strings.Index(fileName, "_")], 10, 64)
+	if err != nil {
+		return fmt.Errorf("[METRICSTORE]> error while reading avro file (resolution parsing) : %s", err)
+	}
+
+	fromTimestamp, err := strconv.ParseInt(fileName[strings.Index(fileName, "_")+1:len(fileName)-5], 10, 64)
+
+	// Same logic according to lineprotocol
+	fromTimestamp -= (resolution / 2)
+
+	if err != nil {
+		return fmt.Errorf("[METRICSTORE]> error converting timestamp from the avro file : %s", err)
+	}
+
+	// fmt.Printf("File : %s with resolution : %d\n", fileName, resolution)
+
+	var recordCounter int64 = 0
+
+	// Create a new OCF reader from the buffered reader
+	ocfReader, err := goavro.NewOCFReader(br)
+	if err != nil {
+		return fmt.Errorf("[METRICSTORE]> error creating OCF reader: %w", err)
+	}
+
+	metricsData := make(map[string]schema.FloatArray)
+
+	for ocfReader.Scan() {
+		datum, err := ocfReader.Read()
+		if err != nil {
+			return fmt.Errorf("[METRICSTORE]> error while reading avro file : %s", err)
+		}
+
+		record, ok := datum.(map[string]any)
+		if !ok {
+			return fmt.Errorf("[METRICSTORE]> failed to assert datum as map[string]interface{}")
+		}
+
+		for key, value := range record {
+			metricsData[key] = append(metricsData[key], schema.ConvertToFloat(value.(float64)))
+		}
+
+		recordCounter += 1
+	}
+
+	to := (fromTimestamp + (recordCounter / (60 / resolution) * 60))
+	if to < from {
+		return nil
+	}
+
+	for key, floatArray := range metricsData {
+		metricName := ReplaceKey(key)
+
+		if strings.Contains(metricName, SelectorDelimiter) {
+			subString := strings.Split(metricName, SelectorDelimiter)
+
+			lvl := l
+
+			for i := 0; i < len(subString)-1; i++ {
+
+				sel := subString[i]
+
+				if lvl.children == nil {
+					lvl.children = make(map[string]*Level)
+				}
+
+				child, ok := lvl.children[sel]
+				if !ok {
+					child = &Level{
+						metrics:  make([]*buffer, len(m.Metrics)),
+						children: nil,
+					}
+					lvl.children[sel] = child
+				}
+				lvl = child
+			}
+
+			leafMetricName := subString[len(subString)-1]
+			err = lvl.createBuffer(m, leafMetricName, floatArray, fromTimestamp, resolution)
+			if err != nil {
+				return fmt.Errorf("[METRICSTORE]> error while creating buffers from avroReader : %s", err)
+			}
+		} else {
+			err = l.createBuffer(m, metricName, floatArray, fromTimestamp, resolution)
+			if err != nil {
+				return fmt.Errorf("[METRICSTORE]> error while creating buffers from avroReader : %s", err)
+			}
+		}
+
+	}
+
+	return nil
+}
+
+func (l *Level) createBuffer(m *MemoryStore, metricName string, floatArray schema.FloatArray, from int64, resolution int64) error {
+	n := len(floatArray)
+	b := &buffer{
+		frequency: resolution,
+		start:     from,
+		data:      floatArray[0:n:n],
+		prev:      nil,
+		next:      nil,
+		archived:  true,
+	}
+
+	minfo, ok := m.Metrics[metricName]
+	if !ok {
+		return nil
+	}
+
+	prev := l.metrics[minfo.offset]
+	if prev == nil {
+		l.metrics[minfo.offset] = b
+	} else {
+		if prev.start > b.start {
+			return fmt.Errorf("[METRICSTORE]> buffer start time %d is before previous buffer start %d", b.start, prev.start)
+		}
+
+		b.prev = prev
+		prev.next = b
+
+		missingCount := ((int(b.start) - int(prev.start)) - len(prev.data)*int(b.frequency))
+		if missingCount > 0 {
+			missingCount /= int(b.frequency)
+
+			for range missingCount {
+				prev.data = append(prev.data, schema.NaN)
+			}
+
+			prev.data = prev.data[0:len(prev.data):len(prev.data)]
+		}
+	}
+	l.metrics[minfo.offset] = b
+
+	return nil
+}
+
+func (l *Level) loadJSONFile(m *MemoryStore, f *os.File, from int64) error {
+	br := bufio.NewReader(f)
+	cf := &CheckpointFile{}
+	if err := json.NewDecoder(br).Decode(cf); err != nil {
+		return err
+	}
+
+	if cf.To != 0 && cf.To < from {
+		return nil
+	}
+
+	if err := l.loadFile(cf, m); err != nil {
+		return err
+	}
+
+	return nil
+}
+
+func (l *Level) loadFile(cf *CheckpointFile, m *MemoryStore) error {
+	for name, metric := range cf.Metrics {
+		n := len(metric.Data)
+		b := &buffer{
+			frequency: metric.Frequency,
+			start:     metric.Start,
+			data:      metric.Data[0:n:n],
+			prev:      nil,
+			next:      nil,
+			archived:  true,
+		}
+
+		minfo, ok := m.Metrics[name]
+		if !ok {
+			continue
+		}
+
+		prev := l.metrics[minfo.offset]
+		if prev == nil {
+			l.metrics[minfo.offset] = b
+		} else {
+			if prev.start > b.start {
+				return fmt.Errorf("[METRICSTORE]> buffer start time %d is before previous buffer start %d", b.start, prev.start)
+			}
+
+			b.prev = prev
+			prev.next = b
+		}
+		l.metrics[minfo.offset] = b
+	}
+
+	if len(cf.Children) > 0 && l.children == nil {
+		l.children = make(map[string]*Level)
+	}
+
+	for sel, childCf := range cf.Children {
+		child, ok := l.children[sel]
+		if !ok {
+			child = &Level{
+				metrics:  make([]*buffer, len(m.Metrics)),
+				children: nil,
+			}
+			l.children[sel] = child
+		}
+
+		if err := child.loadFile(childCf, m); err != nil {
+			return err
+		}
+	}
+
+	return nil
+}
+
+func (l *Level) fromCheckpoint(m *MemoryStore, dir string, from int64, extension string) (int, error) {
+	direntries, err := os.ReadDir(dir)
+	if err != nil {
+		if os.IsNotExist(err) {
+			return 0, nil
+		}
+
+		return 0, err
+	}
+
+	allFiles := make([]fs.DirEntry, 0)
+	filesLoaded := 0
+	for _, e := range direntries {
+		if e.IsDir() {
+			child := &Level{
+				metrics:  make([]*buffer, len(m.Metrics)),
+				children: make(map[string]*Level),
+			}
+
+			files, err := child.fromCheckpoint(m, path.Join(dir, e.Name()), from, extension)
+			filesLoaded += files
+			if err != nil {
+				return filesLoaded, err
+			}
+
+			l.children[e.Name()] = child
+		} else if strings.HasSuffix(e.Name(), "."+extension) {
+			allFiles = append(allFiles, e)
+		} else {
+			continue
+		}
+	}
+
+	files, err := findFiles(allFiles, from, extension, true)
+	if err != nil {
+		return filesLoaded, err
+	}
+
+	loaders := map[string]func(*MemoryStore, *os.File, int64) error{
+		"json": l.loadJSONFile,
+		"avro": l.loadAvroFile,
+	}
+
+	loader := loaders[extension]
+
+	for _, filename := range files {
+		// Use a closure to ensure file is closed immediately after use
+		err := func() error {
+			f, err := os.Open(path.Join(dir, filename))
+			if err != nil {
+				return err
+			}
+			defer f.Close()
+
+			return loader(m, f, from)
+		}()
+		if err != nil {
+			return filesLoaded, err
+		}
+
+		filesLoaded += 1
+	}
+
+	return filesLoaded, nil
+}
+
+// This will probably get very slow over time!
+// A solution could be some sort of an index file in which all other files
+// and the timespan they contain is listed.
+func findFiles(direntries []fs.DirEntry, t int64, extension string, findMoreRecentFiles bool) ([]string, error) {
+	nums := map[string]int64{}
+	for _, e := range direntries {
+		if !strings.HasSuffix(e.Name(), "."+extension) {
+			continue
+		}
+
+		ts, err := strconv.ParseInt(e.Name()[strings.Index(e.Name(), "_")+1:len(e.Name())-5], 10, 64)
+		if err != nil {
+			return nil, err
+		}
+		nums[e.Name()] = ts
+	}
+
+	sort.Slice(direntries, func(i, j int) bool {
+		a, b := direntries[i], direntries[j]
+		return nums[a.Name()] < nums[b.Name()]
+	})
+
+	filenames := make([]string, 0)
+	for i := range direntries {
+		e := direntries[i]
+		ts1 := nums[e.Name()]
+
+		if findMoreRecentFiles && t <= ts1 {
+			filenames = append(filenames, e.Name())
+		}
+		if i == len(direntries)-1 {
+			continue
+		}
+
+		enext := direntries[i+1]
+		ts2 := nums[enext.Name()]
+
+		if findMoreRecentFiles {
+			if ts1 < t && t < ts2 {
+				filenames = append(filenames, e.Name())
+			}
+		} else {
+			if ts2 < t {
+				filenames = append(filenames, e.Name())
+			}
+		}
+	}
+
+	return filenames, nil
+}
--- a/internal/memorystore/config.go
+++ b/internal/memorystore/config.go
@@ -0,0 +1,115 @@
+// Copyright (C) NHR@FAU, University Erlangen-Nuremberg.
+// All rights reserved. This file is part of cc-backend.
+// Use of this source code is governed by a MIT-style
+// license that can be found in the LICENSE file.
+
+package memorystore
+
+import (
+	"fmt"
+	"time"
+)
+
+const (
+	DefaultMaxWorkers             = 10
+	DefaultBufferCapacity         = 512
+	DefaultGCTriggerInterval      = 100
+	DefaultAvroWorkers            = 4
+	DefaultCheckpointBufferMin    = 3
+	DefaultAvroCheckpointInterval = time.Minute
+)
+
+type MetricStoreConfig struct {
+	// Number of concurrent workers for checkpoint and archive operations.
+	// If not set or 0, defaults to min(runtime.NumCPU()/2+1, 10)
+	NumWorkers  int `json:"num-workers"`
+	Checkpoints struct {
+		FileFormat string `json:"file-format"`
+		Interval   string `json:"interval"`
+		RootDir    string `json:"directory"`
+		Restore    string `json:"restore"`
+	} `json:"checkpoints"`
+	Debug struct {
+		DumpToFile string `json:"dump-to-file"`
+		EnableGops bool   `json:"gops"`
+	} `json:"debug"`
+	RetentionInMemory string `json:"retention-in-memory"`
+	Archive           struct {
+		Interval      string `json:"interval"`
+		RootDir       string `json:"directory"`
+		DeleteInstead bool   `json:"delete-instead"`
+	} `json:"archive"`
+	Subscriptions []struct {
+		// Channel name
+		SubscribeTo string `json:"subscribe-to"`
+
+		// Allow lines without a cluster tag, use this as default, optional
+		ClusterTag string `json:"cluster-tag"`
+	} `json:"subscriptions"`
+}
+
+var Keys MetricStoreConfig
+
+// AggregationStrategy for aggregation over multiple values at different cpus/sockets/..., not time!
+type AggregationStrategy int
+
+const (
+	NoAggregation AggregationStrategy = iota
+	SumAggregation
+	AvgAggregation
+)
+
+func AssignAggregationStrategy(str string) (AggregationStrategy, error) {
+	switch str {
+	case "":
+		return NoAggregation, nil
+	case "sum":
+		return SumAggregation, nil
+	case "avg":
+		return AvgAggregation, nil
+	default:
+		return NoAggregation, fmt.Errorf("[METRICSTORE]> unknown aggregation strategy: %s", str)
+	}
+}
+
+type MetricConfig struct {
+	// Interval in seconds at which measurements are stored
+	Frequency int64
+
+	// Can be 'sum', 'avg' or null. Describes how to aggregate metrics from the same timestep over the hierarchy.
+	Aggregation AggregationStrategy
+
+	// Private, used internally...
+	offset int
+}
+
+var Metrics map[string]MetricConfig
+
+func GetMetricFrequency(metricName string) (int64, error) {
+	if metric, ok := Metrics[metricName]; ok {
+		return metric.Frequency, nil
+	}
+	return 0, fmt.Errorf("[METRICSTORE]> metric %s not found", metricName)
+}
+
+// AddMetric adds logic to add metrics. Redundant metrics should be updated with max frequency.
+// use metric.Name to check if the metric already exists.
+// if not, add it to the Metrics map.
+func AddMetric(name string, metric MetricConfig) error {
+	if Metrics == nil {
+		Metrics = make(map[string]MetricConfig, 0)
+	}
+
+	if existingMetric, ok := Metrics[name]; ok {
+		if existingMetric.Frequency != metric.Frequency {
+			if existingMetric.Frequency < metric.Frequency {
+				existingMetric.Frequency = metric.Frequency
+				Metrics[name] = existingMetric
+			}
+		}
+	} else {
+		Metrics[name] = metric
+	}
+
+	return nil
+}
--- a/internal/memorystore/configSchema.go
+++ b/internal/memorystore/configSchema.go
@@ -0,0 +1,95 @@
+// Copyright (C) NHR@FAU, University Erlangen-Nuremberg.
+// All rights reserved. This file is part of cc-backend.
+// Use of this source code is governed by a MIT-style
+// license that can be found in the LICENSE file.
+
+package memorystore
+
+const configSchema = `{
+    "type": "object",
+    "description": "Configuration specific to built-in metric-store.",
+    "properties": {
+        "checkpoints": {
+            "description": "Configuration for checkpointing the metrics within metric-store",
+            "type": "object",
+            "properties": {
+                "file-format": {
+                    "description": "Specify the type of checkpoint file. There are 2 variants: 'avro' and 'json'. If nothing is specified, 'avro' is default.",
+                    "type": "string"
+                },
+                "interval": {
+                    "description": "Interval at which the metrics should be checkpointed.",
+                    "type": "string"
+                },
+                "directory": {
+                    "description": "Specify the parent directy in which the checkpointed files should be placed.",
+                    "type": "string"
+                },
+                "restore": {
+                    "description": "When cc-backend starts up, look for checkpointed files that are less than X hours old and load metrics from these selected checkpoint files.",
+                    "type": "string"
+                }
+            }
+        },
+        "archive": {
+            "description": "Configuration for archiving the already checkpointed files.",
+            "type": "object",
+            "properties": {
+                "interval": {
+                    "description": "Interval at which the checkpointed files should be archived.",
+                    "type": "string"
+                },
+                "directory": {
+                    "description": "Specify the parent directy in which the archived files should be placed.",
+                    "type": "string"
+                }
+            }
+        },
+        "retention-in-memory": {
+            "description": "Keep the metrics within memory for given time interval. Retention for X hours, then the metrics would be freed.",
+            "type": "string"
+        },
+        "nats": {
+            "description": "Configuration for accepting published data through NATS.",
+            "type": "array",
+            "items": {
+                "type": "object",
+                "properties": {
+                    "address": {
+                        "description": "Address of the NATS server.",
+                        "type": "string"
+                    },
+                    "username": {
+                        "description": "Optional: If configured with username/password method.",
+                        "type": "string"
+                    },
+                    "password": {
+                        "description": "Optional: If configured with username/password method.",
+                        "type": "string"
+                    },
+                    "creds-file-path": {
+                        "description": "Optional: If configured with Credential File method. Path to your NATS cred file.",
+                        "type": "string"
+                    },
+                    "subscriptions": {
+                        "description": "Array of various subscriptions. Allows to subscibe to different subjects and publishers.",
+                        "type": "array",
+                        "items": {
+                            "type": "object",
+                            "properties": {
+                                "subscribe-to": {
+                                    "description": "Channel name",
+                                    "type": "string"
+                                },
+                                "cluster-tag": {
+                                    "description": "Optional: Allow lines without a cluster tag, use this as default",
+                                    "type": "string"
+                                }
+                            }
+                        }
+                    }
+                }
+            }
+        }
+    }
+}`
--- a/internal/memorystore/debug.go
+++ b/internal/memorystore/debug.go
@@ -3,7 +3,7 @@
 // Use of this source code is governed by a MIT-style
 // license that can be found in the LICENSE file.

-package metricstore
+package memorystore

 import (
 	"bufio"
--- a/internal/memorystore/healthcheck.go
+++ b/internal/memorystore/healthcheck.go
@@ -0,0 +1,92 @@
+// Copyright (C) NHR@FAU, University Erlangen-Nuremberg.
+// All rights reserved. This file is part of cc-backend.
+// Use of this source code is governed by a MIT-style
+// license that can be found in the LICENSE file.
+
+package memorystore
+
+import (
+	"bufio"
+	"fmt"
+	"time"
+)
+
+// MaxMissingDataPoints is a threshold that allows a node to be healthy with certain number of data points missing.
+// Suppose a node does not receive last 5 data points, then healthCheck endpoint will still say a
+// node is healthy. Anything more than 5 missing points in metrics of the node will deem the node unhealthy.
+const MaxMissingDataPoints int64 = 5
+
+// MaxUnhealthyMetrics is a threshold which allows upto certain number of metrics in a node to be unhealthly.
+// Works with MaxMissingDataPoints. Say 5 metrics (including submetrics) do not receive the last
+// MaxMissingDataPoints data points, then the node will be deemed healthy. Any more metrics that does
+// not receive data for MaxMissingDataPoints data points will deem the node unhealthy.
+const MaxUnhealthyMetrics int64 = 5
+
+func (b *buffer) healthCheck() int64 {
+	// Check if the buffer is empty
+	if b.data == nil {
+		return 1
+	}
+
+	bufferEnd := b.start + b.frequency*int64(len(b.data))
+	t := time.Now().Unix()
+
+	// Check if the buffer is too old
+	if t-bufferEnd > MaxMissingDataPoints*b.frequency {
+		return 1
+	}
+
+	return 0
+}
+
+func (l *Level) healthCheck(m *MemoryStore, count int64) (int64, error) {
+	l.lock.RLock()
+	defer l.lock.RUnlock()
+
+	for _, mc := range m.Metrics {
+		if b := l.metrics[mc.offset]; b != nil {
+			count += b.healthCheck()
+		}
+	}
+
+	for _, lvl := range l.children {
+		c, err := lvl.healthCheck(m, 0)
+		if err != nil {
+			return 0, err
+		}
+		count += c
+	}
+
+	return count, nil
+}
+
+func (m *MemoryStore) HealthCheck(w *bufio.Writer, selector []string) error {
+	lvl := m.root.findLevel(selector)
+	if lvl == nil {
+		return fmt.Errorf("[METRICSTORE]> not found: %#v", selector)
+	}
+
+	buf := make([]byte, 0, 25)
+	// buf = append(buf, "{"...)
+
+	var count int64 = 0
+
+	unhealthyMetricsCount, err := lvl.healthCheck(m, count)
+	if err != nil {
+		return err
+	}
+
+	if unhealthyMetricsCount < MaxUnhealthyMetrics {
+		buf = append(buf, "Healthy"...)
+	} else {
+		buf = append(buf, "Unhealthy"...)
+	}
+
+	// buf = append(buf, "}\n"...)
+
+	if _, err = w.Write(buf); err != nil {
+		return err
+	}
+
+	return w.Flush()
+}
--- a/internal/memorystore/level.go
+++ b/internal/memorystore/level.go
@@ -0,0 +1,192 @@
+// Copyright (C) NHR@FAU, University Erlangen-Nuremberg.
+// All rights reserved. This file is part of cc-backend.
+// Use of this source code is governed by a MIT-style
+// license that can be found in the LICENSE file.
+
+package memorystore
+
+import (
+	"sync"
+	"unsafe"
+
+	"github.com/ClusterCockpit/cc-lib/util"
+)
+
+// Could also be called "node" as this forms a node in a tree structure.
+// Called Level because "node" might be confusing here.
+// Can be both a leaf or a inner node. In this tree structue, inner nodes can
+// also hold data (in `metrics`).
+type Level struct {
+	children map[string]*Level
+	metrics  []*buffer
+	lock     sync.RWMutex
+}
+
+// Find the correct level for the given selector, creating it if
+// it does not exist. Example selector in the context of the
+// ClusterCockpit could be: []string{ "emmy", "host123", "cpu0" }.
+// This function would probably benefit a lot from `level.children` beeing a `sync.Map`?
+func (l *Level) findLevelOrCreate(selector []string, nMetrics int) *Level {
+	if len(selector) == 0 {
+		return l
+	}
+
+	// Allow concurrent reads:
+	l.lock.RLock()
+	var child *Level
+	var ok bool
+	if l.children == nil {
+		// Children map needs to be created...
+		l.lock.RUnlock()
+	} else {
+		child, ok = l.children[selector[0]]
+		l.lock.RUnlock()
+		if ok {
+			return child.findLevelOrCreate(selector[1:], nMetrics)
+		}
+	}
+
+	// The level does not exist, take write lock for unique access:
+	l.lock.Lock()
+	// While this thread waited for the write lock, another thread
+	// could have created the child node.
+	if l.children != nil {
+		child, ok = l.children[selector[0]]
+		if ok {
+			l.lock.Unlock()
+			return child.findLevelOrCreate(selector[1:], nMetrics)
+		}
+	}
+
+	child = &Level{
+		metrics:  make([]*buffer, nMetrics),
+		children: nil,
+	}
+
+	if l.children != nil {
+		l.children[selector[0]] = child
+	} else {
+		l.children = map[string]*Level{selector[0]: child}
+	}
+	l.lock.Unlock()
+	return child.findLevelOrCreate(selector[1:], nMetrics)
+}
+
+func (l *Level) free(t int64) (int, error) {
+	l.lock.Lock()
+	defer l.lock.Unlock()
+
+	n := 0
+	for i, b := range l.metrics {
+		if b != nil {
+			delme, m := b.free(t)
+			n += m
+			if delme {
+				if cap(b.data) == BufferCap {
+					bufferPool.Put(b)
+				}
+				l.metrics[i] = nil
+			}
+		}
+	}
+
+	for _, l := range l.children {
+		m, err := l.free(t)
+		n += m
+		if err != nil {
+			return n, err
+		}
+	}
+
+	return n, nil
+}
+
+func (l *Level) sizeInBytes() int64 {
+	l.lock.RLock()
+	defer l.lock.RUnlock()
+	size := int64(0)
+
+	for _, b := range l.metrics {
+		if b != nil {
+			size += b.count() * int64(unsafe.Sizeof(util.Float(0)))
+		}
+	}
+
+	for _, child := range l.children {
+		size += child.sizeInBytes()
+	}
+
+	return size
+}
+
+func (l *Level) findLevel(selector []string) *Level {
+	if len(selector) == 0 {
+		return l
+	}
+
+	l.lock.RLock()
+	defer l.lock.RUnlock()
+
+	lvl := l.children[selector[0]]
+	if lvl == nil {
+		return nil
+	}
+
+	return lvl.findLevel(selector[1:])
+}
+
+func (l *Level) findBuffers(selector util.Selector, offset int, f func(b *buffer) error) error {
+	l.lock.RLock()
+	defer l.lock.RUnlock()
+
+	if len(selector) == 0 {
+		b := l.metrics[offset]
+		if b != nil {
+			return f(b)
+		}
+
+		for _, lvl := range l.children {
+			err := lvl.findBuffers(nil, offset, f)
+			if err != nil {
+				return err
+			}
+		}
+		return nil
+	}
+
+	sel := selector[0]
+	if len(sel.String) != 0 && l.children != nil {
+		lvl, ok := l.children[sel.String]
+		if ok {
+			err := lvl.findBuffers(selector[1:], offset, f)
+			if err != nil {
+				return err
+			}
+		}
+		return nil
+	}
+
+	if sel.Group != nil && l.children != nil {
+		for _, key := range sel.Group {
+			lvl, ok := l.children[key]
+			if ok {
+				err := lvl.findBuffers(selector[1:], offset, f)
+				if err != nil {
+					return err
+				}
+			}
+		}
+		return nil
+	}
+
+	if sel.Any && l.children != nil {
+		for _, lvl := range l.children {
+			if err := lvl.findBuffers(selector[1:], offset, f); err != nil {
+				return err
+			}
+		}
+		return nil
+	}
+
+	return nil
+}
--- a/internal/memorystore/lineprotocol.go
+++ b/internal/memorystore/lineprotocol.go
@@ -0,0 +1,258 @@
+// Copyright (C) NHR@FAU, University Erlangen-Nuremberg.
+// All rights reserved. This file is part of cc-backend.
+// Use of this source code is governed by a MIT-style
+// license that can be found in the LICENSE file.
+
+package memorystore
+
+import (
+	"context"
+	"fmt"
+	"sync"
+	"time"
+
+	"github.com/ClusterCockpit/cc-backend/pkg/nats"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/schema"
+	"github.com/influxdata/line-protocol/v2/lineprotocol"
+)
+
+func ReceiveNats(ms *MemoryStore,
+	workers int,
+	ctx context.Context,
+) error {
+	nc := nats.GetClient()
+
+	if nc == nil {
+		cclog.Warn("NATS client not initialized")
+		return nil
+	}
+
+	var wg sync.WaitGroup
+
+	msgs := make(chan []byte, workers*2)
+
+	for _, sc := range Keys.Subscriptions {
+		clusterTag := sc.ClusterTag
+		if workers > 1 {
+			wg.Add(workers)
+
+			for range workers {
+				go func() {
+					for m := range msgs {
+						dec := lineprotocol.NewDecoderWithBytes(m)
+						if err := DecodeLine(dec, ms, clusterTag); err != nil {
+							cclog.Errorf("error: %s", err.Error())
+						}
+					}
+
+					wg.Done()
+				}()
+			}
+
+			nc.Subscribe(sc.SubscribeTo, func(subject string, data []byte) {
+				msgs <- data
+			})
+		} else {
+			nc.Subscribe(sc.SubscribeTo, func(subject string, data []byte) {
+				dec := lineprotocol.NewDecoderWithBytes(data)
+				if err := DecodeLine(dec, ms, clusterTag); err != nil {
+					cclog.Errorf("error: %s", err.Error())
+				}
+			})
+		}
+		cclog.Infof("NATS subscription to '%s' established", sc.SubscribeTo)
+	}
+
+	close(msgs)
+	wg.Wait()
+
+	return nil
+}
+
+// Place `prefix` in front of `buf` but if possible,
+// do that inplace in `buf`.
+func reorder(buf, prefix []byte) []byte {
+	n := len(prefix)
+	m := len(buf)
+	if cap(buf) < m+n {
+		return append(prefix[:n:n], buf...)
+	} else {
+		buf = buf[:n+m]
+		for i := m - 1; i >= 0; i-- {
+			buf[i+n] = buf[i]
+		}
+		for i := range n {
+			buf[i] = prefix[i]
+		}
+		return buf
+	}
+}
+
+// Decode lines using dec and make write calls to the MemoryStore.
+// If a line is missing its cluster tag, use clusterDefault as default.
+func DecodeLine(dec *lineprotocol.Decoder,
+	ms *MemoryStore,
+	clusterDefault string,
+) error {
+	// Reduce allocations in loop:
+	t := time.Now()
+	metric, metricBuf := Metric{}, make([]byte, 0, 16)
+	selector := make([]string, 0, 4)
+	typeBuf, subTypeBuf := make([]byte, 0, 16), make([]byte, 0)
+
+	// Optimize for the case where all lines in a "batch" are about the same
+	// cluster and host. By using `WriteToLevel` (level = host), we do not need
+	// to take the root- and cluster-level lock as often.
+	var lvl *Level = nil
+	prevCluster, prevHost := "", ""
+
+	var ok bool
+	for dec.Next() {
+		rawmeasurement, err := dec.Measurement()
+		if err != nil {
+			return err
+		}
+
+		// Needs to be copied because another call to dec.* would
+		// invalidate the returned slice.
+		metricBuf = append(metricBuf[:0], rawmeasurement...)
+
+		// The go compiler optimizes map[string(byteslice)] lookups:
+		metric.MetricConfig, ok = ms.Metrics[string(rawmeasurement)]
+		if !ok {
+			continue
+		}
+
+		typeBuf, subTypeBuf := typeBuf[:0], subTypeBuf[:0]
+		cluster, host := clusterDefault, ""
+		for {
+			key, val, err := dec.NextTag()
+			if err != nil {
+				return err
+			}
+			if key == nil {
+				break
+			}
+
+			// The go compiler optimizes string([]byte{...}) == "...":
+			switch string(key) {
+			case "cluster":
+				if string(val) == prevCluster {
+					cluster = prevCluster
+				} else {
+					cluster = string(val)
+					lvl = nil
+				}
+			case "hostname", "host":
+				if string(val) == prevHost {
+					host = prevHost
+				} else {
+					host = string(val)
+					lvl = nil
+				}
+			case "type":
+				if string(val) == "node" {
+					break
+				}
+
+				// We cannot be sure that the "type" tag comes before the "type-id" tag:
+				if len(typeBuf) == 0 {
+					typeBuf = append(typeBuf, val...)
+				} else {
+					typeBuf = reorder(typeBuf, val)
+				}
+			case "type-id":
+				typeBuf = append(typeBuf, val...)
+			case "subtype":
+				// We cannot be sure that the "subtype" tag comes before the "stype-id" tag:
+				if len(subTypeBuf) == 0 {
+					subTypeBuf = append(subTypeBuf, val...)
+				} else {
+					subTypeBuf = reorder(subTypeBuf, val)
+					// subTypeBuf = reorder(typeBuf, val)
+				}
+			case "stype-id":
+				subTypeBuf = append(subTypeBuf, val...)
+			default:
+			}
+		}
+
+		// If the cluster or host changed, the lvl was set to nil
+		if lvl == nil {
+			selector = selector[:2]
+			selector[0], selector[1] = cluster, host
+			lvl = ms.GetLevel(selector)
+			prevCluster, prevHost = cluster, host
+		}
+
+		// subtypes:
+		selector = selector[:0]
+		if len(typeBuf) > 0 {
+			selector = append(selector, string(typeBuf)) // <- Allocation :(
+			if len(subTypeBuf) > 0 {
+				selector = append(selector, string(subTypeBuf))
+			}
+		}
+
+		for {
+			key, val, err := dec.NextField()
+			if err != nil {
+				return err
+			}
+
+			if key == nil {
+				break
+			}
+
+			if string(key) != "value" {
+				return fmt.Errorf("host %s: unknown field: '%s' (value: %#v)", host, string(key), val)
+			}
+
+			if val.Kind() == lineprotocol.Float {
+				metric.Value = schema.Float(val.FloatV())
+			} else if val.Kind() == lineprotocol.Int {
+				metric.Value = schema.Float(val.IntV())
+			} else if val.Kind() == lineprotocol.Uint {
+				metric.Value = schema.Float(val.UintV())
+			} else {
+				return fmt.Errorf("host %s: unsupported value type in message: %s", host, val.Kind().String())
+			}
+		}
+
+		if t, err = dec.Time(lineprotocol.Second, t); err != nil {
+			t = time.Now()
+			if t, err = dec.Time(lineprotocol.Millisecond, t); err != nil {
+				t = time.Now()
+				if t, err = dec.Time(lineprotocol.Microsecond, t); err != nil {
+					t = time.Now()
+					if t, err = dec.Time(lineprotocol.Nanosecond, t); err != nil {
+						return fmt.Errorf("host %s: timestamp : %#v with error : %#v", host, t, err.Error())
+					}
+				}
+			}
+		}
+
+		if err != nil {
+			return fmt.Errorf("host %s: timestamp : %#v with error : %#v", host, t, err.Error())
+		}
+
+		time := t.Unix()
+
+		if Keys.Checkpoints.FileFormat != "json" {
+			LineProtocolMessages <- &AvroStruct{
+				MetricName: string(metricBuf),
+				Cluster:    cluster,
+				Node:       host,
+				Selector:   append([]string{}, selector...),
+				Value:      metric.Value,
+				Timestamp:  time,
+			}
+		}
+
+		if err := ms.WriteToLevel(lvl, selector, time, []Metric{metric}); err != nil {
+			return err
+		}
+	}
+	return nil
+}
--- a/internal/memorystore/memorystore.go
+++ b/internal/memorystore/memorystore.go
@@ -0,0 +1,429 @@
+// Copyright (C) NHR@FAU, University Erlangen-Nuremberg.
+// All rights reserved. This file is part of cc-backend.
+// Use of this source code is governed by a MIT-style
+// license that can be found in the LICENSE file.
+
+// Package memorystore provides an efficient in-memory time-series metric storage system
+// with support for hierarchical data organization, checkpointing, and archiving.
+//
+// The package organizes metrics in a tree structure (cluster → host → component) and
+// provides concurrent read/write access to metric data with configurable aggregation strategies.
+// Background goroutines handle periodic checkpointing (JSON or Avro format), archiving old data,
+// and enforcing retention policies.
+//
+// Key features:
+//   - In-memory metric storage with configurable retention
+//   - Hierarchical data organization (selectors)
+//   - Concurrent checkpoint/archive workers
+//   - Support for sum and average aggregation
+//   - NATS integration for metric ingestion
+package memorystore
+
+import (
+	"bytes"
+	"context"
+	"encoding/json"
+	"errors"
+	"runtime"
+	"sync"
+	"time"
+
+	"github.com/ClusterCockpit/cc-backend/internal/config"
+	"github.com/ClusterCockpit/cc-backend/pkg/archive"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/resampler"
+	"github.com/ClusterCockpit/cc-lib/schema"
+	"github.com/ClusterCockpit/cc-lib/util"
+)
+
+var (
+	singleton  sync.Once
+	msInstance *MemoryStore
+	// shutdownFunc stores the context cancellation function created in Init
+	// and is called during Shutdown to cancel all background goroutines
+	shutdownFunc context.CancelFunc
+)
+
+type Metric struct {
+	Name         string
+	Value        schema.Float
+	MetricConfig MetricConfig
+}
+
+type MemoryStore struct {
+	Metrics map[string]MetricConfig
+	root    Level
+}
+
+func Init(rawConfig json.RawMessage, wg *sync.WaitGroup) {
+	startupTime := time.Now()
+
+	if rawConfig != nil {
+		config.Validate(configSchema, rawConfig)
+		dec := json.NewDecoder(bytes.NewReader(rawConfig))
+		// dec.DisallowUnknownFields()
+		if err := dec.Decode(&Keys); err != nil {
+			cclog.Abortf("[METRICSTORE]> Metric Store Config Init: Could not decode config file '%s'.\nError: %s\n", rawConfig, err.Error())
+		}
+	}
+
+	// Set NumWorkers from config or use default
+	if Keys.NumWorkers <= 0 {
+		Keys.NumWorkers = min(runtime.NumCPU()/2+1, DefaultMaxWorkers)
+	}
+	cclog.Debugf("[METRICSTORE]> Using %d workers for checkpoint/archive operations\n", Keys.NumWorkers)
+
+	// Helper function to add metric configuration
+	addMetricConfig := func(mc schema.MetricConfig) {
+		agg, err := AssignAggregationStrategy(mc.Aggregation)
+		if err != nil {
+			cclog.Warnf("Could not find aggregation strategy for metric config '%s': %s", mc.Name, err.Error())
+		}
+
+		AddMetric(mc.Name, MetricConfig{
+			Frequency:   int64(mc.Timestep),
+			Aggregation: agg,
+		})
+	}
+
+	for _, c := range archive.Clusters {
+		for _, mc := range c.MetricConfig {
+			addMetricConfig(*mc)
+		}
+
+		for _, sc := range c.SubClusters {
+			for _, mc := range sc.MetricConfig {
+				addMetricConfig(mc)
+			}
+		}
+	}
+
+	// Pass the config.MetricStoreKeys
+	InitMetrics(Metrics)
+
+	ms := GetMemoryStore()
+
+	d, err := time.ParseDuration(Keys.Checkpoints.Restore)
+	if err != nil {
+		cclog.Fatal(err)
+	}
+
+	restoreFrom := startupTime.Add(-d)
+	cclog.Infof("[METRICSTORE]> Loading checkpoints newer than %s\n", restoreFrom.Format(time.RFC3339))
+	files, err := ms.FromCheckpointFiles(Keys.Checkpoints.RootDir, restoreFrom.Unix())
+	loadedData := ms.SizeInBytes() / 1024 / 1024 // In MB
+	if err != nil {
+		cclog.Fatalf("[METRICSTORE]> Loading checkpoints failed: %s\n", err.Error())
+	} else {
+		cclog.Infof("[METRICSTORE]> Checkpoints loaded (%d files, %d MB, that took %fs)\n", files, loadedData, time.Since(startupTime).Seconds())
+	}
+
+	// Try to use less memory by forcing a GC run here and then
+	// lowering the target percentage. The default of 100 means
+	// that only once the ratio of new allocations execeds the
+	// previously active heap, a GC is triggered.
+	// Forcing a GC here will set the "previously active heap"
+	// to a minumum.
+	runtime.GC()
+
+	ctx, shutdown := context.WithCancel(context.Background())
+
+	wg.Add(4)
+
+	Retention(wg, ctx)
+	Checkpointing(wg, ctx)
+	Archiving(wg, ctx)
+	DataStaging(wg, ctx)
+
+	// Note: Signal handling has been removed from this function.
+	// The caller is responsible for handling shutdown signals and calling
+	// the shutdown() function when appropriate.
+	// Store the shutdown function for later use by Shutdown()
+	shutdownFunc = shutdown
+
+	err = ReceiveNats(ms, 1, ctx)
+	if err != nil {
+		cclog.Fatal(err)
+	}
+}
+
+// InitMetrics creates a new, initialized instance of a MemoryStore.
+// Will panic if values in the metric configurations are invalid.
+func InitMetrics(metrics map[string]MetricConfig) {
+	singleton.Do(func() {
+		offset := 0
+		for key, cfg := range metrics {
+			if cfg.Frequency == 0 {
+				panic("[METRICSTORE]> invalid frequency")
+			}
+
+			metrics[key] = MetricConfig{
+				Frequency:   cfg.Frequency,
+				Aggregation: cfg.Aggregation,
+				offset:      offset,
+			}
+			offset += 1
+		}
+
+		msInstance = &MemoryStore{
+			root: Level{
+				metrics:  make([]*buffer, len(metrics)),
+				children: make(map[string]*Level),
+			},
+			Metrics: metrics,
+		}
+	})
+}
+
+func GetMemoryStore() *MemoryStore {
+	if msInstance == nil {
+		return nil
+	}
+
+	return msInstance
+}
+
+func Shutdown() {
+	// Check if memorystore was initialized
+	if msInstance == nil {
+		cclog.Debug("[METRICSTORE]> MemoryStore not initialized, skipping shutdown")
+		return
+	}
+
+	// Cancel the context to signal all background goroutines to stop
+	if shutdownFunc != nil {
+		shutdownFunc()
+	}
+
+	cclog.Infof("[METRICSTORE]> Writing to '%s'...\n", Keys.Checkpoints.RootDir)
+	var files int
+	var err error
+
+	ms := GetMemoryStore()
+
+	if Keys.Checkpoints.FileFormat == "json" {
+		files, err = ms.ToCheckpoint(Keys.Checkpoints.RootDir, lastCheckpoint.Unix(), time.Now().Unix())
+	} else {
+		files, err = GetAvroStore().ToCheckpoint(Keys.Checkpoints.RootDir, true)
+		close(LineProtocolMessages)
+	}
+
+	if err != nil {
+		cclog.Errorf("[METRICSTORE]> Writing checkpoint failed: %s\n", err.Error())
+	}
+	cclog.Infof("[METRICSTORE]> Done! (%d files written)\n", files)
+}
+
+func getName(m *MemoryStore, i int) string {
+	for key, val := range m.Metrics {
+		if val.offset == i {
+			return key
+		}
+	}
+	return ""
+}
+
+func Retention(wg *sync.WaitGroup, ctx context.Context) {
+	ms := GetMemoryStore()
+
+	go func() {
+		defer wg.Done()
+		d, err := time.ParseDuration(Keys.RetentionInMemory)
+		if err != nil {
+			cclog.Fatal(err)
+		}
+		if d <= 0 {
+			return
+		}
+
+		tickInterval := d / 2
+		if tickInterval <= 0 {
+			return
+		}
+		ticker := time.NewTicker(tickInterval)
+		defer ticker.Stop()
+
+		for {
+			select {
+			case <-ctx.Done():
+				return
+			case <-ticker.C:
+				t := time.Now().Add(-d)
+				cclog.Infof("[METRICSTORE]> start freeing buffers (older than %s)...\n", t.Format(time.RFC3339))
+				freed, err := ms.Free(nil, t.Unix())
+				if err != nil {
+					cclog.Errorf("[METRICSTORE]> freeing up buffers failed: %s\n", err.Error())
+				} else {
+					cclog.Infof("[METRICSTORE]> done: %d buffers freed\n", freed)
+				}
+			}
+		}
+	}()
+}
+
+// Write all values in `metrics` to the level specified by `selector` for time `ts`.
+// Look at `findLevelOrCreate` for how selectors work.
+func (m *MemoryStore) Write(selector []string, ts int64, metrics []Metric) error {
+	var ok bool
+	for i, metric := range metrics {
+		if metric.MetricConfig.Frequency == 0 {
+			metric.MetricConfig, ok = m.Metrics[metric.Name]
+			if !ok {
+				metric.MetricConfig.Frequency = 0
+			}
+			metrics[i] = metric
+		}
+	}
+
+	return m.WriteToLevel(&m.root, selector, ts, metrics)
+}
+
+func (m *MemoryStore) GetLevel(selector []string) *Level {
+	return m.root.findLevelOrCreate(selector, len(m.Metrics))
+}
+
+// WriteToLevel assumes that `minfo` in `metrics` is filled in
+func (m *MemoryStore) WriteToLevel(l *Level, selector []string, ts int64, metrics []Metric) error {
+	l = l.findLevelOrCreate(selector, len(m.Metrics))
+	l.lock.Lock()
+	defer l.lock.Unlock()
+
+	for _, metric := range metrics {
+		if metric.MetricConfig.Frequency == 0 {
+			continue
+		}
+
+		b := l.metrics[metric.MetricConfig.offset]
+		if b == nil {
+			// First write to this metric and level
+			b = newBuffer(ts, metric.MetricConfig.Frequency)
+			l.metrics[metric.MetricConfig.offset] = b
+		}
+
+		nb, err := b.write(ts, metric.Value)
+		if err != nil {
+			return err
+		}
+
+		// Last write created a new buffer...
+		if b != nb {
+			l.metrics[metric.MetricConfig.offset] = nb
+		}
+	}
+	return nil
+}
+
+// Read returns all values for metric `metric` from `from` to `to` for the selected level(s).
+// If the level does not hold the metric itself, the data will be aggregated recursively from the children.
+// The second and third return value are the actual from/to for the data. Those can be different from
+// the range asked for if no data was available.
+func (m *MemoryStore) Read(selector util.Selector, metric string, from, to, resolution int64) ([]schema.Float, int64, int64, int64, error) {
+	if from > to {
+		return nil, 0, 0, 0, errors.New("[METRICSTORE]> invalid time range")
+	}
+
+	minfo, ok := m.Metrics[metric]
+	if !ok {
+		return nil, 0, 0, 0, errors.New("[METRICSTORE]> unknown metric: " + metric)
+	}
+
+	n, data := 0, make([]schema.Float, (to-from)/minfo.Frequency+1)
+
+	err := m.root.findBuffers(selector, minfo.offset, func(b *buffer) error {
+		cdata, cfrom, cto, err := b.read(from, to, data)
+		if err != nil {
+			return err
+		}
+
+		if n == 0 {
+			from, to = cfrom, cto
+		} else if from != cfrom || to != cto || len(data) != len(cdata) {
+			missingfront, missingback := int((from-cfrom)/minfo.Frequency), int((to-cto)/minfo.Frequency)
+			if missingfront != 0 {
+				return ErrDataDoesNotAlign
+			}
+
+			newlen := len(cdata) - missingback
+			if newlen < 1 {
+				return ErrDataDoesNotAlign
+			}
+			cdata = cdata[0:newlen]
+			if len(cdata) != len(data) {
+				return ErrDataDoesNotAlign
+			}
+
+			from, to = cfrom, cto
+		}
+
+		data = cdata
+		n += 1
+		return nil
+	})
+
+	if err != nil {
+		return nil, 0, 0, 0, err
+	} else if n == 0 {
+		return nil, 0, 0, 0, errors.New("[METRICSTORE]> metric or host not found")
+	} else if n > 1 {
+		if minfo.Aggregation == AvgAggregation {
+			normalize := 1. / schema.Float(n)
+			for i := 0; i < len(data); i++ {
+				data[i] *= normalize
+			}
+		} else if minfo.Aggregation != SumAggregation {
+			return nil, 0, 0, 0, errors.New("[METRICSTORE]> invalid aggregation")
+		}
+	}
+
+	data, resolution, err = resampler.LargestTriangleThreeBucket(data, minfo.Frequency, resolution)
+	if err != nil {
+		return nil, 0, 0, 0, err
+	}
+
+	return data, from, to, resolution, nil
+}
+
+// Free releases all buffers for the selected level and all its children that
+// contain only values older than `t`.
+func (m *MemoryStore) Free(selector []string, t int64) (int, error) {
+	return m.GetLevel(selector).free(t)
+}
+
+func (m *MemoryStore) FreeAll() error {
+	for k := range m.root.children {
+		delete(m.root.children, k)
+	}
+
+	return nil
+}
+
+func (m *MemoryStore) SizeInBytes() int64 {
+	return m.root.sizeInBytes()
+}
+
+// ListChildren , given a selector, returns a list of all children of the level
+// selected.
+func (m *MemoryStore) ListChildren(selector []string) []string {
+	lvl := &m.root
+	for lvl != nil && len(selector) != 0 {
+		lvl.lock.RLock()
+		next := lvl.children[selector[0]]
+		lvl.lock.RUnlock()
+		lvl = next
+		selector = selector[1:]
+	}
+
+	if lvl == nil {
+		return nil
+	}
+
+	lvl.lock.RLock()
+	defer lvl.lock.RUnlock()
+
+	children := make([]string, 0, len(lvl.children))
+	for child := range lvl.children {
+		children = append(children, child)
+	}
+
+	return children
+}
--- a/internal/memorystore/memorystore_test.go
+++ b/internal/memorystore/memorystore_test.go
@@ -0,0 +1,156 @@
+// Copyright (C) NHR@FAU, University Erlangen-Nuremberg.
+// All rights reserved. This file is part of cc-backend.
+// Use of this source code is governed by a MIT-style
+// license that can be found in the LICENSE file.
+
+package memorystore
+
+import (
+	"testing"
+
+	"github.com/ClusterCockpit/cc-lib/schema"
+)
+
+func TestAssignAggregationStrategy(t *testing.T) {
+	tests := []struct {
+		name     string
+		input    string
+		expected AggregationStrategy
+		wantErr  bool
+	}{
+		{"empty string", "", NoAggregation, false},
+		{"sum", "sum", SumAggregation, false},
+		{"avg", "avg", AvgAggregation, false},
+		{"invalid", "invalid", NoAggregation, true},
+	}
+
+	for _, tt := range tests {
+		t.Run(tt.name, func(t *testing.T) {
+			result, err := AssignAggregationStrategy(tt.input)
+			if (err != nil) != tt.wantErr {
+				t.Errorf("AssignAggregationStrategy(%q) error = %v, wantErr %v", tt.input, err, tt.wantErr)
+				return
+			}
+			if result != tt.expected {
+				t.Errorf("AssignAggregationStrategy(%q) = %v, want %v", tt.input, result, tt.expected)
+			}
+		})
+	}
+}
+
+func TestAddMetric(t *testing.T) {
+	// Reset Metrics before test
+	Metrics = make(map[string]MetricConfig)
+
+	err := AddMetric("test_metric", MetricConfig{
+		Frequency:   60,
+		Aggregation: SumAggregation,
+	})
+	if err != nil {
+		t.Errorf("AddMetric() error = %v", err)
+	}
+
+	if _, ok := Metrics["test_metric"]; !ok {
+		t.Error("AddMetric() did not add metric to Metrics map")
+	}
+
+	// Test updating with higher frequency
+	err = AddMetric("test_metric", MetricConfig{
+		Frequency:   120,
+		Aggregation: SumAggregation,
+	})
+	if err != nil {
+		t.Errorf("AddMetric() error = %v", err)
+	}
+
+	if Metrics["test_metric"].Frequency != 120 {
+		t.Errorf("AddMetric() frequency = %d, want 120", Metrics["test_metric"].Frequency)
+	}
+
+	// Test updating with lower frequency (should not update)
+	err = AddMetric("test_metric", MetricConfig{
+		Frequency:   30,
+		Aggregation: SumAggregation,
+	})
+	if err != nil {
+		t.Errorf("AddMetric() error = %v", err)
+	}
+
+	if Metrics["test_metric"].Frequency != 120 {
+		t.Errorf("AddMetric() frequency = %d, want 120 (should not downgrade)", Metrics["test_metric"].Frequency)
+	}
+}
+
+func TestGetMetricFrequency(t *testing.T) {
+	// Reset Metrics before test
+	Metrics = map[string]MetricConfig{
+		"test_metric": {
+			Frequency:   60,
+			Aggregation: SumAggregation,
+		},
+	}
+
+	freq, err := GetMetricFrequency("test_metric")
+	if err != nil {
+		t.Errorf("GetMetricFrequency() error = %v", err)
+	}
+	if freq != 60 {
+		t.Errorf("GetMetricFrequency() = %d, want 60", freq)
+	}
+
+	_, err = GetMetricFrequency("nonexistent")
+	if err == nil {
+		t.Error("GetMetricFrequency() expected error for nonexistent metric")
+	}
+}
+
+func TestBufferWrite(t *testing.T) {
+	b := newBuffer(100, 10)
+
+	// Test writing value
+	nb, err := b.write(100, schema.Float(42.0))
+	if err != nil {
+		t.Errorf("buffer.write() error = %v", err)
+	}
+	if nb != b {
+		t.Error("buffer.write() created new buffer unexpectedly")
+	}
+	if len(b.data) != 1 {
+		t.Errorf("buffer.write() len(data) = %d, want 1", len(b.data))
+	}
+	if b.data[0] != schema.Float(42.0) {
+		t.Errorf("buffer.write() data[0] = %v, want 42.0", b.data[0])
+	}
+
+	// Test writing value from past (should error)
+	_, err = b.write(50, schema.Float(10.0))
+	if err == nil {
+		t.Error("buffer.write() expected error for past timestamp")
+	}
+}
+
+func TestBufferRead(t *testing.T) {
+	b := newBuffer(100, 10)
+	
+	// Write some test data
+	b.write(100, schema.Float(1.0))
+	b.write(110, schema.Float(2.0))
+	b.write(120, schema.Float(3.0))
+
+	// Read data
+	data := make([]schema.Float, 3)
+	result, from, to, err := b.read(100, 130, data)
+	if err != nil {
+		t.Errorf("buffer.read() error = %v", err)
+	}
+	// Buffer read should return from as firstWrite (start + freq/2)
+	if from != 100 {
+		t.Errorf("buffer.read() from = %d, want 100", from)
+	}
+	if to != 130 {
+		t.Errorf("buffer.read() to = %d, want 130", to)
+	}
+	if len(result) != 3 {
+		t.Errorf("buffer.read() len(result) = %d, want 3", len(result))
+	}
+}
--- a/internal/memorystore/query.go
+++ b/internal/memorystore/query.go
@@ -3,28 +3,7 @@
 // Use of this source code is governed by a MIT-style
 // license that can be found in the LICENSE file.

-// This file implements high-level query functions for loading job metric data
-// with automatic scope transformation and aggregation.
-//
-// Key Concepts:
-//
-// Metric Scopes: Metrics are collected at different granularities (native scope):
-//   - HWThread: Per hardware thread
-//   - Core: Per CPU core
-//   - Socket: Per CPU socket
-//   - MemoryDomain: Per memory domain (NUMA)
-//   - Accelerator: Per GPU/accelerator
-//   - Node: Per compute node
-//
-// Scope Transformation: The buildQueries functions transform between native scope
-// and requested scope by:
-//   - Aggregating finer-grained data (e.g., HWThread → Core → Socket → Node)
-//   - Rejecting requests for finer granularity than available
-//   - Handling special cases (e.g., Accelerator metrics)
-//
-// Query Building: Constructs APIQuery structures with proper selectors (Type, TypeIds)
-// based on cluster topology and job resources.
-package metricstore
+package memorystore

 import (
 	"context"
@@ -34,71 +13,23 @@ import (
 	"time"

 	"github.com/ClusterCockpit/cc-backend/pkg/archive"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/schema"
 )

-type InternalMetricStore struct{}
-
-var MetricStoreHandle *InternalMetricStore
-
-// HealthCheck delegates to the internal MemoryStore's HealthCheck.
-func (ccms *InternalMetricStore) HealthCheck(cluster string,
-	nodes []string, metrics []string,
-) (map[string]HealthCheckResult, error) {
-	return GetMemoryStore().HealthCheck(cluster, nodes, metrics)
-}
-
-// TestLoadDataCallback allows tests to override LoadData behavior for testing purposes.
-// When set to a non-nil function, LoadData will call this function instead of the default implementation.
-var TestLoadDataCallback func(job *schema.Job, metrics []string, scopes []schema.MetricScope, ctx context.Context, resolution int) (schema.JobData, error)
-
-// LoadData loads metric data for a specific job with automatic scope transformation.
-//
-// This is the primary function for retrieving job metric data. It handles:
-//   - Building queries with scope transformation via buildQueries
-//   - Fetching data from the metric store
-//   - Organizing results by metric and scope
-//   - Converting NaN statistics to 0 for JSON compatibility
-//   - Partial error handling (returns data for successful queries even if some fail)
-//
-// Parameters:
-//   - job: Job metadata including cluster, resources, and time range
-//   - metrics: List of metric names to load
-//   - scopes: Requested metric scopes (will be transformed to match native scopes)
-//   - ctx: Context for cancellation (currently unused but reserved for future use)
-//   - resolution: Data resolution in seconds (0 for native resolution)
-//
-// Returns:
-//   - JobData: Map of metric → scope → JobMetric with time-series data and statistics
-//   - Error: Returns error if query building or fetching fails, or partial error listing failed hosts
-//
-// Example:
-//
-//	jobData, err := LoadData(job, []string{"cpu_load", "mem_used"}, []schema.MetricScope{schema.MetricScopeNode}, ctx, 60)
-func (ccms *InternalMetricStore) LoadData(
+func LoadData(
 	job *schema.Job,
 	metrics []string,
 	scopes []schema.MetricScope,
 	ctx context.Context,
 	resolution int,
 ) (schema.JobData, error) {
-	if TestLoadDataCallback != nil {
-		return TestLoadDataCallback(job, metrics, scopes, ctx, resolution)
-	}
-
 	queries, assignedScope, err := buildQueries(job, metrics, scopes, int64(resolution))
 	if err != nil {
 		cclog.Errorf("Error while building queries for jobId %d, Metrics %v, Scopes %v: %s", job.JobID, metrics, scopes, err.Error())
 		return nil, err
 	}

-	// Verify assignment is correct - log any inconsistencies for debugging
-	if len(queries) != len(assignedScope) {
-		cclog.Errorf("Critical error: queries and assignedScope have different lengths after buildQueries: %d vs %d",
-			len(queries), len(assignedScope))
-	}
-
 	req := APIQueryRequest{
 		Cluster:   job.Cluster,
 		From:      job.StartTime,
@@ -116,37 +47,11 @@ func (ccms *InternalMetricStore) LoadData(

 	var errors []string
 	jobData := make(schema.JobData)
-
-	// Add safety check for potential index out of range errors
-	if len(resBody.Results) != len(req.Queries) || len(assignedScope) != len(req.Queries) {
-		cclog.Warnf("Mismatch in query results count: queries=%d, results=%d, assignedScope=%d",
-			len(req.Queries), len(resBody.Results), len(assignedScope))
-		if len(resBody.Results) > len(req.Queries) {
-			resBody.Results = resBody.Results[:len(req.Queries)]
-		}
-		if len(assignedScope) > len(req.Queries) {
-			assignedScope = assignedScope[:len(req.Queries)]
-		}
-	}
-
 	for i, row := range resBody.Results {
-		// Safety check to prevent index out of range errors
-		if i >= len(req.Queries) || i >= len(assignedScope) {
-			cclog.Warnf("Index out of range prevented: i=%d, queries=%d, assignedScope=%d",
-				i, len(req.Queries), len(assignedScope))
-			continue
-		}
-
 		query := req.Queries[i]
 		metric := query.Metric
 		scope := assignedScope[i]
 		mc := archive.GetMetricConfig(job.Cluster, metric)
-
-		if mc == nil {
-			cclog.Warnf("Metric config not found for %s on cluster %s", metric, job.Cluster)
-			continue
-		}
-
 		if _, ok := jobData[metric]; !ok {
 			jobData[metric] = make(map[schema.MetricScope]*schema.JobMetric)
 		}
@@ -175,22 +80,20 @@ func (ccms *InternalMetricStore) LoadData(

 			id := (*string)(nil)
 			if query.Type != nil {
-				// Check if ndx is within the bounds of TypeIds slice
-				if ndx < len(query.TypeIds) {
-					id = new(string)
-					*id = query.TypeIds[ndx]
-				} else {
-					// Log the error but continue processing
-					cclog.Warnf("TypeIds index out of range: %d with length %d for metric %s on host %s",
-						ndx, len(query.TypeIds), query.Metric, query.Hostname)
-				}
+				id = new(string)
+				*id = query.TypeIds[ndx]
 			}

-			sanitizeStats(&res)
+			if res.Avg.IsNaN() || res.Min.IsNaN() || res.Max.IsNaN() {
+				// "schema.Float()" because regular float64 can not be JSONed when NaN.
+				res.Avg = schema.Float(0)
+				res.Min = schema.Float(0)
+				res.Max = schema.Float(0)
+			}

 			jobMetric.Series = append(jobMetric.Series, schema.Series{
 				Hostname: query.Hostname,
-				ID:       id,
+				Id:       id,
 				Statistics: schema.MetricStatistics{
 					Avg: float64(res.Avg),
 					Min: float64(res.Min),
@@ -216,10 +119,6 @@ func (ccms *InternalMetricStore) LoadData(
 	return jobData, nil
 }

-// Pre-converted scope strings avoid repeated string(MetricScope) allocations during
-// query construction. These are used in APIQuery.Type field throughout buildQueries
-// and buildNodeQueries functions. Converting once at package initialization improves
-// performance for high-volume query building.
 var (
 	hwthreadString     = string(schema.MetricScopeHWThread)
 	coreString         = string(schema.MetricScopeCore)
@@ -228,41 +127,12 @@ var (
 	acceleratorString  = string(schema.MetricScopeAccelerator)
 )

-// buildQueries constructs APIQuery structures with automatic scope transformation for a job.
-//
-// This function implements the core scope transformation logic, handling all combinations of
-// native metric scopes and requested scopes. It uses the cluster topology to determine which
-// hardware IDs to include in each query.
-//
-// Scope Transformation Rules:
-//   - If native scope >= requested scope: Aggregates data (Aggregate=true in APIQuery)
-//   - If native scope < requested scope: Returns error (cannot increase granularity)
-//   - Special handling for Accelerator scope (independent of CPU hierarchy)
-//
-// The function generates one or more APIQuery per (metric, scope, host) combination:
-//   - For non-aggregated queries: One query with all relevant IDs
-//   - For aggregated queries: May generate multiple queries (e.g., one per socket/core)
-//
-// Parameters:
-//   - job: Job metadata including cluster, subcluster, and resource allocation
-//   - metrics: List of metrics to query
-//   - scopes: Requested scopes for each metric
-//   - resolution: Data resolution in seconds
-//
-// Returns:
-//   - []APIQuery: List of queries to execute
-//   - []schema.MetricScope: Assigned scope for each query (after transformation)
-//   - error: Returns error if topology lookup fails or unhandled scope combination encountered
 func buildQueries(
 	job *schema.Job,
 	metrics []string,
 	scopes []schema.MetricScope,
 	resolution int64,
 ) ([]APIQuery, []schema.MetricScope, error) {
-	if len(job.Resources) == 0 {
-		return nil, nil, fmt.Errorf("METRICDATA/CCMS > no resources allocated for job %d", job.JobID)
-	}
-
 	queries := make([]APIQuery, 0, len(metrics)*len(scopes)*len(job.Resources))
 	assignedScope := []schema.MetricScope{}

@@ -275,7 +145,8 @@ func buildQueries(
 	for _, metric := range metrics {
 		mc := archive.GetMetricConfig(job.Cluster, metric)
 		if mc == nil {
-			cclog.Warnf("metric '%s' is not specified for cluster '%s'", metric, job.Cluster)
+			// return nil, fmt.Errorf("METRICDATA/CCMS > metric '%s' is not specified for cluster '%s'", metric, job.Cluster)
+			cclog.Infof("metric '%s' is not specified for cluster '%s'", metric, job.Cluster)
 			continue
 		}

@@ -293,9 +164,10 @@ func buildQueries(
 			}
 		}

-		// Avoid duplicates using map for O(1) lookup
-		handledScopes := make(map[schema.MetricScope]bool, 3)
+		// Avoid duplicates...
+		handledScopes := make([]schema.MetricScope, 0, 3)

+	scopesLoop:
 		for _, requestedScope := range scopes {
 			nativeScope := mc.Scope
 			if nativeScope == schema.MetricScopeAccelerator && job.NumAcc == 0 {
@@ -303,10 +175,12 @@ func buildQueries(
 			}

 			scope := nativeScope.Max(requestedScope)
-			if handledScopes[scope] {
-				continue
+			for _, s := range handledScopes {
+				if scope == s {
+					continue scopesLoop
+				}
 			}
-			handledScopes[scope] = true
+			handledScopes = append(handledScopes, scope)

 			for _, host := range job.Resources {
 				hwthreads := host.HWThreads
@@ -351,7 +225,7 @@ func buildQueries(
 					continue
 				}

-				// HWThread -> HWThread
+				// HWThread -> HWThead
 				if nativeScope == schema.MetricScopeHWThread && scope == schema.MetricScopeHWThread {
 					queries = append(queries, APIQuery{
 						Metric:     metric,
@@ -475,7 +349,7 @@ func buildQueries(
 					continue
 				}

-				// MemoryDomain -> Node
+				// MemoryDoman -> Node
 				if nativeScope == schema.MetricScopeMemoryDomain && scope == schema.MetricScopeNode {
 					sockets, _ := topology.GetMemoryDomainsFromHWThreads(hwthreads)
 					queries = append(queries, APIQuery{
@@ -539,26 +413,12 @@ func buildQueries(
 	return queries, assignedScope, nil
 }

-// LoadStats loads only metric statistics (avg/min/max) for a job at node scope.
-//
-// This is an optimized version of LoadData that fetches only statistics without
-// time-series data, reducing bandwidth and memory usage. Always queries at node scope.
-//
-// Parameters:
-//   - job: Job metadata
-//   - metrics: List of metric names
-//   - ctx: Context (currently unused)
-//
-// Returns:
-//   - Map of metric → hostname → statistics
-//   - Error on query building or fetching failure
-func (ccms *InternalMetricStore) LoadStats(
+func LoadStats(
 	job *schema.Job,
 	metrics []string,
 	ctx context.Context,
 ) (map[string]map[string]schema.MetricStatistics, error) {
-	// TODO(#166): Add scope parameter for analysis view accelerator normalization
-	queries, _, err := buildQueries(job, metrics, []schema.MetricScope{schema.MetricScopeNode}, 0)
+	queries, _, err := buildQueries(job, metrics, []schema.MetricScope{schema.MetricScopeNode}, 0) // #166 Add scope shere for analysis view accelerator normalization?
 	if err != nil {
 		cclog.Errorf("Error while building queries for jobId %d, Metrics %v: %s", job.JobID, metrics, err.Error())
 		return nil, err
@@ -581,15 +441,11 @@ func (ccms *InternalMetricStore) LoadStats(

 	stats := make(map[string]map[string]schema.MetricStatistics, len(metrics))
 	for i, res := range resBody.Results {
-		if len(res) == 0 {
-			// No Data Found For Metric, Logged in FetchData to Warn
-			continue
-		}
 		query := req.Queries[i]
 		metric := query.Metric
 		data := res[0]
 		if data.Error != nil {
-			cclog.Warnf("fetching %s for node %s failed: %s", metric, query.Hostname, *data.Error)
+			cclog.Errorf("fetching %s for node %s failed: %s", metric, query.Hostname, *data.Error)
 			continue
 		}

@@ -614,21 +470,7 @@ func (ccms *InternalMetricStore) LoadStats(
 	return stats, nil
 }

-// LoadScopedStats loads metric statistics for a job with scope-aware grouping.
-//
-// Similar to LoadStats but supports multiple scopes and returns statistics grouped
-// by scope with hardware IDs (e.g., per-core, per-socket statistics).
-//
-// Parameters:
-//   - job: Job metadata
-//   - metrics: List of metric names
-//   - scopes: Requested metric scopes
-//   - ctx: Context (currently unused)
-//
-// Returns:
-//   - ScopedJobStats: Map of metric → scope → []ScopedStats (with hostname and ID)
-//   - Error or partial error listing failed queries
-func (ccms *InternalMetricStore) LoadScopedStats(
+func LoadScopedStats(
 	job *schema.Job,
 	metrics []string,
 	scopes []schema.MetricScope,
@@ -659,10 +501,6 @@ func (ccms *InternalMetricStore) LoadScopedStats(
 	scopedJobStats := make(schema.ScopedJobStats)

 	for i, row := range resBody.Results {
-		if len(row) == 0 {
-			// No Data Found For Metric, Logged in FetchData to Warn
-			continue
-		}
 		query := req.Queries[i]
 		metric := query.Metric
 		scope := assignedScope[i]
@@ -684,22 +522,20 @@ func (ccms *InternalMetricStore) LoadScopedStats(

 			id := (*string)(nil)
 			if query.Type != nil {
-				// Check if ndx is within the bounds of TypeIds slice
-				if ndx < len(query.TypeIds) {
-					id = new(string)
-					*id = query.TypeIds[ndx]
-				} else {
-					// Log the error but continue processing
-					cclog.Warnf("TypeIds index out of range: %d with length %d for metric %s on host %s",
-						ndx, len(query.TypeIds), query.Metric, query.Hostname)
-				}
+				id = new(string)
+				*id = query.TypeIds[ndx]
 			}

-			sanitizeStats(&res)
+			if res.Avg.IsNaN() || res.Min.IsNaN() || res.Max.IsNaN() {
+				// "schema.Float()" because regular float64 can not be JSONed when NaN.
+				res.Avg = schema.Float(0)
+				res.Min = schema.Float(0)
+				res.Max = schema.Float(0)
+			}

 			scopedJobStats[metric][scope] = append(scopedJobStats[metric][scope], &schema.ScopedStats{
 				Hostname: query.Hostname,
-				ID:       id,
+				Id:       id,
 				Data: &schema.MetricStatistics{
 					Avg: float64(res.Avg),
 					Min: float64(res.Min),
@@ -724,23 +560,7 @@ func (ccms *InternalMetricStore) LoadScopedStats(
 	return scopedJobStats, nil
 }

-// LoadNodeData loads metric data for specific nodes in a cluster over a time range.
-//
-// Unlike LoadData which operates on job resources, this function queries arbitrary nodes
-// directly. Useful for system monitoring and node status views.
-//
-// Parameters:
-//   - cluster: Cluster name
-//   - metrics: List of metric names
-//   - nodes: List of node hostnames (nil = all nodes in cluster via ForAllNodes)
-//   - scopes: Requested metric scopes (currently unused - always node scope)
-//   - from, to: Time range
-//   - ctx: Context (currently unused)
-//
-// Returns:
-//   - Map of hostname → metric → []JobMetric
-//   - Error or partial error listing failed queries
-func (ccms *InternalMetricStore) LoadNodeData(
+func LoadNodeData(
 	cluster string,
 	metrics, nodes []string,
 	scopes []schema.MetricScope,
@@ -778,11 +598,6 @@ func (ccms *InternalMetricStore) LoadNodeData(
 	var errors []string
 	data := make(map[string]map[string][]*schema.JobMetric)
 	for i, res := range resBody.Results {
-		if len(res) == 0 {
-			// No Data Found For Metric, Logged in FetchData to Warn
-			continue
-		}
-
 		var query APIQuery
 		if resBody.Queries != nil {
 			query = resBody.Queries[i]
@@ -793,10 +608,14 @@ func (ccms *InternalMetricStore) LoadNodeData(
 		metric := query.Metric
 		qdata := res[0]
 		if qdata.Error != nil {
+			/* Build list for "partial errors", if any */
 			errors = append(errors, fmt.Sprintf("fetching %s for node %s failed: %s", metric, query.Hostname, *qdata.Error))
 		}

-		sanitizeStats(&qdata)
+		if qdata.Avg.IsNaN() || qdata.Min.IsNaN() || qdata.Max.IsNaN() {
+			// return nil, fmt.Errorf("METRICDATA/CCMS > fetching %s for node %s failed: %s", metric, query.Hostname, "avg/min/max is NaN")
+			qdata.Avg, qdata.Min, qdata.Max = 0., 0., 0.
+		}

 		hostdata, ok := data[query.Hostname]
 		if !ok {
@@ -830,25 +649,7 @@ func (ccms *InternalMetricStore) LoadNodeData(
 	return data, nil
 }

-// LoadNodeListData loads metric data for a list of nodes with full scope transformation support.
-//
-// This is the most flexible node data loading function, supporting arbitrary scopes and
-// resolution. Uses buildNodeQueries for proper scope transformation based on topology.
-//
-// Parameters:
-//   - cluster: Cluster name
-//   - subCluster: SubCluster name (empty string to infer from node names)
-//   - nodes: List of node hostnames
-//   - metrics: List of metric names
-//   - scopes: Requested metric scopes
-//   - resolution: Data resolution in seconds
-//   - from, to: Time range
-//   - ctx: Context (currently unused)
-//
-// Returns:
-//   - Map of hostname → JobData (metric → scope → JobMetric)
-//   - Error or partial error listing failed queries
-func (ccms *InternalMetricStore) LoadNodeListData(
+func LoadNodeListData(
 	cluster, subCluster string,
 	nodes []string,
 	metrics []string,
@@ -864,12 +665,6 @@ func (ccms *InternalMetricStore) LoadNodeListData(
 		return nil, err
 	}

-	// Verify assignment is correct - log any inconsistencies for debugging
-	if len(queries) != len(assignedScope) {
-		cclog.Errorf("Critical error: queries and assignedScope have different lengths after buildNodeQueries: %d vs %d",
-			len(queries), len(assignedScope))
-	}
-
 	req := APIQueryRequest{
 		Cluster:   cluster,
 		Queries:   queries,
@@ -887,47 +682,17 @@ func (ccms *InternalMetricStore) LoadNodeListData(

 	var errors []string
 	data := make(map[string]schema.JobData)
-
-	// Add safety check for index out of range issues
-	if len(resBody.Results) != len(req.Queries) || len(assignedScope) != len(req.Queries) {
-		cclog.Warnf("Mismatch in query results count: queries=%d, results=%d, assignedScope=%d",
-			len(req.Queries), len(resBody.Results), len(assignedScope))
-		if len(resBody.Results) > len(req.Queries) {
-			resBody.Results = resBody.Results[:len(req.Queries)]
-		}
-		if len(assignedScope) > len(req.Queries) {
-			assignedScope = assignedScope[:len(req.Queries)]
-		}
-	}
-
 	for i, row := range resBody.Results {
-		// Safety check to prevent index out of range errors
-		if i >= len(req.Queries) || i >= len(assignedScope) {
-			cclog.Warnf("Index out of range prevented: i=%d, queries=%d, assignedScope=%d",
-				i, len(req.Queries), len(assignedScope))
-			continue
-		}
-
 		var query APIQuery
 		if resBody.Queries != nil {
-			if i < len(resBody.Queries) {
-				query = resBody.Queries[i]
-			} else {
-				cclog.Warnf("Index out of range prevented for resBody.Queries: i=%d, len=%d",
-					i, len(resBody.Queries))
-				continue
-			}
+			query = resBody.Queries[i]
 		} else {
 			query = req.Queries[i]
 		}
-
+		// qdata := res[0]
 		metric := query.Metric
 		scope := assignedScope[i]
 		mc := archive.GetMetricConfig(cluster, metric)
-		if mc == nil {
-			cclog.Warnf("Metric config not found for %s on cluster %s", metric, cluster)
-			continue
-		}

 		res := mc.Timestep
 		if len(row) > 0 {
@@ -966,22 +731,20 @@ func (ccms *InternalMetricStore) LoadNodeListData(

 			id := (*string)(nil)
 			if query.Type != nil {
-				// Check if ndx is within the bounds of TypeIds slice
-				if ndx < len(query.TypeIds) {
-					id = new(string)
-					*id = query.TypeIds[ndx]
-				} else {
-					// Log the error but continue processing
-					cclog.Warnf("TypeIds index out of range: %d with length %d for metric %s on host %s",
-						ndx, len(query.TypeIds), query.Metric, query.Hostname)
-				}
+				id = new(string)
+				*id = query.TypeIds[ndx]
 			}

-			sanitizeStats(&res)
+			if res.Avg.IsNaN() || res.Min.IsNaN() || res.Max.IsNaN() {
+				// "schema.Float()" because regular float64 can not be JSONed when NaN.
+				res.Avg = schema.Float(0)
+				res.Min = schema.Float(0)
+				res.Max = schema.Float(0)
+			}

 			scopeData.Series = append(scopeData.Series, schema.Series{
 				Hostname: query.Hostname,
-				ID:       id,
+				Id:       id,
 				Statistics: schema.MetricStatistics{
 					Avg: float64(res.Avg),
 					Min: float64(res.Min),
@@ -1000,23 +763,6 @@ func (ccms *InternalMetricStore) LoadNodeListData(
 	return data, nil
 }

-// buildNodeQueries constructs APIQuery structures for node-based queries with scope transformation.
-//
-// Similar to buildQueries but operates on node lists rather than job resources.
-// Supports dynamic subcluster lookup when subCluster parameter is empty.
-//
-// Parameters:
-//   - cluster: Cluster name
-//   - subCluster: SubCluster name (empty = infer from node hostnames)
-//   - nodes: List of node hostnames
-//   - metrics: List of metric names
-//   - scopes: Requested metric scopes
-//   - resolution: Data resolution in seconds
-//
-// Returns:
-//   - []APIQuery: List of queries to execute
-//   - []schema.MetricScope: Assigned scope for each query
-//   - error: Returns error if topology lookup fails or unhandled scope combination
 func buildNodeQueries(
 	cluster string,
 	subCluster string,
@@ -1042,6 +788,7 @@ func buildNodeQueries(
 	for _, metric := range metrics {
 		mc := archive.GetMetricConfig(cluster, metric)
 		if mc == nil {
+			// return nil, fmt.Errorf("METRICDATA/CCMS > metric '%s' is not specified for cluster '%s'", metric, cluster)
 			cclog.Warnf("metric '%s' is not specified for cluster '%s'", metric, cluster)
 			continue
 		}
@@ -1060,17 +807,20 @@ func buildNodeQueries(
 			}
 		}

-		// Avoid duplicates using map for O(1) lookup
-		handledScopes := make(map[schema.MetricScope]bool, 3)
+		// Avoid duplicates...
+		handledScopes := make([]schema.MetricScope, 0, 3)

+	scopesLoop:
 		for _, requestedScope := range scopes {
 			nativeScope := mc.Scope

 			scope := nativeScope.Max(requestedScope)
-			if handledScopes[scope] {
-				continue
+			for _, s := range handledScopes {
+				if scope == s {
+					continue scopesLoop
+				}
 			}
-			handledScopes[scope] = true
+			handledScopes = append(handledScopes, scope)

 			for _, hostname := range nodes {

@@ -1093,7 +843,7 @@ func buildNodeQueries(

 				// Moved check here if metric matches hardware specs
 				if nativeScope == schema.MetricScopeAccelerator && len(acceleratorIds) == 0 {
-					continue
+					continue scopesLoop
 				}

 				// Accelerator -> Accelerator (Use "accelerator" scope if requested scope is lower than node)
@@ -1133,7 +883,7 @@ func buildNodeQueries(
 					continue
 				}

-				// HWThread -> HWThread
+				// HWThread -> HWThead
 				if nativeScope == schema.MetricScopeHWThread && scope == schema.MetricScopeHWThread {
 					queries = append(queries, APIQuery{
 						Metric:     metric,
@@ -1257,7 +1007,7 @@ func buildNodeQueries(
 					continue
 				}

-				// MemoryDomain -> Node
+				// MemoryDoman -> Node
 				if nativeScope == schema.MetricScopeMemoryDomain && scope == schema.MetricScopeNode {
 					sockets, _ := topology.GetMemoryDomainsFromHWThreads(topology.Node)
 					queries = append(queries, APIQuery{
@@ -1321,37 +1071,10 @@ func buildNodeQueries(
 	return queries, assignedScope, nil
 }

-// sanitizeStats converts NaN statistics to zero for JSON compatibility.
-//
-// schema.Float with NaN values cannot be properly JSON-encoded, so we convert
-// NaN to 0. This loses the distinction between "no data" and "zero value",
-// but maintains API compatibility.
-func sanitizeStats(data *APIMetricData) {
-	if data.Avg.IsNaN() {
-		data.Avg = schema.Float(0)
-	}
-	if data.Min.IsNaN() {
-		data.Min = schema.Float(0)
-	}
-	if data.Max.IsNaN() {
-		data.Max = schema.Float(0)
-	}
-}
-
-// intToStringSlice converts a slice of integers to a slice of strings.
-// Used to convert hardware thread/core/socket IDs from topology (int) to APIQuery TypeIds (string).
-//
-// Optimized to reuse a byte buffer for string conversion, reducing allocations.
 func intToStringSlice(is []int) []string {
-	if len(is) == 0 {
-		return nil
-	}
-
 	ss := make([]string, len(is))
-	buf := make([]byte, 0, 16) // Reusable buffer for integer conversion
 	for i, x := range is {
-		buf = strconv.AppendInt(buf[:0], int64(x), 10)
-		ss[i] = string(buf)
+		ss[i] = strconv.Itoa(x)
 	}
 	return ss
 }
--- a/internal/memorystore/stats.go
+++ b/internal/memorystore/stats.go
@@ -3,21 +3,20 @@
 // Use of this source code is governed by a MIT-style
 // license that can be found in the LICENSE file.

-package metricstore
+package memorystore

 import (
 	"errors"
 	"math"

-	"github.com/ClusterCockpit/cc-lib/v2/schema"
-	"github.com/ClusterCockpit/cc-lib/v2/util"
+	"github.com/ClusterCockpit/cc-lib/util"
 )

 type Stats struct {
 	Samples int
-	Avg     schema.Float
-	Min     schema.Float
-	Max     schema.Float
+	Avg     util.Float
+	Min     util.Float
+	Max     util.Float
 }

 func (b *buffer) stats(from, to int64) (Stats, int64, int64, error) {
@@ -62,9 +61,9 @@ func (b *buffer) stats(from, to int64) (Stats, int64, int64, error) {

 	return Stats{
 		Samples: samples,
-		Avg:     schema.Float(sum) / schema.Float(samples),
-		Min:     schema.Float(min),
-		Max:     schema.Float(max),
+		Avg:     util.Float(sum) / util.Float(samples),
+		Min:     util.Float(min),
+		Max:     util.Float(max),
 	}, from, t, nil
 }

@@ -82,7 +81,7 @@ func (m *MemoryStore) Stats(selector util.Selector, metric string, from, to int6
 	}

 	n, samples := 0, 0
-	avg, min, max := schema.Float(0), math.MaxFloat32, -math.MaxFloat32
+	avg, min, max := util.Float(0), math.MaxFloat32, -math.MaxFloat32
 	err := m.root.findBuffers(selector, minfo.offset, func(b *buffer) error {
 		stats, cfrom, cto, err := b.stats(from, to)
 		if err != nil {
@@ -111,7 +110,7 @@ func (m *MemoryStore) Stats(selector util.Selector, metric string, from, to int6
 	}

 	if minfo.Aggregation == AvgAggregation {
-		avg /= schema.Float(n)
+		avg /= util.Float(n)
 	} else if n > 1 && minfo.Aggregation != SumAggregation {
 		return nil, 0, 0, errors.New("invalid aggregation")
 	}
@@ -119,7 +118,7 @@ func (m *MemoryStore) Stats(selector util.Selector, metric string, from, to int6
 	return &Stats{
 		Samples: samples,
 		Avg:     avg,
-		Min:     schema.Float(min),
-		Max:     schema.Float(max),
+		Min:     util.Float(min),
+		Max:     util.Float(max),
 	}, from, to, nil
 }
--- a/internal/metricdata/cc-metric-store.go
+++ b/internal/metricdata/cc-metric-store.go
--- a/internal/metricdata/metricdata.go
+++ b/internal/metricdata/metricdata.go
@@ -0,0 +1,127 @@
+// Copyright (C) NHR@FAU, University Erlangen-Nuremberg.
+// All rights reserved. This file is part of cc-backend.
+// Use of this source code is governed by a MIT-style
+// license that can be found in the LICENSE file.
+
+package metricdata
+
+import (
+	"context"
+	"encoding/json"
+	"fmt"
+	"time"
+
+	"github.com/ClusterCockpit/cc-backend/internal/config"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/schema"
+)
+
+type MetricDataRepository interface {
+	// Initialize this MetricDataRepository. One instance of
+	// this interface will only ever be responsible for one cluster.
+	Init(rawConfig json.RawMessage) error
+
+	// Return the JobData for the given job, only with the requested metrics.
+	LoadData(job *schema.Job, metrics []string, scopes []schema.MetricScope, ctx context.Context, resolution int) (schema.JobData, error)
+
+	// Return a map of metrics to a map of nodes to the metric statistics of the job. node scope only.
+	LoadStats(job *schema.Job, metrics []string, ctx context.Context) (map[string]map[string]schema.MetricStatistics, error)
+
+	// Return a map of metrics to a map of scopes to the scoped metric statistics of the job.
+	LoadScopedStats(job *schema.Job, metrics []string, scopes []schema.MetricScope, ctx context.Context) (schema.ScopedJobStats, error)
+
+	// Return a map of hosts to a map of metrics at the requested scopes (currently only node) for that node.
+	LoadNodeData(cluster string, metrics, nodes []string, scopes []schema.MetricScope, from, to time.Time, ctx context.Context) (map[string]map[string][]*schema.JobMetric, error)
+
+	// Return a map of hosts to a map of metrics to a map of scopes for multiple nodes.
+	LoadNodeListData(cluster, subCluster string, nodes, metrics []string, scopes []schema.MetricScope, resolution int, from, to time.Time, ctx context.Context) (map[string]schema.JobData, error)
+}
+
+var upstreamMetricDataRepo MetricDataRepository
+
+// func Init() error {
+// 	for _, cluster := range config.Clusters {
+// 		if cluster.MetricDataRepository != nil {
+// 			var kind struct {
+// 				Kind string `json:"kind"`
+// 			}
+// 			if err := json.Unmarshal(cluster.MetricDataRepository, &kind); err != nil {
+// 				cclog.Warn("Error while unmarshaling raw json MetricDataRepository")
+// 				return err
+// 			}
+//
+// 			var mdr MetricDataRepository
+// 			switch kind.Kind {
+// 			case "cc-metric-store":
+// 				mdr = &CCMetricStore{}
+// 			case "prometheus":
+// 				mdr = &PrometheusDataRepository{}
+// 			case "test":
+// 				mdr = &TestMetricDataRepository{}
+// 			default:
+// 				return fmt.Errorf("METRICDATA/METRICDATA > Unknown MetricDataRepository %v for cluster %v", kind.Kind, cluster.Name)
+// 			}
+//
+// 			if err := mdr.Init(cluster.MetricDataRepository); err != nil {
+// 				cclog.Errorf("Error initializing MetricDataRepository %v for cluster %v", kind.Kind, cluster.Name)
+// 				return err
+// 			}
+// 			metricDataRepos[cluster.Name] = mdr
+// 		}
+// 	}
+// 	return nil
+// }
+
+// func GetMetricDataRepo(cluster string) (MetricDataRepository, error) {
+// 	var err error
+// 	repo, ok := metricDataRepos[cluster]
+//
+// 	if !ok {
+// 		err = fmt.Errorf("METRICDATA/METRICDATA > no metric data repository configured for '%s'", cluster)
+// 	}
+//
+// 	return repo, err
+// }
+
+// InitUpstreamRepos initializes global upstream metric data repository for the pull worker
+func InitUpstreamRepos() error {
+	if config.Keys.UpstreamMetricRepository == nil {
+		return nil
+	}
+
+	var kind struct {
+		Kind string `json:"kind"`
+	}
+	if err := json.Unmarshal(*config.Keys.UpstreamMetricRepository, &kind); err != nil {
+		cclog.Warn("Error while unmarshaling raw json UpstreamMetricRepository")
+		return err
+	}
+
+	var mdr MetricDataRepository
+	switch kind.Kind {
+	case "cc-metric-store":
+		mdr = &CCMetricStore{}
+	case "prometheus":
+		mdr = &PrometheusDataRepository{}
+	case "test":
+		mdr = &TestMetricDataRepository{}
+	default:
+		return fmt.Errorf("METRICDATA/METRICDATA > Unknown UpstreamMetricRepository %v", kind.Kind)
+	}
+
+	if err := mdr.Init(*config.Keys.UpstreamMetricRepository); err != nil {
+		cclog.Errorf("Error initializing UpstreamMetricRepository %v", kind.Kind)
+		return err
+	}
+	upstreamMetricDataRepo = mdr
+	cclog.Infof("Initialized global upstream metric repository '%s'", kind.Kind)
+	return nil
+}
+
+// GetUpstreamMetricDataRepo returns the global upstream metric data repository
+func GetUpstreamMetricDataRepo() (MetricDataRepository, error) {
+	if upstreamMetricDataRepo == nil {
+		return nil, fmt.Errorf("METRICDATA/METRICDATA > no upstream metric data repository configured")
+	}
+	return upstreamMetricDataRepo, nil
+}
--- a/internal/metricdata/prometheus.go
+++ b/internal/metricdata/prometheus.go
@@ -0,0 +1,588 @@
+// Copyright (C) 2022 DKRZ
+// All rights reserved. This file is part of cc-backend.
+// Use of this source code is governed by a MIT-style
+// license that can be found in the LICENSE file.
+
+package metricdata
+
+import (
+	"bytes"
+	"context"
+	"encoding/json"
+	"errors"
+	"fmt"
+	"math"
+	"net/http"
+	"os"
+	"regexp"
+	"sort"
+	"strings"
+	"sync"
+	"text/template"
+	"time"
+
+	"github.com/ClusterCockpit/cc-backend/pkg/archive"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/schema"
+	promapi "github.com/prometheus/client_golang/api"
+	promv1 "github.com/prometheus/client_golang/api/prometheus/v1"
+	promcfg "github.com/prometheus/common/config"
+	promm "github.com/prometheus/common/model"
+)
+
+type PrometheusDataRepositoryConfig struct {
+	Url       string            `json:"url"`
+	Username  string            `json:"username,omitempty"`
+	Suffix    string            `json:"suffix,omitempty"`
+	Templates map[string]string `json:"query-templates"`
+}
+
+type PrometheusDataRepository struct {
+	client      promapi.Client
+	queryClient promv1.API
+	suffix      string
+	templates   map[string]*template.Template
+}
+
+type PromQLArgs struct {
+	Nodes string
+}
+
+type Trie map[rune]Trie
+
+var logOnce sync.Once
+
+func contains(s []schema.MetricScope, str schema.MetricScope) bool {
+	for _, v := range s {
+		if v == str {
+			return true
+		}
+	}
+	return false
+}
+
+func MinMaxMean(data []schema.Float) (float64, float64, float64) {
+	if len(data) == 0 {
+		return 0.0, 0.0, 0.0
+	}
+	min := math.MaxFloat64
+	max := -math.MaxFloat64
+	var sum float64
+	var n float64
+	for _, val := range data {
+		if val.IsNaN() {
+			continue
+		}
+		sum += float64(val)
+		n += 1
+		if float64(val) > max {
+			max = float64(val)
+		}
+		if float64(val) < min {
+			min = float64(val)
+		}
+	}
+	return min, max, sum / n
+}
+
+// Rewritten from
+// https://github.com/ermanh/trieregex/blob/master/trieregex/trieregex.py
+func nodeRegex(nodes []string) string {
+	root := Trie{}
+	// add runes of each compute node to trie
+	for _, node := range nodes {
+		_trie := root
+		for _, c := range node {
+			if _, ok := _trie[c]; !ok {
+				_trie[c] = Trie{}
+			}
+			_trie = _trie[c]
+		}
+		_trie['*'] = Trie{}
+	}
+	// recursively build regex from rune trie
+	var trieRegex func(trie Trie, reset bool) string
+	trieRegex = func(trie Trie, reset bool) string {
+		if reset == true {
+			trie = root
+		}
+		if len(trie) == 0 {
+			return ""
+		}
+		if len(trie) == 1 {
+			for key, _trie := range trie {
+				if key == '*' {
+					return ""
+				}
+				return regexp.QuoteMeta(string(key)) + trieRegex(_trie, false)
+			}
+		} else {
+			sequences := []string{}
+			for key, _trie := range trie {
+				if key != '*' {
+					sequences = append(sequences, regexp.QuoteMeta(string(key))+trieRegex(_trie, false))
+				}
+			}
+			sort.Slice(sequences, func(i, j int) bool {
+				return (-len(sequences[i]) < -len(sequences[j])) || (sequences[i] < sequences[j])
+			})
+			var result string
+			// single edge from this tree node
+			if len(sequences) == 1 {
+				result = sequences[0]
+				if len(result) > 1 {
+					result = "(?:" + result + ")"
+				}
+				// multiple edges, each length 1
+			} else if s := strings.Join(sequences, ""); len(s) == len(sequences) {
+				// char or numeric range
+				if len(s)-1 == int(s[len(s)-1])-int(s[0]) {
+					result = fmt.Sprintf("[%c-%c]", s[0], s[len(s)-1])
+					// char or numeric set
+				} else {
+					result = "[" + s + "]"
+				}
+				// multiple edges of different lengths
+			} else {
+				result = "(?:" + strings.Join(sequences, "|") + ")"
+			}
+			if _, ok := trie['*']; ok {
+				result += "?"
+			}
+			return result
+		}
+		return ""
+	}
+	return trieRegex(root, true)
+}
+
+func (pdb *PrometheusDataRepository) Init(rawConfig json.RawMessage) error {
+	var config PrometheusDataRepositoryConfig
+	// parse config
+	if err := json.Unmarshal(rawConfig, &config); err != nil {
+		cclog.Warn("Error while unmarshaling raw json config")
+		return err
+	}
+	// support basic authentication
+	var rt http.RoundTripper = nil
+	if prom_pw := os.Getenv("PROMETHEUS_PASSWORD"); prom_pw != "" && config.Username != "" {
+		prom_pw := promcfg.Secret(prom_pw)
+		rt = promcfg.NewBasicAuthRoundTripper(promcfg.NewInlineSecret(config.Username), promcfg.NewInlineSecret(string(prom_pw)), promapi.DefaultRoundTripper)
+	} else {
+		if config.Username != "" {
+			return errors.New("METRICDATA/PROMETHEUS > Prometheus username provided, but PROMETHEUS_PASSWORD not set")
+		}
+	}
+	// init client
+	client, err := promapi.NewClient(promapi.Config{
+		Address:      config.Url,
+		RoundTripper: rt,
+	})
+	if err != nil {
+		cclog.Error("Error while initializing new prometheus client")
+		return err
+	}
+	// init query client
+	pdb.client = client
+	pdb.queryClient = promv1.NewAPI(pdb.client)
+	// site config
+	pdb.suffix = config.Suffix
+	// init query templates
+	pdb.templates = make(map[string]*template.Template)
+	for metric, templ := range config.Templates {
+		pdb.templates[metric], err = template.New(metric).Parse(templ)
+		if err == nil {
+			cclog.Debugf("Added PromQL template for %s: %s", metric, templ)
+		} else {
+			cclog.Warnf("Failed to parse PromQL template %s for metric %s", templ, metric)
+		}
+	}
+	return nil
+}
+
+// TODO: respect scope argument
+func (pdb *PrometheusDataRepository) FormatQuery(
+	metric string,
+	scope schema.MetricScope,
+	nodes []string,
+	cluster string,
+) (string, error) {
+	args := PromQLArgs{}
+	if len(nodes) > 0 {
+		args.Nodes = fmt.Sprintf("(%s)%s", nodeRegex(nodes), pdb.suffix)
+	} else {
+		args.Nodes = fmt.Sprintf(".*%s", pdb.suffix)
+	}
+
+	buf := &bytes.Buffer{}
+	if templ, ok := pdb.templates[metric]; ok {
+		err := templ.Execute(buf, args)
+		if err != nil {
+			return "", errors.New(fmt.Sprintf("METRICDATA/PROMETHEUS > Error compiling template %v", templ))
+		} else {
+			query := buf.String()
+			cclog.Debugf("PromQL: %s", query)
+			return query, nil
+		}
+	} else {
+		return "", errors.New(fmt.Sprintf("METRICDATA/PROMETHEUS > No PromQL for metric %s configured.", metric))
+	}
+}
+
+// Convert PromAPI row to CC schema.Series
+func (pdb *PrometheusDataRepository) RowToSeries(
+	from time.Time,
+	step int64,
+	steps int64,
+	row *promm.SampleStream,
+) schema.Series {
+	ts := from.Unix()
+	hostname := strings.TrimSuffix(string(row.Metric["exported_instance"]), pdb.suffix)
+	// init array of expected length with NaN
+	values := make([]schema.Float, steps+1)
+	for i := range values {
+		values[i] = schema.NaN
+	}
+	// copy recorded values from prom sample pair
+	for _, v := range row.Values {
+		idx := (v.Timestamp.Unix() - ts) / step
+		values[idx] = schema.Float(v.Value)
+	}
+	min, max, mean := MinMaxMean(values)
+	// output struct
+	return schema.Series{
+		Hostname: hostname,
+		Data:     values,
+		Statistics: schema.MetricStatistics{
+			Avg: mean,
+			Min: min,
+			Max: max,
+		},
+	}
+}
+
+func (pdb *PrometheusDataRepository) LoadData(
+	job *schema.Job,
+	metrics []string,
+	scopes []schema.MetricScope,
+	ctx context.Context,
+	resolution int,
+) (schema.JobData, error) {
+	// TODO respect requested scope
+	if len(scopes) == 0 || !contains(scopes, schema.MetricScopeNode) {
+		scopes = append(scopes, schema.MetricScopeNode)
+	}
+
+	jobData := make(schema.JobData)
+	// parse job specs
+	nodes := make([]string, len(job.Resources))
+	for i, resource := range job.Resources {
+		nodes[i] = resource.Hostname
+	}
+	from := time.Unix(job.StartTime, 0)
+	to := time.Unix(job.StartTime+int64(job.Duration), 0)
+
+	for _, scope := range scopes {
+		if scope != schema.MetricScopeNode {
+			logOnce.Do(func() {
+				cclog.Infof("Scope '%s' requested, but not yet supported: Will return 'node' scope only.", scope)
+			})
+			continue
+		}
+
+		for _, metric := range metrics {
+			metricConfig := archive.GetMetricConfig(job.Cluster, metric)
+			if metricConfig == nil {
+				cclog.Warnf("Error in LoadData: Metric %s for cluster %s not configured", metric, job.Cluster)
+				return nil, errors.New("Prometheus config error")
+			}
+			query, err := pdb.FormatQuery(metric, scope, nodes, job.Cluster)
+			if err != nil {
+				cclog.Warn("Error while formatting prometheus query")
+				return nil, err
+			}
+
+			// ranged query over all job nodes
+			r := promv1.Range{
+				Start: from,
+				End:   to,
+				Step:  time.Duration(metricConfig.Timestep * 1e9),
+			}
+			result, warnings, err := pdb.queryClient.QueryRange(ctx, query, r)
+			if err != nil {
+				cclog.Errorf("Prometheus query error in LoadData: %v\nQuery: %s", err, query)
+				return nil, errors.New("Prometheus query error")
+			}
+			if len(warnings) > 0 {
+				cclog.Warnf("Warnings: %v\n", warnings)
+			}
+
+			// init data structures
+			if _, ok := jobData[metric]; !ok {
+				jobData[metric] = make(map[schema.MetricScope]*schema.JobMetric)
+			}
+			jobMetric, ok := jobData[metric][scope]
+			if !ok {
+				jobMetric = &schema.JobMetric{
+					Unit:     metricConfig.Unit,
+					Timestep: metricConfig.Timestep,
+					Series:   make([]schema.Series, 0),
+				}
+			}
+			step := int64(metricConfig.Timestep)
+			steps := int64(to.Sub(from).Seconds()) / step
+			// iter rows of host, metric, values
+			for _, row := range result.(promm.Matrix) {
+				jobMetric.Series = append(jobMetric.Series,
+					pdb.RowToSeries(from, step, steps, row))
+			}
+			// only add metric if at least one host returned data
+			if !ok && len(jobMetric.Series) > 0 {
+				jobData[metric][scope] = jobMetric
+			}
+			// sort by hostname to get uniform coloring
+			sort.Slice(jobMetric.Series, func(i, j int) bool {
+				return (jobMetric.Series[i].Hostname < jobMetric.Series[j].Hostname)
+			})
+		}
+	}
+	return jobData, nil
+}
+
+// TODO change implementation to precomputed/cached stats
+func (pdb *PrometheusDataRepository) LoadStats(
+	job *schema.Job,
+	metrics []string,
+	ctx context.Context,
+) (map[string]map[string]schema.MetricStatistics, error) {
+	// map of metrics of nodes of stats
+	stats := map[string]map[string]schema.MetricStatistics{}
+
+	data, err := pdb.LoadData(job, metrics, []schema.MetricScope{schema.MetricScopeNode}, ctx, 0 /*resolution here*/)
+	if err != nil {
+		cclog.Warn("Error while loading job for stats")
+		return nil, err
+	}
+	for metric, metricData := range data {
+		stats[metric] = make(map[string]schema.MetricStatistics)
+		for _, series := range metricData[schema.MetricScopeNode].Series {
+			stats[metric][series.Hostname] = series.Statistics
+		}
+	}
+
+	return stats, nil
+}
+
+func (pdb *PrometheusDataRepository) LoadNodeData(
+	cluster string,
+	metrics, nodes []string,
+	scopes []schema.MetricScope,
+	from, to time.Time,
+	ctx context.Context,
+) (map[string]map[string][]*schema.JobMetric, error) {
+	t0 := time.Now()
+	// Map of hosts of metrics of value slices
+	data := make(map[string]map[string][]*schema.JobMetric)
+	// query db for each metric
+	// TODO: scopes seems to be always empty
+	if len(scopes) == 0 || !contains(scopes, schema.MetricScopeNode) {
+		scopes = append(scopes, schema.MetricScopeNode)
+	}
+	for _, scope := range scopes {
+		if scope != schema.MetricScopeNode {
+			logOnce.Do(func() {
+				cclog.Infof("Note: Scope '%s' requested, but not yet supported: Will return 'node' scope only.", scope)
+			})
+			continue
+		}
+		for _, metric := range metrics {
+			metricConfig := archive.GetMetricConfig(cluster, metric)
+			if metricConfig == nil {
+				cclog.Warnf("Error in LoadNodeData: Metric %s for cluster %s not configured", metric, cluster)
+				return nil, errors.New("Prometheus config error")
+			}
+			query, err := pdb.FormatQuery(metric, scope, nodes, cluster)
+			if err != nil {
+				cclog.Warn("Error while formatting prometheus query")
+				return nil, err
+			}
+
+			// ranged query over all nodes
+			r := promv1.Range{
+				Start: from,
+				End:   to,
+				Step:  time.Duration(metricConfig.Timestep * 1e9),
+			}
+			result, warnings, err := pdb.queryClient.QueryRange(ctx, query, r)
+			if err != nil {
+				cclog.Errorf("Prometheus query error in LoadNodeData: %v\n", err)
+				return nil, errors.New("Prometheus query error")
+			}
+			if len(warnings) > 0 {
+				cclog.Warnf("Warnings: %v\n", warnings)
+			}
+
+			step := int64(metricConfig.Timestep)
+			steps := int64(to.Sub(from).Seconds()) / step
+
+			// iter rows of host, metric, values
+			for _, row := range result.(promm.Matrix) {
+				hostname := strings.TrimSuffix(string(row.Metric["exported_instance"]), pdb.suffix)
+				hostdata, ok := data[hostname]
+				if !ok {
+					hostdata = make(map[string][]*schema.JobMetric)
+					data[hostname] = hostdata
+				}
+				// output per host and metric
+				hostdata[metric] = append(hostdata[metric], &schema.JobMetric{
+					Unit:     metricConfig.Unit,
+					Timestep: metricConfig.Timestep,
+					Series:   []schema.Series{pdb.RowToSeries(from, step, steps, row)},
+				},
+				)
+			}
+		}
+	}
+	t1 := time.Since(t0)
+	cclog.Debugf("LoadNodeData of %v nodes took %s", len(data), t1)
+	return data, nil
+}
+
+// Implemented by NHR@FAU; Used in Job-View StatsTable
+func (pdb *PrometheusDataRepository) LoadScopedStats(
+	job *schema.Job,
+	metrics []string,
+	scopes []schema.MetricScope,
+	ctx context.Context,
+) (schema.ScopedJobStats, error) {
+	// Assumption: pdb.loadData() only returns series node-scope - use node scope for statsTable
+	scopedJobStats := make(schema.ScopedJobStats)
+	data, err := pdb.LoadData(job, metrics, []schema.MetricScope{schema.MetricScopeNode}, ctx, 0 /*resolution here*/)
+	if err != nil {
+		cclog.Warn("Error while loading job for scopedJobStats")
+		return nil, err
+	}
+
+	for metric, metricData := range data {
+		for _, scope := range scopes {
+			if scope != schema.MetricScopeNode {
+				logOnce.Do(func() {
+					cclog.Infof("Note: Scope '%s' requested, but not yet supported: Will return 'node' scope only.", scope)
+				})
+				continue
+			}
+
+			if _, ok := scopedJobStats[metric]; !ok {
+				scopedJobStats[metric] = make(map[schema.MetricScope][]*schema.ScopedStats)
+			}
+
+			if _, ok := scopedJobStats[metric][scope]; !ok {
+				scopedJobStats[metric][scope] = make([]*schema.ScopedStats, 0)
+			}
+
+			for _, series := range metricData[scope].Series {
+				scopedJobStats[metric][scope] = append(scopedJobStats[metric][scope], &schema.ScopedStats{
+					Hostname: series.Hostname,
+					Data:     &series.Statistics,
+				})
+			}
+		}
+	}
+
+	return scopedJobStats, nil
+}
+
+// Implemented by NHR@FAU; Used in NodeList-View
+func (pdb *PrometheusDataRepository) LoadNodeListData(
+	cluster, subCluster string,
+	nodes []string,
+	metrics []string,
+	scopes []schema.MetricScope,
+	resolution int,
+	from, to time.Time,
+	ctx context.Context,
+) (map[string]schema.JobData, error) {
+	// Assumption: pdb.loadData() only returns series node-scope - use node scope for NodeList
+
+	// Fetch Data, based on pdb.LoadNodeData()
+	t0 := time.Now()
+	// Map of hosts of jobData
+	data := make(map[string]schema.JobData)
+
+	// query db for each metric
+	// TODO: scopes seems to be always empty
+	if len(scopes) == 0 || !contains(scopes, schema.MetricScopeNode) {
+		scopes = append(scopes, schema.MetricScopeNode)
+	}
+
+	for _, scope := range scopes {
+		if scope != schema.MetricScopeNode {
+			logOnce.Do(func() {
+				cclog.Infof("Note: Scope '%s' requested, but not yet supported: Will return 'node' scope only.", scope)
+			})
+			continue
+		}
+
+		for _, metric := range metrics {
+			metricConfig := archive.GetMetricConfig(cluster, metric)
+			if metricConfig == nil {
+				cclog.Warnf("Error in LoadNodeListData: Metric %s for cluster %s not configured", metric, cluster)
+				return nil, errors.New("Prometheus config error")
+			}
+			query, err := pdb.FormatQuery(metric, scope, nodes, cluster)
+			if err != nil {
+				cclog.Warn("Error while formatting prometheus query")
+				return nil, err
+			}
+
+			// ranged query over all nodes
+			r := promv1.Range{
+				Start: from,
+				End:   to,
+				Step:  time.Duration(metricConfig.Timestep * 1e9),
+			}
+			result, warnings, err := pdb.queryClient.QueryRange(ctx, query, r)
+			if err != nil {
+				cclog.Errorf("Prometheus query error in LoadNodeData: %v\n", err)
+				return nil, errors.New("Prometheus query error")
+			}
+			if len(warnings) > 0 {
+				cclog.Warnf("Warnings: %v\n", warnings)
+			}
+
+			step := int64(metricConfig.Timestep)
+			steps := int64(to.Sub(from).Seconds()) / step
+
+			// iter rows of host, metric, values
+			for _, row := range result.(promm.Matrix) {
+				hostname := strings.TrimSuffix(string(row.Metric["exported_instance"]), pdb.suffix)
+
+				hostdata, ok := data[hostname]
+				if !ok {
+					hostdata = make(schema.JobData)
+					data[hostname] = hostdata
+				}
+
+				metricdata, ok := hostdata[metric]
+				if !ok {
+					metricdata = make(map[schema.MetricScope]*schema.JobMetric)
+					data[hostname][metric] = metricdata
+				}
+
+				// output per host, metric and scope
+				scopeData, ok := metricdata[scope]
+				if !ok {
+					scopeData = &schema.JobMetric{
+						Unit:     metricConfig.Unit,
+						Timestep: metricConfig.Timestep,
+						Series:   []schema.Series{pdb.RowToSeries(from, step, steps, row)},
+					}
+					data[hostname][metric][scope] = scopeData
+				}
+			}
+		}
+	}
+	t1 := time.Since(t0)
+	cclog.Debugf("LoadNodeListData of %v nodes took %s", len(data), t1)
+	return data, nil
+}
--- a/internal/metricdata/utils.go
+++ b/internal/metricdata/utils.go
@@ -0,0 +1,74 @@
+// Copyright (C) NHR@FAU, University Erlangen-Nuremberg.
+// All rights reserved. This file is part of cc-backend.
+// Use of this source code is governed by a MIT-style
+// license that can be found in the LICENSE file.
+
+package metricdata
+
+import (
+	"context"
+	"encoding/json"
+	"time"
+
+	"github.com/ClusterCockpit/cc-lib/schema"
+)
+
+var TestLoadDataCallback func(job *schema.Job, metrics []string, scopes []schema.MetricScope, ctx context.Context, resolution int) (schema.JobData, error) = func(job *schema.Job, metrics []string, scopes []schema.MetricScope, ctx context.Context, resolution int) (schema.JobData, error) {
+	panic("TODO")
+}
+
+// TestMetricDataRepository is only a mock for unit-testing.
+type TestMetricDataRepository struct{}
+
+func (tmdr *TestMetricDataRepository) Init(_ json.RawMessage) error {
+	return nil
+}
+
+func (tmdr *TestMetricDataRepository) LoadData(
+	job *schema.Job,
+	metrics []string,
+	scopes []schema.MetricScope,
+	ctx context.Context,
+	resolution int,
+) (schema.JobData, error) {
+	return TestLoadDataCallback(job, metrics, scopes, ctx, resolution)
+}
+
+func (tmdr *TestMetricDataRepository) LoadStats(
+	job *schema.Job,
+	metrics []string,
+	ctx context.Context,
+) (map[string]map[string]schema.MetricStatistics, error) {
+	panic("TODO")
+}
+
+func (tmdr *TestMetricDataRepository) LoadScopedStats(
+	job *schema.Job,
+	metrics []string,
+	scopes []schema.MetricScope,
+	ctx context.Context,
+) (schema.ScopedJobStats, error) {
+	panic("TODO")
+}
+
+func (tmdr *TestMetricDataRepository) LoadNodeData(
+	cluster string,
+	metrics, nodes []string,
+	scopes []schema.MetricScope,
+	from, to time.Time,
+	ctx context.Context,
+) (map[string]map[string][]*schema.JobMetric, error) {
+	panic("TODO")
+}
+
+func (tmdr *TestMetricDataRepository) LoadNodeListData(
+	cluster, subCluster string,
+	nodes []string,
+	metrics []string,
+	scopes []schema.MetricScope,
+	resolution int,
+	from, to time.Time,
+	ctx context.Context,
+) (map[string]schema.JobData, error) {
+	panic("TODO")
+}
--- a/internal/metricdispatch/configSchema.go
+++ b/internal/metricdispatch/configSchema.go
@@ -1,29 +0,0 @@
-// Copyright (C) NHR@FAU, University Erlangen-Nuremberg.
-// All rights reserved. This file is part of cc-backend.
-// Use of this source code is governed by a MIT-style
-// license that can be found in the LICENSE file.
-
-package metricdispatch
-
-const configSchema = `{
-  "type": "array",
-  "description": "Array of metric store configurations with scope-based routing.",
-  "items": {
-    "type": "object",
-    "properties": {
-      "scope": {
-        "description": "Scope identifier for routing metrics (e.g., cluster name, '*' for default)",
-        "type": "string"
-      },
-      "url": {
-        "description": "URL of the metric store endpoint",
-        "type": "string"
-      },
-      "token": {
-        "description": "Authentication token for the metric store",
-        "type": "string"
-      }
-    },
-    "required": ["scope", "url", "token"]
-  }
-}`
--- a/internal/metricdispatch/metricdata.go
+++ b/internal/metricdispatch/metricdata.go
@@ -1,123 +0,0 @@
-// Copyright (C) NHR@FAU, University Erlangen-Nuremberg.
-// All rights reserved.
-// Use of this source code is governed by a MIT-style
-// license that can be found in the LICENSE file.
-package metricdispatch
-
-import (
-	"bytes"
-	"context"
-	"encoding/json"
-	"fmt"
-	"time"
-
-	"github.com/ClusterCockpit/cc-backend/internal/config"
-	ccms "github.com/ClusterCockpit/cc-backend/internal/metricstoreclient"
-	"github.com/ClusterCockpit/cc-backend/pkg/metricstore"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
-)
-
-type MetricDataRepository interface {
-	// Return the JobData for the given job, only with the requested metrics.
-	LoadData(job *schema.Job,
-		metrics []string,
-		scopes []schema.MetricScope,
-		ctx context.Context,
-		resolution int) (schema.JobData, error)
-
-	// Return a map of metrics to a map of nodes to the metric statistics of the job. node scope only.
-	LoadStats(job *schema.Job,
-		metrics []string,
-		ctx context.Context) (map[string]map[string]schema.MetricStatistics, error)
-
-	// Return a map of metrics to a map of scopes to the scoped metric statistics of the job.
-	LoadScopedStats(job *schema.Job,
-		metrics []string,
-		scopes []schema.MetricScope,
-		ctx context.Context) (schema.ScopedJobStats, error)
-
-	// Return a map of hosts to a map of metrics at the requested scopes (currently only node) for that node.
-	LoadNodeData(cluster string,
-		metrics, nodes []string,
-		scopes []schema.MetricScope,
-		from, to time.Time,
-		ctx context.Context) (map[string]map[string][]*schema.JobMetric, error)
-
-	// Return a map of hosts to a map of metrics to a map of scopes for multiple nodes.
-	LoadNodeListData(cluster, subCluster string,
-		nodes []string,
-		metrics []string,
-		scopes []schema.MetricScope,
-		resolution int,
-		from, to time.Time,
-		ctx context.Context) (map[string]schema.JobData, error)
-
-	// HealthCheck evaluates the monitoring state for a set of nodes against expected metrics.
-	HealthCheck(cluster string,
-		nodes []string,
-		metrics []string) (map[string]metricstore.HealthCheckResult, error)
-}
-
-type CCMetricStoreConfig struct {
-	Scope string `json:"scope"`
-	URL   string `json:"url"`
-	Token string `json:"token"`
-}
-
-var metricDataRepos map[string]MetricDataRepository = map[string]MetricDataRepository{}
-
-func Init(rawConfig json.RawMessage) error {
-	if rawConfig != nil {
-		var configs []CCMetricStoreConfig
-		config.Validate(configSchema, rawConfig)
-		dec := json.NewDecoder(bytes.NewReader(rawConfig))
-		dec.DisallowUnknownFields()
-		if err := dec.Decode(&configs); err != nil {
-			return fmt.Errorf("[METRICDISPATCH]> Metric Store Config Init: Could not decode config file '%s' Error: %s", rawConfig, err.Error())
-		}
-
-		if len(configs) == 0 {
-			return fmt.Errorf("[METRICDISPATCH]> No metric store configurations found in config file")
-		}
-
-		for _, config := range configs {
-			metricDataRepos[config.Scope] = ccms.NewCCMetricStore(config.URL, config.Token)
-		}
-	}
-
-	return nil
-}
-
-func GetMetricDataRepo(cluster string, subcluster string) (MetricDataRepository, error) {
-	var repo MetricDataRepository
-	var ok bool
-
-	key := cluster + "-" + subcluster
-	repo, ok = metricDataRepos[key]
-
-	if !ok {
-		repo, ok = metricDataRepos[cluster]
-
-		if !ok {
-			repo, ok = metricDataRepos["*"]
-
-			if !ok {
-				if metricstore.MetricStoreHandle == nil {
-					return nil, fmt.Errorf("[METRICDISPATCH]> no metric data repository configured '%s'", key)
-				}
-
-				repo = metricstore.MetricStoreHandle
-				cclog.Debugf("[METRICDISPATCH]> Using internal metric data repository for '%s'", key)
-			}
-		}
-	}
-
-	return repo, nil
-}
-
-// GetHealthCheckRepo returns the MetricDataRepository for performing health checks on a cluster.
-// It uses the same fallback logic as GetMetricDataRepo: cluster → wildcard → internal.
-func GetHealthCheckRepo(cluster string) (MetricDataRepository, error) {
-	return GetMetricDataRepo(cluster, "")
-}
--- a/internal/metricdispatcher/dataLoader.go
+++ b/internal/metricdispatcher/dataLoader.go
@@ -3,7 +3,7 @@
 // Use of this source code is governed by a MIT-style
 // license that can be found in the LICENSE file.

-// Package metricdispatch provides a unified interface for loading and caching job metric data.
+// Package metricdispatcher provides a unified interface for loading and caching job metric data.
 //
 // This package serves as a central dispatcher that routes metric data requests to the appropriate
 // backend based on job state. For running jobs, data is fetched from the metric store (e.g., cc-metric-store).
@@ -29,13 +29,13 @@
 //
 // The primary entry point is LoadData, which automatically handles both running and archived jobs:
 //
-//	jobData, err := metricdispatch.LoadData(job, metrics, scopes, ctx, resolution)
+//	jobData, err := metricdispatcher.LoadData(job, metrics, scopes, ctx, resolution)
 //	if err != nil {
 //	    // Handle error
 //	}
 //
 // For statistics only, use LoadJobStats, LoadScopedJobStats, or LoadAverages depending on the required format.
-package metricdispatch
+package metricdispatcher

 import (
 	"context"
@@ -43,11 +43,13 @@ import (
 	"math"
 	"time"

+	"github.com/ClusterCockpit/cc-backend/internal/config"
+	"github.com/ClusterCockpit/cc-backend/internal/memorystore"
 	"github.com/ClusterCockpit/cc-backend/pkg/archive"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/lrucache"
-	"github.com/ClusterCockpit/cc-lib/v2/resampler"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/lrucache"
+	"github.com/ClusterCockpit/cc-lib/resampler"
+	"github.com/ClusterCockpit/cc-lib/schema"
 )

 // cache is an LRU cache with 128 MB capacity for storing loaded job metric data.
@@ -64,7 +66,7 @@ func cacheKey(
 	resolution int,
 ) string {
 	return fmt.Sprintf("%d(%s):[%v],[%v]-%d",
-		*job.ID, job.State, metrics, scopes, resolution)
+		job.ID, job.State, metrics, scopes, resolution)
 }

 // LoadData retrieves metric data for a job from the appropriate backend (memory store for running jobs,
@@ -93,14 +95,8 @@ func LoadData(job *schema.Job,
 		var err error

 		if job.State == schema.JobStateRunning ||
-			job.MonitoringStatus == schema.MonitoringStatusRunningOrArchiving {
-
-			ms, err := GetMetricDataRepo(job.Cluster, job.SubCluster)
-			if err != nil {
-				cclog.Errorf("failed to access metricDataRepo for cluster %s-%s: %s",
-					job.Cluster, job.SubCluster, err.Error())
-				return err, 0, 0
-			}
+			job.MonitoringStatus == schema.MonitoringStatusRunningOrArchiving ||
+			config.Keys.DisableArchive {

 			if scopes == nil {
 				scopes = append(scopes, schema.MetricScopeNode)
@@ -113,14 +109,14 @@ func LoadData(job *schema.Job,
 				}
 			}

-			jd, err = ms.LoadData(job, metrics, scopes, ctx, resolution)
+			jd, err = memorystore.LoadData(job, metrics, scopes, ctx, resolution)
 			if err != nil {
 				if len(jd) != 0 {
-					cclog.Warnf("partial error loading metrics from store for job %d (user: %s, project: %s, cluster: %s-%s): %s",
-						job.JobID, job.User, job.Project, job.Cluster, job.SubCluster, err.Error())
+					cclog.Warnf("partial error loading metrics from store for job %d (user: %s, project: %s): %s",
+						job.JobID, job.User, job.Project, err.Error())
 				} else {
-					cclog.Warnf("failed to load job data from metric store for job %d (user: %s, project: %s, cluster: %s-%s): %s",
-						job.JobID, job.User, job.Project, job.Cluster, job.SubCluster, err.Error())
+					cclog.Errorf("failed to load job data from metric store for job %d (user: %s, project: %s): %s",
+						job.JobID, job.User, job.Project, err.Error())
 					return err, 0, 0
 				}
 			}
@@ -129,8 +125,8 @@ func LoadData(job *schema.Job,
 			var jdTemp schema.JobData
 			jdTemp, err = archive.GetHandle().LoadJobData(job)
 			if err != nil {
-				cclog.Warnf("failed to load job data from archive for job %d (user: %s, project: %s, cluster: %s-%s): %s",
-					job.JobID, job.User, job.Project, job.Cluster, job.SubCluster, err.Error())
+				cclog.Errorf("failed to load job data from archive for job %d (user: %s, project: %s): %s",
+					job.JobID, job.User, job.Project, err.Error())
 				return err, 0, 0
 			}

@@ -238,21 +234,14 @@ func LoadAverages(
 	data [][]schema.Float,
 	ctx context.Context,
 ) error {
-	if job.State != schema.JobStateRunning {
+	if job.State != schema.JobStateRunning && !config.Keys.DisableArchive {
 		return archive.LoadAveragesFromArchive(job, metrics, data) // #166 change also here?
 	}

-	ms, err := GetMetricDataRepo(job.Cluster, job.SubCluster)
+	stats, err := memorystore.LoadStats(job, metrics, ctx)
 	if err != nil {
-		cclog.Errorf("failed to access metricDataRepo for cluster %s-%s: %s",
-			job.Cluster, job.SubCluster, err.Error())
-		return err
-	}
-
-	stats, err := ms.LoadStats(job, metrics, ctx)
-	if err != nil {
-		cclog.Warnf("failed to load statistics from metric store for job %d (user: %s, project: %s, cluster: %s-%s): %s",
-			job.JobID, job.User, job.Project, job.Cluster, job.SubCluster, err.Error())
+		cclog.Errorf("failed to load statistics from metric store for job %d (user: %s, project: %s): %s",
+			job.JobID, job.User, job.Project, err.Error())
 		return err
 	}

@@ -282,27 +271,17 @@ func LoadScopedJobStats(
 	scopes []schema.MetricScope,
 	ctx context.Context,
 ) (schema.ScopedJobStats, error) {
-	if job.State != schema.JobStateRunning {
+	if job.State != schema.JobStateRunning && !config.Keys.DisableArchive {
 		return archive.LoadScopedStatsFromArchive(job, metrics, scopes)
 	}

-	ms, err := GetMetricDataRepo(job.Cluster, job.SubCluster)
+	scopedStats, err := memorystore.LoadScopedStats(job, metrics, scopes, ctx)
 	if err != nil {
-		cclog.Errorf("failed to access metricDataRepo for cluster %s-%s: %s",
-			job.Cluster, job.SubCluster, err.Error())
+		cclog.Errorf("failed to load scoped statistics from metric store for job %d (user: %s, project: %s): %s",
+			job.JobID, job.User, job.Project, err.Error())
 		return nil, err
 	}

-	scopedStats, err := ms.LoadScopedStats(job, metrics, scopes, ctx)
-	if err != nil {
-		cclog.Warnf("failed to load scoped statistics from metric store for job %d (user: %s, project: %s, cluster: %s-%s): %s",
-			job.JobID, job.User, job.Project, job.Cluster, job.SubCluster, err.Error())
-		return nil, err
-	}
-
-	// Round Resulting Stat Values
-	scopedStats.RoundScopedMetricStats()
-
 	return scopedStats, nil
 }

@@ -314,23 +293,16 @@ func LoadJobStats(
 	metrics []string,
 	ctx context.Context,
 ) (map[string]schema.MetricStatistics, error) {
-	if job.State != schema.JobStateRunning {
+	if job.State != schema.JobStateRunning && !config.Keys.DisableArchive {
 		return archive.LoadStatsFromArchive(job, metrics)
 	}

-	ms, err := GetMetricDataRepo(job.Cluster, job.SubCluster)
-	if err != nil {
-		cclog.Errorf("failed to access metricDataRepo for cluster %s-%s: %s",
-			job.Cluster, job.SubCluster, err.Error())
-		return nil, err
-	}
-
 	data := make(map[string]schema.MetricStatistics, len(metrics))

-	stats, err := ms.LoadStats(job, metrics, ctx)
+	stats, err := memorystore.LoadStats(job, metrics, ctx)
 	if err != nil {
-		cclog.Warnf("failed to load statistics from metric store for job %d (user: %s, project: %s, cluster: %s-%s): %s",
-			job.JobID, job.User, job.Project, job.Cluster, job.SubCluster, err.Error())
+		cclog.Errorf("failed to load statistics from metric store for job %d (user: %s, project: %s): %s",
+			job.JobID, job.User, job.Project, err.Error())
 		return data, err
 	}

@@ -363,7 +335,6 @@ func LoadJobStats(
 // the metric store (not the archive) since it's for current/recent node status monitoring.
 //
 // Returns a nested map structure: node -> metric -> scoped data.
-// FIXME: Add support for subcluster specific cc-metric-stores
 func LoadNodeData(
 	cluster string,
 	metrics, nodes []string,
@@ -377,19 +348,12 @@ func LoadNodeData(
 		}
 	}

-	ms, err := GetMetricDataRepo(cluster, "")
-	if err != nil {
-		cclog.Errorf("failed to access metricDataRepo for cluster %s: %s",
-			cluster, err.Error())
-		return nil, err
-	}
-
-	data, err := ms.LoadNodeData(cluster, metrics, nodes, scopes, from, to, ctx)
+	data, err := memorystore.LoadNodeData(cluster, metrics, nodes, scopes, from, to, ctx)
 	if err != nil {
 		if len(data) != 0 {
 			cclog.Warnf("partial error loading node data from metric store for cluster %s: %s", cluster, err.Error())
 		} else {
-			cclog.Warnf("failed to load node data from metric store for cluster %s: %s", cluster, err.Error())
+			cclog.Errorf("failed to load node data from metric store for cluster %s: %s", cluster, err.Error())
 			return nil, err
 		}
 	}
@@ -421,20 +385,13 @@ func LoadNodeListData(
 		}
 	}

-	ms, err := GetMetricDataRepo(cluster, subCluster)
-	if err != nil {
-		cclog.Errorf("failed to access metricDataRepo for cluster %s-%s: %s",
-			cluster, subCluster, err.Error())
-		return nil, err
-	}
-
-	data, err := ms.LoadNodeListData(cluster, subCluster, nodes, metrics, scopes, resolution, from, to, ctx)
+	data, err := memorystore.LoadNodeListData(cluster, subCluster, nodes, metrics, scopes, resolution, from, to, ctx)
 	if err != nil {
 		if len(data) != 0 {
 			cclog.Warnf("partial error loading node list data from metric store for cluster %s, subcluster %s: %s",
 				cluster, subCluster, err.Error())
 		} else {
-			cclog.Warnf("failed to load node list data from metric store for cluster %s, subcluster %s: %s",
+			cclog.Errorf("failed to load node list data from metric store for cluster %s, subcluster %s: %s",
 				cluster, subCluster, err.Error())
 			return nil, err
 		}
@@ -499,7 +456,7 @@ func copyJobMetric(src *schema.JobMetric) *schema.JobMetric {
 func copySeries(src *schema.Series) schema.Series {
 	dst := schema.Series{
 		Hostname:   src.Hostname,
-		ID:         src.ID,
+		Id:         src.Id,
 		Statistics: src.Statistics,
 		Data:       make([]schema.Float, len(src.Data)),
 	}
--- a/internal/metricdispatcher/dataLoader_test.go
+++ b/internal/metricdispatcher/dataLoader_test.go
@@ -3,12 +3,12 @@
 // Use of this source code is governed by a MIT-style
 // license that can be found in the LICENSE file.

-package metricdispatch
+package metricdispatcher

 import (
 	"testing"

-	"github.com/ClusterCockpit/cc-lib/v2/schema"
+	"github.com/ClusterCockpit/cc-lib/schema"
 )

 func TestDeepCopy(t *testing.T) {
@@ -21,7 +21,7 @@ func TestDeepCopy(t *testing.T) {
 				Series: []schema.Series{
 					{
 						Hostname: "node001",
-						ID:       &nodeId,
+						Id:       &nodeId,
 						Data:     []schema.Float{1.0, 2.0, 3.0},
 						Statistics: schema.MetricStatistics{
 							Min: 1.0,
--- a/internal/metricstoreclient/cc-metric-store-queries.go
+++ b/internal/metricstoreclient/cc-metric-store-queries.go
@@ -1,522 +0,0 @@
-// Copyright (C) NHR@FAU, University Erlangen-Nuremberg.
-// All rights reserved.
-// Use of this source code is governed by a MIT-style
-// license that can be found in the LICENSE file.
-
-// Package metricstoreclient - Query Building
-//
-// This file contains the query construction and scope transformation logic for cc-metric-store queries.
-// It handles the complex mapping between requested metric scopes and native hardware topology,
-// automatically aggregating or filtering metrics as needed.
-//
-// # Scope Transformations
-//
-// The buildScopeQueries function implements the core scope transformation algorithm.
-// It handles 25+ different transformation cases, mapping between:
-//   - Accelerator (GPU) scope
-//   - HWThread (hardware thread/SMT) scope
-//   - Core (CPU core) scope
-//   - Socket (CPU package) scope
-//   - MemoryDomain (NUMA domain) scope
-//   - Node (full system) scope
-//
-// Transformations follow these rules:
-//   - Same scope: Return data as-is (e.g., Core → Core)
-//   - Coarser scope: Aggregate data (e.g., Core → Socket with Aggregate=true)
-//   - Finer scope: Error - cannot increase granularity
-//
-// # Query Building
-//
-// buildQueries and buildNodeQueries are the main entry points, handling job-specific
-// and node-specific query construction respectively. They:
-//   - Validate metric configurations
-//   - Handle subcluster-specific metric filtering
-//   - Detect and skip duplicate scope requests
-//   - Call buildScopeQueries for each metric/scope/host combination
-package metricstoreclient
-
-import (
-	"fmt"
-	"strconv"
-
-	"github.com/ClusterCockpit/cc-backend/pkg/archive"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
-)
-
-// Scope string constants used in API queries.
-// Pre-converted to avoid repeated allocations during query building.
-var (
-	hwthreadString     = string(schema.MetricScopeHWThread)
-	coreString         = string(schema.MetricScopeCore)
-	memoryDomainString = string(schema.MetricScopeMemoryDomain)
-	socketString       = string(schema.MetricScopeSocket)
-	acceleratorString  = string(schema.MetricScopeAccelerator)
-)
-
-// buildQueries constructs API queries for job-specific metric data.
-// It iterates through metrics, scopes, and job resources to build the complete query set.
-//
-// The function handles:
-//   - Metric configuration validation and subcluster filtering
-//   - Scope deduplication to avoid redundant queries
-//   - Hardware thread list resolution (job-allocated vs full node)
-//   - Delegation to buildScopeQueries for scope transformations
-//
-// Returns queries and their corresponding assigned scopes (which may differ from requested scopes).
-func (ccms *CCMetricStore) buildQueries(
-	job *schema.Job,
-	metrics []string,
-	scopes []schema.MetricScope,
-	resolution int,
-) ([]APIQuery, []schema.MetricScope, error) {
-	// Initialize both slices together
-	queries := make([]APIQuery, 0, len(metrics)*len(scopes)*len(job.Resources))
-	assignedScope := make([]schema.MetricScope, 0, len(metrics)*len(scopes)*len(job.Resources))
-
-	topology, err := ccms.getTopology(job.Cluster, job.SubCluster)
-	if err != nil {
-		cclog.Errorf("could not load cluster %s subCluster %s topology: %s", job.Cluster, job.SubCluster, err.Error())
-		return nil, nil, err
-	}
-
-	for _, metric := range metrics {
-		remoteName := metric
-		mc := archive.GetMetricConfig(job.Cluster, metric)
-		if mc == nil {
-			cclog.Warnf("metric '%s' is not specified for cluster '%s' - skipping", metric, job.Cluster)
-			continue
-		}
-
-		// Skip if metric is removed for subcluster
-		if len(mc.SubClusters) != 0 {
-			isRemoved := false
-			for _, scConfig := range mc.SubClusters {
-				if scConfig.Name == job.SubCluster && scConfig.Remove {
-					isRemoved = true
-					break
-				}
-			}
-			if isRemoved {
-				continue
-			}
-		}
-
-		// Avoid duplicates...
-		handledScopes := make([]schema.MetricScope, 0, 3)
-
-	scopesLoop:
-		for _, requestedScope := range scopes {
-			nativeScope := mc.Scope
-			if nativeScope == schema.MetricScopeAccelerator && job.NumAcc == 0 {
-				continue
-			}
-
-			scope := nativeScope.Max(requestedScope)
-			for _, s := range handledScopes {
-				if scope == s {
-					continue scopesLoop
-				}
-			}
-			handledScopes = append(handledScopes, scope)
-
-			for _, host := range job.Resources {
-				hwthreads := host.HWThreads
-				if hwthreads == nil {
-					hwthreads = topology.Node
-				}
-
-				hostQueries, hostScopes := buildScopeQueries(
-					nativeScope, requestedScope,
-					remoteName, host.Hostname,
-					topology, hwthreads, host.Accelerators,
-					resolution,
-				)
-
-				if len(hostQueries) == 0 && len(hostScopes) == 0 {
-					return nil, nil, fmt.Errorf("METRICDATA/CCMS > TODO: unhandled case: native-scope=%s, requested-scope=%s", nativeScope, requestedScope)
-				}
-
-				queries = append(queries, hostQueries...)
-				assignedScope = append(assignedScope, hostScopes...)
-			}
-		}
-	}
-
-	return queries, assignedScope, nil
-}
-
-// buildNodeQueries constructs API queries for node-specific metric data (Systems View).
-// Similar to buildQueries but uses full node topology instead of job-allocated resources.
-//
-// The function handles:
-//   - SubCluster topology resolution (either pre-loaded or per-node lookup)
-//   - Full node hardware thread lists (not job-specific subsets)
-//   - All accelerators on each node
-//   - Metric configuration validation with subcluster filtering
-//
-// Returns queries and their corresponding assigned scopes.
-func (ccms *CCMetricStore) buildNodeQueries(
-	cluster string,
-	subCluster string,
-	nodes []string,
-	metrics []string,
-	scopes []schema.MetricScope,
-	resolution int,
-) ([]APIQuery, []schema.MetricScope, error) {
-	// Initialize both slices together
-	queries := make([]APIQuery, 0, len(metrics)*len(scopes)*len(nodes))
-	assignedScope := make([]schema.MetricScope, 0, len(metrics)*len(scopes)*len(nodes))
-
-	for _, metric := range metrics {
-		remoteName := metric
-		mc := archive.GetMetricConfig(cluster, metric)
-		if mc == nil {
-			cclog.Warnf("metric '%s' is not specified for cluster '%s'", metric, cluster)
-			continue
-		}
-
-		// Skip if metric is removed for subcluster
-		if mc.SubClusters != nil {
-			isRemoved := false
-			for _, scConfig := range mc.SubClusters {
-				if scConfig.Name == subCluster && scConfig.Remove {
-					isRemoved = true
-					break
-				}
-			}
-			if isRemoved {
-				continue
-			}
-		}
-
-		// Avoid duplicates...
-		handledScopes := make([]schema.MetricScope, 0, 3)
-
-	scopesLoop:
-		for _, requestedScope := range scopes {
-			nativeScope := mc.Scope
-
-			scope := nativeScope.Max(requestedScope)
-			for _, s := range handledScopes {
-				if scope == s {
-					continue scopesLoop
-				}
-			}
-			handledScopes = append(handledScopes, scope)
-
-			for _, hostname := range nodes {
-				var topology *schema.Topology
-				var err error
-
-				// If no subCluster given, get it by node
-				if subCluster == "" {
-					topology, err = ccms.getTopologyByNode(cluster, hostname)
-				} else {
-					topology, err = ccms.getTopology(cluster, subCluster)
-				}
-
-				if err != nil {
-					return nil, nil, err
-				}
-
-				// Always full node hwthread id list, no partial queries expected -> Use "topology.Node" directly where applicable
-				// Always full accelerator id list, no partial queries expected -> Use "acceleratorIds" directly where applicable
-				acceleratorIds := topology.GetAcceleratorIDs()
-
-				// Moved check here if metric matches hardware specs
-				if nativeScope == schema.MetricScopeAccelerator && len(acceleratorIds) == 0 {
-					continue scopesLoop
-				}
-
-				nodeQueries, nodeScopes := buildScopeQueries(
-					nativeScope, requestedScope,
-					remoteName, hostname,
-					topology, topology.Node, acceleratorIds,
-					resolution,
-				)
-
-				if len(nodeQueries) == 0 && len(nodeScopes) == 0 {
-					return nil, nil, fmt.Errorf("METRICDATA/CCMS > TODO: unhandled case: native-scope=%s, requested-scope=%s", nativeScope, requestedScope)
-				}
-
-				queries = append(queries, nodeQueries...)
-				assignedScope = append(assignedScope, nodeScopes...)
-			}
-		}
-	}
-
-	return queries, assignedScope, nil
-}
-
-// buildScopeQueries generates API queries for a given scope transformation.
-// It returns a slice of queries and corresponding assigned scopes.
-// Some transformations (e.g., HWThread -> Core/Socket) may generate multiple queries.
-func buildScopeQueries(
-	nativeScope, requestedScope schema.MetricScope,
-	metric, hostname string,
-	topology *schema.Topology,
-	hwthreads []int,
-	accelerators []string,
-	resolution int,
-) ([]APIQuery, []schema.MetricScope) {
-	scope := nativeScope.Max(requestedScope)
-	queries := []APIQuery{}
-	scopes := []schema.MetricScope{}
-
-	hwthreadsStr := intToStringSlice(hwthreads)
-
-	// Accelerator -> Accelerator (Use "accelerator" scope if requested scope is lower than node)
-	if nativeScope == schema.MetricScopeAccelerator && scope.LT(schema.MetricScopeNode) {
-		if scope != schema.MetricScopeAccelerator {
-			return queries, scopes
-		}
-
-		queries = append(queries, APIQuery{
-			Metric:     metric,
-			Hostname:   hostname,
-			Aggregate:  false,
-			Type:       &acceleratorString,
-			TypeIds:    accelerators,
-			Resolution: resolution,
-		})
-		scopes = append(scopes, schema.MetricScopeAccelerator)
-		return queries, scopes
-	}
-
-	// Accelerator -> Node
-	if nativeScope == schema.MetricScopeAccelerator && scope == schema.MetricScopeNode {
-		if len(accelerators) == 0 {
-			return queries, scopes
-		}
-
-		queries = append(queries, APIQuery{
-			Metric:     metric,
-			Hostname:   hostname,
-			Aggregate:  true,
-			Type:       &acceleratorString,
-			TypeIds:    accelerators,
-			Resolution: resolution,
-		})
-		scopes = append(scopes, scope)
-		return queries, scopes
-	}
-
-	// HWThread -> HWThread
-	if nativeScope == schema.MetricScopeHWThread && scope == schema.MetricScopeHWThread {
-		queries = append(queries, APIQuery{
-			Metric:     metric,
-			Hostname:   hostname,
-			Aggregate:  false,
-			Type:       &hwthreadString,
-			TypeIds:    hwthreadsStr,
-			Resolution: resolution,
-		})
-		scopes = append(scopes, scope)
-		return queries, scopes
-	}
-
-	// HWThread -> Core
-	if nativeScope == schema.MetricScopeHWThread && scope == schema.MetricScopeCore {
-		cores, _ := topology.GetCoresFromHWThreads(hwthreads)
-		for _, core := range cores {
-			queries = append(queries, APIQuery{
-				Metric:     metric,
-				Hostname:   hostname,
-				Aggregate:  true,
-				Type:       &hwthreadString,
-				TypeIds:    intToStringSlice(topology.Core[core]),
-				Resolution: resolution,
-			})
-			scopes = append(scopes, scope)
-		}
-		return queries, scopes
-	}
-
-	// HWThread -> Socket
-	if nativeScope == schema.MetricScopeHWThread && scope == schema.MetricScopeSocket {
-		sockets, _ := topology.GetSocketsFromHWThreads(hwthreads)
-		for _, socket := range sockets {
-			queries = append(queries, APIQuery{
-				Metric:     metric,
-				Hostname:   hostname,
-				Aggregate:  true,
-				Type:       &hwthreadString,
-				TypeIds:    intToStringSlice(topology.Socket[socket]),
-				Resolution: resolution,
-			})
-			scopes = append(scopes, scope)
-		}
-		return queries, scopes
-	}
-
-	// HWThread -> Node
-	if nativeScope == schema.MetricScopeHWThread && scope == schema.MetricScopeNode {
-		queries = append(queries, APIQuery{
-			Metric:     metric,
-			Hostname:   hostname,
-			Aggregate:  true,
-			Type:       &hwthreadString,
-			TypeIds:    hwthreadsStr,
-			Resolution: resolution,
-		})
-		scopes = append(scopes, scope)
-		return queries, scopes
-	}
-
-	// Core -> Core
-	if nativeScope == schema.MetricScopeCore && scope == schema.MetricScopeCore {
-		cores, _ := topology.GetCoresFromHWThreads(hwthreads)
-		queries = append(queries, APIQuery{
-			Metric:     metric,
-			Hostname:   hostname,
-			Aggregate:  false,
-			Type:       &coreString,
-			TypeIds:    intToStringSlice(cores),
-			Resolution: resolution,
-		})
-		scopes = append(scopes, scope)
-		return queries, scopes
-	}
-
-	// Core -> Socket
-	if nativeScope == schema.MetricScopeCore && scope == schema.MetricScopeSocket {
-		sockets, _ := topology.GetSocketsFromCores(hwthreads)
-		for _, socket := range sockets {
-			queries = append(queries, APIQuery{
-				Metric:     metric,
-				Hostname:   hostname,
-				Aggregate:  true,
-				Type:       &coreString,
-				TypeIds:    intToStringSlice(topology.Socket[socket]),
-				Resolution: resolution,
-			})
-			scopes = append(scopes, scope)
-		}
-		return queries, scopes
-	}
-
-	// Core -> Node
-	if nativeScope == schema.MetricScopeCore && scope == schema.MetricScopeNode {
-		cores, _ := topology.GetCoresFromHWThreads(hwthreads)
-		queries = append(queries, APIQuery{
-			Metric:     metric,
-			Hostname:   hostname,
-			Aggregate:  true,
-			Type:       &coreString,
-			TypeIds:    intToStringSlice(cores),
-			Resolution: resolution,
-		})
-		scopes = append(scopes, scope)
-		return queries, scopes
-	}
-
-	// MemoryDomain -> MemoryDomain
-	if nativeScope == schema.MetricScopeMemoryDomain && scope == schema.MetricScopeMemoryDomain {
-		memDomains, _ := topology.GetMemoryDomainsFromHWThreads(hwthreads)
-		queries = append(queries, APIQuery{
-			Metric:     metric,
-			Hostname:   hostname,
-			Aggregate:  false,
-			Type:       &memoryDomainString,
-			TypeIds:    intToStringSlice(memDomains),
-			Resolution: resolution,
-		})
-		scopes = append(scopes, scope)
-		return queries, scopes
-	}
-
-	// MemoryDomain -> Node
-	if nativeScope == schema.MetricScopeMemoryDomain && scope == schema.MetricScopeNode {
-		memDomains, _ := topology.GetMemoryDomainsFromHWThreads(hwthreads)
-		queries = append(queries, APIQuery{
-			Metric:     metric,
-			Hostname:   hostname,
-			Aggregate:  true,
-			Type:       &memoryDomainString,
-			TypeIds:    intToStringSlice(memDomains),
-			Resolution: resolution,
-		})
-		scopes = append(scopes, scope)
-		return queries, scopes
-	}
-
-	// MemoryDomain -> Socket
-	if nativeScope == schema.MetricScopeMemoryDomain && scope == schema.MetricScopeSocket {
-		memDomains, _ := topology.GetMemoryDomainsFromHWThreads(hwthreads)
-		socketToDomains, err := topology.GetMemoryDomainsBySocket(memDomains)
-		if err != nil {
-			cclog.Errorf("Error mapping memory domains to sockets, return unchanged: %v", err)
-			return queries, scopes
-		}
-
-		// Create a query for each socket
-		for _, domains := range socketToDomains {
-			queries = append(queries, APIQuery{
-				Metric:     metric,
-				Hostname:   hostname,
-				Aggregate:  true,
-				Type:       &memoryDomainString,
-				TypeIds:    intToStringSlice(domains),
-				Resolution: resolution,
-			})
-			// Add scope for each query, not just once
-			scopes = append(scopes, scope)
-		}
-		return queries, scopes
-	}
-
-	// Socket -> Socket
-	if nativeScope == schema.MetricScopeSocket && scope == schema.MetricScopeSocket {
-		sockets, _ := topology.GetSocketsFromHWThreads(hwthreads)
-		queries = append(queries, APIQuery{
-			Metric:     metric,
-			Hostname:   hostname,
-			Aggregate:  false,
-			Type:       &socketString,
-			TypeIds:    intToStringSlice(sockets),
-			Resolution: resolution,
-		})
-		scopes = append(scopes, scope)
-		return queries, scopes
-	}
-
-	// Socket -> Node
-	if nativeScope == schema.MetricScopeSocket && scope == schema.MetricScopeNode {
-		sockets, _ := topology.GetSocketsFromHWThreads(hwthreads)
-		queries = append(queries, APIQuery{
-			Metric:     metric,
-			Hostname:   hostname,
-			Aggregate:  true,
-			Type:       &socketString,
-			TypeIds:    intToStringSlice(sockets),
-			Resolution: resolution,
-		})
-		scopes = append(scopes, scope)
-		return queries, scopes
-	}
-
-	// Node -> Node
-	if nativeScope == schema.MetricScopeNode && scope == schema.MetricScopeNode {
-		queries = append(queries, APIQuery{
-			Metric:     metric,
-			Hostname:   hostname,
-			Resolution: resolution,
-		})
-		scopes = append(scopes, scope)
-		return queries, scopes
-	}
-
-	// Unhandled case - return empty slices
-	return queries, scopes
-}
-
-// intToStringSlice converts a slice of integers to a slice of strings.
-// Used to convert hardware IDs (core IDs, socket IDs, etc.) to the string format required by the API.
-func intToStringSlice(is []int) []string {
-	ss := make([]string, len(is))
-	for i, x := range is {
-		ss[i] = strconv.Itoa(x)
-	}
-	return ss
-}
--- a/internal/metricstoreclient/cc-metric-store.go
+++ b/internal/metricstoreclient/cc-metric-store.go
@@ -1,835 +0,0 @@
-// Copyright (C) NHR@FAU, University Erlangen-Nuremberg.
-// All rights reserved.
-// Use of this source code is governed by a MIT-style
-// license that can be found in the LICENSE file.
-
-// Package metricstoreclient provides a client for querying the cc-metric-store time series database.
-//
-// The cc-metric-store is a high-performance time series database optimized for HPC metric data.
-// This client handles HTTP communication, query construction, scope transformations, and data retrieval
-// for job and node metrics across different metric scopes (node, socket, core, hwthread, accelerator).
-//
-// # Architecture
-//
-// The package is split into two main components:
-//   - Client Operations (cc-metric-store.go): HTTP client, request handling, data loading methods
-//   - Query Building (cc-metric-store-queries.go): Query construction and scope transformation logic
-//
-// # Basic Usage
-//
-//	store := NewCCMetricStore("http://localhost:8080", "jwt-token")
-//
-//	// Load job data
-//	jobData, err := store.LoadData(job, metrics, scopes, ctx, resolution)
-//	if err != nil {
-//	    log.Fatal(err)
-//	}
-//
-// # Metric Scopes
-//
-// The client supports hierarchical metric scopes that map to HPC hardware topology:
-//   - MetricScopeAccelerator: GPU/accelerator level metrics
-//   - MetricScopeHWThread: Hardware thread (SMT) level metrics
-//   - MetricScopeCore: CPU core level metrics
-//   - MetricScopeSocket: CPU socket level metrics
-//   - MetricScopeMemoryDomain: NUMA domain level metrics
-//   - MetricScopeNode: Full node level metrics
-//
-// The client automatically handles scope transformations, aggregating finer-grained metrics
-// to coarser scopes when needed (e.g., aggregating core metrics to socket level).
-//
-// # Error Handling
-//
-// The client supports partial errors - if some queries fail, it returns both the successful
-// data and an error listing the failed queries. This allows processing partial results
-// when some nodes or metrics are temporarily unavailable.
-//
-// # API Versioning
-//
-// The client uses cc-metric-store API v2, which includes support for:
-//   - Data resampling for bandwidth optimization
-//   - Multi-scope queries in a single request
-//   - Aggregation across hardware topology levels
-package metricstoreclient
-
-import (
-	"bufio"
-	"bytes"
-	"context"
-	"encoding/json"
-	"fmt"
-	"net/http"
-	"strings"
-	"time"
-
-	"github.com/ClusterCockpit/cc-backend/pkg/archive"
-	"github.com/ClusterCockpit/cc-backend/pkg/metricstore"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
-)
-
-// CCMetricStore is the HTTP client for communicating with cc-metric-store.
-// It manages connection details, authentication, and provides methods for querying metrics.
-type CCMetricStore struct {
-	client        http.Client                 // HTTP client with 10-second timeout
-	jwt           string                      // JWT Bearer token for authentication
-	url           string                      // Base URL of cc-metric-store instance
-	queryEndpoint string                      // Full URL to query API endpoint
-	topologyCache map[string]*schema.Topology // cluster -> topology cache
-}
-
-// APIQueryRequest represents a request to the cc-metric-store query API.
-// It supports both explicit queries and "for-all-nodes" bulk queries.
-type APIQueryRequest struct {
-	Cluster     string     `json:"cluster"`       // Target cluster name
-	Queries     []APIQuery `json:"queries"`       // Explicit list of metric queries
-	ForAllNodes []string   `json:"for-all-nodes"` // Metrics to query for all nodes
-	From        int64      `json:"from"`          // Start time (Unix timestamp)
-	To          int64      `json:"to"`            // End time (Unix timestamp)
-	WithStats   bool       `json:"with-stats"`    // Include min/avg/max statistics
-	WithData    bool       `json:"with-data"`     // Include time series data points
-}
-
-// APIQuery specifies a single metric query with optional scope filtering.
-// Type and TypeIds define the hardware scope (core, socket, accelerator, etc.).
-type APIQuery struct {
-	Type       *string  `json:"type,omitempty"`        // Scope type (e.g., "core", "socket")
-	SubType    *string  `json:"subtype,omitempty"`     // Sub-scope type (reserved for future use)
-	Metric     string   `json:"metric"`                // Metric name
-	Hostname   string   `json:"host"`                  // Target hostname
-	Resolution int      `json:"resolution"`            // Data resolution in seconds (0 = native)
-	TypeIds    []string `json:"type-ids,omitempty"`    // IDs for the scope type (e.g., core IDs)
-	SubTypeIds []string `json:"subtype-ids,omitempty"` // IDs for sub-scope (reserved)
-	Aggregate  bool     `json:"aggreg"`                // Aggregate across TypeIds
-}
-
-// APIQueryResponse contains the results from a cc-metric-store query.
-// Results align with the Queries slice by index.
-type APIQueryResponse struct {
-	Queries []APIQuery        `json:"queries,omitempty"` // Echoed queries (for bulk requests)
-	Results [][]APIMetricData `json:"results"`           // Result data, indexed by query
-}
-
-// APIMetricData represents time series data and statistics for a single metric series.
-// Error is set if this particular series failed to load.
-type APIMetricData struct {
-	Error      *string        `json:"error"`      // Error message if query failed
-	Data       []schema.Float `json:"data"`       // Time series data points
-	From       int64          `json:"from"`       // Actual start time of data
-	To         int64          `json:"to"`         // Actual end time of data
-	Resolution int            `json:"resolution"` // Actual resolution of data in seconds
-	Avg        schema.Float   `json:"avg"`        // Average value across time range
-	Min        schema.Float   `json:"min"`        // Minimum value in time range
-	Max        schema.Float   `json:"max"`        // Maximum value in time range
-}
-
-// NewCCMetricStore creates and initializes a new CCMetricStore client.
-// The url parameter should include the protocol and port (e.g., "http://localhost:8080").
-// The token parameter is a JWT used for Bearer authentication; pass empty string if auth is disabled.
-func NewCCMetricStore(url string, token string) *CCMetricStore {
-	return &CCMetricStore{
-		url:           url,
-		queryEndpoint: fmt.Sprintf("%s/api/query", url),
-		jwt:           token,
-		client: http.Client{
-			Timeout: 10 * time.Second,
-		},
-		topologyCache: make(map[string]*schema.Topology),
-	}
-}
-
-// doRequest executes an HTTP POST request to the cc-metric-store query API.
-// It handles JSON encoding/decoding, authentication, and API versioning.
-// The request body is automatically closed to prevent resource leaks.
-func (ccms *CCMetricStore) doRequest(
-	ctx context.Context,
-	body *APIQueryRequest,
-) (*APIQueryResponse, error) {
-	buf := &bytes.Buffer{}
-	if err := json.NewEncoder(buf).Encode(body); err != nil {
-		cclog.Errorf("Error while encoding request body: %s", err.Error())
-		return nil, err
-	}
-
-	req, err := http.NewRequestWithContext(ctx, http.MethodGet, ccms.queryEndpoint, buf)
-	if err != nil {
-		cclog.Errorf("Error while building request body: %s", err.Error())
-		return nil, err
-	}
-	if ccms.jwt != "" {
-		req.Header.Add("Authorization", fmt.Sprintf("Bearer %s", ccms.jwt))
-	}
-
-	// versioning the cc-metric-store query API.
-	// v2 = data with resampling
-	// v1 = data without resampling
-	q := req.URL.Query()
-	q.Add("version", "v2")
-	req.URL.RawQuery = q.Encode()
-
-	res, err := ccms.client.Do(req)
-	if err != nil {
-		cclog.Errorf("Error while performing request: %s", err.Error())
-		return nil, err
-	}
-	defer res.Body.Close()
-
-	if res.StatusCode != http.StatusOK {
-		return nil, fmt.Errorf("'%s': HTTP Status: %s", ccms.queryEndpoint, res.Status)
-	}
-
-	var resBody APIQueryResponse
-	if err := json.NewDecoder(bufio.NewReader(res.Body)).Decode(&resBody); err != nil {
-		cclog.Errorf("Error while decoding result body: %s", err.Error())
-		return nil, err
-	}
-
-	return &resBody, nil
-}
-
-// getTopology returns the topology for a given cluster and subcluster, caching it if not already present
-func (ccms *CCMetricStore) getTopology(cluster, subCluster string) (*schema.Topology, error) {
-	cacheKey := fmt.Sprintf("%s:%s", cluster, subCluster)
-	if topology, ok := ccms.topologyCache[cacheKey]; ok {
-		return topology, nil
-	}
-
-	subcluster, err := archive.GetSubCluster(cluster, subCluster)
-	if err != nil {
-		return nil, err
-	}
-
-	ccms.topologyCache[cacheKey] = &subcluster.Topology
-	return &subcluster.Topology, nil
-}
-
-// getTopologyByNode returns the topology for a given cluster and node, caching it if not already present
-func (ccms *CCMetricStore) getTopologyByNode(cluster, node string) (*schema.Topology, error) {
-	subCluster, err := archive.GetSubClusterByNode(cluster, node)
-	if err != nil {
-		return nil, err
-	}
-
-	return ccms.getTopology(cluster, subCluster)
-}
-
-// LoadData retrieves time series data and statistics for the specified job and metrics.
-// It queries data for the job's time range and resources, handling scope transformations automatically.
-//
-// Parameters:
-//   - job: Job metadata including cluster, time range, and allocated resources
-//   - metrics: List of metric names to retrieve
-//   - scopes: Requested metric scopes (node, socket, core, etc.)
-//   - ctx: Context for cancellation and timeouts
-//   - resolution: Data resolution in seconds (0 for native resolution)
-//
-// Returns JobData organized as: metric -> scope -> series list.
-// Supports partial errors: returns available data even if some queries fail.
-func (ccms *CCMetricStore) LoadData(
-	job *schema.Job,
-	metrics []string,
-	scopes []schema.MetricScope,
-	ctx context.Context,
-	resolution int,
-) (schema.JobData, error) {
-	queries, assignedScope, err := ccms.buildQueries(job, metrics, scopes, resolution)
-	if err != nil {
-		cclog.Errorf("Error while building queries for jobId %d, Metrics %v, Scopes %v: %s", job.JobID, metrics, scopes, err.Error())
-		return nil, err
-	}
-
-	// Verify assignment is correct - log any inconsistencies for debugging
-	if len(queries) != len(assignedScope) {
-		cclog.Errorf("Critical error: queries and assignedScope have different lengths after buildQueries: %d vs %d",
-			len(queries), len(assignedScope))
-	}
-
-	req := APIQueryRequest{
-		Cluster:   job.Cluster,
-		From:      job.StartTime,
-		To:        job.StartTime + int64(job.Duration),
-		Queries:   queries,
-		WithStats: true,
-		WithData:  true,
-	}
-
-	resBody, err := ccms.doRequest(ctx, &req)
-	if err != nil {
-		cclog.Errorf("Error while performing request for job %d: %s", job.JobID, err.Error())
-		return nil, err
-	}
-
-	var errors []string
-	jobData := make(schema.JobData)
-
-	// Add safety check for potential index out of range errors
-	if len(resBody.Results) != len(req.Queries) || len(assignedScope) != len(req.Queries) {
-		cclog.Warnf("Mismatch in query results count: queries=%d, results=%d, assignedScope=%d",
-			len(req.Queries), len(resBody.Results), len(assignedScope))
-		if len(resBody.Results) > len(req.Queries) {
-			resBody.Results = resBody.Results[:len(req.Queries)]
-		}
-		if len(assignedScope) > len(req.Queries) {
-			assignedScope = assignedScope[:len(req.Queries)]
-		}
-	}
-
-	for i, row := range resBody.Results {
-		// Safety check to prevent index out of range errors
-		if i >= len(req.Queries) || i >= len(assignedScope) {
-			cclog.Warnf("Index out of range prevented: i=%d, queries=%d, assignedScope=%d",
-				i, len(req.Queries), len(assignedScope))
-			continue
-		}
-
-		query := req.Queries[i]
-		metric := query.Metric
-		scope := assignedScope[i]
-		mc := archive.GetMetricConfig(job.Cluster, metric)
-
-		if mc == nil {
-			cclog.Warnf("Metric config not found for %s on cluster %s", metric, job.Cluster)
-			continue
-		}
-
-		if _, ok := jobData[metric]; !ok {
-			jobData[metric] = make(map[schema.MetricScope]*schema.JobMetric)
-		}
-
-		res := mc.Timestep
-		if len(row) > 0 {
-			res = row[0].Resolution
-		}
-
-		jobMetric, ok := jobData[metric][scope]
-		if !ok {
-			jobMetric = &schema.JobMetric{
-				Unit:     mc.Unit,
-				Timestep: res,
-				Series:   make([]schema.Series, 0),
-			}
-			jobData[metric][scope] = jobMetric
-		}
-
-		for ndx, res := range row {
-			if res.Error != nil {
-				/* Build list for "partial errors", if any */
-				errors = append(errors, fmt.Sprintf("failed to fetch '%s' from host '%s': %s", query.Metric, query.Hostname, *res.Error))
-				continue
-			}
-
-			id := (*string)(nil)
-			if query.Type != nil {
-				// Check if ndx is within the bounds of TypeIds slice
-				if ndx < len(query.TypeIds) {
-					id = new(string)
-					*id = query.TypeIds[ndx]
-				} else {
-					// Log the error but continue processing
-					cclog.Warnf("TypeIds index out of range: %d with length %d for metric %s on host %s",
-						ndx, len(query.TypeIds), query.Metric, query.Hostname)
-				}
-			}
-
-			sanitizeStats(&res.Avg, &res.Min, &res.Max)
-
-			jobMetric.Series = append(jobMetric.Series, schema.Series{
-				Hostname: query.Hostname,
-				ID:       id,
-				Statistics: schema.MetricStatistics{
-					Avg: float64(res.Avg),
-					Min: float64(res.Min),
-					Max: float64(res.Max),
-				},
-				Data: res.Data,
-			})
-		}
-
-		// So that one can later check len(jobData):
-		if len(jobMetric.Series) == 0 {
-			delete(jobData[metric], scope)
-			if len(jobData[metric]) == 0 {
-				delete(jobData, metric)
-			}
-		}
-	}
-
-	if len(errors) != 0 {
-		/* Returns list for "partial errors" */
-		return jobData, fmt.Errorf("METRICDATA/CCMS > Errors: %s", strings.Join(errors, ", "))
-	}
-	return jobData, nil
-}
-
-// LoadStats retrieves min/avg/max statistics for job metrics at node scope.
-// This is faster than LoadData when only statistical summaries are needed (no time series data).
-//
-// Returns statistics organized as: metric -> hostname -> statistics.
-func (ccms *CCMetricStore) LoadStats(
-	job *schema.Job,
-	metrics []string,
-	ctx context.Context,
-) (map[string]map[string]schema.MetricStatistics, error) {
-	queries, _, err := ccms.buildQueries(job, metrics, []schema.MetricScope{schema.MetricScopeNode}, 0) // #166 Add scope shere for analysis view accelerator normalization?
-	if err != nil {
-		cclog.Errorf("Error while building queries for jobId %d, Metrics %v: %s", job.JobID, metrics, err.Error())
-		return nil, err
-	}
-
-	req := APIQueryRequest{
-		Cluster:   job.Cluster,
-		From:      job.StartTime,
-		To:        job.StartTime + int64(job.Duration),
-		Queries:   queries,
-		WithStats: true,
-		WithData:  false,
-	}
-
-	resBody, err := ccms.doRequest(ctx, &req)
-	if err != nil {
-		cclog.Errorf("Error while performing request for job %d: %s", job.JobID, err.Error())
-		return nil, err
-	}
-
-	stats := make(map[string]map[string]schema.MetricStatistics, len(metrics))
-	for i, res := range resBody.Results {
-		query := req.Queries[i]
-		metric := query.Metric
-		data := res[0]
-		if data.Error != nil {
-			cclog.Warnf("fetching %s for node %s failed: %s", metric, query.Hostname, *data.Error)
-			continue
-		}
-
-		metricdata, ok := stats[metric]
-		if !ok {
-			metricdata = make(map[string]schema.MetricStatistics, job.NumNodes)
-			stats[metric] = metricdata
-		}
-
-		if hasNaNStats(data.Avg, data.Min, data.Max) {
-			cclog.Warnf("fetching %s for node %s failed: one of avg/min/max is NaN", metric, query.Hostname)
-			continue
-		}
-
-		metricdata[query.Hostname] = schema.MetricStatistics{
-			Avg: float64(data.Avg),
-			Min: float64(data.Min),
-			Max: float64(data.Max),
-		}
-	}
-
-	return stats, nil
-}
-
-// LoadScopedStats retrieves statistics for job metrics across multiple scopes.
-// Used for the Job-View Statistics Table to display per-scope breakdowns.
-//
-// Returns statistics organized as: metric -> scope -> list of scoped statistics.
-// Each scoped statistic includes hostname, hardware ID (if applicable), and min/avg/max values.
-func (ccms *CCMetricStore) LoadScopedStats(
-	job *schema.Job,
-	metrics []string,
-	scopes []schema.MetricScope,
-	ctx context.Context,
-) (schema.ScopedJobStats, error) {
-	queries, assignedScope, err := ccms.buildQueries(job, metrics, scopes, 0)
-	if err != nil {
-		cclog.Errorf("Error while building queries for jobId %d, Metrics %v, Scopes %v: %s", job.JobID, metrics, scopes, err.Error())
-		return nil, err
-	}
-
-	req := APIQueryRequest{
-		Cluster:   job.Cluster,
-		From:      job.StartTime,
-		To:        job.StartTime + int64(job.Duration),
-		Queries:   queries,
-		WithStats: true,
-		WithData:  false,
-	}
-
-	resBody, err := ccms.doRequest(ctx, &req)
-	if err != nil {
-		cclog.Errorf("Error while performing request for job %d: %s", job.JobID, err.Error())
-		return nil, err
-	}
-
-	var errors []string
-	scopedJobStats := make(schema.ScopedJobStats)
-
-	for i, row := range resBody.Results {
-		query := req.Queries[i]
-		metric := query.Metric
-		scope := assignedScope[i]
-
-		if _, ok := scopedJobStats[metric]; !ok {
-			scopedJobStats[metric] = make(map[schema.MetricScope][]*schema.ScopedStats)
-		}
-
-		if _, ok := scopedJobStats[metric][scope]; !ok {
-			scopedJobStats[metric][scope] = make([]*schema.ScopedStats, 0)
-		}
-
-		for ndx, res := range row {
-			if res.Error != nil {
-				/* Build list for "partial errors", if any */
-				errors = append(errors, fmt.Sprintf("failed to fetch '%s' from host '%s': %s", query.Metric, query.Hostname, *res.Error))
-				continue
-			}
-
-			id := (*string)(nil)
-			if query.Type != nil {
-				// Check if ndx is within the bounds of TypeIds slice
-				if ndx < len(query.TypeIds) {
-					id = new(string)
-					*id = query.TypeIds[ndx]
-				} else {
-					// Log the error but continue processing
-					cclog.Warnf("TypeIds index out of range: %d with length %d for metric %s on host %s",
-						ndx, len(query.TypeIds), query.Metric, query.Hostname)
-				}
-			}
-
-			sanitizeStats(&res.Avg, &res.Min, &res.Max)
-
-			scopedJobStats[metric][scope] = append(scopedJobStats[metric][scope], &schema.ScopedStats{
-				Hostname: query.Hostname,
-				ID:       id,
-				Data: &schema.MetricStatistics{
-					Avg: float64(res.Avg),
-					Min: float64(res.Min),
-					Max: float64(res.Max),
-				},
-			})
-		}
-
-		// So that one can later check len(scopedJobStats[metric][scope]): Remove from map if empty
-		if len(scopedJobStats[metric][scope]) == 0 {
-			delete(scopedJobStats[metric], scope)
-			if len(scopedJobStats[metric]) == 0 {
-				delete(scopedJobStats, metric)
-			}
-		}
-	}
-
-	if len(errors) != 0 {
-		/* Returns list for "partial errors" */
-		return scopedJobStats, fmt.Errorf("METRICDATA/CCMS > Errors: %s", strings.Join(errors, ", "))
-	}
-	return scopedJobStats, nil
-}
-
-// LoadNodeData retrieves current metric data for specified nodes in a cluster.
-// Used for the Systems-View Node-Overview to display real-time node status.
-//
-// If nodes is nil, queries all metrics for all nodes in the cluster (bulk query).
-// Returns data organized as: hostname -> metric -> list of JobMetric (with time series and stats).
-func (ccms *CCMetricStore) LoadNodeData(
-	cluster string,
-	metrics, nodes []string,
-	scopes []schema.MetricScope,
-	from, to time.Time,
-	ctx context.Context,
-) (map[string]map[string][]*schema.JobMetric, error) {
-	req := APIQueryRequest{
-		Cluster:   cluster,
-		From:      from.Unix(),
-		To:        to.Unix(),
-		WithStats: true,
-		WithData:  true,
-	}
-
-	if nodes == nil {
-		req.ForAllNodes = append(req.ForAllNodes, metrics...)
-	} else {
-		for _, node := range nodes {
-			for _, metric := range metrics {
-				req.Queries = append(req.Queries, APIQuery{
-					Hostname:   node,
-					Metric:     metric,
-					Resolution: 0, // Default for Node Queries: Will return metric $Timestep Resolution
-				})
-			}
-		}
-	}
-
-	resBody, err := ccms.doRequest(ctx, &req)
-	if err != nil {
-		cclog.Errorf("Error while performing request for cluster %s: %s", cluster, err.Error())
-		return nil, err
-	}
-
-	var errors []string
-	data := make(map[string]map[string][]*schema.JobMetric)
-	for i, res := range resBody.Results {
-		var query APIQuery
-		if resBody.Queries != nil {
-			query = resBody.Queries[i]
-		} else {
-			query = req.Queries[i]
-		}
-
-		metric := query.Metric
-		qdata := res[0]
-		if qdata.Error != nil {
-			/* Build list for "partial errors", if any */
-			errors = append(errors, fmt.Sprintf("fetching %s for node %s failed: %s", metric, query.Hostname, *qdata.Error))
-		}
-
-		sanitizeStats(&qdata.Avg, &qdata.Min, &qdata.Max)
-
-		hostdata, ok := data[query.Hostname]
-		if !ok {
-			hostdata = make(map[string][]*schema.JobMetric)
-			data[query.Hostname] = hostdata
-		}
-
-		mc := archive.GetMetricConfig(cluster, metric)
-		hostdata[metric] = append(hostdata[metric], &schema.JobMetric{
-			Unit:     mc.Unit,
-			Timestep: mc.Timestep,
-			Series: []schema.Series{
-				{
-					Hostname: query.Hostname,
-					Data:     qdata.Data,
-					Statistics: schema.MetricStatistics{
-						Avg: float64(qdata.Avg),
-						Min: float64(qdata.Min),
-						Max: float64(qdata.Max),
-					},
-				},
-			},
-		})
-	}
-
-	if len(errors) != 0 {
-		/* Returns list of "partial errors" */
-		return data, fmt.Errorf("METRICDATA/CCMS > Errors: %s", strings.Join(errors, ", "))
-	}
-
-	return data, nil
-}
-
-// LoadNodeListData retrieves paginated node metrics for the Systems-View Node-List.
-//
-// Supports filtering by subcluster and node name pattern. The nodeFilter performs
-// substring matching on hostnames.
-//
-// Returns:
-//   - Node data organized as: hostname -> JobData (metric -> scope -> series)
-//   - Total node count (before pagination)
-//   - HasNextPage flag indicating if more pages are available
-//   - Error (may be partial error with some data returned)
-func (ccms *CCMetricStore) LoadNodeListData(
-	cluster, subCluster string,
-	nodes []string,
-	metrics []string,
-	scopes []schema.MetricScope,
-	resolution int,
-	from, to time.Time,
-	ctx context.Context,
-) (map[string]schema.JobData, error) {
-	queries, assignedScope, err := ccms.buildNodeQueries(cluster, subCluster, nodes, metrics, scopes, resolution)
-	if err != nil {
-		cclog.Errorf("Error while building node queries for Cluster %s, SubCluster %s, Metrics %v, Scopes %v: %s", cluster, subCluster, metrics, scopes, err.Error())
-		return nil, err
-	}
-
-	// Verify assignment is correct - log any inconsistencies for debugging
-	if len(queries) != len(assignedScope) {
-		cclog.Errorf("Critical error: queries and assignedScope have different lengths after buildNodeQueries: %d vs %d",
-			len(queries), len(assignedScope))
-	}
-
-	req := APIQueryRequest{
-		Cluster:   cluster,
-		Queries:   queries,
-		From:      from.Unix(),
-		To:        to.Unix(),
-		WithStats: true,
-		WithData:  true,
-	}
-
-	resBody, err := ccms.doRequest(ctx, &req)
-	if err != nil {
-		cclog.Errorf("Error while performing request for cluster %s: %s", cluster, err.Error())
-		return nil, err
-	}
-
-	var errors []string
-	data := make(map[string]schema.JobData)
-
-	// Add safety check for index out of range issues
-	if len(resBody.Results) != len(req.Queries) || len(assignedScope) != len(req.Queries) {
-		cclog.Warnf("Mismatch in query results count: queries=%d, results=%d, assignedScope=%d",
-			len(req.Queries), len(resBody.Results), len(assignedScope))
-		if len(resBody.Results) > len(req.Queries) {
-			resBody.Results = resBody.Results[:len(req.Queries)]
-		}
-		if len(assignedScope) > len(req.Queries) {
-			assignedScope = assignedScope[:len(req.Queries)]
-		}
-	}
-
-	for i, row := range resBody.Results {
-		// Safety check to prevent index out of range errors
-		if i >= len(req.Queries) || i >= len(assignedScope) {
-			cclog.Warnf("Index out of range prevented: i=%d, queries=%d, assignedScope=%d",
-				i, len(req.Queries), len(assignedScope))
-			continue
-		}
-
-		var query APIQuery
-		if resBody.Queries != nil {
-			if i < len(resBody.Queries) {
-				query = resBody.Queries[i]
-			} else {
-				cclog.Warnf("Index out of range prevented for resBody.Queries: i=%d, len=%d",
-					i, len(resBody.Queries))
-				continue
-			}
-		} else {
-			query = req.Queries[i]
-		}
-
-		metric := query.Metric
-		scope := assignedScope[i]
-		mc := archive.GetMetricConfig(cluster, metric)
-		if mc == nil {
-			cclog.Warnf("Metric config not found for %s on cluster %s", metric, cluster)
-			continue
-		}
-
-		res := mc.Timestep
-		if len(row) > 0 {
-			res = row[0].Resolution
-		}
-
-		// Init Nested Map Data Structures If Not Found
-		hostData, ok := data[query.Hostname]
-		if !ok {
-			hostData = make(schema.JobData)
-			data[query.Hostname] = hostData
-		}
-
-		metricData, ok := hostData[metric]
-		if !ok {
-			metricData = make(map[schema.MetricScope]*schema.JobMetric)
-			data[query.Hostname][metric] = metricData
-		}
-
-		scopeData, ok := metricData[scope]
-		if !ok {
-			scopeData = &schema.JobMetric{
-				Unit:     mc.Unit,
-				Timestep: res,
-				Series:   make([]schema.Series, 0),
-			}
-			data[query.Hostname][metric][scope] = scopeData
-		}
-
-		for ndx, res := range row {
-			if res.Error != nil {
-				/* Build list for "partial errors", if any */
-				errors = append(errors, fmt.Sprintf("failed to fetch '%s' from host '%s': %s", query.Metric, query.Hostname, *res.Error))
-				continue
-			}
-
-			id := (*string)(nil)
-			if query.Type != nil {
-				// Check if ndx is within the bounds of TypeIds slice
-				if ndx < len(query.TypeIds) {
-					id = new(string)
-					*id = query.TypeIds[ndx]
-				} else {
-					// Log the error but continue processing
-					cclog.Warnf("TypeIds index out of range: %d with length %d for metric %s on host %s",
-						ndx, len(query.TypeIds), query.Metric, query.Hostname)
-				}
-			}
-
-			sanitizeStats(&res.Avg, &res.Min, &res.Max)
-
-			scopeData.Series = append(scopeData.Series, schema.Series{
-				Hostname: query.Hostname,
-				ID:       id,
-				Statistics: schema.MetricStatistics{
-					Avg: float64(res.Avg),
-					Min: float64(res.Min),
-					Max: float64(res.Max),
-				},
-				Data: res.Data,
-			})
-		}
-	}
-
-	if len(errors) != 0 {
-		/* Returns list of "partial errors" */
-		return data, fmt.Errorf("METRICDATA/CCMS > Errors: %s", strings.Join(errors, ", "))
-	}
-
-	return data, nil
-}
-
-// HealthCheck queries the external cc-metric-store's health check endpoint.
-// It sends a HealthCheckReq as the request body to /api/healthcheck and
-// returns the per-node health check results.
-func (ccms *CCMetricStore) HealthCheck(cluster string,
-	nodes []string, metrics []string,
-) (map[string]metricstore.HealthCheckResult, error) {
-	req := metricstore.HealthCheckReq{
-		Cluster:     cluster,
-		Nodes:       nodes,
-		MetricNames: metrics,
-	}
-
-	buf := &bytes.Buffer{}
-	if err := json.NewEncoder(buf).Encode(req); err != nil {
-		cclog.Errorf("Error while encoding health check request body: %s", err.Error())
-		return nil, err
-	}
-
-	endpoint := fmt.Sprintf("%s/api/healthcheck", ccms.url)
-	httpReq, err := http.NewRequest(http.MethodGet, endpoint, buf)
-	if err != nil {
-		cclog.Errorf("Error while building health check request: %s", err.Error())
-		return nil, err
-	}
-	if ccms.jwt != "" {
-		httpReq.Header.Add("Authorization", fmt.Sprintf("Bearer %s", ccms.jwt))
-	}
-
-	res, err := ccms.client.Do(httpReq)
-	if err != nil {
-		cclog.Errorf("Error while performing health check request: %s", err.Error())
-		return nil, err
-	}
-	defer res.Body.Close()
-
-	if res.StatusCode != http.StatusOK {
-		return nil, fmt.Errorf("'%s': HTTP Status: %s", endpoint, res.Status)
-	}
-
-	var results map[string]metricstore.HealthCheckResult
-	if err := json.NewDecoder(bufio.NewReader(res.Body)).Decode(&results); err != nil {
-		cclog.Errorf("Error while decoding health check response: %s", err.Error())
-		return nil, err
-	}
-
-	return results, nil
-}
-
-// sanitizeStats replaces NaN values in statistics with 0 to enable JSON marshaling.
-// Regular float64 values cannot be JSONed when NaN.
-func sanitizeStats(avg, min, max *schema.Float) {
-	if avg.IsNaN() || min.IsNaN() || max.IsNaN() {
-		*avg = schema.Float(0)
-		*min = schema.Float(0)
-		*max = schema.Float(0)
-	}
-}
-
-// hasNaNStats returns true if any of the statistics contain NaN values.
-func hasNaNStats(avg, min, max schema.Float) bool {
-	return avg.IsNaN() || min.IsNaN() || max.IsNaN()
-}
--- a/internal/repository/dbConnection.go
+++ b/internal/repository/dbConnection.go
@@ -12,7 +12,7 @@ import (
 	"sync"
 	"time"

-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
 	"github.com/jmoiron/sqlx"
 	"github.com/mattn/go-sqlite3"
 	"github.com/qustavo/sqlhooks/v2"
@@ -51,10 +51,14 @@ func setupSqlite(db *sql.DB) error {
 	return nil
 }

-func Connect(db string) {
+func Connect(driver string, db string) {
 	var err error
 	var dbHandle *sqlx.DB

+	if driver != "sqlite3" {
+		cclog.Abortf("Unsupported database driver '%s'. Only 'sqlite3' is supported.\n", driver)
+	}
+
 	dbConnOnce.Do(func() {
 		opts := DatabaseOptions{
 			URL:                   db,
@@ -96,7 +100,7 @@ func Connect(db string) {
 		dbHandle.SetConnMaxLifetime(opts.ConnectionMaxLifetime)
 		dbHandle.SetConnMaxIdleTime(opts.ConnectionMaxIdleTime)

-		dbConnInstance = &DBConnection{DB: dbHandle}
+		dbConnInstance = &DBConnection{DB: dbHandle, Driver: driver}
 		err = checkDBVersion(dbHandle.DB)
 		if err != nil {
 			cclog.Abortf("DB Connection: Failed DB version check.\nError: %s\n", err.Error())
@@ -111,26 +115,3 @@ func GetConnection() *DBConnection {

 	return dbConnInstance
 }
-
-// ResetConnection closes the current database connection and resets the connection state.
-// This function is intended for testing purposes only to allow test isolation.
-func ResetConnection() error {
-	if dbConnInstance != nil && dbConnInstance.DB != nil {
-		if err := dbConnInstance.DB.Close(); err != nil {
-			return fmt.Errorf("failed to close database connection: %w", err)
-		}
-	}
-
-	dbConnInstance = nil
-	dbConnOnce = sync.Once{}
-	jobRepoInstance = nil
-	jobRepoOnce = sync.Once{}
-	nodeRepoInstance = nil
-	nodeRepoOnce = sync.Once{}
-	userRepoInstance = nil
-	userRepoOnce = sync.Once{}
-	userCfgRepoInstance = nil
-	userCfgRepoOnce = sync.Once{}
-
-	return nil
-}
--- a/internal/repository/hooks.go
+++ b/internal/repository/hooks.go
@@ -2,14 +2,13 @@
 // All rights reserved. This file is part of cc-backend.
 // Use of this source code is governed by a MIT-style
 // license that can be found in the LICENSE file.
-
 package repository

 import (
 	"context"
 	"time"

-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
 )

 // Hooks satisfies the sqlhook.Hooks interface
--- a/internal/repository/hooks_test.go
+++ b/internal/repository/hooks_test.go
@@ -1,274 +0,0 @@
-// Copyright (C) NHR@FAU, University Erlangen-Nuremberg.
-// All rights reserved. This file is part of cc-backend.
-// Use of this source code is governed by a MIT-style
-// license that can be found in the LICENSE file.
-package repository
-
-import (
-	"context"
-	"testing"
-	"time"
-
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
-	_ "github.com/mattn/go-sqlite3"
-	"github.com/stretchr/testify/assert"
-	"github.com/stretchr/testify/require"
-)
-
-type MockJobHook struct {
-	startCalled bool
-	stopCalled  bool
-	startJobs   []*schema.Job
-	stopJobs    []*schema.Job
-}
-
-func (m *MockJobHook) JobStartCallback(job *schema.Job) {
-	m.startCalled = true
-	m.startJobs = append(m.startJobs, job)
-}
-
-func (m *MockJobHook) JobStopCallback(job *schema.Job) {
-	m.stopCalled = true
-	m.stopJobs = append(m.stopJobs, job)
-}
-
-func TestRegisterJobHook(t *testing.T) {
-	t.Run("register single hook", func(t *testing.T) {
-		hooks = nil
-		mock := &MockJobHook{}
-
-		RegisterJobHook(mock)
-
-		assert.NotNil(t, hooks)
-		assert.Len(t, hooks, 1)
-		assert.Equal(t, mock, hooks[0])
-
-		hooks = nil
-	})
-
-	t.Run("register multiple hooks", func(t *testing.T) {
-		hooks = nil
-		mock1 := &MockJobHook{}
-		mock2 := &MockJobHook{}
-
-		RegisterJobHook(mock1)
-		RegisterJobHook(mock2)
-
-		assert.Len(t, hooks, 2)
-		assert.Equal(t, mock1, hooks[0])
-		assert.Equal(t, mock2, hooks[1])
-
-		hooks = nil
-	})
-
-	t.Run("register nil hook does not add to hooks", func(t *testing.T) {
-		hooks = nil
-		RegisterJobHook(nil)
-
-		if hooks != nil {
-			assert.Len(t, hooks, 0, "Nil hook should not be added")
-		}
-
-		hooks = nil
-	})
-}
-
-func TestCallJobStartHooks(t *testing.T) {
-	t.Run("call start hooks with single job", func(t *testing.T) {
-		hooks = nil
-		mock := &MockJobHook{}
-		RegisterJobHook(mock)
-
-		job := &schema.Job{
-			JobID:   123,
-			User:    "testuser",
-			Cluster: "testcluster",
-		}
-
-		CallJobStartHooks([]*schema.Job{job})
-
-		assert.True(t, mock.startCalled)
-		assert.False(t, mock.stopCalled)
-		assert.Len(t, mock.startJobs, 1)
-		assert.Equal(t, int64(123), mock.startJobs[0].JobID)
-
-		hooks = nil
-	})
-
-	t.Run("call start hooks with multiple jobs", func(t *testing.T) {
-		hooks = nil
-		mock := &MockJobHook{}
-		RegisterJobHook(mock)
-
-		jobs := []*schema.Job{
-			{JobID: 1, User: "user1", Cluster: "cluster1"},
-			{JobID: 2, User: "user2", Cluster: "cluster2"},
-			{JobID: 3, User: "user3", Cluster: "cluster3"},
-		}
-
-		CallJobStartHooks(jobs)
-
-		assert.True(t, mock.startCalled)
-		assert.Len(t, mock.startJobs, 3)
-		assert.Equal(t, int64(1), mock.startJobs[0].JobID)
-		assert.Equal(t, int64(2), mock.startJobs[1].JobID)
-		assert.Equal(t, int64(3), mock.startJobs[2].JobID)
-
-		hooks = nil
-	})
-
-	t.Run("call start hooks with multiple registered hooks", func(t *testing.T) {
-		hooks = nil
-		mock1 := &MockJobHook{}
-		mock2 := &MockJobHook{}
-		RegisterJobHook(mock1)
-		RegisterJobHook(mock2)
-
-		job := &schema.Job{
-			JobID: 456, User: "testuser", Cluster: "testcluster",
-		}
-
-		CallJobStartHooks([]*schema.Job{job})
-
-		assert.True(t, mock1.startCalled)
-		assert.True(t, mock2.startCalled)
-		assert.Len(t, mock1.startJobs, 1)
-		assert.Len(t, mock2.startJobs, 1)
-
-		hooks = nil
-	})
-
-	t.Run("call start hooks with nil hooks", func(t *testing.T) {
-		hooks = nil
-
-		job := &schema.Job{
-			JobID: 789, User: "testuser", Cluster: "testcluster",
-		}
-
-		CallJobStartHooks([]*schema.Job{job})
-
-		hooks = nil
-	})
-
-	t.Run("call start hooks with empty job list", func(t *testing.T) {
-		hooks = nil
-		mock := &MockJobHook{}
-		RegisterJobHook(mock)
-
-		CallJobStartHooks([]*schema.Job{})
-
-		assert.False(t, mock.startCalled)
-		assert.Len(t, mock.startJobs, 0)
-
-		hooks = nil
-	})
-}
-
-func TestCallJobStopHooks(t *testing.T) {
-	t.Run("call stop hooks with single job", func(t *testing.T) {
-		hooks = nil
-		mock := &MockJobHook{}
-		RegisterJobHook(mock)
-
-		job := &schema.Job{
-			JobID:   123,
-			User:    "testuser",
-			Cluster: "testcluster",
-		}
-
-		CallJobStopHooks(job)
-
-		assert.True(t, mock.stopCalled)
-		assert.False(t, mock.startCalled)
-		assert.Len(t, mock.stopJobs, 1)
-		assert.Equal(t, int64(123), mock.stopJobs[0].JobID)
-
-		hooks = nil
-	})
-
-	t.Run("call stop hooks with multiple registered hooks", func(t *testing.T) {
-		hooks = nil
-		mock1 := &MockJobHook{}
-		mock2 := &MockJobHook{}
-		RegisterJobHook(mock1)
-		RegisterJobHook(mock2)
-
-		job := &schema.Job{
-			JobID: 456, User: "testuser", Cluster: "testcluster",
-		}
-
-		CallJobStopHooks(job)
-
-		assert.True(t, mock1.stopCalled)
-		assert.True(t, mock2.stopCalled)
-		assert.Len(t, mock1.stopJobs, 1)
-		assert.Len(t, mock2.stopJobs, 1)
-
-		hooks = nil
-	})
-
-	t.Run("call stop hooks with nil hooks", func(t *testing.T) {
-		hooks = nil
-
-		job := &schema.Job{
-			JobID: 789, User: "testuser", Cluster: "testcluster",
-		}
-
-		CallJobStopHooks(job)
-
-		hooks = nil
-	})
-}
-
-func TestSQLHooks(t *testing.T) {
-	_ = setup(t)
-
-	t.Run("hooks log queries in debug mode", func(t *testing.T) {
-		h := &Hooks{}
-
-		ctx := context.Background()
-		query := "SELECT * FROM job WHERE job_id = ?"
-		args := []any{123}
-
-		ctxWithTime, err := h.Before(ctx, query, args...)
-		require.NoError(t, err)
-		assert.NotNil(t, ctxWithTime)
-
-		beginTime := ctxWithTime.Value("begin")
-		require.NotNil(t, beginTime)
-		_, ok := beginTime.(time.Time)
-		assert.True(t, ok, "Begin time should be time.Time")
-
-		time.Sleep(10 * time.Millisecond)
-
-		ctxAfter, err := h.After(ctxWithTime, query, args...)
-		require.NoError(t, err)
-		assert.NotNil(t, ctxAfter)
-	})
-}
-
-func TestHookIntegration(t *testing.T) {
-	t.Run("hooks are called during job lifecycle", func(t *testing.T) {
-		hooks = nil
-		mock := &MockJobHook{}
-		RegisterJobHook(mock)
-
-		job := &schema.Job{
-			JobID:   999,
-			User:    "integrationuser",
-			Cluster: "integrationcluster",
-		}
-
-		CallJobStartHooks([]*schema.Job{job})
-		assert.True(t, mock.startCalled)
-		assert.Equal(t, 1, len(mock.startJobs))
-
-		CallJobStopHooks(job)
-		assert.True(t, mock.stopCalled)
-		assert.Equal(t, 1, len(mock.stopJobs))
-
-		assert.Equal(t, mock.startJobs[0].JobID, mock.stopJobs[0].JobID)
-
-		hooks = nil
-	})
-}
--- a/internal/repository/job.go
+++ b/internal/repository/job.go
@@ -66,47 +66,31 @@ import (
 	"fmt"
 	"maps"
 	"math"
-	"sort"
 	"strconv"
 	"sync"
 	"time"

 	"github.com/ClusterCockpit/cc-backend/pkg/archive"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/lrucache"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/lrucache"
+	"github.com/ClusterCockpit/cc-lib/schema"
 	sq "github.com/Masterminds/squirrel"
 	"github.com/jmoiron/sqlx"
 )

 var (
-	// jobRepoOnce ensures singleton initialization of the JobRepository
-	jobRepoOnce sync.Once
-	// jobRepoInstance holds the single instance of JobRepository
+	jobRepoOnce     sync.Once
 	jobRepoInstance *JobRepository
 )

-// JobRepository provides database access for job-related operations.
-// It implements the repository pattern to abstract database interactions
-// and provides caching for improved performance.
-//
-// The repository is a singleton initialized via GetJobRepository().
-// All database queries use prepared statements via stmtCache for efficiency.
-// Frequently accessed data (metadata, energy footprints) is cached in an LRU cache.
 type JobRepository struct {
-	DB        *sqlx.DB        // Database connection pool
-	stmtCache *sq.StmtCache   // Prepared statement cache for query optimization
-	cache     *lrucache.Cache // LRU cache for metadata and footprint data
-	driver    string          // Database driver name (e.g., "sqlite3")
-	Mutex     sync.Mutex      // Mutex for thread-safe operations
+	DB        *sqlx.DB
+	stmtCache *sq.StmtCache
+	cache     *lrucache.Cache
+	driver    string
+	Mutex     sync.Mutex
 }

-// GetJobRepository returns the singleton instance of JobRepository.
-// The repository is initialized lazily on first access with database connection,
-// prepared statement cache, and LRU cache configured from repoConfig.
-//
-// This function is thread-safe and ensures only one instance is created.
-// It must be called after Connect() has established a database connection.
 func GetJobRepository() *JobRepository {
 	jobRepoOnce.Do(func() {
 		db := GetConnection()
@@ -122,8 +106,6 @@ func GetJobRepository() *JobRepository {
 	return jobRepoInstance
 }

-// jobColumns defines the standard set of columns selected from the job table.
-// Used consistently across all job queries to ensure uniform data retrieval.
 var jobColumns []string = []string{
 	"job.id", "job.job_id", "job.hpc_user", "job.project", "job.cluster", "job.subcluster",
 	"job.start_time", "job.cluster_partition", "job.array_job_id", "job.num_nodes",
@@ -132,8 +114,6 @@ var jobColumns []string = []string{
 	"job.footprint", "job.energy",
 }

-// jobCacheColumns defines columns from the job_cache table, mirroring jobColumns.
-// Used for queries against cached job data for performance optimization.
 var jobCacheColumns []string = []string{
 	"job_cache.id", "job_cache.job_id", "job_cache.hpc_user", "job_cache.project", "job_cache.cluster",
 	"job_cache.subcluster", "job_cache.start_time", "job_cache.cluster_partition",
@@ -143,14 +123,6 @@ var jobCacheColumns []string = []string{
 	"job_cache.footprint", "job_cache.energy",
 }

-// scanJob converts a database row into a schema.Job struct.
-// It handles JSON unmarshaling of resources and footprint fields,
-// and calculates accurate duration for running jobs.
-//
-// Parameters:
-//   - row: Database row implementing Scan() interface (sql.Row or sql.Rows)
-//
-// Returns the populated Job struct or an error if scanning or unmarshaling fails.
 func scanJob(row interface{ Scan(...any) error }) (*schema.Job, error) {
 	job := &schema.Job{}

@@ -183,53 +155,29 @@ func scanJob(row interface{ Scan(...any) error }) (*schema.Job, error) {
 	return job, nil
 }

-// Optimize performs database optimization by running VACUUM command.
-// This reclaims unused space and defragments the database file.
-// Should be run periodically during maintenance windows.
 func (r *JobRepository) Optimize() error {
 	if _, err := r.DB.Exec(`VACUUM`); err != nil {
-		cclog.Errorf("Error while executing VACUUM: %v", err)
-		return fmt.Errorf("failed to optimize database: %w", err)
+		return err
 	}
 	return nil
 }

-// Flush removes all data from job-related tables (jobtag, tag, job).
-// WARNING: This is a destructive operation that deletes all job data.
-// Use with extreme caution, typically only for testing or complete resets.
 func (r *JobRepository) Flush() error {
 	if _, err := r.DB.Exec(`DELETE FROM jobtag`); err != nil {
-		cclog.Errorf("Error while deleting from jobtag table: %v", err)
-		return fmt.Errorf("failed to flush jobtag table: %w", err)
+		return err
 	}
 	if _, err := r.DB.Exec(`DELETE FROM tag`); err != nil {
-		cclog.Errorf("Error while deleting from tag table: %v", err)
-		return fmt.Errorf("failed to flush tag table: %w", err)
+		return err
 	}
 	if _, err := r.DB.Exec(`DELETE FROM job`); err != nil {
-		cclog.Errorf("Error while deleting from job table: %v", err)
-		return fmt.Errorf("failed to flush job table: %w", err)
+		return err
 	}
 	return nil
 }

-// FetchMetadata retrieves and unmarshals the metadata JSON for a job.
-// Metadata is cached with a 24-hour TTL to improve performance.
-//
-// The metadata field stores arbitrary key-value pairs associated with a job,
-// such as tags, labels, or custom attributes added by external systems.
-//
-// Parameters:
-//   - job: Job struct with valid ID field, metadata will be populated in job.MetaData
-//
-// Returns the metadata map or an error if the job is nil or database query fails.
 func (r *JobRepository) FetchMetadata(job *schema.Job) (map[string]string, error) {
-	if job == nil {
-		return nil, fmt.Errorf("job cannot be nil")
-	}
-
 	start := time.Now()
-	cachekey := fmt.Sprintf("metadata:%d", *job.ID)
+	cachekey := fmt.Sprintf("metadata:%d", job.ID)
 	if cached := r.cache.Get(cachekey, nil); cached != nil {
 		job.MetaData = cached.(map[string]string)
 		return job.MetaData, nil
@@ -237,8 +185,8 @@ func (r *JobRepository) FetchMetadata(job *schema.Job) (map[string]string, error

 	if err := sq.Select("job.meta_data").From("job").Where("job.id = ?", job.ID).
 		RunWith(r.stmtCache).QueryRow().Scan(&job.RawMetaData); err != nil {
-		cclog.Warnf("Error while scanning for job metadata (ID=%d): %v", *job.ID, err)
-		return nil, fmt.Errorf("failed to fetch metadata for job %d: %w", *job.ID, err)
+		cclog.Warn("Error while scanning for job metadata")
+		return nil, err
 	}

 	if len(job.RawMetaData) == 0 {
@@ -246,8 +194,8 @@ func (r *JobRepository) FetchMetadata(job *schema.Job) (map[string]string, error
 	}

 	if err := json.Unmarshal(job.RawMetaData, &job.MetaData); err != nil {
-		cclog.Warnf("Error while unmarshaling raw metadata json (ID=%d): %v", *job.ID, err)
-		return nil, fmt.Errorf("failed to unmarshal metadata for job %d: %w", *job.ID, err)
+		cclog.Warn("Error while unmarshaling raw metadata json")
+		return nil, err
 	}

 	r.cache.Put(cachekey, job.MetaData, len(job.RawMetaData), 24*time.Hour)
@@ -255,27 +203,13 @@ func (r *JobRepository) FetchMetadata(job *schema.Job) (map[string]string, error
 	return job.MetaData, nil
 }

-// UpdateMetadata adds or updates a single metadata key-value pair for a job.
-// The entire metadata map is re-marshaled and stored, and the cache is invalidated.
-// Also triggers archive metadata update via archive.UpdateMetadata.
-//
-// Parameters:
-//   - job: Job struct with valid ID, existing metadata will be fetched if not present
-//   - key: Metadata key to set
-//   - val: Metadata value to set
-//
-// Returns an error if the job is nil, metadata fetch fails, or database update fails.
 func (r *JobRepository) UpdateMetadata(job *schema.Job, key, val string) (err error) {
-	if job == nil {
-		return fmt.Errorf("job cannot be nil")
-	}
-
-	cachekey := fmt.Sprintf("metadata:%d", *job.ID)
+	cachekey := fmt.Sprintf("metadata:%d", job.ID)
 	r.cache.Del(cachekey)
 	if job.MetaData == nil {
 		if _, err = r.FetchMetadata(job); err != nil {
-			cclog.Warnf("Error while fetching metadata for job, DB ID '%v'", *job.ID)
-			return fmt.Errorf("failed to fetch metadata for job %d: %w", *job.ID, err)
+			cclog.Warnf("Error while fetching metadata for job, DB ID '%v'", job.ID)
+			return err
 		}
 	}

@@ -289,43 +223,29 @@ func (r *JobRepository) UpdateMetadata(job *schema.Job, key, val string) (err er
 	}

 	if job.RawMetaData, err = json.Marshal(job.MetaData); err != nil {
-		cclog.Warnf("Error while marshaling metadata for job, DB ID '%v'", *job.ID)
-		return fmt.Errorf("failed to marshal metadata for job %d: %w", *job.ID, err)
+		cclog.Warnf("Error while marshaling metadata for job, DB ID '%v'", job.ID)
+		return err
 	}

 	if _, err = sq.Update("job").
 		Set("meta_data", job.RawMetaData).
 		Where("job.id = ?", job.ID).
 		RunWith(r.stmtCache).Exec(); err != nil {
-		cclog.Warnf("Error while updating metadata for job, DB ID '%v'", *job.ID)
-		return fmt.Errorf("failed to update metadata in database for job %d: %w", *job.ID, err)
+		cclog.Warnf("Error while updating metadata for job, DB ID '%v'", job.ID)
+		return err
 	}

 	r.cache.Put(cachekey, job.MetaData, len(job.RawMetaData), 24*time.Hour)
 	return archive.UpdateMetadata(job, job.MetaData)
 }

-// FetchFootprint retrieves and unmarshals the performance footprint JSON for a job.
-// Unlike FetchMetadata, footprints are NOT cached as they can be large and change frequently.
-//
-// The footprint contains summary statistics (avg/min/max) for monitored metrics,
-// stored as JSON with keys like "cpu_load_avg", "mem_used_max", etc.
-//
-// Parameters:
-//   - job: Job struct with valid ID, footprint will be populated in job.Footprint
-//
-// Returns the footprint map or an error if the job is nil or database query fails.
 func (r *JobRepository) FetchFootprint(job *schema.Job) (map[string]float64, error) {
-	if job == nil {
-		return nil, fmt.Errorf("job cannot be nil")
-	}
-
 	start := time.Now()

 	if err := sq.Select("job.footprint").From("job").Where("job.id = ?", job.ID).
 		RunWith(r.stmtCache).QueryRow().Scan(&job.RawFootprint); err != nil {
-		cclog.Warnf("Error while scanning for job footprint (ID=%d): %v", *job.ID, err)
-		return nil, fmt.Errorf("failed to fetch footprint for job %d: %w", *job.ID, err)
+		cclog.Warn("Error while scanning for job footprint")
+		return nil, err
 	}

 	if len(job.RawFootprint) == 0 {
@@ -333,31 +253,17 @@ func (r *JobRepository) FetchFootprint(job *schema.Job) (map[string]float64, err
 	}

 	if err := json.Unmarshal(job.RawFootprint, &job.Footprint); err != nil {
-		cclog.Warnf("Error while unmarshaling raw footprint json (ID=%d): %v", *job.ID, err)
-		return nil, fmt.Errorf("failed to unmarshal footprint for job %d: %w", *job.ID, err)
+		cclog.Warn("Error while unmarshaling raw footprint json")
+		return nil, err
 	}

 	cclog.Debugf("Timer FetchFootprint %s", time.Since(start))
 	return job.Footprint, nil
 }

-// FetchEnergyFootprint retrieves and unmarshals the energy footprint JSON for a job.
-// Energy footprints are cached with a 24-hour TTL as they are frequently accessed but rarely change.
-//
-// The energy footprint contains calculated energy consumption (in kWh) per metric,
-// stored as JSON with keys like "power_avg", "acc_power_avg", etc.
-//
-// Parameters:
-//   - job: Job struct with valid ID, energy footprint will be populated in job.EnergyFootprint
-//
-// Returns the energy footprint map or an error if the job is nil or database query fails.
 func (r *JobRepository) FetchEnergyFootprint(job *schema.Job) (map[string]float64, error) {
-	if job == nil {
-		return nil, fmt.Errorf("job cannot be nil")
-	}
-
 	start := time.Now()
-	cachekey := fmt.Sprintf("energyFootprint:%d", *job.ID)
+	cachekey := fmt.Sprintf("energyFootprint:%d", job.ID)
 	if cached := r.cache.Get(cachekey, nil); cached != nil {
 		job.EnergyFootprint = cached.(map[string]float64)
 		return job.EnergyFootprint, nil
@@ -365,8 +271,8 @@ func (r *JobRepository) FetchEnergyFootprint(job *schema.Job) (map[string]float6

 	if err := sq.Select("job.energy_footprint").From("job").Where("job.id = ?", job.ID).
 		RunWith(r.stmtCache).QueryRow().Scan(&job.RawEnergyFootprint); err != nil {
-		cclog.Warnf("Error while scanning for job energy_footprint (ID=%d): %v", *job.ID, err)
-		return nil, fmt.Errorf("failed to fetch energy footprint for job %d: %w", *job.ID, err)
+		cclog.Warn("Error while scanning for job energy_footprint")
+		return nil, err
 	}

 	if len(job.RawEnergyFootprint) == 0 {
@@ -374,8 +280,8 @@ func (r *JobRepository) FetchEnergyFootprint(job *schema.Job) (map[string]float6
 	}

 	if err := json.Unmarshal(job.RawEnergyFootprint, &job.EnergyFootprint); err != nil {
-		cclog.Warnf("Error while unmarshaling raw energy footprint json (ID=%d): %v", *job.ID, err)
-		return nil, fmt.Errorf("failed to unmarshal energy footprint for job %d: %w", *job.ID, err)
+		cclog.Warn("Error while unmarshaling raw energy footprint json")
+		return nil, err
 	}

 	r.cache.Put(cachekey, job.EnergyFootprint, len(job.EnergyFootprint), 24*time.Hour)
@@ -383,28 +289,12 @@ func (r *JobRepository) FetchEnergyFootprint(job *schema.Job) (map[string]float6
 	return job.EnergyFootprint, nil
 }

-// DeleteJobsBefore removes jobs older than the specified start time.
-// Optionally preserves tagged jobs to protect important data from deletion.
-// Cache entries for deleted jobs are automatically invalidated.
-//
-// This is typically used for data retention policies and cleanup operations.
-// WARNING: This is a destructive operation that permanently deletes job records.
-//
-// Parameters:
-//   - startTime: Unix timestamp, jobs with start_time < this value will be deleted
-//   - omitTagged: "none" = delete all jobs, "all" = skip any tagged jobs,
-//     "user" = skip jobs with user-created tags (not auto-tagger types "app"/"jobClass")
-//
-// Returns the count of deleted jobs or an error if the operation fails.
-func (r *JobRepository) DeleteJobsBefore(startTime int64, omitTagged string) (int, error) {
+func (r *JobRepository) DeleteJobsBefore(startTime int64, omitTagged bool) (int, error) {
 	var cnt int
 	q := sq.Select("count(*)").From("job").Where("job.start_time < ?", startTime)

-	switch omitTagged {
-	case "all":
+	if omitTagged {
 		q = q.Where("NOT EXISTS (SELECT 1 FROM jobtag WHERE jobtag.job_id = job.id)")
-	case "user":
-		q = q.Where("NOT EXISTS (SELECT 1 FROM jobtag JOIN tag ON tag.id = jobtag.tag_id WHERE jobtag.job_id = job.id AND tag.tag_type NOT IN ('app', 'jobClass'))")
 	}

 	if err := q.RunWith(r.DB).QueryRow().Scan(&cnt); err != nil {
@@ -417,11 +307,8 @@ func (r *JobRepository) DeleteJobsBefore(startTime int64, omitTagged string) (in
 		var jobIds []int64
 		selectQuery := sq.Select("id").From("job").Where("job.start_time < ?", startTime)

-		switch omitTagged {
-		case "all":
+		if omitTagged {
 			selectQuery = selectQuery.Where("NOT EXISTS (SELECT 1 FROM jobtag WHERE jobtag.job_id = job.id)")
-		case "user":
-			selectQuery = selectQuery.Where("NOT EXISTS (SELECT 1 FROM jobtag JOIN tag ON tag.id = jobtag.tag_id WHERE jobtag.job_id = job.id AND tag.tag_type NOT IN ('app', 'jobClass'))")
 		}

 		rows, err := selectQuery.RunWith(r.DB).Query()
@@ -443,11 +330,8 @@ func (r *JobRepository) DeleteJobsBefore(startTime int64, omitTagged string) (in

 	qd := sq.Delete("job").Where("job.start_time < ?", startTime)

-	switch omitTagged {
-	case "all":
+	if omitTagged {
 		qd = qd.Where("NOT EXISTS (SELECT 1 FROM jobtag WHERE jobtag.job_id = job.id)")
-	case "user":
-		qd = qd.Where("NOT EXISTS (SELECT 1 FROM jobtag JOIN tag ON tag.id = jobtag.tag_id WHERE jobtag.job_id = job.id AND tag.tag_type NOT IN ('app', 'jobClass'))")
 	}
 	_, err := qd.RunWith(r.DB).Exec()

@@ -460,13 +344,6 @@ func (r *JobRepository) DeleteJobsBefore(startTime int64, omitTagged string) (in
 	return cnt, err
 }

-// DeleteJobByID permanently removes a single job by its database ID.
-// Cache entries for the deleted job are automatically invalidated.
-//
-// Parameters:
-//   - id: Database ID (primary key) of the job to delete
-//
-// Returns an error if the deletion fails.
 func (r *JobRepository) DeleteJobByID(id int64) error {
 	// Invalidate cache entries before deletion
 	r.cache.Del(fmt.Sprintf("metadata:%d", id))
@@ -484,29 +361,7 @@ func (r *JobRepository) DeleteJobByID(id int64) error {
 	return err
 }

-// FindUserOrProjectOrJobname attempts to interpret a search term as a job ID,
-// username, project ID, or job name by querying the database.
-//
-// Search logic (in priority order):
-//  1. If searchterm is numeric, treat as job ID (returned immediately)
-//  2. Try exact match in job.hpc_user column (username)
-//  3. Try LIKE match in hpc_user.name column (real name)
-//  4. Try exact match in job.project column (project ID)
-//  5. If no matches, return searchterm as jobname for GraphQL query
-//
-// This powers the searchbar functionality for flexible job searching.
-// Requires authenticated user for database lookups (returns empty if user is nil).
-//
-// Parameters:
-//   - user: Authenticated user context, required for database access
-//   - searchterm: Search string to interpret
-//
-// Returns up to one non-empty value among (jobid, username, project, jobname).
 func (r *JobRepository) FindUserOrProjectOrJobname(user *schema.User, searchterm string) (jobid string, username string, project string, jobname string) {
-	if searchterm == "" {
-		return "", "", "", ""
-	}
-
 	if _, err := strconv.Atoi(searchterm); err == nil { // Return empty on successful conversion: parent method will redirect for integer jobId
 		return searchterm, "", "", ""
 	} else { // Has to have letters and logged-in user for other guesses
@@ -537,24 +392,7 @@ var (
 	ErrForbidden = errors.New("not authorized")
 )

-// FindColumnValue performs a generic column lookup in a database table with role-based access control.
-// Only users with admin, support, or manager roles can execute this query.
-//
-// Parameters:
-//   - user: User context for authorization check
-//   - searchterm: Value to search for (exact match or LIKE pattern)
-//   - table: Database table name to query
-//   - selectColumn: Column name to return in results
-//   - whereColumn: Column name to filter on
-//   - isLike: If true, use LIKE with wildcards; if false, use exact equality
-//
-// Returns the first matching value, ErrForbidden if user lacks permission,
-// or ErrNotFound if no matches are found.
 func (r *JobRepository) FindColumnValue(user *schema.User, searchterm string, table string, selectColumn string, whereColumn string, isLike bool) (result string, err error) {
-	if user == nil {
-		return "", fmt.Errorf("user cannot be nil")
-	}
-
 	compareStr := " = ?"
 	query := searchterm
 	if isLike {
@@ -565,11 +403,17 @@ func (r *JobRepository) FindColumnValue(user *schema.User, searchterm string, ta
 		theQuery := sq.Select(table+"."+selectColumn).Distinct().From(table).
 			Where(table+"."+whereColumn+compareStr, query)

+		// theSql, args, theErr := theQuery.ToSql()
+		// if theErr != nil {
+		// 	cclog.Warn("Error while converting query to sql")
+		// 	return "", err
+		// }
+		// cclog.Debugf("SQL query (FindColumnValue): `%s`, args: %#v", theSql, args)
+
 		err := theQuery.RunWith(r.stmtCache).QueryRow().Scan(&result)

 		if err != nil && err != sql.ErrNoRows {
-			cclog.Warnf("Error while querying FindColumnValue (table=%s, column=%s): %v", table, selectColumn, err)
-			return "", fmt.Errorf("failed to find column value: %w", err)
+			return "", err
 		} else if err == nil {
 			return result, nil
 		}
@@ -580,40 +424,22 @@ func (r *JobRepository) FindColumnValue(user *schema.User, searchterm string, ta
 	}
 }

-// FindColumnValues performs a generic column lookup returning multiple matches with role-based access control.
-// Similar to FindColumnValue but returns all matching values instead of just the first.
-// Only users with admin, support, or manager roles can execute this query.
-//
-// Parameters:
-//   - user: User context for authorization check
-//   - query: Search pattern (always uses LIKE with wildcards)
-//   - table: Database table name to query
-//   - selectColumn: Column name to return in results
-//   - whereColumn: Column name to filter on
-//
-// Returns a slice of matching values, ErrForbidden if user lacks permission,
-// or ErrNotFound if no matches are found.
 func (r *JobRepository) FindColumnValues(user *schema.User, query string, table string, selectColumn string, whereColumn string) (results []string, err error) {
-	if user == nil {
-		return nil, fmt.Errorf("user cannot be nil")
-	}
-
 	emptyResult := make([]string, 0)
 	if user.HasAnyRole([]schema.Role{schema.RoleAdmin, schema.RoleSupport, schema.RoleManager}) {
 		rows, err := sq.Select(table+"."+selectColumn).Distinct().From(table).
 			Where(table+"."+whereColumn+" LIKE ?", fmt.Sprint("%", query, "%")).
 			RunWith(r.stmtCache).Query()
 		if err != nil && err != sql.ErrNoRows {
-			cclog.Errorf("Error while querying FindColumnValues (table=%s, column=%s): %v", table, selectColumn, err)
-			return emptyResult, fmt.Errorf("failed to find column values: %w", err)
+			return emptyResult, err
 		} else if err == nil {
-			defer rows.Close()
 			for rows.Next() {
 				var result string
 				err := rows.Scan(&result)
 				if err != nil {
-					cclog.Warnf("Error while scanning rows in FindColumnValues: %v", err)
-					return emptyResult, fmt.Errorf("failed to scan column value: %w", err)
+					rows.Close()
+					cclog.Warnf("Error while scanning rows: %v", err)
+					return emptyResult, err
 				}
 				results = append(results, result)
 			}
@@ -627,13 +453,6 @@ func (r *JobRepository) FindColumnValues(user *schema.User, query string, table
 	}
 }

-// Partitions returns a list of distinct cluster partitions for a given cluster.
-// Results are cached with a 1-hour TTL to improve performance.
-//
-// Parameters:
-//   - cluster: Cluster name to query partitions for
-//
-// Returns a slice of partition names or an error if the database query fails.
 func (r *JobRepository) Partitions(cluster string) ([]string, error) {
 	var err error
 	start := time.Now()
@@ -662,8 +481,8 @@ func (r *JobRepository) AllocatedNodes(cluster string) (map[string]map[string]in
 		Where("job.cluster = ?", cluster).
 		RunWith(r.stmtCache).Query()
 	if err != nil {
-		cclog.Errorf("Error while running AllocatedNodes query for cluster=%s: %v", cluster, err)
-		return nil, fmt.Errorf("failed to query allocated nodes for cluster %s: %w", cluster, err)
+		cclog.Error("Error while running query")
+		return nil, err
 	}

 	var raw []byte
@@ -673,12 +492,12 @@ func (r *JobRepository) AllocatedNodes(cluster string) (map[string]map[string]in
 		var resources []*schema.Resource
 		var subcluster string
 		if err := rows.Scan(&raw, &subcluster); err != nil {
-			cclog.Warnf("Error while scanning rows in AllocatedNodes: %v", err)
-			return nil, fmt.Errorf("failed to scan allocated nodes row: %w", err)
+			cclog.Warn("Error while scanning rows")
+			return nil, err
 		}
 		if err := json.Unmarshal(raw, &resources); err != nil {
-			cclog.Warnf("Error while unmarshaling raw resources json in AllocatedNodes: %v", err)
-			return nil, fmt.Errorf("failed to unmarshal resources in AllocatedNodes: %w", err)
+			cclog.Warn("Error while unmarshaling raw resources json")
+			return nil, err
 		}

 		hosts, ok := subclusters[subcluster]
@@ -696,19 +515,7 @@ func (r *JobRepository) AllocatedNodes(cluster string) (map[string]map[string]in
 	return subclusters, nil
 }

-// StopJobsExceedingWalltimeBy marks running jobs as failed if they exceed their walltime limit.
-// This is typically called periodically to clean up stuck or orphaned jobs.
-//
-// Jobs are marked with:
-//   - monitoring_status: MonitoringStatusArchivingFailed
-//   - duration: 0
-//   - job_state: JobStateFailed
-//
-// Parameters:
-//   - seconds: Grace period beyond walltime before marking as failed
-//
-// Returns an error if the database update fails.
-// Logs the number of jobs marked as failed if any were affected.
+// FIXME: Set duration to requested walltime?
 func (r *JobRepository) StopJobsExceedingWalltimeBy(seconds int) error {
 	start := time.Now()
 	currentTime := time.Now().Unix()
@@ -721,14 +528,14 @@ func (r *JobRepository) StopJobsExceedingWalltimeBy(seconds int) error {
 		Where("(? - job.start_time) > (job.walltime + ?)", currentTime, seconds).
 		RunWith(r.DB).Exec()
 	if err != nil {
-		cclog.Warnf("Error while stopping jobs exceeding walltime: %v", err)
-		return fmt.Errorf("failed to stop jobs exceeding walltime: %w", err)
+		cclog.Warn("Error while stopping jobs exceeding walltime")
+		return err
 	}

 	rowsAffected, err := res.RowsAffected()
 	if err != nil {
-		cclog.Warnf("Error while fetching affected rows after stopping due to exceeded walltime: %v", err)
-		return fmt.Errorf("failed to get rows affected count: %w", err)
+		cclog.Warn("Error while fetching affected rows after stopping due to exceeded walltime")
+		return err
 	}

 	if rowsAffected > 0 {
@@ -738,31 +545,24 @@ func (r *JobRepository) StopJobsExceedingWalltimeBy(seconds int) error {
 	return nil
 }

-// FindJobIdsByTag returns all job database IDs associated with a specific tag.
-//
-// Parameters:
-//   - tagID: Database ID of the tag to search for
-//
-// Returns a slice of job IDs or an error if the query fails.
 func (r *JobRepository) FindJobIdsByTag(tagID int64) ([]int64, error) {
 	query := sq.Select("job.id").From("job").
 		Join("jobtag ON jobtag.job_id = job.id").
 		Where(sq.Eq{"jobtag.tag_id": tagID}).Distinct()
 	rows, err := query.RunWith(r.stmtCache).Query()
 	if err != nil {
-		cclog.Errorf("Error while running FindJobIdsByTag query for tagID=%d: %v", tagID, err)
-		return nil, fmt.Errorf("failed to find job IDs by tag %d: %w", tagID, err)
+		cclog.Error("Error while running query")
+		return nil, err
 	}
-	defer rows.Close()
-
 	jobIds := make([]int64, 0, 100)

 	for rows.Next() {
 		var jobID int64

 		if err := rows.Scan(&jobID); err != nil {
-			cclog.Warnf("Error while scanning rows in FindJobIdsByTag: %v", err)
-			return nil, fmt.Errorf("failed to scan job ID in FindJobIdsByTag: %w", err)
+			rows.Close()
+			cclog.Warn("Error while scanning rows")
+			return nil, err
 		}

 		jobIds = append(jobIds, jobID)
@@ -771,13 +571,7 @@ func (r *JobRepository) FindJobIdsByTag(tagID int64) ([]int64, error) {
 	return jobIds, nil
 }

-// FindRunningJobs returns all currently running jobs for a specific cluster.
-// Filters out short-running jobs based on repoConfig.MinRunningJobDuration threshold.
-//
-// Parameters:
-//   - cluster: Cluster name to filter jobs
-//
-// Returns a slice of running job objects or an error if the query fails.
+// FIXME: Reconsider filtering short jobs with harcoded threshold
 func (r *JobRepository) FindRunningJobs(cluster string) ([]*schema.Job, error) {
 	query := sq.Select(jobColumns...).From("job").
 		Where("job.cluster = ?", cluster).
@@ -786,8 +580,8 @@ func (r *JobRepository) FindRunningJobs(cluster string) ([]*schema.Job, error) {

 	rows, err := query.RunWith(r.stmtCache).Query()
 	if err != nil {
-		cclog.Errorf("Error while running FindRunningJobs query for cluster=%s: %v", cluster, err)
-		return nil, fmt.Errorf("failed to find running jobs for cluster %s: %w", cluster, err)
+		cclog.Error("Error while running query")
+		return nil, err
 	}
 	defer rows.Close()

@@ -795,22 +589,16 @@ func (r *JobRepository) FindRunningJobs(cluster string) ([]*schema.Job, error) {
 	for rows.Next() {
 		job, err := scanJob(rows)
 		if err != nil {
-			cclog.Warnf("Error while scanning rows in FindRunningJobs: %v", err)
-			return nil, fmt.Errorf("failed to scan job in FindRunningJobs: %w", err)
+			cclog.Warn("Error while scanning rows")
+			return nil, err
 		}
 		jobs = append(jobs, job)
 	}

-	cclog.Debugf("JobRepository.FindRunningJobs(): Return job count %d (cluster: %s)", len(jobs), cluster)
+	cclog.Infof("Return job count %d", len(jobs))
 	return jobs, nil
 }

-// UpdateDuration recalculates and updates the duration field for all running jobs.
-// Called periodically to keep job durations current without querying individual jobs.
-//
-// Duration is calculated as: current_time - job.start_time
-//
-// Returns an error if the database update fails.
 func (r *JobRepository) UpdateDuration() error {
 	stmnt := sq.Update("job").
 		Set("duration", sq.Expr("? - job.start_time", time.Now().Unix())).
@@ -818,25 +606,13 @@ func (r *JobRepository) UpdateDuration() error {

 	_, err := stmnt.RunWith(r.stmtCache).Exec()
 	if err != nil {
-		cclog.Errorf("Error while updating duration for running jobs: %v", err)
-		return fmt.Errorf("failed to update duration for running jobs: %w", err)
+		return err
 	}

 	return nil
 }

-// FindJobsBetween returns jobs within a specified time range.
-// If startTimeBegin is 0, returns all jobs before startTimeEnd.
-// Optionally excludes tagged jobs from results.
-//
-// Parameters:
-//   - startTimeBegin: Unix timestamp for range start (use 0 for unbounded start)
-//   - startTimeEnd: Unix timestamp for range end
-//   - omitTagged: "none" = include all jobs, "all" = exclude any tagged jobs,
-//     "user" = exclude jobs with user-created tags (not auto-tagger types "app"/"jobClass")
-//
-// Returns a slice of jobs or an error if the time range is invalid or query fails.
-func (r *JobRepository) FindJobsBetween(startTimeBegin int64, startTimeEnd int64, omitTagged string) ([]*schema.Job, error) {
+func (r *JobRepository) FindJobsBetween(startTimeBegin int64, startTimeEnd int64, omitTagged bool) ([]*schema.Job, error) {
 	var query sq.SelectBuilder

 	if startTimeBegin == startTimeEnd || startTimeBegin > startTimeEnd {
@@ -851,19 +627,14 @@ func (r *JobRepository) FindJobsBetween(startTimeBegin int64, startTimeEnd int64
 		query = sq.Select(jobColumns...).From("job").Where("job.start_time BETWEEN ? AND ?", startTimeBegin, startTimeEnd)
 	}

-	switch omitTagged {
-	case "all":
+	if omitTagged {
 		query = query.Where("NOT EXISTS (SELECT 1 FROM jobtag WHERE jobtag.job_id = job.id)")
-	case "user":
-		query = query.Where("NOT EXISTS (SELECT 1 FROM jobtag JOIN tag ON tag.id = jobtag.tag_id WHERE jobtag.job_id = job.id AND tag.tag_type NOT IN ('app', 'jobClass'))")
 	}

-	query = query.OrderBy("job.cluster ASC", "job.subcluster ASC", "job.project ASC", "job.start_time ASC")
-
 	rows, err := query.RunWith(r.stmtCache).Query()
 	if err != nil {
-		cclog.Errorf("Error while running FindJobsBetween query: %v", err)
-		return nil, fmt.Errorf("failed to find jobs between %d and %d: %w", startTimeBegin, startTimeEnd, err)
+		cclog.Error("Error while running query")
+		return nil, err
 	}
 	defer rows.Close()

@@ -871,24 +642,16 @@ func (r *JobRepository) FindJobsBetween(startTimeBegin int64, startTimeEnd int64
 	for rows.Next() {
 		job, err := scanJob(rows)
 		if err != nil {
-			cclog.Warnf("Error while scanning rows in FindJobsBetween: %v", err)
-			return nil, fmt.Errorf("failed to scan job in FindJobsBetween: %w", err)
+			cclog.Warn("Error while scanning rows")
+			return nil, err
 		}
 		jobs = append(jobs, job)
 	}

-	cclog.Debugf("JobRepository.FindJobsBetween(): Return job count %d (omitTagged: %v)", len(jobs), omitTagged)
+	cclog.Infof("Return job count %d", len(jobs))
 	return jobs, nil
 }

-// UpdateMonitoringStatus updates the monitoring status for a job and invalidates its cache entries.
-// Cache invalidation affects both metadata and energy footprint to ensure consistency.
-//
-// Parameters:
-//   - job: Database ID of the job to update
-//   - monitoringStatus: New monitoring status value (see schema.MonitoringStatus constants)
-//
-// Returns an error if the database update fails.
 func (r *JobRepository) UpdateMonitoringStatus(job int64, monitoringStatus int32) (err error) {
 	// Invalidate cache entries as monitoring status affects job state
 	r.cache.Del(fmt.Sprintf("metadata:%d", job))
@@ -898,37 +661,18 @@ func (r *JobRepository) UpdateMonitoringStatus(job int64, monitoringStatus int32
 		Set("monitoring_status", monitoringStatus).
 		Where("job.id = ?", job)

-	if _, err = stmt.RunWith(r.stmtCache).Exec(); err != nil {
-		cclog.Errorf("Error while updating monitoring status for job %d: %v", job, err)
-		return fmt.Errorf("failed to update monitoring status for job %d: %w", job, err)
-	}
-	return nil
+	_, err = stmt.RunWith(r.stmtCache).Exec()
+	return err
 }

-// Execute runs a Squirrel UpdateBuilder statement against the database.
-// This is a generic helper for executing pre-built update queries.
-//
-// Parameters:
-//   - stmt: Squirrel UpdateBuilder with prepared update query
-//
-// Returns an error if the execution fails.
 func (r *JobRepository) Execute(stmt sq.UpdateBuilder) error {
 	if _, err := stmt.RunWith(r.stmtCache).Exec(); err != nil {
-		cclog.Errorf("Error while executing statement: %v", err)
-		return fmt.Errorf("failed to execute update statement: %w", err)
+		return err
 	}

 	return nil
 }

-// MarkArchived adds monitoring status update to an existing UpdateBuilder statement.
-// This is a builder helper used when constructing multi-field update queries.
-//
-// Parameters:
-//   - stmt: Existing UpdateBuilder to modify
-//   - monitoringStatus: Monitoring status value to set
-//
-// Returns the modified UpdateBuilder for method chaining.
 func (r *JobRepository) MarkArchived(
 	stmt sq.UpdateBuilder,
 	monitoringStatus int32,
@@ -936,22 +680,11 @@ func (r *JobRepository) MarkArchived(
 	return stmt.Set("monitoring_status", monitoringStatus)
 }

-// UpdateEnergy calculates and updates the energy consumption for a job.
-// This is called for running jobs during intermediate updates or when archiving.
-//
-// Energy calculation formula:
-//   - For "power" metrics: Energy (kWh) = (Power_avg * NumNodes * Duration_hours) / 1000
-//   - For "energy" metrics: Currently not implemented (would need sum statistics)
-//
-// The calculation accounts for:
-//   - Multi-node jobs: Multiplies by NumNodes to get total cluster energy
-//   - Shared jobs: Node average is already based on partial resources, so NumNodes=1
-//   - Unit conversion: Watts * hours / 1000 = kilowatt-hours (kWh)
-//   - Rounding: Results rounded to 2 decimal places
 func (r *JobRepository) UpdateEnergy(
 	stmt sq.UpdateBuilder,
 	jobMeta *schema.Job,
 ) (sq.UpdateBuilder, error) {
+	/* Note: Only Called for Running Jobs during Intermediate Update or on Archiving */
 	sc, err := archive.GetSubCluster(jobMeta.Cluster, jobMeta.SubCluster)
 	if err != nil {
 		cclog.Errorf("cannot get subcluster: %s", err.Error())
@@ -959,27 +692,25 @@ func (r *JobRepository) UpdateEnergy(
 	}
 	energyFootprint := make(map[string]float64)

-	// Accumulate total energy across all energy-related metrics
+	// Total Job Energy Outside Loop
 	totalEnergy := 0.0
 	for _, fp := range sc.EnergyFootprint {
-		// Calculate energy for this specific metric
+		// Always Init Metric Energy Inside Loop
 		metricEnergy := 0.0
 		if i, err := archive.MetricIndex(sc.MetricConfig, fp); err == nil {
+			// Note: For DB data, calculate and save as kWh
 			switch sc.MetricConfig[i].Energy {
-			case "energy": // Metric already in energy units (Joules or Wh)
+			case "energy": // this metric has energy as unit (Joules or Wh)
 				cclog.Warnf("Update EnergyFootprint for Job %d and Metric %s on cluster %s: Set to 'energy' in cluster.json: Not implemented, will return 0.0", jobMeta.JobID, jobMeta.Cluster, fp)
-				// FIXME: Needs sum as stats type to accumulate energy values over time
-			case "power": // Metric in power units (Watts)
-				// Energy (kWh) = Power (W) × Time (h) / 1000
-				// Formula: (avg_power_per_node * num_nodes) * (duration_sec / 3600) / 1000
-				//
-				// Breakdown:
-				//   LoadJobStat(jobMeta, fp, "avg") = average power per node (W)
-				//   jobMeta.NumNodes = number of nodes (1 for shared jobs)
-				//   jobMeta.Duration / 3600.0 = duration in hours
-				//   / 1000.0 = convert Wh to kWh
+				// FIXME: Needs sum as stats type
+			case "power": // this metric has power as unit (Watt)
+				// Energy: Power (in Watts) * Time (in Seconds)
+				// Unit: (W * (s / 3600)) / 1000 = kWh
+				// Round 2 Digits: round(Energy * 100) / 100
+				// Here: (All-Node Metric Average * Number of Nodes) * (Job Duration in Seconds / 3600) / 1000
+				// Note: Shared Jobs handled correctly since "Node Average" is based on partial resources, while "numNodes" factor is 1
 				rawEnergy := ((LoadJobStat(jobMeta, fp, "avg") * float64(jobMeta.NumNodes)) * (float64(jobMeta.Duration) / 3600.0)) / 1000.0
-				metricEnergy = math.Round(rawEnergy*100.0) / 100.0 // Round to 2 decimal places
+				metricEnergy = math.Round(rawEnergy*100.0) / 100.0
 			}
 		} else {
 			cclog.Warnf("Error while collecting energy metric %s for job, DB ID '%v', return '0.0'", fp, jobMeta.ID)
@@ -987,6 +718,8 @@ func (r *JobRepository) UpdateEnergy(

 		energyFootprint[fp] = metricEnergy
 		totalEnergy += metricEnergy
+
+		// cclog.Infof("Metric %s Average %f -> %f kWh | Job %d Total -> %f kWh", fp, LoadJobStat(jobMeta, fp, "avg"), energy, jobMeta.JobID, totalEnergy)
 	}

 	var rawFootprint []byte
@@ -998,19 +731,11 @@ func (r *JobRepository) UpdateEnergy(
 	return stmt.Set("energy_footprint", string(rawFootprint)).Set("energy", (math.Round(totalEnergy*100.0) / 100.0)), nil
 }

-// UpdateFootprint calculates and updates the performance footprint for a job.
-// This is called for running jobs during intermediate updates or when archiving.
-//
-// A footprint is a summary statistic (avg/min/max) for each monitored metric.
-// The specific statistic type is defined in the cluster config's Footprint field.
-// Results are stored as JSON with keys like "metric_avg", "metric_max", etc.
-//
-// Example: For a "cpu_load" metric with Footprint="avg", this stores
-// the average CPU load across all nodes as "cpu_load_avg": 85.3
 func (r *JobRepository) UpdateFootprint(
 	stmt sq.UpdateBuilder,
 	jobMeta *schema.Job,
 ) (sq.UpdateBuilder, error) {
+	/* Note: Only Called for Running Jobs during Intermediate Update or on Archiving */
 	sc, err := archive.GetSubCluster(jobMeta.Cluster, jobMeta.SubCluster)
 	if err != nil {
 		cclog.Errorf("cannot get subcluster: %s", err.Error())
@@ -1018,10 +743,7 @@ func (r *JobRepository) UpdateFootprint(
 	}
 	footprint := make(map[string]float64)

-	// Build footprint map with metric_stattype as keys
 	for _, fp := range sc.Footprint {
-		// Determine which statistic to use: avg, min, or max
-		// First check global metric config, then cluster-specific config
 		var statType string
 		for _, gm := range archive.GlobalMetricList {
 			if gm.Name == fp {
@@ -1029,18 +751,15 @@ func (r *JobRepository) UpdateFootprint(
 			}
 		}

-		// Validate statistic type
 		if statType != "avg" && statType != "min" && statType != "max" {
 			cclog.Warnf("unknown statType for footprint update: %s", statType)
 			return stmt, fmt.Errorf("unknown statType for footprint update: %s", statType)
 		}

-		// Override with cluster-specific config if available
 		if i, err := archive.MetricIndex(sc.MetricConfig, fp); err != nil {
 			statType = sc.MetricConfig[i].Footprint
 		}

-		// Store as "metric_stattype": value (e.g., "cpu_load_avg": 85.3)
 		name := fmt.Sprintf("%s_%s", fp, statType)
 		footprint[name] = LoadJobStat(jobMeta, fp, statType)
 	}
@@ -1053,84 +772,3 @@ func (r *JobRepository) UpdateFootprint(

 	return stmt.Set("footprint", string(rawFootprint)), nil
 }
-
-// GetUsedNodes returns a map of cluster names to sorted lists of unique hostnames
-// that are currently in use by jobs that started before the given timestamp and
-// are still in running state.
-//
-// The timestamp parameter (ts) is compared against job.start_time to find
-// relevant jobs. Returns an error if the database query fails or row iteration
-// encounters errors. Individual row parsing errors are logged but don't fail
-// the entire operation.
-func (r *JobRepository) GetUsedNodes(ts int64) (map[string][]string, error) {
-	// Note: Query expects index on (job_state, start_time) for optimal performance
-	q := sq.Select("job.cluster", "job.resources").From("job").
-		Where("job.start_time < ?", ts).
-		Where(sq.Eq{"job.job_state": "running"})
-
-	rows, err := q.RunWith(r.stmtCache).Query()
-	if err != nil {
-		queryString, queryVars, _ := q.ToSql()
-		return nil, fmt.Errorf("query failed [%s] %v: %w", queryString, queryVars, err)
-	}
-	defer rows.Close()
-
-	// Use a map of sets for efficient deduplication
-	nodeSet := make(map[string]map[string]struct{})
-
-	var (
-		cluster      string
-		rawResources []byte
-		resources    []*schema.Resource
-		skippedRows  int
-	)
-
-	for rows.Next() {
-		if err := rows.Scan(&cluster, &rawResources); err != nil {
-			cclog.Warnf("Error scanning job row in GetUsedNodes: %v", err)
-			skippedRows++
-			continue
-		}
-
-		resources = resources[:0] // Clear slice, keep capacity
-		if err := json.Unmarshal(rawResources, &resources); err != nil {
-			cclog.Warnf("Error unmarshaling resources for cluster %s: %v", cluster, err)
-			skippedRows++
-			continue
-		}
-
-		if len(resources) == 0 {
-			cclog.Debugf("Job in cluster %s has no resources", cluster)
-			continue
-		}
-
-		if _, ok := nodeSet[cluster]; !ok {
-			nodeSet[cluster] = make(map[string]struct{})
-		}
-
-		for _, res := range resources {
-			nodeSet[cluster][res.Hostname] = struct{}{}
-		}
-	}
-
-	if err := rows.Err(); err != nil {
-		return nil, fmt.Errorf("error iterating rows: %w", err)
-	}
-
-	if skippedRows > 0 {
-		cclog.Warnf("GetUsedNodes: Skipped %d rows due to parsing errors", skippedRows)
-	}
-
-	// Convert sets to sorted slices
-	nodeList := make(map[string][]string, len(nodeSet))
-	for cluster, nodes := range nodeSet {
-		list := make([]string, 0, len(nodes))
-		for node := range nodes {
-			list = append(list, node)
-		}
-		sort.Strings(list)
-		nodeList[cluster] = list
-	}
-
-	return nodeList, nil
-}
--- a/internal/repository/jobCreate.go
+++ b/internal/repository/jobCreate.go
@@ -9,8 +9,8 @@ import (
 	"encoding/json"
 	"fmt"

-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/schema"
 	sq "github.com/Masterminds/squirrel"
 )

@@ -30,27 +30,6 @@ const NamedJobInsert string = `INSERT INTO job (
  :shared, :monitoring_status, :smt, :job_state, :start_time, :duration, :walltime, :footprint,  :energy, :energy_footprint, :resources, :meta_data
 );`

-// InsertJobDirect inserts a job directly into the job table (not job_cache).
-// Use this when the returned ID will be used for operations on the job table
-// (e.g., adding tags), or for imported jobs that are already completed.
-func (r *JobRepository) InsertJobDirect(job *schema.Job) (int64, error) {
-	r.Mutex.Lock()
-	defer r.Mutex.Unlock()
-
-	res, err := r.DB.NamedExec(NamedJobInsert, job)
-	if err != nil {
-		cclog.Warn("Error while NamedJobInsert (direct)")
-		return 0, err
-	}
-	id, err := res.LastInsertId()
-	if err != nil {
-		cclog.Warn("Error while getting last insert ID (direct)")
-		return 0, err
-	}
-
-	return id, nil
-}
-
 func (r *JobRepository) InsertJob(job *schema.Job) (int64, error) {
 	r.Mutex.Lock()
 	defer r.Mutex.Unlock()
@@ -92,9 +71,8 @@ func (r *JobRepository) SyncJobs() ([]*schema.Job, error) {
 		jobs = append(jobs, job)
 	}

-	// Use INSERT OR IGNORE to skip jobs already transferred by the stop path
 	_, err = r.DB.Exec(
-		"INSERT OR IGNORE INTO job (job_id, cluster, subcluster, start_time, hpc_user, project, cluster_partition, array_job_id, num_nodes, num_hwthreads, num_acc, shared, monitoring_status, smt, job_state, duration, walltime, footprint, energy, energy_footprint, resources, meta_data) SELECT job_id, cluster, subcluster, start_time, hpc_user, project, cluster_partition, array_job_id, num_nodes, num_hwthreads, num_acc, shared, monitoring_status, smt, job_state, duration, walltime, footprint, energy, energy_footprint, resources, meta_data FROM job_cache")
+		"INSERT INTO job (job_id, cluster, subcluster, start_time, hpc_user, project, cluster_partition, array_job_id, num_nodes, num_hwthreads, num_acc, shared, monitoring_status, smt, job_state, duration, walltime, footprint, energy, energy_footprint, resources, meta_data) SELECT job_id, cluster, subcluster, start_time, hpc_user, project, cluster_partition, array_job_id, num_nodes, num_hwthreads, num_acc, shared, monitoring_status, smt, job_state, duration, walltime, footprint, energy, energy_footprint, resources, meta_data FROM job_cache")
 	if err != nil {
 		cclog.Warnf("Error while Job sync: %v", err)
 		return nil, err
@@ -106,48 +84,9 @@ func (r *JobRepository) SyncJobs() ([]*schema.Job, error) {
 		return nil, err
 	}

-	// Resolve correct job.id from the job table. The IDs read from job_cache
-	// are from a different auto-increment sequence and must not be used to
-	// query the job table.
-	for _, job := range jobs {
-		var newID int64
-		if err := sq.Select("job.id").From("job").
-			Where("job.job_id = ? AND job.cluster = ? AND job.start_time = ?",
-				job.JobID, job.Cluster, job.StartTime).
-			RunWith(r.stmtCache).QueryRow().Scan(&newID); err != nil {
-			cclog.Warnf("SyncJobs: could not resolve job table id for job %d on %s: %v",
-				job.JobID, job.Cluster, err)
-			continue
-		}
-		job.ID = &newID
-	}
-
 	return jobs, nil
 }

-// TransferCachedJobToMain moves a job from job_cache to the job table.
-// Caller must hold r.Mutex. Returns the new job table ID.
-func (r *JobRepository) TransferCachedJobToMain(cacheID int64) (int64, error) {
-	res, err := r.DB.Exec(
-		"INSERT INTO job (job_id, cluster, subcluster, start_time, hpc_user, project, cluster_partition, array_job_id, num_nodes, num_hwthreads, num_acc, shared, monitoring_status, smt, job_state, duration, walltime, footprint, energy, energy_footprint, resources, meta_data) SELECT job_id, cluster, subcluster, start_time, hpc_user, project, cluster_partition, array_job_id, num_nodes, num_hwthreads, num_acc, shared, monitoring_status, smt, job_state, duration, walltime, footprint, energy, energy_footprint, resources, meta_data FROM job_cache WHERE id = ?",
-		cacheID)
-	if err != nil {
-		return 0, fmt.Errorf("transferring cached job %d to main table failed: %w", cacheID, err)
-	}
-
-	newID, err := res.LastInsertId()
-	if err != nil {
-		return 0, fmt.Errorf("getting new job ID after transfer failed: %w", err)
-	}
-
-	_, err = r.DB.Exec("DELETE FROM job_cache WHERE id = ?", cacheID)
-	if err != nil {
-		return 0, fmt.Errorf("deleting cached job %d after transfer failed: %w", cacheID, err)
-	}
-
-	return newID, nil
-}
-
 // Start inserts a new job in the table, returning the unique job ID.
 // Statistics are not transfered!
 func (r *JobRepository) Start(job *schema.Job) (id int64, err error) {
@@ -169,28 +108,6 @@ func (r *JobRepository) Start(job *schema.Job) (id int64, err error) {
 	return r.InsertJob(job)
 }

-// StartDirect inserts a new job directly into the job table (not job_cache).
-// Use this when the returned ID will immediately be used for job table
-// operations such as adding tags.
-func (r *JobRepository) StartDirect(job *schema.Job) (id int64, err error) {
-	job.RawFootprint, err = json.Marshal(job.Footprint)
-	if err != nil {
-		return -1, fmt.Errorf("REPOSITORY/JOB > encoding footprint field failed: %w", err)
-	}
-
-	job.RawResources, err = json.Marshal(job.Resources)
-	if err != nil {
-		return -1, fmt.Errorf("REPOSITORY/JOB > encoding resources field failed: %w", err)
-	}
-
-	job.RawMetaData, err = json.Marshal(job.MetaData)
-	if err != nil {
-		return -1, fmt.Errorf("REPOSITORY/JOB > encoding metaData field failed: %w", err)
-	}
-
-	return r.InsertJobDirect(job)
-}
-
 // Stop updates the job with the database id jobId using the provided arguments.
 func (r *JobRepository) Stop(
 	jobID int64,
@@ -212,3 +129,20 @@ func (r *JobRepository) Stop(
 	return err
 }

+func (r *JobRepository) StopCached(
+	jobID int64,
+	duration int32,
+	state schema.JobState,
+	monitoringStatus int32,
+) (err error) {
+	// Note: StopCached updates job_cache table, not the main job table
+	// Cache invalidation happens when job is synced to main table
+	stmt := sq.Update("job_cache").
+		Set("job_state", state).
+		Set("duration", duration).
+		Set("monitoring_status", monitoringStatus).
+		Where("job_cache.id = ?", jobID)
+
+	_, err = stmt.RunWith(r.stmtCache).Exec()
+	return err
+}
--- a/internal/repository/jobCreate_test.go
+++ b/internal/repository/jobCreate_test.go
@@ -1,607 +0,0 @@
-// Copyright (C) NHR@FAU, University Erlangen-Nuremberg.
-// All rights reserved. This file is part of cc-backend.
-// Use of this source code is governed by a MIT-style
-// license that can be found in the LICENSE file.
-package repository
-
-import (
-	"encoding/json"
-	"testing"
-
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
-	_ "github.com/mattn/go-sqlite3"
-	"github.com/stretchr/testify/assert"
-	"github.com/stretchr/testify/require"
-)
-
-// createTestJob creates a minimal valid job for testing
-func createTestJob(jobID int64, cluster string) *schema.Job {
-	return &schema.Job{
-		JobID:            jobID,
-		User:             "testuser",
-		Project:          "testproject",
-		Cluster:          cluster,
-		SubCluster:       "main",
-		Partition:        "batch",
-		NumNodes:         1,
-		NumHWThreads:     4,
-		NumAcc:           0,
-		Shared:           "none",
-		MonitoringStatus: schema.MonitoringStatusRunningOrArchiving,
-		SMT:              1,
-		State:            schema.JobStateRunning,
-		StartTime:        1234567890,
-		Duration:         0,
-		Walltime:         3600,
-		Resources: []*schema.Resource{
-			{
-				Hostname:  "node01",
-				HWThreads: []int{0, 1, 2, 3},
-			},
-		},
-		Footprint: map[string]float64{
-			"cpu_load":      50.0,
-			"mem_used":      8000.0,
-			"flops_any":     0.5,
-			"mem_bw":        10.0,
-			"net_bw":        2.0,
-			"file_bw":       1.0,
-			"cpu_used":      2.0,
-			"cpu_load_core": 12.5,
-		},
-		MetaData: map[string]string{
-			"jobName":     "test_job",
-			"queue":       "normal",
-			"qosName":     "default",
-			"accountName": "testaccount",
-		},
-	}
-}
-
-func TestInsertJob(t *testing.T) {
-	r := setup(t)
-
-	t.Run("successful insertion", func(t *testing.T) {
-		job := createTestJob(999001, "testcluster")
-		job.RawResources, _ = json.Marshal(job.Resources)
-		job.RawFootprint, _ = json.Marshal(job.Footprint)
-		job.RawMetaData, _ = json.Marshal(job.MetaData)
-
-		id, err := r.InsertJob(job)
-		require.NoError(t, err, "InsertJob should succeed")
-		assert.Greater(t, id, int64(0), "Should return valid insert ID")
-
-		// Verify job was inserted into job_cache
-		var count int
-		err = r.DB.QueryRow("SELECT COUNT(*) FROM job_cache WHERE job_id = ? AND cluster = ?",
-			job.JobID, job.Cluster).Scan(&count)
-		require.NoError(t, err)
-		assert.Equal(t, 1, count, "Job should be in job_cache table")
-
-		// Clean up
-		_, err = r.DB.Exec("DELETE FROM job_cache WHERE job_id = ? AND cluster = ?", job.JobID, job.Cluster)
-		require.NoError(t, err)
-	})
-
-	t.Run("insertion with all fields", func(t *testing.T) {
-		job := createTestJob(999002, "testcluster")
-		job.ArrayJobID = 5000
-		job.Energy = 1500.5
-		job.RawResources, _ = json.Marshal(job.Resources)
-		job.RawFootprint, _ = json.Marshal(job.Footprint)
-		job.RawMetaData, _ = json.Marshal(job.MetaData)
-
-		id, err := r.InsertJob(job)
-		require.NoError(t, err)
-		assert.Greater(t, id, int64(0))
-
-		// Verify all fields were stored correctly
-		var retrievedJob schema.Job
-		err = r.DB.QueryRow(`SELECT job_id, hpc_user, project, cluster, array_job_id, energy 
-			FROM job_cache WHERE id = ?`, id).Scan(
-			&retrievedJob.JobID, &retrievedJob.User, &retrievedJob.Project,
-			&retrievedJob.Cluster, &retrievedJob.ArrayJobID, &retrievedJob.Energy)
-		require.NoError(t, err)
-		assert.Equal(t, job.JobID, retrievedJob.JobID)
-		assert.Equal(t, job.User, retrievedJob.User)
-		assert.Equal(t, job.Project, retrievedJob.Project)
-		assert.Equal(t, job.Cluster, retrievedJob.Cluster)
-		assert.Equal(t, job.ArrayJobID, retrievedJob.ArrayJobID)
-		assert.Equal(t, job.Energy, retrievedJob.Energy)
-
-		// Clean up
-		_, err = r.DB.Exec("DELETE FROM job_cache WHERE id = ?", id)
-		require.NoError(t, err)
-	})
-}
-
-func TestStart(t *testing.T) {
-	r := setup(t)
-
-	t.Run("successful job start with JSON encoding", func(t *testing.T) {
-		job := createTestJob(999003, "testcluster")
-
-		id, err := r.Start(job)
-		require.NoError(t, err, "Start should succeed")
-		assert.Greater(t, id, int64(0), "Should return valid insert ID")
-
-		// Verify job was inserted and JSON fields were encoded
-		var rawResources, rawFootprint, rawMetaData []byte
-		err = r.DB.QueryRow(`SELECT resources, footprint, meta_data FROM job_cache WHERE id = ?`, id).Scan(
-			&rawResources, &rawFootprint, &rawMetaData)
-		require.NoError(t, err)
-
-		// Verify resources JSON
-		var resources []*schema.Resource
-		err = json.Unmarshal(rawResources, &resources)
-		require.NoError(t, err, "Resources should be valid JSON")
-		assert.Equal(t, 1, len(resources))
-		assert.Equal(t, "node01", resources[0].Hostname)
-
-		// Verify footprint JSON
-		var footprint map[string]float64
-		err = json.Unmarshal(rawFootprint, &footprint)
-		require.NoError(t, err, "Footprint should be valid JSON")
-		assert.Equal(t, 50.0, footprint["cpu_load"])
-		assert.Equal(t, 8000.0, footprint["mem_used"])
-
-		// Verify metadata JSON
-		var metaData map[string]string
-		err = json.Unmarshal(rawMetaData, &metaData)
-		require.NoError(t, err, "MetaData should be valid JSON")
-		assert.Equal(t, "test_job", metaData["jobName"])
-
-		// Clean up
-		_, err = r.DB.Exec("DELETE FROM job_cache WHERE id = ?", id)
-		require.NoError(t, err)
-	})
-
-	t.Run("job start with empty footprint", func(t *testing.T) {
-		job := createTestJob(999004, "testcluster")
-		job.Footprint = map[string]float64{}
-
-		id, err := r.Start(job)
-		require.NoError(t, err)
-		assert.Greater(t, id, int64(0))
-
-		// Verify empty footprint was encoded as empty JSON object
-		var rawFootprint []byte
-		err = r.DB.QueryRow(`SELECT footprint FROM job_cache WHERE id = ?`, id).Scan(&rawFootprint)
-		require.NoError(t, err)
-		assert.Equal(t, []byte("{}"), rawFootprint)
-
-		// Clean up
-		_, err = r.DB.Exec("DELETE FROM job_cache WHERE id = ?", id)
-		require.NoError(t, err)
-	})
-
-	t.Run("job start with nil metadata", func(t *testing.T) {
-		job := createTestJob(999005, "testcluster")
-		job.MetaData = nil
-
-		id, err := r.Start(job)
-		require.NoError(t, err)
-		assert.Greater(t, id, int64(0))
-
-		// Clean up
-		_, err = r.DB.Exec("DELETE FROM job_cache WHERE id = ?", id)
-		require.NoError(t, err)
-	})
-}
-
-func TestStop(t *testing.T) {
-	r := setup(t)
-
-	t.Run("successful job stop", func(t *testing.T) {
-		// First insert a job using Start
-		job := createTestJob(999106, "testcluster")
-		id, err := r.Start(job)
-		require.NoError(t, err)
-
-		// Move from job_cache to job table (simulate SyncJobs) - exclude id to let it auto-increment
-		_, err = r.DB.Exec(`INSERT INTO job (job_id, cluster, subcluster, submit_time, start_time, hpc_user, project, 
-			cluster_partition, array_job_id, duration, walltime, job_state, meta_data, resources, num_nodes, 
-			num_hwthreads, num_acc, smt, shared, monitoring_status, energy, energy_footprint, footprint) 
-			SELECT job_id, cluster, subcluster, submit_time, start_time, hpc_user, project, 
-			cluster_partition, array_job_id, duration, walltime, job_state, meta_data, resources, num_nodes, 
-			num_hwthreads, num_acc, smt, shared, monitoring_status, energy, energy_footprint, footprint 
-			FROM job_cache WHERE id = ?`, id)
-		require.NoError(t, err)
-		_, err = r.DB.Exec("DELETE FROM job_cache WHERE id = ?", id)
-		require.NoError(t, err)
-
-		// Get the new job id in the job table
-		err = r.DB.QueryRow("SELECT id FROM job WHERE job_id = ? AND cluster = ? AND start_time = ?",
-			job.JobID, job.Cluster, job.StartTime).Scan(&id)
-		require.NoError(t, err)
-
-		// Stop the job
-		duration := int32(3600)
-		state := schema.JobStateCompleted
-		monitoringStatus := int32(schema.MonitoringStatusArchivingSuccessful)
-
-		err = r.Stop(id, duration, state, monitoringStatus)
-		require.NoError(t, err, "Stop should succeed")
-
-		// Verify job was updated
-		var retrievedDuration int32
-		var retrievedState string
-		var retrievedMonStatus int32
-		err = r.DB.QueryRow(`SELECT duration, job_state, monitoring_status FROM job WHERE id = ?`, id).Scan(
-			&retrievedDuration, &retrievedState, &retrievedMonStatus)
-		require.NoError(t, err)
-		assert.Equal(t, duration, retrievedDuration)
-		assert.Equal(t, string(state), retrievedState)
-		assert.Equal(t, monitoringStatus, retrievedMonStatus)
-
-		// Clean up
-		_, err = r.DB.Exec("DELETE FROM job WHERE id = ?", id)
-		require.NoError(t, err)
-	})
-
-	t.Run("stop updates job state transitions", func(t *testing.T) {
-		// Insert a job
-		job := createTestJob(999107, "testcluster")
-		id, err := r.Start(job)
-		require.NoError(t, err)
-
-		// Move to job table
-		_, err = r.DB.Exec(`INSERT INTO job (job_id, cluster, subcluster, submit_time, start_time, hpc_user, project, 
-			cluster_partition, array_job_id, duration, walltime, job_state, meta_data, resources, num_nodes, 
-			num_hwthreads, num_acc, smt, shared, monitoring_status, energy, energy_footprint, footprint) 
-			SELECT job_id, cluster, subcluster, submit_time, start_time, hpc_user, project, 
-			cluster_partition, array_job_id, duration, walltime, job_state, meta_data, resources, num_nodes, 
-			num_hwthreads, num_acc, smt, shared, monitoring_status, energy, energy_footprint, footprint 
-			FROM job_cache WHERE id = ?`, id)
-		require.NoError(t, err)
-		_, err = r.DB.Exec("DELETE FROM job_cache WHERE id = ?", id)
-		require.NoError(t, err)
-
-		// Get the new job id in the job table
-		err = r.DB.QueryRow("SELECT id FROM job WHERE job_id = ? AND cluster = ? AND start_time = ?",
-			job.JobID, job.Cluster, job.StartTime).Scan(&id)
-		require.NoError(t, err)
-
-		// Stop the job with different duration
-		err = r.Stop(id, 7200, schema.JobStateCompleted, int32(schema.MonitoringStatusArchivingSuccessful))
-		require.NoError(t, err)
-
-		// Verify the duration was updated correctly
-		var duration int32
-		err = r.DB.QueryRow(`SELECT duration FROM job WHERE id = ?`, id).Scan(&duration)
-		require.NoError(t, err)
-		assert.Equal(t, int32(7200), duration, "Duration should be updated to 7200")
-
-		// Clean up
-		_, err = r.DB.Exec("DELETE FROM job WHERE id = ?", id)
-		require.NoError(t, err)
-	})
-
-	t.Run("stop with different states", func(t *testing.T) {
-		testCases := []struct {
-			name             string
-			jobID            int64
-			state            schema.JobState
-			monitoringStatus int32
-		}{
-			{"completed", 999108, schema.JobStateCompleted, int32(schema.MonitoringStatusArchivingSuccessful)},
-			{"failed", 999118, schema.JobStateFailed, int32(schema.MonitoringStatusArchivingSuccessful)},
-			{"cancelled", 999119, schema.JobStateCancelled, int32(schema.MonitoringStatusArchivingSuccessful)},
-			{"timeout", 999120, schema.JobStateTimeout, int32(schema.MonitoringStatusArchivingSuccessful)},
-		}
-
-		for _, tc := range testCases {
-			t.Run(tc.name, func(t *testing.T) {
-				job := createTestJob(tc.jobID, "testcluster")
-				id, err := r.Start(job)
-				require.NoError(t, err)
-
-				// Move to job table
-				_, err = r.DB.Exec(`INSERT INTO job (job_id, cluster, subcluster, submit_time, start_time, hpc_user, project, 
-					cluster_partition, array_job_id, duration, walltime, job_state, meta_data, resources, num_nodes, 
-					num_hwthreads, num_acc, smt, shared, monitoring_status, energy, energy_footprint, footprint) 
-					SELECT job_id, cluster, subcluster, submit_time, start_time, hpc_user, project, 
-					cluster_partition, array_job_id, duration, walltime, job_state, meta_data, resources, num_nodes, 
-					num_hwthreads, num_acc, smt, shared, monitoring_status, energy, energy_footprint, footprint 
-					FROM job_cache WHERE id = ?`, id)
-				require.NoError(t, err)
-				_, err = r.DB.Exec("DELETE FROM job_cache WHERE id = ?", id)
-				require.NoError(t, err)
-
-				// Get the new job id in the job table
-				err = r.DB.QueryRow("SELECT id FROM job WHERE job_id = ? AND cluster = ? AND start_time = ?",
-					job.JobID, job.Cluster, job.StartTime).Scan(&id)
-				require.NoError(t, err)
-
-				// Stop with specific state
-				err = r.Stop(id, 1800, tc.state, tc.monitoringStatus)
-				require.NoError(t, err)
-
-				// Verify state was set correctly
-				var retrievedState string
-				err = r.DB.QueryRow(`SELECT job_state FROM job WHERE id = ?`, id).Scan(&retrievedState)
-				require.NoError(t, err)
-				assert.Equal(t, string(tc.state), retrievedState)
-
-				// Clean up
-				_, err = r.DB.Exec("DELETE FROM job WHERE id = ?", id)
-				require.NoError(t, err)
-			})
-		}
-	})
-}
-
-func TestTransferCachedJobToMain(t *testing.T) {
-	r := setup(t)
-
-	t.Run("successful transfer from cache to main", func(t *testing.T) {
-		// Insert a job in job_cache
-		job := createTestJob(999009, "testcluster")
-		cacheID, err := r.Start(job)
-		require.NoError(t, err)
-
-		// Transfer the cached job to the main table
-		r.Mutex.Lock()
-		newID, err := r.TransferCachedJobToMain(cacheID)
-		r.Mutex.Unlock()
-		require.NoError(t, err, "TransferCachedJobToMain should succeed")
-		assert.NotEqual(t, cacheID, newID, "New ID should differ from cache ID")
-
-		// Verify job exists in job table
-		var count int
-		err = r.DB.QueryRow(`SELECT COUNT(*) FROM job WHERE id = ?`, newID).Scan(&count)
-		require.NoError(t, err)
-		assert.Equal(t, 1, count, "Job should exist in main table")
-
-		// Verify job was removed from job_cache
-		err = r.DB.QueryRow(`SELECT COUNT(*) FROM job_cache WHERE id = ?`, cacheID).Scan(&count)
-		require.NoError(t, err)
-		assert.Equal(t, 0, count, "Job should be removed from cache")
-
-		// Clean up
-		_, err = r.DB.Exec("DELETE FROM job WHERE id = ?", newID)
-		require.NoError(t, err)
-	})
-
-	t.Run("transfer preserves job data", func(t *testing.T) {
-		// Insert a job in job_cache
-		job := createTestJob(999010, "testcluster")
-		cacheID, err := r.Start(job)
-		require.NoError(t, err)
-
-		// Transfer the cached job
-		r.Mutex.Lock()
-		newID, err := r.TransferCachedJobToMain(cacheID)
-		r.Mutex.Unlock()
-		require.NoError(t, err)
-
-		// Verify the transferred job has the correct data
-		var jobID int64
-		var cluster string
-		err = r.DB.QueryRow(`SELECT job_id, cluster FROM job WHERE id = ?`, newID).Scan(&jobID, &cluster)
-		require.NoError(t, err)
-		assert.Equal(t, job.JobID, jobID)
-		assert.Equal(t, job.Cluster, cluster)
-
-		// Clean up
-		_, err = r.DB.Exec("DELETE FROM job WHERE id = ?", newID)
-		require.NoError(t, err)
-	})
-}
-
-func TestSyncJobs(t *testing.T) {
-	r := setup(t)
-
-	t.Run("sync jobs from cache to main table", func(t *testing.T) {
-		// Ensure cache is empty first
-		_, err := r.DB.Exec("DELETE FROM job_cache")
-		require.NoError(t, err)
-
-		// Insert multiple jobs in job_cache
-		job1 := createTestJob(999011, "testcluster")
-		job2 := createTestJob(999012, "testcluster")
-		job3 := createTestJob(999013, "testcluster")
-
-		_, err = r.Start(job1)
-		require.NoError(t, err)
-		_, err = r.Start(job2)
-		require.NoError(t, err)
-		_, err = r.Start(job3)
-		require.NoError(t, err)
-
-		// Verify jobs are in job_cache
-		var cacheCount int
-		err = r.DB.QueryRow("SELECT COUNT(*) FROM job_cache WHERE job_id IN (?, ?, ?)",
-			job1.JobID, job2.JobID, job3.JobID).Scan(&cacheCount)
-		require.NoError(t, err)
-		assert.Equal(t, 3, cacheCount, "All jobs should be in job_cache")
-
-		// Sync jobs
-		jobs, err := r.SyncJobs()
-		require.NoError(t, err, "SyncJobs should succeed")
-		assert.Equal(t, 3, len(jobs), "Should return 3 synced jobs")
-
-		// Verify jobs were moved to job table
-		var jobCount int
-		err = r.DB.QueryRow("SELECT COUNT(*) FROM job WHERE job_id IN (?, ?, ?)",
-			job1.JobID, job2.JobID, job3.JobID).Scan(&jobCount)
-		require.NoError(t, err)
-		assert.Equal(t, 3, jobCount, "All jobs should be in job table")
-
-		// Verify job_cache was cleared
-		err = r.DB.QueryRow("SELECT COUNT(*) FROM job_cache WHERE job_id IN (?, ?, ?)",
-			job1.JobID, job2.JobID, job3.JobID).Scan(&cacheCount)
-		require.NoError(t, err)
-		assert.Equal(t, 0, cacheCount, "job_cache should be empty after sync")
-
-		// Clean up
-		_, err = r.DB.Exec("DELETE FROM job WHERE job_id IN (?, ?, ?)", job1.JobID, job2.JobID, job3.JobID)
-		require.NoError(t, err)
-	})
-
-	t.Run("sync preserves job data", func(t *testing.T) {
-		// Ensure cache is empty first
-		_, err := r.DB.Exec("DELETE FROM job_cache")
-		require.NoError(t, err)
-
-		// Insert a job with specific data
-		job := createTestJob(999014, "testcluster")
-		job.ArrayJobID = 7777
-		job.Energy = 2500.75
-		job.Duration = 1800
-
-		id, err := r.Start(job)
-		require.NoError(t, err)
-
-		// Update some fields to simulate job progress
-		result, err := r.DB.Exec(`UPDATE job_cache SET duration = ?, energy = ? WHERE id = ?`,
-			3600, 3000.5, id)
-		require.NoError(t, err)
-		rowsAffected, _ := result.RowsAffected()
-		require.Equal(t, int64(1), rowsAffected, "UPDATE should affect exactly 1 row")
-
-		// Verify the update worked
-		var checkDuration int32
-		var checkEnergy float64
-		err = r.DB.QueryRow(`SELECT duration, energy FROM job_cache WHERE id = ?`, id).Scan(&checkDuration, &checkEnergy)
-		require.NoError(t, err)
-		require.Equal(t, int32(3600), checkDuration, "Duration should be updated to 3600 before sync")
-		require.Equal(t, 3000.5, checkEnergy, "Energy should be updated to 3000.5 before sync")
-
-		// Sync jobs
-		jobs, err := r.SyncJobs()
-		require.NoError(t, err)
-		require.Equal(t, 1, len(jobs), "Should return exactly 1 synced job")
-
-		// Verify in database
-		var dbJob schema.Job
-		err = r.DB.QueryRow(`SELECT job_id, hpc_user, project, cluster, array_job_id, duration, energy 
-			FROM job WHERE job_id = ? AND cluster = ?`, job.JobID, job.Cluster).Scan(
-			&dbJob.JobID, &dbJob.User, &dbJob.Project, &dbJob.Cluster,
-			&dbJob.ArrayJobID, &dbJob.Duration, &dbJob.Energy)
-		require.NoError(t, err)
-		assert.Equal(t, job.JobID, dbJob.JobID)
-		assert.Equal(t, int32(3600), dbJob.Duration)
-		assert.Equal(t, 3000.5, dbJob.Energy)
-
-		// Clean up
-		_, err = r.DB.Exec("DELETE FROM job WHERE job_id = ? AND cluster = ?", job.JobID, job.Cluster)
-		require.NoError(t, err)
-	})
-
-	t.Run("sync returns job table IDs not cache IDs", func(t *testing.T) {
-		// Ensure cache is empty first
-		_, err := r.DB.Exec("DELETE FROM job_cache")
-		require.NoError(t, err)
-
-		// Insert a job into job_cache
-		job := createTestJob(999015, "testcluster")
-		cacheID, err := r.Start(job)
-		require.NoError(t, err)
-
-		// Sync jobs
-		jobs, err := r.SyncJobs()
-		require.NoError(t, err)
-		require.Equal(t, 1, len(jobs))
-
-		// The returned ID must refer to the job table, not job_cache
-		var jobTableID int64
-		err = r.DB.QueryRow("SELECT id FROM job WHERE job_id = ? AND cluster = ? AND start_time = ?",
-			jobs[0].JobID, jobs[0].Cluster, jobs[0].StartTime).Scan(&jobTableID)
-		require.NoError(t, err)
-		assert.Equal(t, jobTableID, *jobs[0].ID,
-			"returned ID should match the job table row, not the cache ID (%d)", cacheID)
-
-		// Clean up
-		_, err = r.DB.Exec("DELETE FROM job WHERE job_id = ? AND cluster = ?", job.JobID, job.Cluster)
-		require.NoError(t, err)
-	})
-
-	t.Run("sync with empty cache returns empty list", func(t *testing.T) {
-		// Ensure cache is empty
-		_, err := r.DB.Exec("DELETE FROM job_cache")
-		require.NoError(t, err)
-
-		// Sync should return empty list
-		jobs, err := r.SyncJobs()
-		require.NoError(t, err)
-		assert.Equal(t, 0, len(jobs), "Should return empty list when cache is empty")
-	})
-}
-
-func TestInsertJobDirect(t *testing.T) {
-	r := setup(t)
-
-	t.Run("inserts into job table not cache", func(t *testing.T) {
-		job := createTestJob(999020, "testcluster")
-		job.RawResources, _ = json.Marshal(job.Resources)
-		job.RawFootprint, _ = json.Marshal(job.Footprint)
-		job.RawMetaData, _ = json.Marshal(job.MetaData)
-
-		id, err := r.InsertJobDirect(job)
-		require.NoError(t, err, "InsertJobDirect should succeed")
-		assert.Greater(t, id, int64(0), "Should return valid insert ID")
-
-		// Verify job is in job table
-		var count int
-		err = r.DB.QueryRow("SELECT COUNT(*) FROM job WHERE id = ?", id).Scan(&count)
-		require.NoError(t, err)
-		assert.Equal(t, 1, count, "Job should be in job table")
-
-		// Verify job is NOT in job_cache
-		err = r.DB.QueryRow("SELECT COUNT(*) FROM job_cache WHERE job_id = ? AND cluster = ?",
-			job.JobID, job.Cluster).Scan(&count)
-		require.NoError(t, err)
-		assert.Equal(t, 0, count, "Job should NOT be in job_cache")
-
-		// Clean up
-		_, err = r.DB.Exec("DELETE FROM job WHERE id = ?", id)
-		require.NoError(t, err)
-	})
-
-	t.Run("returned ID works for tag operations", func(t *testing.T) {
-		job := createTestJob(999021, "testcluster")
-		job.RawResources, _ = json.Marshal(job.Resources)
-		job.RawFootprint, _ = json.Marshal(job.Footprint)
-		job.RawMetaData, _ = json.Marshal(job.MetaData)
-
-		id, err := r.InsertJobDirect(job)
-		require.NoError(t, err)
-
-		// Adding a tag using the returned ID should succeed (FK constraint on jobtag)
-		err = r.ImportTag(id, "test_type", "test_name", "global")
-		require.NoError(t, err, "ImportTag should succeed with direct insert ID")
-
-		// Clean up
-		_, err = r.DB.Exec("DELETE FROM jobtag WHERE job_id = ?", id)
-		require.NoError(t, err)
-		_, err = r.DB.Exec("DELETE FROM job WHERE id = ?", id)
-		require.NoError(t, err)
-	})
-}
-
-func TestStartDirect(t *testing.T) {
-	r := setup(t)
-
-	t.Run("inserts into job table with JSON encoding", func(t *testing.T) {
-		job := createTestJob(999022, "testcluster")
-
-		id, err := r.StartDirect(job)
-		require.NoError(t, err, "StartDirect should succeed")
-		assert.Greater(t, id, int64(0))
-
-		// Verify job is in job table with encoded JSON
-		var rawResources []byte
-		err = r.DB.QueryRow("SELECT resources FROM job WHERE id = ?", id).Scan(&rawResources)
-		require.NoError(t, err)
-
-		var resources []*schema.Resource
-		err = json.Unmarshal(rawResources, &resources)
-		require.NoError(t, err, "Resources should be valid JSON")
-		assert.Equal(t, "node01", resources[0].Hostname)
-
-		// Clean up
-		_, err = r.DB.Exec("DELETE FROM job WHERE id = ?", id)
-		require.NoError(t, err)
-	})
-}
--- a/internal/repository/jobFind.go
+++ b/internal/repository/jobFind.go
@@ -12,8 +12,8 @@ import (
 	"time"

 	"github.com/ClusterCockpit/cc-backend/internal/graph/model"
-	cclog "github.com/ClusterCockpit/cc-lib/v2/ccLogger"
-	"github.com/ClusterCockpit/cc-lib/v2/schema"
+	cclog "github.com/ClusterCockpit/cc-lib/ccLogger"
+	"github.com/ClusterCockpit/cc-lib/schema"
 	sq "github.com/Masterminds/squirrel"
 )

@@ -27,10 +27,6 @@ func (r *JobRepository) Find(
 	cluster *string,
 	startTime *int64,
 ) (*schema.Job, error) {
-	if jobID == nil {
-		return nil, fmt.Errorf("jobID cannot be nil")
-	}
-
 	start := time.Now()
 	q := sq.Select(jobColumns...).From("job").
 		Where("job.job_id = ?", *jobID)
@@ -42,27 +38,17 @@ func (r *JobRepository) Find(
 		q = q.Where("job.start_time = ?", *startTime)
 	}

-	q = q.OrderBy("job.id DESC").Limit(1) // always use newest matching job by db id if more than one match
+	q = q.OrderBy("job.id DESC") // always use newest matching job by db id if more than one match

 	cclog.Debugf("Timer Find %s", time.Since(start))
 	return scanJob(q.RunWith(r.stmtCache).QueryRow())
 }

-// FindCached executes a SQL query to find a specific batch job from the job_cache table.
-// The job is queried using the batch job id, and optionally filtered by cluster name
-// and start time (UNIX epoch time seconds). This method uses cached job data which
-// may be stale but provides faster access than Find().
-// It returns a pointer to a schema.Job data structure and an error variable.
-// To check if no job was found test err == sql.ErrNoRows
 func (r *JobRepository) FindCached(
 	jobID *int64,
 	cluster *string,
 	startTime *int64,
 ) (*schema.Job, error) {
-	if jobID == nil {
-		return nil, fmt.Errorf("jobID cannot be nil")
-	}
-
 	q := sq.Select(jobCacheColumns...).From("job_cache").
 		Where("job_cache.job_id = ?", *jobID)

@@ -73,7 +59,7 @@ func (r *JobRepository) FindCached(
 		q = q.Where("job_cache.start_time = ?", *startTime)
 	}

-	q = q.OrderBy("job_cache.id DESC").Limit(1) // always use newest matching job by db id if more than one match
+	q = q.OrderBy("job_cache.id DESC") // always use newest matching job by db id if more than one match

 	return scanJob(q.RunWith(r.stmtCache).QueryRow())
 }
@@ -88,10 +74,6 @@ func (r *JobRepository) FindAll(
 	cluster *string,
 	startTime *int64,
 ) ([]*schema.Job, error) {
-	if jobID == nil {
-		return nil, fmt.Errorf("jobID cannot be nil")
-	}
-
 	start := time.Now()
 	q := sq.Select(jobColumns...).From("job").
 		Where("job.job_id = ?", *jobID)
@@ -105,8 +87,8 @@ func (r *JobRepository) FindAll(

 	rows, err := q.RunWith(r.stmtCache).Query()
 	if err != nil {
-		cclog.Errorf("Error while running FindAll query for jobID=%d: %v", *jobID, err)
-		return nil, fmt.Errorf("failed to execute FindAll query: %w", err)
+		cclog.Error("Error while running query")
+		return nil, err
 	}
 	defer rows.Close()

@@ -114,8 +96,8 @@ func (r *JobRepository) FindAll(
 	for rows.Next() {
 		job, err := scanJob(rows)
 		if err != nil {
-			cclog.Warnf("Error while scanning rows in FindAll: %v", err)
-			return nil, fmt.Errorf("failed to scan job row: %w", err)
+			cclog.Warn("Error while scanning rows")
+			return nil, err
 		}
 		jobs = append(jobs, job)
 	}
@@ -138,8 +120,8 @@ func (r *JobRepository) GetJobList(limit int, offset int) ([]int64, error) {

 	rows, err := query.RunWith(r.stmtCache).Query()
 	if err != nil {
-		cclog.Errorf("Error while running GetJobList query (limit=%d, offset=%d): %v", limit, offset, err)
-		return nil, fmt.Errorf("failed to execute GetJobList query: %w", err)
+		cclog.Error("Error while running query")
+		return nil, err
 	}
 	defer rows.Close()

@@ -148,13 +130,13 @@ func (r *JobRepository) GetJobList(limit int, offset int) ([]int64, error) {
 		var id int64
 		err := rows.Scan(&id)
 		if err != nil {
-			cclog.Warnf("Error while scanning rows in GetJobList: %v", err)
-			return nil, fmt.Errorf("failed to scan job ID: %w", err)
+			cclog.Warn("Error while scanning rows")
+			return nil, err
 		}
 		jl = append(jl, id)
 	}

-	cclog.Debugf("JobRepository.GetJobList(): Return job count %d", len(jl))
+	cclog.Infof("Return job count %d", len(jl))
 	return jl, nil
 }

@@ -220,10 +202,10 @@ func (r *JobRepository) FindByJobID(ctx context.Context, jobID int64, startTime
 	return scanJob(q.RunWith(r.stmtCache).QueryRow())
 }

-// IsJobOwner checks if the specified user owns the batch job identified by jobID,
-// startTime, and cluster. Returns true if the user is the owner, false otherwise.
-// This method does not return errors; it returns false for both non-existent jobs
-// and jobs owned by other users.
+// IsJobOwner executes a SQL query to find a specific batch job.
+// The job is queried using the slurm id,a username and the cluster.
+// It returns a bool.
+// If job was found, user is owner: test err != sql.ErrNoRows
 func (r *JobRepository) IsJobOwner(jobID int64, startTime int64, user string, cluster string) bool {
 	q := sq.Select("id").
 		From("job").
@@ -233,9 +215,6 @@ func (r *JobRepository) IsJobOwner(jobID int64, startTime int64, user string, cl
 		Where("job.start_time = ?", startTime)

 	_, err := scanJob(q.RunWith(r.stmtCache).QueryRow())
-	if err != nil && err != sql.ErrNoRows {
-		cclog.Warnf("IsJobOwner: unexpected error for jobID=%d, user=%s, cluster=%s: %v", jobID, user, cluster, err)
-	}
 	return err != sql.ErrNoRows
 }

@@ -253,11 +232,6 @@ func (r *JobRepository) FindConcurrentJobs(
 	}

 	query = query.Where("cluster = ?", job.Cluster)
-
-	if len(job.Resources) == 0 {
-		return nil, fmt.Errorf("job has no resources defined")
-	}
-
 	var startTime int64
 	var stopTime int64

@@ -270,17 +244,11 @@ func (r *JobRepository) FindConcurrentJobs(
 		stopTime = startTime + int64(job.Duration)
 	}

-	// Time buffer constants for finding overlapping jobs
-	// overlapBufferStart: 10s grace period at job start to catch jobs starting just after
-	// overlapBufferEnd: 200s buffer at job end to account for scheduling/cleanup overlap
-	const overlapBufferStart = 10
-	const overlapBufferEnd = 200
+	// Add 200s overlap for jobs start time at the end
+	startTimeTail := startTime + 10
+	stopTimeTail := stopTime - 200
+	startTimeFront := startTime + 200

-	startTimeTail := startTime + overlapBufferStart
-	stopTimeTail := stopTime - overlapBufferEnd
-	startTimeFront := startTime + overlapBufferEnd
-
-	// Reminder: BETWEEN Queries are slower and dont use indices as frequently: Can this be optimized?
 	queryRunning := query.Where("job.job_state = ?").Where("(job.start_time BETWEEN ? AND ? OR job.start_time < ?)",
 		"running", startTimeTail, stopTimeTail, startTime)
 	// Get At Least One Exact Hostname Match from JSON Resources Array in Database
@@ -293,8 +261,8 @@ func (r *JobRepository) FindConcurrentJobs(

 	rows, err := query.RunWith(r.stmtCache).Query()
 	if err != nil {
-		cclog.Errorf("Error while running concurrent jobs query: %v", err)
-		return nil, fmt.Errorf("failed to execute concurrent jobs query: %w", err)
+		cclog.Errorf("Error while running query: %v", err)
+		return nil, err
 	}
 	defer rows.Close()

@@ -305,8 +273,8 @@ func (r *JobRepository) FindConcurrentJobs(
 		var id, jobID, startTime sql.NullInt64

 		if err = rows.Scan(&id, &jobID, &startTime); err != nil {
-			cclog.Warnf("Error while scanning concurrent job rows: %v", err)
-			return nil, fmt.Errorf("failed to scan concurrent job row: %w", err)
+			cclog.Warn("Error while scanning rows")
+			return nil, err
 		}

 		if id.Valid {
@@ -321,8 +289,8 @@ func (r *JobRepository) FindConcurrentJobs(

 	rows, err = queryRunning.RunWith(r.stmtCache).Query()
 	if err != nil {
-		cclog.Errorf("Error while running concurrent running jobs query: %v", err)
-		return nil, fmt.Errorf("failed to execute concurrent running jobs query: %w", err)
+		cclog.Errorf("Error while running query: %v", err)
+		return nil, err
 	}
 	defer rows.Close()

@@ -330,8 +298,8 @@ func (r *JobRepository) FindConcurrentJobs(
 		var id, jobID, startTime sql.NullInt64

 		if err := rows.Scan(&id, &jobID, &startTime); err != nil {
-			cclog.Warnf("Error while scanning running concurrent job rows: %v", err)
-			return nil, fmt.Errorf("failed to scan running concurrent job row: %w", err)
+			cclog.Warn("Error while scanning rows")
+			return nil, err
 		}

 		if id.Valid {
--- a/internal/repository/jobHooks.go
+++ b/internal/repository/jobHooks.go
@@ -2,45 +2,16 @@
 // All rights reserved. This file is part of cc-backend.
 // Use of this source code is governed by a MIT-style
 // license that can be found in the LICENSE file.
-
 package repository

 import (
 	"sync"

-	"github.com/ClusterCockpit/cc-lib/v2/schema"
+	"github.com/ClusterCockpit/cc-lib/schema"
 )

-// JobHook interface allows external components to hook into job lifecycle events.
-// Implementations can perform actions when jobs start or stop, such as tagging,
-// logging, notifications, or triggering external workflows.
-//
-// Example implementation:
-//
-//	type MyJobTagger struct{}
-//
-//	func (t *MyJobTagger) JobStartCallback(job *schema.Job) {
-//	    if job.NumNodes > 100 {
-//	        // Tag large jobs automatically
-//	    }
-//	}
-//
-//	func (t *MyJobTagger) JobStopCallback(job *schema.Job) {
-//	    if job.State == schema.JobStateFailed {
-//	        // Log or alert on failed jobs
-//	    }
-//	}
-//
-// Register hooks during application initialization:
-//
-//	repository.RegisterJobHook(&MyJobTagger{})
 type JobHook interface {
-	// JobStartCallback is invoked when one or more jobs start.
-	// This is called synchronously, so implementations should be fast.
 	JobStartCallback(job *schema.Job)
-
-	// JobStopCallback is invoked when a job completes.
-	// This is called synchronously, so implementations should be fast.
 	JobStopCallback(job *schema.Job)
 }

@@ -49,13 +20,7 @@ var (
 	hooks    []JobHook
 )

-// RegisterJobHook registers a JobHook to receive job lifecycle callbacks.
-// Multiple hooks can be registered and will be called in registration order.
-// This function is safe to call multiple times and is typically called during
-// application initialization.
-//
-// Nil hooks are silently ignored to simplify conditional registration.
-func RegisterJobHook(hook JobHook) {
+func RegisterJobJook(hook JobHook) {
 	initOnce.Do(func() {
 		hooks = make([]JobHook, 0)
 	})
@@ -65,12 +30,6 @@ func RegisterJobHook(hook JobHook) {
 	}
 }

-// CallJobStartHooks invokes all registered JobHook.JobStartCallback methods
-// for each job in the provided slice. This is called internally by the repository
-// when jobs are started (e.g., via StartJob or batch job imports).
-//
-// Hooks are called synchronously in registration order. If a hook panics,
-// the panic will propagate to the caller.
 func CallJobStartHooks(jobs []*schema.Job) {
 	if hooks == nil {
 		return
@@ -85,12 +44,6 @@ func CallJobStartHooks(jobs []*schema.Job) {
 	}
 }

-// CallJobStopHooks invokes all registered JobHook.JobStopCallback methods
-// for the provided job. This is called internally by the repository when a
-// job completes (e.g., via StopJob or job state updates).
-//
-// Hooks are called synchronously in registration order. If a hook panics,
-// the panic will propagate to the caller.
 func CallJobStopHooks(job *schema.Job) {
 	if hooks == nil {
 		return
--- a/Show More
+++ b/Show More
Author	SHA1	Message	Date
Jan Eitzinger	d12da655e9	Only register PullWorker task if it was configured	2025-12-21 13:39:21 +01:00
Jan Eitzinger	50df63a2d2	Remove metricData usage Is replaced by builtin memorystore API	2025-12-21 13:29:43 +01:00
Jan Eitzinger	d23f20f42a	Remove InternalCCMSFlag	2025-12-21 08:12:36 +01:00
Jan Eitzinger	965561956e	Move ccms api to memorystore and make it default. Rename metricDataDispatcher. Refactor and document.	2025-12-21 07:34:17 +01:00
Jan Eitzinger	5a65044caf	Introduce updstream TS pull	2025-12-21 06:34:17 +01:00