mirror of
https://github.com/ClusterCockpit/cc-metric-store.git
synced 2024-11-14 23:17:25 +01:00
2.1 KiB
2.1 KiB
Possible Tasks and Improvements
Importance:
-
I Important
-
N Nice to have
-
W Won't do. Probably not necessary.
-
Benchmarking
- Benchmark and compare common timeseries DBs with our data and our queries (N)
-
Web interface
- Provide simple http endpoint with a status and debug view (Start with Basic Authentication)
-
Configuration
- Consolidate configuration with cc-backend, remove redundant information
- Support to receive configuration via NATS channel
-
Memory management
- To overcome garbage collection overhead: Reimplement in Rust (N)
- Request memory directly batchwise via mmap (started in branch) (W)
-
Archive
- S3 backend for archive (I)
- Store information in each buffer if already archived (N)
- Do not create new checkpoint if all buffers already archived (N)
-
Checkpoints
- S3 backend for checkpoints (I)
- Combine checkpoints into larger files (I)
- Binary checkpoints (started in branch) (W)
-
API
- Redesign query interface (N)
- Provide an endpoint for node health based on received metric data (I)
- Introduce JWT authentication for REST and NATS (I)
-
Testing
- General tests (I)
- Test data generator for regression tests (I)
- Check for corner cases that should fail gracefully (N)
- Write a more realistic
ToArchive
/FromArchive
Tests (N)
-
Aggregation
- Calculate averages buffer-wise as soon as full, average weighted by length of buffer (N)
- Only the head-buffer needs to be fully traversed (N)
- If aggregating over hwthreads/cores/sockets cache those results and reuse some of that for new queries aggregating only over the newer data (W)
-
Core functionality
- Implement a health checker component that provides information to the web interface and REST API (I)
- Support units for metrics including to request unit conversions (I)
-
Compression
- Enable compression for http API requests (N)
- Enable compression for checkpoints/archive (I)
-
Sampling
- Support data re sampling to reduce data points (I)
- Use re sampling algorithms that preserve min/max as far as possible (I)