- pnpm monorepo: apps/api (Fastify + SQLite + SSE), apps/web (React+Vite), packages/shared, packages/pi-adapter - Local auth (admin/webhook-runner roles) + Keycloak JWT ready - Multi-session chat with reliable history (user persisted before LLM, assistant persisted after stream) - Markdown knowledge base with /api/docs/search + /api/docs/:id - YAML webhook catalog with backend-only execution, retry/backoff, audit (webhook_runs), and per-user rate limit - Skills config (sre-on-call, blameless-postmortem, security-incident) injected into LLM system prompt - LLM provider failover chain (config/models.yml fallback + LLM_FALLBACK_CHAIN override) - Context-aware webhooks panel + backend id-mention safety net - Per-message stats (time/duration/tokens/model), Markdown+GFM render, code & table copy/download buttons - Vitest suite, end-to-end smoke test (scripts/smoke.mjs), per-session system prompt override - /metrics Prometheus endpoint + /api/metrics JSON, request-id correlation - dotenv with explicit repo-root path; envString/envNumber helpers (handles empty-string env) - Runbooks + SOPs under knowledge/ in English; README, docs, and INDEX.md in English
26 lines
538 B
Markdown
26 lines
538 B
Markdown
---
|
|
title: Disk Cleanup Runbook
|
|
tags: [disk, cleanup, storage, operations]
|
|
owner: sre
|
|
updated: 2026-06-12
|
|
---
|
|
|
|
# Disk Cleanup Runbook
|
|
|
|
## When to use it
|
|
|
|
- `disk usage > 85%` alert on /tmp or /var.
|
|
- Job failures with `No space left on device`.
|
|
- Before scheduled node maintenance.
|
|
|
|
## Procedure
|
|
|
|
1. List candidate files: `find /tmp -type f -mtime +7`.
|
|
2. Confirm none are in use by an active process.
|
|
3. Run the `disk-cleanup` webhook to remove /tmp files older than 7 days.
|
|
4. Re-verify disk usage.
|
|
|
|
## Related webhooks
|
|
|
|
- disk-cleanup
|