Initial commit: SIC harness (backend, web, pi-adapter, configs, docs)
- pnpm monorepo: apps/api (Fastify + SQLite + SSE), apps/web (React+Vite), packages/shared, packages/pi-adapter - Local auth (admin/webhook-runner roles) + Keycloak JWT ready - Multi-session chat with reliable history (user persisted before LLM, assistant persisted after stream) - Markdown knowledge base with /api/docs/search + /api/docs/:id - YAML webhook catalog with backend-only execution, retry/backoff, audit (webhook_runs), and per-user rate limit - Skills config (sre-on-call, blameless-postmortem, security-incident) injected into LLM system prompt - LLM provider failover chain (config/models.yml fallback + LLM_FALLBACK_CHAIN override) - Context-aware webhooks panel + backend id-mention safety net - Per-message stats (time/duration/tokens/model), Markdown+GFM render, code & table copy/download buttons - Vitest suite, end-to-end smoke test (scripts/smoke.mjs), per-session system prompt override - /metrics Prometheus endpoint + /api/metrics JSON, request-id correlation - dotenv with explicit repo-root path; envString/envNumber helpers (handles empty-string env) - Runbooks + SOPs under knowledge/ in English; README, docs, and INDEX.md in English
This commit is contained in:
29
config/models.yml
Normal file
29
config/models.yml
Normal file
@@ -0,0 +1,29 @@
|
||||
models:
|
||||
- id: fast
|
||||
label: MiniMax Fast
|
||||
provider: openai-compatible
|
||||
base_url: ${LLM_BASE_URL}
|
||||
model: MiniMax-M2.7-highspeed
|
||||
max_tokens: 1024
|
||||
- id: balanced
|
||||
label: MiniMax Balanced
|
||||
provider: openai-compatible
|
||||
base_url: ${LLM_BASE_URL}
|
||||
model: MiniMax-M2.7
|
||||
max_tokens: 2048
|
||||
# If MiniMax fails (5xx, 429, timeout, network) fall through to mr-auto.
|
||||
fallback:
|
||||
- mr-auto
|
||||
- id: reasoning
|
||||
label: MiniMax Reasoning
|
||||
provider: openai-compatible
|
||||
base_url: ${LLM_BASE_URL}
|
||||
model: MiniMax-M3
|
||||
max_tokens: 4096
|
||||
- id: mr-auto
|
||||
label: MR Auto (llm.rikrdo.com)
|
||||
provider: openai-compatible
|
||||
base_url: https://llm.rikrdo.com/v1
|
||||
api_key_env: MR_AUTO_API_KEY
|
||||
model: mr-auto
|
||||
max_tokens: 8192
|
||||
Reference in New Issue
Block a user