ARIA-AGENT

Hacker-Software/ARIA-AGENT

Fork 0

Commit Graph

Author	SHA1	Message	Date
duffyduck	b2f7d6dda2	feat(brain+diagnostic): Token/Call-Metrics mit Subscription-Plan-Tracking Stefan hat den Max 5x Plan (~\$90-100/Monat), ungefaehres Limit 225 Calls pro 5h-Fenster fuer Sonnet. Damit nicht in eine Tool-Loop-Schleife laufen ohne es zu merken: kleine Metrics-Pipeline, sichtbar in der Diagnostic. aria-brain/metrics.py Append-only JSONL Logger unter /data/metrics.jsonl. Pro Claude-Call eine Zeile {ts, model, in, out} mit Token-Schaetzung (chars/4, Anthropic- Heuristik). aggregate(window) zaehlt die letzten N Sekunden. Auto-Rotate bei 50k Zeilen → 25k behalten (~70 KB/Monat bei 1k Calls/Tag, Cap also weit oben). aria-brain/proxy_client.py chat_full() ruft am Ende metrics.log_call(model, messages_in, reply). Failed/exception-Pfade loggen nicht (sonst false positives). aria-brain/main.py GET /metrics/calls → {h1, h5, h24, d30}, jedes Window mit calls, tokens_in, tokens_out, by_model. diagnostic/index.html Neue Card "Token / Calls" im Gehirn-Tab. Plan-Dropdown (Pro / Max 5x / Max 20x / Custom), localStorage-persistiert. 4 Metric- Zellen fuer 1h/5h/24h/30d mit Calls + Tokens. Progress-Bar oben zeigt 5h-Counter gegen Plan-Limit. Warn-Klassen: gelb bei 80%, rot bei 90%. Auto-Refresh alle 30s wenn Gehirn-Tab offen, plus bei Tab-Wechsel. Info-Modal erklaert die Limits + dass HTTP-Call != User-Frage (Tool-Use kann pro Frage bis zu 8 Calls verursachen). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-11 23:43:56 +02:00
duffyduck	aa077f60e6	fix(diagnostic+brain): Sprachmodell-Einstellung auf runtime.json umgestellt War kaputt nach OpenClaw-Abriss: handleGetModel/handleSetModel haben gegen aria-core (dockerExec + node-script in den Container) gearbeitet, der gibt's nicht mehr. diagnostic/server.js - handleGetModel/handleSetModel lesen/schreiben jetzt brainModel in /shared/config/runtime.json - RUNTIME_CONFIG_FIELDS um "brainModel" erweitert - Tote Variante (findSettingsFile + base64-node-script) komplett raus aria-brain/proxy_client.py - Liest brainModel aus runtime.json beim Container-Start - Fallback: BRAIN_MODEL env → "claude-sonnet-4" Default - Bei Aenderung in Diagnostic: aria-brain restarten damit's greift (Hinweis steht in der UI) diagnostic/index.html - Section "Model" → "Sprachmodell (Brain)" - Hinweis-Block mit Default-Erklaerung und Restart-Hinweis - Modelle: claude-sonnet-4 (default), claude-opus-4, claude-haiku-4-5 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-11 22:36:14 +02:00
duffyduck	70d1500096	feat(brain): Phase B — Vector-DB-Memory, Conversation-Loop, Skills, Tool-Use OpenClaw (aria-core) ist raus, ARIA laeuft jetzt mit eigenem Agent-Framework im aria-brain Container. Vector-DB-basiertes Gedaechtnis statt Sessions, eigener Conversation-Loop mit Hot+Cold-Memory + Rolling Window, Tool-Use fuer Skills, Memory-Destillat-Pipeline. aria-brain/ (neuer Container) - main.py FastAPI auf 8080, alle Endpoints - agent.py Conversation-Loop mit Tool-Use (skill_create + run_<skill>) - conversation.py Rolling Window, JSONL-Persistenz, Distill-Marker - proxy_client.py httpx-Wrapper zum Claude-Proxy, OpenAI-Format - prompts.py System-Prompt aus Hot+Cold+Skills - migration.py Markdown-Parser fuer brain-import/ → atomare Memories - skills.py Filesystem-Layer fuer /data/skills/<name>/ (Python-only, venv pro Skill, tar.gz Export/Import, Run-Logs) - memory/ Embedder (sentence-transformers, multilingual MiniLM) + VectorStore (Qdrant-Wrapper) docker-compose.yml - aria-core (OpenClaw) raus, openclaw-config Volume raus - aria-brain Service (FastAPI + Memory) - aria-qdrant Service (Vector-DB) mit Bind-Mount aria-data/brain/qdrant/ - Diagnostic teilt jetzt Netzwerk mit Bridge (vorher: aria-core) - Brain bekommt SSH-Mount fuer aria-wohnung + /import fuer brain-import/ bridge/aria_bridge.py - send_to_core → HTTP-Call an aria-brain:8080/chat (statt OpenClaw-WS) - aria-core-spezifische Handler raus: doctor_fix, aria_restart, aria_session_reset, Auto-Compact-Logik, OpenClaw-Handshake - Generischer container_restart-Handler (Whitelist Bridge/Brain/Qdrant) - Side-Channel-Events aus /chat-Response (z.B. skill_created) werden als RVS-Events forwarded - file_list_request / file_delete_request → an Diagnostic forwarded - Tote OpenClaw-Connection-Logik bleibt im Code als Referenz (nicht aktiv) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-11 22:23:17 +02:00

Author

SHA1

Message

Date

duffyduck

b2f7d6dda2

feat(brain+diagnostic): Token/Call-Metrics mit Subscription-Plan-Tracking

Stefan hat den Max 5x Plan (~\$90-100/Monat), ungefaehres Limit 225 Calls pro
5h-Fenster fuer Sonnet. Damit nicht in eine Tool-Loop-Schleife laufen ohne
es zu merken: kleine Metrics-Pipeline, sichtbar in der Diagnostic.

aria-brain/metrics.py
  Append-only JSONL Logger unter /data/metrics.jsonl. Pro Claude-Call eine
  Zeile {ts, model, in, out} mit Token-Schaetzung (chars/4, Anthropic-
  Heuristik). aggregate(window) zaehlt die letzten N Sekunden.
  Auto-Rotate bei 50k Zeilen → 25k behalten (~70 KB/Monat bei 1k Calls/Tag,
  Cap also weit oben).

aria-brain/proxy_client.py
  chat_full() ruft am Ende metrics.log_call(model, messages_in, reply).
  Failed/exception-Pfade loggen nicht (sonst false positives).

aria-brain/main.py
  GET /metrics/calls → {h1, h5, h24, d30}, jedes Window mit calls,
  tokens_in, tokens_out, by_model.

diagnostic/index.html
  Neue Card "Token / Calls" im Gehirn-Tab. Plan-Dropdown
  (Pro / Max 5x / Max 20x / Custom), localStorage-persistiert. 4 Metric-
  Zellen fuer 1h/5h/24h/30d mit Calls + Tokens. Progress-Bar oben zeigt
  5h-Counter gegen Plan-Limit. Warn-Klassen: gelb bei 80%, rot bei 90%.
  Auto-Refresh alle 30s wenn Gehirn-Tab offen, plus bei Tab-Wechsel.
  Info-Modal erklaert die Limits + dass HTTP-Call != User-Frage (Tool-Use
  kann pro Frage bis zu 8 Calls verursachen).

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-05-11 23:43:56 +02:00

duffyduck

aa077f60e6

fix(diagnostic+brain): Sprachmodell-Einstellung auf runtime.json umgestellt

War kaputt nach OpenClaw-Abriss: handleGetModel/handleSetModel haben gegen
aria-core (dockerExec + node-script in den Container) gearbeitet, der gibt's
nicht mehr.

diagnostic/server.js
  - handleGetModel/handleSetModel lesen/schreiben jetzt brainModel in
    /shared/config/runtime.json
  - RUNTIME_CONFIG_FIELDS um "brainModel" erweitert
  - Tote Variante (findSettingsFile + base64-node-script) komplett raus

aria-brain/proxy_client.py
  - Liest brainModel aus runtime.json beim Container-Start
  - Fallback: BRAIN_MODEL env → "claude-sonnet-4" Default
  - Bei Aenderung in Diagnostic: aria-brain restarten damit's greift
    (Hinweis steht in der UI)

diagnostic/index.html
  - Section "Model" → "Sprachmodell (Brain)"
  - Hinweis-Block mit Default-Erklaerung und Restart-Hinweis
  - Modelle: claude-sonnet-4 (default), claude-opus-4, claude-haiku-4-5

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-05-11 22:36:14 +02:00

duffyduck

70d1500096

feat(brain): Phase B — Vector-DB-Memory, Conversation-Loop, Skills, Tool-Use

OpenClaw (aria-core) ist raus, ARIA laeuft jetzt mit eigenem Agent-Framework
im aria-brain Container. Vector-DB-basiertes Gedaechtnis statt Sessions,
eigener Conversation-Loop mit Hot+Cold-Memory + Rolling Window, Tool-Use
fuer Skills, Memory-Destillat-Pipeline.

aria-brain/ (neuer Container)
  - main.py            FastAPI auf 8080, alle Endpoints
  - agent.py           Conversation-Loop mit Tool-Use (skill_create + run_<skill>)
  - conversation.py    Rolling Window, JSONL-Persistenz, Distill-Marker
  - proxy_client.py    httpx-Wrapper zum Claude-Proxy, OpenAI-Format
  - prompts.py         System-Prompt aus Hot+Cold+Skills
  - migration.py       Markdown-Parser fuer brain-import/ → atomare Memories
  - skills.py          Filesystem-Layer fuer /data/skills/<name>/ (Python-only,
                       venv pro Skill, tar.gz Export/Import, Run-Logs)
  - memory/            Embedder (sentence-transformers, multilingual MiniLM)
                       + VectorStore (Qdrant-Wrapper)

docker-compose.yml
  - aria-core (OpenClaw) raus, openclaw-config Volume raus
  - aria-brain Service (FastAPI + Memory)
  - aria-qdrant Service (Vector-DB) mit Bind-Mount aria-data/brain/qdrant/
  - Diagnostic teilt jetzt Netzwerk mit Bridge (vorher: aria-core)
  - Brain bekommt SSH-Mount fuer aria-wohnung + /import fuer brain-import/

bridge/aria_bridge.py
  - send_to_core → HTTP-Call an aria-brain:8080/chat (statt OpenClaw-WS)
  - aria-core-spezifische Handler raus: doctor_fix, aria_restart,
    aria_session_reset, Auto-Compact-Logik, OpenClaw-Handshake
  - Generischer container_restart-Handler (Whitelist Bridge/Brain/Qdrant)
  - Side-Channel-Events aus /chat-Response (z.B. skill_created) werden
    als RVS-Events forwarded
  - file_list_request / file_delete_request → an Diagnostic forwarded
  - Tote OpenClaw-Connection-Logik bleibt im Code als Referenz (nicht aktiv)

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-05-11 22:23:17 +02:00

3 Commits