ARIA-AGENT

Hacker-Software/ARIA-AGENT

Fork 0

Commit Graph

Author	SHA1	Message	Date
duffyduck	6549fcbce8	feat(brain): Volltext-Suche zusaetzlich zu Semantic — Default ist jetzt Wortlich Stefan wollte ne richtige Suche statt nur "klingt aehnlich". Beide Modi sind jetzt verfuegbar, Default ist Volltext: - 📝 Wortlich (Substring, case-insensitive ueber Title + Content + Category + Tags) — neuer Endpoint /memory/search-text. Full-Scan via Qdrant scroll, k=50. Findet "cessna" exakt im Content. Bei kleiner DB (<1000 Eintraege) unkritisch performant. - 🧠 Semantisch (Embedder + score_threshold 0.30) — bestehender /memory/search Endpoint. Findet konzeptuell verwandte Eintraege. Diagnostic UI: Dropdown neben dem Suchfeld zum Modus-Wechsel. Info-Banner zeigt klar welcher Modus aktiv ist. Warum Wortlich Default: bei kleiner DB liefert Semantic gern False Positives mit Score 0.30-0.45 fuer komplett unverwandte Begriffe (z.B. "cessna" matched "Tageslog fuehren" mit 0.43). Wortlich ist deterministisch und vermeidet das Rauschen. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-12 16:59:31 +02:00
duffyduck	daf0d44dd7	fix(brain): Memory-Suche filtert jetzt Rauschen — score_threshold + kleineres k Bug: bei kleiner DB (31 Eintraege) lieferte die Suche fuer JEDES Wort fast alles als Treffer zurueck — k=20 Top-N ohne Threshold sorgte dafuer dass auch "banane" zehn vermeintliche Treffer mit Scores 0.09-0.22 (= Rauschen) zurueckgab. Fix: - vector_store.search() bekommt optional score_threshold (an Qdrant durchgereicht, das nimmt's nativ) - /memory/search endpoint hat score_threshold-Query-Param (default 0.30) - Diagnostic schickt k=10 + score_threshold=0.30 statt k=20 ohne Threshold - "Keine Treffer"-Info-Box wenn alle Treffer < Threshold MiniLM-multilingual liefert typischerweise: >0.50 → starker Treffer 0.30-0.50 → relevant 0.20-0.30 → grenzwertig <0.20 → Rauschen Mit score_threshold=0 (oder None) bleibt die alte Top-N-Semantik fuer Aufrufer die Rauschen explizit wollen. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-12 15:59:05 +02:00
duffyduck	70d1500096	feat(brain): Phase B — Vector-DB-Memory, Conversation-Loop, Skills, Tool-Use OpenClaw (aria-core) ist raus, ARIA laeuft jetzt mit eigenem Agent-Framework im aria-brain Container. Vector-DB-basiertes Gedaechtnis statt Sessions, eigener Conversation-Loop mit Hot+Cold-Memory + Rolling Window, Tool-Use fuer Skills, Memory-Destillat-Pipeline. aria-brain/ (neuer Container) - main.py FastAPI auf 8080, alle Endpoints - agent.py Conversation-Loop mit Tool-Use (skill_create + run_<skill>) - conversation.py Rolling Window, JSONL-Persistenz, Distill-Marker - proxy_client.py httpx-Wrapper zum Claude-Proxy, OpenAI-Format - prompts.py System-Prompt aus Hot+Cold+Skills - migration.py Markdown-Parser fuer brain-import/ → atomare Memories - skills.py Filesystem-Layer fuer /data/skills/<name>/ (Python-only, venv pro Skill, tar.gz Export/Import, Run-Logs) - memory/ Embedder (sentence-transformers, multilingual MiniLM) + VectorStore (Qdrant-Wrapper) docker-compose.yml - aria-core (OpenClaw) raus, openclaw-config Volume raus - aria-brain Service (FastAPI + Memory) - aria-qdrant Service (Vector-DB) mit Bind-Mount aria-data/brain/qdrant/ - Diagnostic teilt jetzt Netzwerk mit Bridge (vorher: aria-core) - Brain bekommt SSH-Mount fuer aria-wohnung + /import fuer brain-import/ bridge/aria_bridge.py - send_to_core → HTTP-Call an aria-brain:8080/chat (statt OpenClaw-WS) - aria-core-spezifische Handler raus: doctor_fix, aria_restart, aria_session_reset, Auto-Compact-Logik, OpenClaw-Handshake - Generischer container_restart-Handler (Whitelist Bridge/Brain/Qdrant) - Side-Channel-Events aus /chat-Response (z.B. skill_created) werden als RVS-Events forwarded - file_list_request / file_delete_request → an Diagnostic forwarded - Tote OpenClaw-Connection-Logik bleibt im Code als Referenz (nicht aktiv) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-11 22:23:17 +02:00

Author

SHA1

Message

Date

duffyduck

6549fcbce8

feat(brain): Volltext-Suche zusaetzlich zu Semantic — Default ist jetzt Wortlich

Stefan wollte ne richtige Suche statt nur "klingt aehnlich". Beide
Modi sind jetzt verfuegbar, Default ist Volltext:

- 📝 Wortlich (Substring, case-insensitive ueber Title + Content +
  Category + Tags) — neuer Endpoint /memory/search-text. Full-Scan
  via Qdrant scroll, k=50. Findet "cessna" exakt im Content. Bei
  kleiner DB (<1000 Eintraege) unkritisch performant.

- 🧠 Semantisch (Embedder + score_threshold 0.30) — bestehender
  /memory/search Endpoint. Findet konzeptuell verwandte Eintraege.

Diagnostic UI: Dropdown neben dem Suchfeld zum Modus-Wechsel.
Info-Banner zeigt klar welcher Modus aktiv ist.

Warum Wortlich Default: bei kleiner DB liefert Semantic gern False
Positives mit Score 0.30-0.45 fuer komplett unverwandte Begriffe
(z.B. "cessna" matched "Tageslog fuehren" mit 0.43). Wortlich ist
deterministisch und vermeidet das Rauschen.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-05-12 16:59:31 +02:00

duffyduck

daf0d44dd7

fix(brain): Memory-Suche filtert jetzt Rauschen — score_threshold + kleineres k

Bug: bei kleiner DB (31 Eintraege) lieferte die Suche fuer JEDES Wort
fast alles als Treffer zurueck — k=20 Top-N ohne Threshold sorgte
dafuer dass auch "banane" zehn vermeintliche Treffer mit Scores
0.09-0.22 (= Rauschen) zurueckgab.

Fix:
- vector_store.search() bekommt optional score_threshold (an Qdrant
  durchgereicht, das nimmt's nativ)
- /memory/search endpoint hat score_threshold-Query-Param (default 0.30)
- Diagnostic schickt k=10 + score_threshold=0.30 statt k=20 ohne Threshold
- "Keine Treffer"-Info-Box wenn alle Treffer < Threshold

MiniLM-multilingual liefert typischerweise:
  >0.50 → starker Treffer
  0.30-0.50 → relevant
  0.20-0.30 → grenzwertig
  <0.20 → Rauschen

Mit score_threshold=0 (oder None) bleibt die alte Top-N-Semantik
fuer Aufrufer die Rauschen explizit wollen.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-05-12 15:59:05 +02:00

duffyduck

70d1500096

feat(brain): Phase B — Vector-DB-Memory, Conversation-Loop, Skills, Tool-Use

OpenClaw (aria-core) ist raus, ARIA laeuft jetzt mit eigenem Agent-Framework
im aria-brain Container. Vector-DB-basiertes Gedaechtnis statt Sessions,
eigener Conversation-Loop mit Hot+Cold-Memory + Rolling Window, Tool-Use
fuer Skills, Memory-Destillat-Pipeline.

aria-brain/ (neuer Container)
  - main.py            FastAPI auf 8080, alle Endpoints
  - agent.py           Conversation-Loop mit Tool-Use (skill_create + run_<skill>)
  - conversation.py    Rolling Window, JSONL-Persistenz, Distill-Marker
  - proxy_client.py    httpx-Wrapper zum Claude-Proxy, OpenAI-Format
  - prompts.py         System-Prompt aus Hot+Cold+Skills
  - migration.py       Markdown-Parser fuer brain-import/ → atomare Memories
  - skills.py          Filesystem-Layer fuer /data/skills/<name>/ (Python-only,
                       venv pro Skill, tar.gz Export/Import, Run-Logs)
  - memory/            Embedder (sentence-transformers, multilingual MiniLM)
                       + VectorStore (Qdrant-Wrapper)

docker-compose.yml
  - aria-core (OpenClaw) raus, openclaw-config Volume raus
  - aria-brain Service (FastAPI + Memory)
  - aria-qdrant Service (Vector-DB) mit Bind-Mount aria-data/brain/qdrant/
  - Diagnostic teilt jetzt Netzwerk mit Bridge (vorher: aria-core)
  - Brain bekommt SSH-Mount fuer aria-wohnung + /import fuer brain-import/

bridge/aria_bridge.py
  - send_to_core → HTTP-Call an aria-brain:8080/chat (statt OpenClaw-WS)
  - aria-core-spezifische Handler raus: doctor_fix, aria_restart,
    aria_session_reset, Auto-Compact-Logik, OpenClaw-Handshake
  - Generischer container_restart-Handler (Whitelist Bridge/Brain/Qdrant)
  - Side-Channel-Events aus /chat-Response (z.B. skill_created) werden
    als RVS-Events forwarded
  - file_list_request / file_delete_request → an Diagnostic forwarded
  - Tote OpenClaw-Connection-Logik bleibt im Code als Referenz (nicht aktiv)

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-05-11 22:23:17 +02:00

3 Commits