feat(phase1): Whisper STT auf die Gamebox ausgelagert

Neuer Container aria-whisper-bridge auf der Gamebox — faster-whisper
CUDA mit float16. Der Container verbindet sich per WebSocket an den RVS,
nimmt stt_request entgegen, laeuft ffmpeg+Whisper, antwortet mit
stt_response. Hoert zusaetzlich auf config-Broadcasts und lädt das
Modell hot-swap bei Diagnostic-Wechsel.

aria-bridge ruft jetzt primaer die Gamebox an; nur wenn die nicht binnen
45s antwortet, faellt auf lokales Whisper (CPU) zurueck. Das lokale
Modell wird lazy geladen, spart RAM auf der VM.

RVS: stt_request/stt_response zur ALLOWED_TYPES-Liste.

Diagnostic-Voice-Config (whisperModel-Feld) bleibt unveraendert —
die Auswahl wird an die Gamebox durchgereicht.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

This commit is contained in:

duffyduck

2026-04-24 13:42:07 +02:00

parent 97a1a3089a

commit e544992c9f

6 changed files with 422 additions and 43 deletions

									
										rvs/server.js
									
		+1
		
												View File
												
				@@ -20,6 +20,7 @@ const ALLOWED_TYPES = new Set([

				  "audio_pcm",

				  "xtts_delete_voice",

				  "voice_preload", "voice_ready",

				  "stt_request", "stt_response",

				]);

				// Token-Raum: token -> { clients: Set<ws> }