first commit

2026-03-04 21:55:49 +01:00
commit bb7c1d5c3f
12 changed files with 2247 additions and 0 deletions
@@ -0,0 +1,245 @@
 # Proxmox Cluster Network Changer
 Migriert ein komplettes Proxmox-Cluster (inkl. Ceph) von einem Netzwerk in ein anderes.
 **Problem:** Wenn man bei einem Proxmox-Cluster die IPs ändert, verliert man das Quorum und `/etc/pve` wird read-only — dann kann man weder Corosync noch Ceph über das Cluster-Dateisystem konfigurieren. Dieses Tool löst das Problem durch eine koordinierte Migration aller Nodes.
 ## Features
 - Automatische Erkennung aller Nodes, IPs und Konfigurationen
 - Koordinierte Migration aller Nodes in einem Durchgang
 - Ceph-Support (Public Network, Cluster Network, MON-Adressen)
 - Funktioniert auch bei **gebrochenem Quorum** (z.B. wenn ein Node bereits manuell geändert wurde)
 - Automatische Backups aller Konfigurationen vor der Migration
 - Dry-Run-Modus zum gefahrlosen Testen
 - Verifikation nach der Migration
 ## Voraussetzungen
 - Python 3.9+ (auf Proxmox standardmäßig vorhanden)
 - Root-Zugriff auf dem Node, auf dem das Tool läuft
 - SSH-Zugriff (Key-basiert) zu allen anderen Cluster-Nodes
 - Keine externen Python-Pakete nötig (nur stdlib)
 ## Installation
 ```bash
 # Auf einen Proxmox-Node kopieren
 scp -r proxmox-cluster-network-changer/ root@pve1:/root/
 # Oder direkt klonen
 cd /root
 git clone <repo-url> proxmox-cluster-network-changer
 ```
 ## Verwendung
 ### Aktuelle Konfiguration anzeigen (Discovery)
 ```bash
 python3 main.py --discover
 ```
 Zeigt an:
 - Alle Cluster-Nodes mit IPs
 - Corosync-Konfiguration
 - Ceph-Netzwerke und MON-Hosts
 - Quorum-Status
 - Welche Nodes erreichbar sind
 ### Dry-Run (nichts wird geändert)
 ```bash
 python3 main.py --dry-run
 ```
 Durchläuft den kompletten Prozess, zeigt alle geplanten Änderungen an, schreibt aber nichts.
 ### Migration durchführen
 ```bash
 python3 main.py
 ```
 Das Tool führt interaktiv durch den Prozess:
 ```
 === Phase 1: Discovery ===
 [Corosync]
  Cluster: mycluster
  Nodes gefunden: 4
    - pve1 (ID: 1) -> 192.168.0.101
    - pve2 (ID: 2) -> 192.168.0.102
    - pve3 (ID: 3) -> 192.168.0.103
    - pve4 (ID: 4) -> 192.168.0.104
 [Ceph]
  Public Network: 192.168.0.0/24
  Cluster Network: 192.168.0.0/24
 === Phase 2: Migration planen ===
 Neues Netzwerk (z.B. 172.0.2.0/16): 172.0.2.0/16
 Neues Gateway [172.0.0.1]: 172.0.2.1
 [IP-Mapping]
  pve1: 192.168.0.101 -> [172.0.2.101]:
  pve2: 192.168.0.102 -> [172.0.2.102]:
  pve3: 192.168.0.103 -> [172.0.2.103]:
  pve4: 192.168.0.104 -> [172.0.2.104]:
 Migration durchführen? [j/N]: j
 ```
 ### Optionen
 | Option | Beschreibung |
 |---|---|
 | `--dry-run` | Nur anzeigen, nichts ändern |
 | `--discover` | Nur aktuelle Config anzeigen |
 | `--rescue` | Rescue-Modus: Emergency-Netzwerk einrichten |
 | `--rescue-commands SUBNET` | Nur Rescue-Befehle ausgeben (z.B. `10.99.99.0/24`) |
 | `--ssh-key PFAD` | Pfad zum SSH-Key (Standard: Default-Key) |
 | `--ssh-port PORT` | SSH-Port (Standard: 22) |
 ## Was wird geändert?
 | Datei | Wo | Was |
 |---|---|---|
 | `/etc/network/interfaces` | Jeder Node | Bridge-IP, Gateway |
 | `/etc/hosts` | Jeder Node | Hostname-zu-IP-Zuordnung |
 | `/etc/corosync/corosync.conf` | Jeder Node | Corosync Ring-Adressen |
 | `/etc/pve/ceph.conf` | Cluster-FS | public_network, cluster_network, MON-Adressen |
 ## Migrationsablauf (Phase 4)
 1. Neue Konfigurationen werden auf alle Nodes verteilt (Staging)
 2. Corosync wird auf allen Nodes gestoppt
 3. pve-cluster (pmxcfs) wird gestoppt
 4. Corosync-Config wird direkt geschrieben (`/etc/corosync/corosync.conf`)
 5. `/etc/hosts` wird aktualisiert
 6. `/etc/network/interfaces` wird aktualisiert + Netzwerk-Reload (`ifreload -a`)
 7. Services werden gestartet, Quorum abgewartet, Ceph aktualisiert
 ## Rescue-Netzwerk (Emergency Mode)
 **Szenario:** PVE01 hat bereits eine neue IP, PVE02-04 sind noch im alten Netz. Kein Node kann die anderen erreichen.
 ### Schnell: Nur Befehle anzeigen
 ```bash
 python3 main.py --rescue-commands 10.99.99.0/24
 ```
 Ausgabe:
 ```
  RESCUE BEFEHLE
  Subnetz: 10.99.99.0/24 | Bridge: vmbr0
  pve1 (192.168.0.101):
    ip addr add 10.99.99.1/24 dev vmbr0
  pve2 (192.168.0.102):
    ip addr add 10.99.99.2/24 dev vmbr0
  pve3 (192.168.0.103):
    ip addr add 10.99.99.3/24 dev vmbr0
  pve4 (192.168.0.104):
    ip addr add 10.99.99.4/24 dev vmbr0
  Zum Entfernen:
    ip addr del 10.99.99.1/24 dev vmbr0  # pve1
    ip addr del 10.99.99.2/24 dev vmbr0  # pve2
    ip addr del 10.99.99.3/24 dev vmbr0  # pve3
    ip addr del 10.99.99.4/24 dev vmbr0  # pve4
 ```
 Diese Befehle über IPMI/iLO/iDRAC/KVM-Konsole auf jedem Node ausführen.
 ### Interaktiv: Rescue + Migration
 ```bash
 python3 main.py --rescue
 ```
 oder einfach starten — wenn Nodes nicht erreichbar sind, wird automatisch gefragt:
 ```bash
 python3 main.py
 ```
 ```
  3 Node(s) nicht erreichbar.
  Rescue-Netzwerk einrichten? [J/n]: j
 ```
 Ablauf:
 1. Du gibst ein freies Subnetz an (z.B. `10.99.99.0/24`)
 2. Das Tool zeigt für jeden Node den `ip addr add` Befehl
 3. Auf dem lokalen Node wird die IP automatisch gesetzt
 4. Du führst die Befehle auf den anderen Nodes per Konsole aus
 5. Das Tool testet die Verbindung und liest die Configs
 6. Danach läuft die normale Migration
 7. Am Ende werden die Emergency-IPs automatisch entfernt
 ### Wann brauche ich das?
 - Ein oder mehrere Nodes haben bereits manuell eine neue IP bekommen
 - Die Nodes liegen in verschiedenen Subnetzen
 - SSH zwischen den Nodes funktioniert nicht mehr
 - Du hast aber noch Zugriff auf die Konsolen (IPMI/iLO/iDRAC/KVM)
 ## Gebrochenes Quorum
 Wenn bereits ein Node manuell geändert wurde und das Quorum verloren ist:
 - Das Tool erkennt den Zustand automatisch in der Discovery-Phase
 - Nicht erreichbare Nodes werden per Hostname gesucht
 - Configs werden direkt geschrieben (nicht über `/etc/pve/`)
 - Nach dem Netzwerk-Reload wird `pvecm expected 1` genutzt, um Quorum zu erzwingen
 - Danach wird Ceph über das Cluster-Dateisystem aktualisiert
 ## Backups
 Vor der Migration werden automatisch Backups erstellt:
 ```
 /root/network-migration-backup-20260304_143022/
 ├── etc_network_interfaces
 ├── etc_hosts
 ├── etc_corosync_corosync.conf
 ├── etc_ceph_ceph.conf
 ├── etc_pve_corosync.conf
 └── etc_pve_ceph.conf
 ```
 ### Restore (manuell)
 ```bash
 # Beispiel: Netzwerk-Config wiederherstellen
 cp /root/network-migration-backup-*/etc_network_interfaces /etc/network/interfaces
 ifreload -a
 # Corosync wiederherstellen
 cp /root/network-migration-backup-*/etc_corosync_corosync.conf /etc/corosync/corosync.conf
 systemctl restart corosync
 ```
 ## Empfohlene Reihenfolge bei Problemen
 1. `pvecm status` — Cluster-Status prüfen
 2. `pvecm expected 1` — Quorum erzwingen (Notfall)
 3. `ceph -s` — Ceph-Status prüfen
 4. `ceph -w` — Ceph-Recovery beobachten
 5. `journalctl -u corosync` — Corosync-Logs prüfen
 6. `journalctl -u pve-cluster` — pmxcfs-Logs prüfen
 ## Hinweise
 - Das Tool muss als **root** ausgeführt werden
 - SSH-Keys müssen **vorher** zwischen den Nodes eingerichtet sein (bei Proxmox-Clustern standardmäßig der Fall)
 - VMs/CTs werden **nicht** automatisch migriert oder gestoppt — das Netzwerk wird im laufenden Betrieb geändert
 - Nach der Migration sollten VM-Netzwerke (Bridges in VM-Configs) geprüft werden, falls diese sich auf spezifische IPs beziehen
 - Getestet mit Proxmox VE 7.x und 8.x
@@ -0,0 +1,86 @@
 """Phase 3: Backup all configuration files before migration."""
 import datetime
 from models import MigrationPlan
 from ssh_manager import SSHManager
 BACKUP_FILES = [
    "/etc/network/interfaces",
    "/etc/hosts",
    "/etc/corosync/corosync.conf",
    "/etc/ceph/ceph.conf",
 ]
 CLUSTER_BACKUP_FILES = [
    "/etc/pve/corosync.conf",
    "/etc/pve/ceph.conf",
 ]
 class Backup:
    """Creates backups of all config files on each node."""
    def __init__(self, ssh: SSHManager):
        self.ssh = ssh
    def run(self, plan: MigrationPlan) -> bool:
        """Create backups on all reachable nodes.
        Returns True if all backups succeeded.
        """
        print("\n=== Phase 3: Backup ===\n")
        timestamp = datetime.datetime.now().strftime("%Y%m%d_%H%M%S")
        backup_dir = f"/root/network-migration-backup-{timestamp}"
        all_ok = True
        for node in plan.nodes:
            if not node.is_reachable:
                print(f"  [{node.name}] Übersprungen (nicht erreichbar)")
                continue
            print(f"  [{node.name}] Erstelle Backup in {backup_dir}/")
            # Create backup directory
            rc, _, err = self.ssh.run_on_node(
                node.ssh_host, f"mkdir -p {backup_dir}", node.is_local
            )
            if rc != 0:
                print(f"    [!] Fehler beim Erstellen des Backup-Verzeichnisses: {err}")
                all_ok = False
                continue
            # Backup per-node files
            for filepath in BACKUP_FILES:
                filename = filepath.replace("/", "_").lstrip("_")
                rc, _, _ = self.ssh.run_on_node(
                    node.ssh_host,
                    f"cp {filepath} {backup_dir}/{filename} 2>/dev/null",
                    node.is_local,
                )
                if rc == 0:
                    print(f"    OK: {filepath}")
                else:
                    print(f"    --: {filepath} (nicht vorhanden)")
            # Backup cluster files (only from local node since they're shared)
            if node.is_local:
                for filepath in CLUSTER_BACKUP_FILES:
                    filename = filepath.replace("/", "_").lstrip("_")
                    rc, _, _ = self.ssh.run_on_node(
                        node.ssh_host,
                        f"cp {filepath} {backup_dir}/{filename} 2>/dev/null",
                        node.is_local,
                    )
                    if rc == 0:
                        print(f"    OK: {filepath} (cluster)")
                    else:
                        print(f"    --: {filepath} (nicht vorhanden)")
        if all_ok:
            print(f"\n  Backup erfolgreich in {backup_dir}/")
        else:
            print("\n  [!] Einige Backups sind fehlgeschlagen!")
        return all_ok
@@ -0,0 +1,271 @@
 """Parsers for Proxmox configuration files (Corosync, Ceph, /etc/network/interfaces)."""
 import re
 from models import (
    CorosyncConfig, CorosyncNode, CephConfig, NetworkInterface,
 )
 def parse_corosync_conf(content: str) -> CorosyncConfig:
    """Parse corosync.conf and extract node information."""
    config = CorosyncConfig(raw_content=content)
    # Extract config_version
    m = re.search(r'config_version:\s*(\d+)', content)
    if m:
        config.config_version = int(m.group(1))
    # Extract cluster_name
    m = re.search(r'cluster_name:\s*(\S+)', content)
    if m:
        config.cluster_name = m.group(1)
    # Extract transport
    m = re.search(r'transport:\s*(\S+)', content)
    if m:
        config.transport = m.group(1)
    # Extract nodes from nodelist section
    nodelist_match = re.search(r'nodelist\s*\{(.*?)\n\}', content, re.DOTALL)
    if nodelist_match:
        nodelist_content = nodelist_match.group(1)
        # Find all node blocks
        node_blocks = re.findall(r'node\s*\{(.*?)\}', nodelist_content, re.DOTALL)
        for block in node_blocks:
            node = CorosyncNode(nodeid=0, name="", ring0_addr="")
            m = re.search(r'nodeid:\s*(\d+)', block)
            if m:
                node.nodeid = int(m.group(1))
            m = re.search(r'name:\s*(\S+)', block)
            if m:
                node.name = m.group(1)
            m = re.search(r'ring0_addr:\s*(\S+)', block)
            if m:
                node.ring0_addr = m.group(1)
            m = re.search(r'ring1_addr:\s*(\S+)', block)
            if m:
                node.ring1_addr = m.group(1)
            config.nodes.append(node)
    return config
 def generate_corosync_conf(config: CorosyncConfig, ip_mapping: dict[str, str]) -> str:
    """Generate new corosync.conf with updated IP addresses.
    ip_mapping: old_ip -> new_ip
    """
    new_content = config.raw_content
    for old_ip, new_ip in ip_mapping.items():
        new_content = new_content.replace(old_ip, new_ip)
    # Increment config_version
    m = re.search(r'config_version:\s*(\d+)', new_content)
    if m:
        old_version = int(m.group(1))
        new_content = new_content.replace(
            f'config_version: {old_version}',
            f'config_version: {old_version + 1}'
        )
    return new_content
 def parse_ceph_conf(content: str) -> CephConfig:
    """Parse ceph.conf (INI-like format)."""
    config = CephConfig(raw_content=content)
    # Extract fsid
    m = re.search(r'fsid\s*=\s*(\S+)', content)
    if m:
        config.fsid = m.group(1)
    # Extract public_network
    m = re.search(r'public.network\s*=\s*(\S+)', content)
    if m:
        config.public_network = m.group(1)
    # Extract cluster_network
    m = re.search(r'cluster.network\s*=\s*(\S+)', content)
    if m:
        config.cluster_network = m.group(1)
    # Extract mon_host
    m = re.search(r'mon.host\s*=\s*(.+)', content)
    if m:
        hosts_str = m.group(1).strip()
        config.mon_hosts = [h.strip() for h in hosts_str.split(',') if h.strip()]
    # Extract [mon.X] sections
    mon_sections = re.findall(
        r'\[(mon\.[\w.-]+)\]\s*\n((?:\s+\w.*\n)*)', content
    )
    for section_name, section_body in mon_sections:
        props = {}
        for line in section_body.strip().split('\n'):
            line = line.strip()
            if '=' in line:
                key, val = line.split('=', 1)
                props[key.strip()] = val.strip()
        config.mon_sections[section_name] = props
    return config
 def generate_ceph_conf(config: CephConfig, ip_mapping: dict[str, str],
                       new_public_network: str, new_cluster_network: str) -> str:
    """Generate new ceph.conf with updated IPs and networks."""
    new_content = config.raw_content
    # Replace network definitions
    if config.public_network:
        new_content = new_content.replace(
            config.public_network, new_public_network, 1
        )
    if config.cluster_network:
        new_content = new_content.replace(
            config.cluster_network, new_cluster_network, 1
        )
    # Replace all IPs in the config
    for old_ip, new_ip in ip_mapping.items():
        new_content = new_content.replace(old_ip, new_ip)
    return new_content
 def parse_network_interfaces(content: str) -> list[NetworkInterface]:
    """Parse /etc/network/interfaces and extract interface configs."""
    interfaces = []
    current_iface = None
    current_lines = []
    for line in content.split('\n'):
        stripped = line.strip()
        # New iface block
        m = re.match(r'iface\s+(\S+)\s+inet\s+(\S+)', stripped)
        if m:
            # Save previous
            if current_iface:
                interfaces.append(_build_interface(current_iface, current_lines))
            current_iface = m.group(1)
            current_lines = [line]
            continue
        # Auto line or source line starts a new context
        if stripped.startswith('auto ') or stripped.startswith('source '):
            if current_iface:
                interfaces.append(_build_interface(current_iface, current_lines))
                current_iface = None
                current_lines = []
            continue
        if current_iface and stripped:
            current_lines.append(line)
    # Don't forget the last one
    if current_iface:
        interfaces.append(_build_interface(current_iface, current_lines))
    return interfaces
 def _build_interface(name: str, lines: list[str]) -> NetworkInterface:
    """Build a NetworkInterface from parsed lines."""
    raw = '\n'.join(lines)
    address = ""
    netmask = ""
    cidr = 0
    gateway = None
    bridge_ports = None
    for line in lines:
        stripped = line.strip()
        # address with CIDR notation: address 192.168.0.1/24
        m = re.match(r'address\s+(\d+\.\d+\.\d+\.\d+)/(\d+)', stripped)
        if m:
            address = m.group(1)
            cidr = int(m.group(2))
            netmask = cidr_to_netmask(cidr)
            continue
        # address without CIDR
        m = re.match(r'address\s+(\d+\.\d+\.\d+\.\d+)', stripped)
        if m:
            address = m.group(1)
            continue
        m = re.match(r'netmask\s+(\S+)', stripped)
        if m:
            netmask = m.group(1)
            cidr = netmask_to_cidr(netmask)
            continue
        m = re.match(r'gateway\s+(\S+)', stripped)
        if m:
            gateway = m.group(1)
            continue
        m = re.match(r'bridge[_-]ports\s+(\S+)', stripped)
        if m:
            bridge_ports = m.group(1)
            continue
    return NetworkInterface(
        name=name,
        address=address,
        netmask=netmask,
        cidr=cidr,
        gateway=gateway,
        bridge_ports=bridge_ports,
        raw_config=raw,
    )
 def generate_network_interfaces(content: str, old_ip: str, new_ip: str,
                                 new_cidr: int, new_gateway: str | None = None,
                                 old_gateway: str | None = None) -> str:
    """Update /etc/network/interfaces with new IP, keeping everything else."""
    new_content = content
    # Replace IP in address lines (with and without CIDR)
    # address 192.168.0.101/24 -> address 172.0.2.101/16
    new_content = re.sub(
        rf'(address\s+){re.escape(old_ip)}/\d+',
        rf'\g<1>{new_ip}/{new_cidr}',
        new_content
    )
    # address 192.168.0.101 (without CIDR)
    new_content = re.sub(
        rf'(address\s+){re.escape(old_ip)}(\s)',
        rf'\g<1>{new_ip}\2',
        new_content
    )
    # Replace gateway if provided
    if new_gateway and old_gateway:
        new_content = new_content.replace(
            f'gateway {old_gateway}',
            f'gateway {new_gateway}'
        )
    return new_content
 def generate_hosts(content: str, ip_mapping: dict[str, str]) -> str:
    """Update /etc/hosts with new IPs."""
    new_content = content
    for old_ip, new_ip in ip_mapping.items():
        new_content = new_content.replace(old_ip, new_ip)
    return new_content
 def cidr_to_netmask(cidr: int) -> str:
    """Convert CIDR prefix length to netmask string."""
    bits = (0xFFFFFFFF << (32 - cidr)) & 0xFFFFFFFF
    return f"{(bits >> 24) & 0xFF}.{(bits >> 16) & 0xFF}.{(bits >> 8) & 0xFF}.{bits & 0xFF}"
 def netmask_to_cidr(netmask: str) -> int:
    """Convert netmask string to CIDR prefix length."""
    parts = netmask.split('.')
    binary = ''.join(f'{int(p):08b}' for p in parts)
    return binary.count('1')
@@ -0,0 +1,189 @@
 """Phase 1: Discovery - Read current cluster configuration."""
 import socket
 from models import NodeInfo, CorosyncConfig, CephConfig
 from config_parser import parse_corosync_conf, parse_ceph_conf, parse_network_interfaces
 from ssh_manager import SSHManager
 class Discovery:
    """Discovers current Proxmox cluster and Ceph configuration."""
    def __init__(self, ssh: SSHManager):
        self.ssh = ssh
        self.local_hostname = socket.gethostname()
    def discover_corosync(self) -> CorosyncConfig | None:
        """Read and parse corosync.conf from the local node."""
        # Try /etc/pve/corosync.conf first (cluster filesystem)
        ok, content = self.ssh.read_local_file("/etc/pve/corosync.conf")
        if not ok:
            # Fallback to local corosync config
            ok, content = self.ssh.read_local_file("/etc/corosync/corosync.conf")
        if not ok:
            print(f"  [!] Corosync config nicht gefunden: {content}")
            return None
        config = parse_corosync_conf(content)
        print(f"  Cluster: {config.cluster_name}")
        print(f"  Transport: {config.transport}")
        print(f"  Config Version: {config.config_version}")
        print(f"  Nodes gefunden: {len(config.nodes)}")
        for node in config.nodes:
            print(f"    - {node.name} (ID: {node.nodeid}) -> {node.ring0_addr}")
        return config
    def discover_ceph(self) -> CephConfig | None:
        """Read and parse ceph.conf."""
        ok, content = self.ssh.read_local_file("/etc/pve/ceph.conf")
        if not ok:
            ok, content = self.ssh.read_local_file("/etc/ceph/ceph.conf")
        if not ok:
            print("  [!] Ceph config nicht gefunden (Ceph evtl. nicht installiert)")
            return None
        config = parse_ceph_conf(content)
        print(f"  FSID: {config.fsid}")
        print(f"  Public Network: {config.public_network}")
        print(f"  Cluster Network: {config.cluster_network}")
        if config.mon_hosts:
            print(f"  MON Hosts: {', '.join(config.mon_hosts)}")
        if config.mon_sections:
            print(f"  MON Sections: {', '.join(config.mon_sections.keys())}")
        return config
    def discover_nodes(self, corosync: CorosyncConfig) -> list[NodeInfo]:
        """Build node list from corosync config and check reachability."""
        nodes = []
        for cs_node in corosync.nodes:
            is_local = (cs_node.name == self.local_hostname)
            node = NodeInfo(
                name=cs_node.name,
                current_ip=cs_node.ring0_addr,
                ssh_host=cs_node.ring0_addr,
                is_local=is_local,
            )
            # Check reachability
            if is_local:
                node.is_reachable = True
            else:
                node.is_reachable = self.ssh.is_reachable(cs_node.ring0_addr)
            # Try to reach by hostname if IP doesn't work
            if not node.is_reachable and not is_local:
                if self.ssh.is_reachable(cs_node.name):
                    node.is_reachable = True
                    node.ssh_host = cs_node.name
            if node.is_reachable:
                self._read_node_configs(node)
            status = "erreichbar" if node.is_reachable else "NICHT ERREICHBAR"
            local_tag = " (lokal)" if is_local else ""
            print(f"  {node.name}: {node.current_ip} - {status}{local_tag}")
            nodes.append(node)
        return nodes
    def discover_nodes_with_overrides(self, corosync: CorosyncConfig,
                                       override_nodes: list[NodeInfo]) -> list[NodeInfo]:
        """Re-discover nodes using override SSH hosts (e.g. rescue IPs).
        Takes pre-configured nodes (with rescue IPs as ssh_host) and
        reads their configs.
        """
        print("\n[Nodes - via Rescue-Netzwerk]")
        for node in override_nodes:
            if node.is_reachable:
                self._read_node_configs(node)
            status = "erreichbar" if node.is_reachable else "NICHT ERREICHBAR"
            local_tag = " (lokal)" if node.is_local else ""
            via = f" via {node.ssh_host}" if not node.is_local else ""
            print(f"  {node.name}: {node.current_ip}{via} - {status}{local_tag}")
        return override_nodes
    def _read_node_configs(self, node: NodeInfo):
        """Read network interfaces and hosts from a node."""
        # Read /etc/network/interfaces
        ok, content = self.ssh.read_node_file(
            node.ssh_host, "/etc/network/interfaces", node.is_local
        )
        if ok:
            node.network_interfaces_content = content
            node.interfaces = parse_network_interfaces(content)
        # Read /etc/hosts
        ok, content = self.ssh.read_node_file(
            node.ssh_host, "/etc/hosts", node.is_local
        )
        if ok:
            node.hosts_content = content
    def check_quorum(self) -> bool:
        """Check if the cluster currently has quorum."""
        rc, stdout, _ = self.ssh.execute_local("pvecm status 2>/dev/null")
        if rc != 0:
            print("  [!] pvecm status fehlgeschlagen - kein Quorum oder kein Cluster")
            return False
        if "Quorate:          Yes" in stdout or "Activity blocked" not in stdout:
            # Also check if /etc/pve is writable
            rc2, _, _ = self.ssh.execute_local(
                "touch /etc/pve/.migration_test && rm -f /etc/pve/.migration_test"
            )
            if rc2 == 0:
                print("  Quorum: JA (/etc/pve ist beschreibbar)")
                return True
        print("  Quorum: NEIN (/etc/pve ist read-only!)")
        return False
    def check_ceph_health(self) -> str | None:
        """Get current Ceph health status."""
        rc, stdout, _ = self.ssh.execute_local("ceph health 2>/dev/null")
        if rc == 0:
            status = stdout.strip()
            print(f"  Ceph Health: {status}")
            return status
        return None
    def run(self) -> tuple[CorosyncConfig | None, CephConfig | None,
                           list[NodeInfo], bool]:
        """Run full discovery.
        Returns: (corosync_config, ceph_config, nodes, has_quorum)
        """
        print("\n=== Phase 1: Discovery ===\n")
        print("[Corosync]")
        corosync = self.discover_corosync()
        if not corosync or not corosync.nodes:
            print("FEHLER: Konnte keine Corosync-Konfiguration lesen!")
            return None, None, [], False
        print("\n[Ceph]")
        ceph = self.discover_ceph()
        print("\n[Nodes]")
        nodes = self.discover_nodes(corosync)
        print("\n[Cluster Status]")
        has_quorum = self.check_quorum()
        if ceph:
            print("\n[Ceph Health]")
            self.check_ceph_health()
        unreachable = [n for n in nodes if not n.is_reachable]
        if unreachable:
            print(f"\n[!] WARNUNG: {len(unreachable)} Node(s) nicht erreichbar:")
            for n in unreachable:
                print(f"    - {n.name} ({n.current_ip})")
            print("    Diese Nodes wurden möglicherweise bereits manuell geändert.")
            print("    Das Tool wird versuchen, sie über ihren Hostnamen zu erreichen.")
        return corosync, ceph, nodes, has_quorum
@@ -0,0 +1,212 @@
 #!/usr/bin/env python3
 """
 Proxmox Cluster Network Changer
 Migriert ein Proxmox-Cluster (inkl. Ceph) von einem Netzwerk in ein anderes.
 Behandelt Corosync, Ceph, /etc/network/interfaces und /etc/hosts.
 Kann auch mit gebrochenem Quorum umgehen (z.B. wenn ein Node bereits
 manuell geändert wurde).
 Muss als root auf einem Proxmox-Node ausgeführt werden.
 Verwendung:
    python3 main.py              # Interaktiver Modus
    python3 main.py --dry-run    # Nur anzeigen, nichts ändern
    python3 main.py --discover   # Nur Discovery, keine Migration
 """
 import argparse
 import os
 import sys
 from ssh_manager import SSHManager
 from discovery import Discovery
 from planner import Planner
 from backup import Backup
 from migrator import Migrator
 from verifier import Verifier
 from rescue import RescueNetwork
 def check_prerequisites():
    """Check that we're running as root on a Proxmox node."""
    if os.geteuid() != 0:
        print("FEHLER: Dieses Tool muss als root ausgeführt werden!")
        print("Bitte mit 'sudo python3 main.py' starten.")
        sys.exit(1)
    if not os.path.exists("/etc/pve") and not os.path.exists("/etc/corosync"):
        print("WARNUNG: Dies scheint kein Proxmox-Node zu sein.")
        print("         /etc/pve und /etc/corosync nicht gefunden.")
        answer = input("Trotzdem fortfahren? [j/N]: ").strip().lower()
        if answer not in ('j', 'ja', 'y', 'yes'):
            sys.exit(0)
 def main():
    parser = argparse.ArgumentParser(
        description="Proxmox Cluster Network Changer - "
                    "Migriert Cluster + Ceph in ein neues Netzwerk"
    )
    parser.add_argument(
        "--dry-run", action="store_true",
        help="Nur anzeigen was geändert würde, nichts ändern"
    )
    parser.add_argument(
        "--discover", action="store_true",
        help="Nur Discovery durchführen, keine Migration"
    )
    parser.add_argument(
        "--ssh-key", type=str, default=None,
        help="Pfad zum SSH-Key (Standard: Default SSH-Key)"
    )
    parser.add_argument(
        "--ssh-port", type=int, default=22,
        help="SSH-Port (Standard: 22)"
    )
    parser.add_argument(
        "--rescue", action="store_true",
        help="Rescue-Modus: Emergency-Netzwerk einrichten wenn Nodes "
             "sich nicht erreichen können"
    )
    parser.add_argument(
        "--rescue-commands", type=str, metavar="SUBNET",
        help="Nur Rescue-Befehle ausgeben ohne Migration "
             "(z.B. --rescue-commands 10.99.99.0/24)"
    )
    args = parser.parse_args()
    print("=" * 60)
    print("  Proxmox Cluster Network Changer")
    print("=" * 60)
    check_prerequisites()
    # Initialize SSH manager
    ssh = SSHManager(ssh_key=args.ssh_key, ssh_port=args.ssh_port)
    rescue = RescueNetwork(ssh)
    # Quick mode: just print rescue commands and exit
    if args.rescue_commands:
        discovery = Discovery(ssh)
        print("\n[Corosync]")
        corosync = discovery.discover_corosync()
        if not corosync:
            print("\nFEHLER: Konnte Cluster-Konfiguration nicht lesen.")
            sys.exit(1)
        bridge_input = input(f"Bridge [{rescue.bridge}]: ").strip()
        bridge = bridge_input or rescue.bridge
        commands = rescue.get_rescue_commands(corosync, args.rescue_commands, bridge)
        print()
        print("=" * 60)
        print("  RESCUE BEFEHLE")
        print(f"  Subnetz: {args.rescue_commands} | Bridge: {bridge}")
        print("=" * 60)
        print()
        for cmd_info in commands:
            print(f"  {cmd_info['name']} ({cmd_info['current_ip']}):")
            print(f"    {cmd_info['command']}")
            print()
        print("  Zum Entfernen:")
        for cmd_info in commands:
            print(f"    {cmd_info['remove_command']}  # {cmd_info['name']}")
        print()
        sys.exit(0)
    # Phase 1: Discovery
    discovery = Discovery(ssh)
    corosync, ceph, nodes, has_quorum = discovery.run()
    if not corosync:
        print("\nFEHLER: Konnte Cluster-Konfiguration nicht lesen. Abbruch.")
        sys.exit(1)
    # Check if rescue mode is needed
    unreachable = [n for n in nodes if not n.is_reachable and not n.is_local]
    use_rescue = args.rescue
    if unreachable and not use_rescue:
        print(f"\n  {len(unreachable)} Node(s) nicht erreichbar.")
        answer = input("  Rescue-Netzwerk einrichten? [J/n]: ").strip().lower()
        if answer not in ('n', 'nein', 'no'):
            use_rescue = True
    if use_rescue:
        rescue_nodes = rescue.setup_interactive(corosync)
        if not rescue_nodes:
            sys.exit(1)
        # Re-run discovery with rescue IPs to read configs from all nodes
        print("\n  [Rescue] Lese Konfigurationen über Rescue-Netzwerk...")
        nodes = discovery.discover_nodes_with_overrides(
            corosync, rescue_nodes
        )
        # Re-check quorum
        has_quorum = discovery.check_quorum()
        # Re-read ceph
        ceph = discovery.discover_ceph()
    if args.discover:
        if rescue.active:
            rescue.cleanup(nodes)
        print("\n--- Discovery abgeschlossen (--discover Modus) ---")
        sys.exit(0)
    # Phase 2: Planning
    planner = Planner()
    plan = planner.plan(nodes, corosync, ceph, has_quorum)
    if not plan:
        if rescue.active:
            rescue.cleanup(nodes)
        sys.exit(0)
    plan.dry_run = args.dry_run
    # Generate all new config files
    configs = planner.generate_new_configs(plan)
    # Phase 3: Backup (skip in dry-run)
    if not args.dry_run:
        backup = Backup(ssh)
        if not backup.run(plan):
            print("\nBackup fehlgeschlagen! Trotzdem fortfahren?")
            answer = input("[j/N]: ").strip().lower()
            if answer not in ('j', 'ja', 'y', 'yes'):
                if rescue.active:
                    rescue.cleanup(nodes)
                sys.exit(1)
    else:
        print("\n=== Phase 3: Backup (übersprungen im Dry-Run) ===")
    # Phase 4: Migration
    migrator = Migrator(ssh)
    success = migrator.run(plan, configs, dry_run=args.dry_run)
    if not success:
        print("\n[!] Migration hatte Fehler!")
        if not args.dry_run:
            print("    Prüfe Backups in /root/network-migration-backup-*/")
        if rescue.active:
            rescue.cleanup(nodes)
        sys.exit(1)
    # Cleanup rescue network (before verification, so we verify real connectivity)
    if rescue.active and not args.dry_run:
        rescue.cleanup(nodes)
    # Phase 5: Verification (skip in dry-run)
    if not args.dry_run:
        verifier = Verifier(ssh)
        verifier.run(plan)
    else:
        if rescue.active:
            rescue.cleanup(nodes)
        print("\n=== Phase 5: Verifikation (übersprungen im Dry-Run) ===")
        print("\nDry-Run abgeschlossen. Keine Änderungen vorgenommen.")
 if __name__ == "__main__":
    main()
@@ -0,0 +1,450 @@
 """Phase 4: Execute the network migration."""
 import time
 from models import MigrationPlan
 from ssh_manager import SSHManager
 class Migrator:
    """Executes the actual network migration across all nodes."""
    def __init__(self, ssh: SSHManager):
        self.ssh = ssh
    def run(self, plan: MigrationPlan, configs: dict, dry_run: bool = False) -> bool:
        """Execute the migration.
        Args:
            plan: The migration plan
            configs: Generated configs from Planner.generate_new_configs()
            dry_run: If True, only show what would be done
        """
        print("\n=== Phase 4: Migration ===\n")
        if dry_run:
            print("  *** DRY RUN - Es werden keine Änderungen vorgenommen ***\n")
        ip_mapping = {n.current_ip: n.new_ip for n in plan.nodes if n.new_ip}
        reachable_nodes = [n for n in plan.nodes if n.is_reachable]
        if not reachable_nodes:
            print("  FEHLER: Keine Nodes erreichbar!")
            return False
        # Step 1: Write new configs to all nodes (but don't activate yet)
        print("[1/7] Neue Konfigurationen verteilen...")
        if not self._distribute_configs(plan, configs, dry_run):
            return False
        # Step 2: Stop Corosync on all nodes
        print("\n[2/7] Corosync stoppen auf allen Nodes...")
        if not self._stop_corosync(reachable_nodes, dry_run):
            return False
        # Step 3: Stop pve-cluster (pmxcfs) to release corosync.conf
        print("\n[3/7] pve-cluster stoppen...")
        if not self._stop_pve_cluster(reachable_nodes, dry_run):
            return False
        # Step 4: Write corosync config directly
        print("\n[4/7] Corosync-Konfiguration aktualisieren...")
        if not self._update_corosync(reachable_nodes, configs, dry_run):
            return False
        # Step 5: Update /etc/hosts on all nodes
        print("\n[5/7] /etc/hosts aktualisieren...")
        if not self._update_hosts(plan, configs, dry_run):
            return False
        # Step 6: Update network interfaces and restart networking
        print("\n[6/7] Netzwerk-Interfaces aktualisieren und Netzwerk neu starten...")
        if not self._update_network(plan, configs, dry_run):
            return False
        # Step 7: Start services back up
        print("\n[7/7] Services starten...")
        if not self._start_services(plan, configs, dry_run):
            return False
        return True
    def _distribute_configs(self, plan: MigrationPlan, configs: dict,
                            dry_run: bool) -> bool:
        """Write prepared configs as staged files (not yet active)."""
        for node in plan.nodes:
            if not node.is_reachable or node.name not in configs['nodes']:
                continue
            node_configs = configs['nodes'][node.name]
            staging_dir = "/root/.network-migration-staged"
            if dry_run:
                print(f"  [{node.name}] Würde Configs nach {staging_dir}/ schreiben")
                continue
            # Create staging directory
            self.ssh.run_on_node(
                node.ssh_host, f"mkdir -p {staging_dir}", node.is_local
            )
            # Stage network interfaces
            ok, msg = self.ssh.write_node_file(
                node.ssh_host,
                f"{staging_dir}/interfaces",
                node_configs['interfaces'],
                node.is_local,
            )
            if ok:
                print(f"  [{node.name}] interfaces staged")
            else:
                print(f"  [{node.name}] FEHLER interfaces: {msg}")
                return False
            # Stage hosts
            ok, msg = self.ssh.write_node_file(
                node.ssh_host,
                f"{staging_dir}/hosts",
                node_configs['hosts'],
                node.is_local,
            )
            if ok:
                print(f"  [{node.name}] hosts staged")
            else:
                print(f"  [{node.name}] FEHLER hosts: {msg}")
                return False
        # Stage corosync config
        if configs['corosync']:
            for node in plan.nodes:
                if not node.is_reachable:
                    continue
                staging_dir = "/root/.network-migration-staged"
                if dry_run:
                    print(f"  [{node.name}] Würde corosync.conf stagen")
                    continue
                ok, msg = self.ssh.write_node_file(
                    node.ssh_host,
                    f"{staging_dir}/corosync.conf",
                    configs['corosync'],
                    node.is_local,
                )
                if ok:
                    print(f"  [{node.name}] corosync.conf staged")
                else:
                    print(f"  [{node.name}] FEHLER corosync.conf: {msg}")
                    return False
        # Stage ceph config
        if configs['ceph']:
            for node in plan.nodes:
                if not node.is_reachable:
                    continue
                staging_dir = "/root/.network-migration-staged"
                if dry_run:
                    print(f"  [{node.name}] Würde ceph.conf stagen")
                    continue
                ok, msg = self.ssh.write_node_file(
                    node.ssh_host,
                    f"{staging_dir}/ceph.conf",
                    configs['ceph'],
                    node.is_local,
                )
                if ok:
                    print(f"  [{node.name}] ceph.conf staged")
                else:
                    print(f"  [{node.name}] FEHLER ceph.conf: {msg}")
                    return False
        return True
    def _stop_corosync(self, nodes: list, dry_run: bool) -> bool:
        """Stop corosync on all nodes."""
        for node in nodes:
            if dry_run:
                print(f"  [{node.name}] Würde corosync stoppen")
                continue
            rc, _, err = self.ssh.run_on_node(
                node.ssh_host, "systemctl stop corosync", node.is_local
            )
            if rc == 0:
                print(f"  [{node.name}] corosync gestoppt")
            else:
                print(f"  [{node.name}] WARNUNG beim Stoppen: {err}")
        return True
    def _stop_pve_cluster(self, nodes: list, dry_run: bool) -> bool:
        """Stop pve-cluster service to unmount /etc/pve."""
        for node in nodes:
            if dry_run:
                print(f"  [{node.name}] Würde pve-cluster stoppen")
                continue
            rc, _, err = self.ssh.run_on_node(
                node.ssh_host, "systemctl stop pve-cluster", node.is_local
            )
            if rc == 0:
                print(f"  [{node.name}] pve-cluster gestoppt")
            else:
                print(f"  [{node.name}] WARNUNG: {err}")
        return True
    def _update_corosync(self, nodes: list, configs: dict,
                         dry_run: bool) -> bool:
        """Write new corosync.conf directly to /etc/corosync/."""
        if not configs['corosync']:
            print("  Keine Corosync-Änderungen")
            return True
        for node in nodes:
            if dry_run:
                print(f"  [{node.name}] Würde /etc/corosync/corosync.conf schreiben")
                continue
            staging = "/root/.network-migration-staged/corosync.conf"
            rc, _, err = self.ssh.run_on_node(
                node.ssh_host,
                f"cp {staging} /etc/corosync/corosync.conf",
                node.is_local,
            )
            if rc == 0:
                print(f"  [{node.name}] corosync.conf aktualisiert")
            else:
                print(f"  [{node.name}] FEHLER: {err}")
                return False
        return True
    def _update_hosts(self, plan: MigrationPlan, configs: dict,
                      dry_run: bool) -> bool:
        """Update /etc/hosts on all nodes."""
        for node in plan.nodes:
            if not node.is_reachable or node.name not in configs['nodes']:
                continue
            if dry_run:
                print(f"  [{node.name}] Würde /etc/hosts aktualisieren")
                continue
            staging = "/root/.network-migration-staged/hosts"
            rc, _, err = self.ssh.run_on_node(
                node.ssh_host,
                f"cp {staging} /etc/hosts",
                node.is_local,
            )
            if rc == 0:
                print(f"  [{node.name}] /etc/hosts aktualisiert")
            else:
                print(f"  [{node.name}] FEHLER: {err}")
                return False
        return True
    def _update_network(self, plan: MigrationPlan, configs: dict,
                        dry_run: bool) -> bool:
        """Update /etc/network/interfaces and restart networking."""
        for node in plan.nodes:
            if not node.is_reachable or node.name not in configs['nodes']:
                continue
            if dry_run:
                print(f"  [{node.name}] Würde /etc/network/interfaces aktualisieren")
                print(f"  [{node.name}] Würde 'ifreload -a' ausführen")
                continue
            staging = "/root/.network-migration-staged/interfaces"
            rc, _, err = self.ssh.run_on_node(
                node.ssh_host,
                f"cp {staging} /etc/network/interfaces",
                node.is_local,
            )
            if rc == 0:
                print(f"  [{node.name}] /etc/network/interfaces aktualisiert")
            else:
                print(f"  [{node.name}] FEHLER: {err}")
                return False
            # Reload network - use ifreload if available, otherwise ifdown/ifup
            rc, _, _ = self.ssh.run_on_node(
                node.ssh_host, "which ifreload", node.is_local
            )
            if rc == 0:
                reload_cmd = "ifreload -a"
            else:
                reload_cmd = f"ifdown {plan.bridge_name} && ifup {plan.bridge_name}"
            print(f"  [{node.name}] Netzwerk wird neu geladen ({reload_cmd})...")
            rc, _, err = self.ssh.run_on_node(
                node.ssh_host, reload_cmd, node.is_local, timeout=60
            )
            if rc == 0:
                print(f"  [{node.name}] Netzwerk neu geladen")
            else:
                print(f"  [{node.name}] WARNUNG beim Netzwerk-Reload: {err}")
                # Don't fail here - the node might just be unreachable on old IP now
        return True
    def _start_services(self, plan: MigrationPlan, configs: dict,
                        dry_run: bool) -> bool:
        """Start pve-cluster and corosync, then handle Ceph."""
        # Now we need to reach nodes on their NEW IPs
        for node in plan.nodes:
            if not node.is_reachable:
                continue
            new_host = node.new_ip if not node.is_local else node.ssh_host
            is_local = node.is_local
            # Start pve-cluster
            if dry_run:
                print(f"  [{node.name}] Würde pve-cluster starten")
                print(f"  [{node.name}] Würde corosync starten")
                continue
            print(f"  [{node.name}] Starte pve-cluster...")
            rc, _, err = self.ssh.run_on_node(
                new_host, "systemctl start pve-cluster", is_local, timeout=30
            )
            if rc == 0:
                print(f"  [{node.name}] pve-cluster gestartet")
            else:
                print(f"  [{node.name}] WARNUNG pve-cluster: {err}")
            print(f"  [{node.name}] Starte corosync...")
            rc, _, err = self.ssh.run_on_node(
                new_host, "systemctl start corosync", is_local, timeout=30
            )
            if rc == 0:
                print(f"  [{node.name}] corosync gestartet")
            else:
                print(f"  [{node.name}] WARNUNG corosync: {err}")
        if dry_run:
            print("\n  Würde auf Quorum warten...")
            return True
        # Wait for quorum
        print("\n  Warte auf Quorum...")
        if not self._wait_for_quorum(timeout=60):
            print("  [!] Quorum nicht erreicht! Versuche 'pvecm expected 1'...")
            rc, _, _ = self.ssh.execute_local("pvecm expected 1")
            if rc == 0:
                print("  Quorum erzwungen mit 'pvecm expected 1'")
                time.sleep(5)
            else:
                print("  [!] Konnte Quorum nicht erzwingen!")
        # Update Ceph config via cluster FS if possible
        if configs.get('ceph'):
            self._update_ceph(plan, configs)
        # Cleanup staging directories
        print("\n  Staging-Verzeichnisse aufräumen...")
        for node in plan.nodes:
            if not node.is_reachable:
                continue
            new_host = node.new_ip if not node.is_local else node.ssh_host
            self.ssh.run_on_node(
                new_host,
                "rm -rf /root/.network-migration-staged",
                node.is_local,
            )
        return True
    def _wait_for_quorum(self, timeout: int = 60) -> bool:
        """Wait for cluster quorum to be established."""
        start = time.time()
        while time.time() - start < timeout:
            rc, stdout, _ = self.ssh.execute_local("pvecm status 2>/dev/null")
            if rc == 0 and "Quorate:          Yes" in stdout:
                print("  Quorum erreicht!")
                return True
            print("  ... warte auf Quorum ...")
            time.sleep(5)
        return False
    def _update_ceph(self, plan: MigrationPlan, configs: dict):
        """Update Ceph configuration after quorum is available."""
        print("\n  [Ceph] Konfiguration aktualisieren...")
        # Try to write via /etc/pve/ceph.conf first
        rc, _, _ = self.ssh.execute_local(
            "touch /etc/pve/.ceph_test && rm -f /etc/pve/.ceph_test"
        )
        if rc == 0:
            # /etc/pve is writable - use cluster filesystem
            ok, msg = self.ssh.write_local_file("/etc/pve/ceph.conf", configs['ceph'])
            if ok:
                print("  [Ceph] /etc/pve/ceph.conf aktualisiert (via Cluster-FS)")
            else:
                print(f"  [Ceph] FEHLER /etc/pve/ceph.conf: {msg}")
                self._update_ceph_direct(plan, configs)
        else:
            # /etc/pve not writable - write directly on each node
            print("  [Ceph] /etc/pve nicht beschreibbar, schreibe direkt...")
            self._update_ceph_direct(plan, configs)
        # Restart Ceph services
        print("  [Ceph] Services neu starten...")
        for node in plan.nodes:
            if not node.is_reachable:
                continue
            new_host = node.new_ip if not node.is_local else node.ssh_host
            # Restart MON
            self.ssh.run_on_node(
                new_host,
                f"systemctl restart ceph-mon@{node.name} 2>/dev/null",
                node.is_local, timeout=30,
            )
            # Restart MGR
            self.ssh.run_on_node(
                new_host,
                f"systemctl restart ceph-mgr@{node.name} 2>/dev/null",
                node.is_local, timeout=30,
            )
            # Restart all OSDs on this node
            self.ssh.run_on_node(
                new_host,
                "systemctl restart ceph-osd.target 2>/dev/null",
                node.is_local, timeout=60,
            )
            print(f"  [{node.name}] Ceph-Services neu gestartet")
    def _update_ceph_direct(self, plan: MigrationPlan, configs: dict):
        """Write ceph.conf directly on each node (fallback when no quorum)."""
        for node in plan.nodes:
            if not node.is_reachable:
                continue
            new_host = node.new_ip if not node.is_local else node.ssh_host
            ok, msg = self.ssh.write_node_file(
                new_host, "/etc/ceph/ceph.conf",
                configs['ceph'], node.is_local,
            )
            if ok:
                print(f"  [{node.name}] /etc/ceph/ceph.conf direkt geschrieben")
            else:
                print(f"  [{node.name}] FEHLER /etc/ceph/ceph.conf: {msg}")
    def _update_ceph_mon_map(self, plan: MigrationPlan):
        """Update Ceph MON map with new addresses.
        This is needed when MON IPs change.
        """
        ip_mapping = {n.current_ip: n.new_ip for n in plan.nodes if n.new_ip}
        for node in plan.nodes:
            if not node.is_reachable:
                continue
            new_host = node.new_ip if not node.is_local else node.ssh_host
            new_ip = node.new_ip
            # Extract monmap, modify, and reinject
            cmds = [
                f"ceph-mon -i {node.name} --extract-monmap /tmp/monmap",
                # Remove old entries and add new ones
            ]
            # This is complex - for now we rely on the ceph.conf update
            # and let Ceph handle the MON map update on restart
            print(f"  [{node.name}] MON-Map wird beim Neustart aktualisiert")
@@ -0,0 +1,76 @@
 """Data models for the Proxmox Cluster Network Changer."""
 from dataclasses import dataclass, field
 from typing import Optional
@dataclass
 class NetworkInterface:
    """Represents a network interface configuration."""
    name: str  # e.g. vmbr0
    address: str  # e.g. 192.168.0.101
    netmask: str  # e.g. 255.255.255.0
    cidr: int  # e.g. 24
    gateway: Optional[str] = None
    bridge_ports: Optional[str] = None
    raw_config: str = ""
@dataclass
 class NodeInfo:
    """Represents a single Proxmox node."""
    name: str  # e.g. pve1
    current_ip: str  # current IP address
    new_ip: Optional[str] = None  # planned new IP
    ssh_host: Optional[str] = None  # how to reach it (IP or hostname)
    is_local: bool = False  # is this the node we're running on
    is_reachable: bool = False
    interfaces: list[NetworkInterface] = field(default_factory=list)
    hosts_content: str = ""
    network_interfaces_content: str = ""
@dataclass
 class CorosyncNode:
    """A node entry in corosync.conf."""
    nodeid: int
    name: str
    ring0_addr: str
    ring1_addr: Optional[str] = None
@dataclass
 class CorosyncConfig:
    """Parsed corosync configuration."""
    nodes: list[CorosyncNode] = field(default_factory=list)
    config_version: int = 1
    cluster_name: str = ""
    transport: str = "knet"
    raw_content: str = ""
@dataclass
 class CephConfig:
    """Parsed Ceph configuration."""
    fsid: str = ""
    public_network: str = ""  # e.g. 192.168.0.0/24
    cluster_network: str = ""  # e.g. 192.168.0.0/24
    mon_hosts: list[str] = field(default_factory=list)
    mon_sections: dict[str, dict[str, str]] = field(default_factory=dict)  # [mon.pve1] -> {key: val}
    raw_content: str = ""
@dataclass
 class MigrationPlan:
    """Complete migration plan with old -> new mappings."""
    nodes: list[NodeInfo] = field(default_factory=list)
    old_network: str = ""  # e.g. 192.168.0.0/24
    new_network: str = ""  # e.g. 172.0.2.0/16
    new_gateway: Optional[str] = None
    ceph_new_public_network: str = ""
    ceph_new_cluster_network: str = ""
    corosync_config: Optional[CorosyncConfig] = None
    ceph_config: Optional[CephConfig] = None
    dry_run: bool = False
    quorum_available: bool = True
    bridge_name: str = "vmbr0"  # which bridge to modify
@@ -0,0 +1,236 @@
 """Phase 2: Plan the migration - IP mapping and config generation."""
 import ipaddress
 from models import NodeInfo, CorosyncConfig, CephConfig, MigrationPlan
 from config_parser import (
    generate_corosync_conf, generate_ceph_conf,
    generate_network_interfaces, generate_hosts,
 )
 class Planner:
    """Plans the network migration with user input."""
    def plan(self, nodes: list[NodeInfo], corosync: CorosyncConfig,
             ceph: CephConfig | None, has_quorum: bool) -> MigrationPlan | None:
        """Interactive planning with the user."""
        plan = MigrationPlan(
            nodes=nodes,
            corosync_config=corosync,
            ceph_config=ceph,
            quorum_available=has_quorum,
        )
        print("\n=== Phase 2: Migration planen ===\n")
        # Get new network
        plan.new_network = self._ask_new_network()
        if not plan.new_network:
            return None
        new_net = ipaddress.ip_network(plan.new_network, strict=False)
        plan.new_gateway = self._ask_gateway(new_net)
        # Detect old network from first node
        if nodes:
            old_ip = ipaddress.ip_address(nodes[0].current_ip)
            for iface in nodes[0].interfaces:
                if iface.address == str(old_ip):
                    plan.old_network = f"{ipaddress.ip_network(f'{iface.address}/{iface.cidr}', strict=False)}"
                    plan.bridge_name = iface.name
                    break
        # Generate IP mapping suggestions
        print("\n[IP-Mapping]")
        print("Für jeden Node wird eine neue IP benötigt.\n")
        for node in nodes:
            suggested_ip = self._suggest_new_ip(node.current_ip, plan.new_network)
            print(f"  {node.name}: {node.current_ip} -> ", end="")
            user_input = input(f"[{suggested_ip}]: ").strip()
            if user_input:
                node.new_ip = user_input
            else:
                node.new_ip = suggested_ip
            print(f"    => {node.new_ip}")
        # Ceph network planning
        if ceph:
            print("\n[Ceph Netzwerke]")
            print(f"  Aktuelles Public Network:  {ceph.public_network}")
            print(f"  Aktuelles Cluster Network: {ceph.cluster_network}")
            default_ceph_net = plan.new_network
            user_input = input(
                f"\n  Neues Ceph Public Network [{default_ceph_net}]: "
            ).strip()
            plan.ceph_new_public_network = user_input or default_ceph_net
            user_input = input(
                f"  Neues Ceph Cluster Network [{plan.ceph_new_public_network}]: "
            ).strip()
            plan.ceph_new_cluster_network = user_input or plan.ceph_new_public_network
        # Which bridge to modify
        print(f"\n[Bridge]")
        user_input = input(
            f"  Welche Bridge soll geändert werden? [{plan.bridge_name}]: "
        ).strip()
        if user_input:
            plan.bridge_name = user_input
        # Show preview
        self._show_preview(plan)
        # Confirm
        confirm = input("\nMigration durchführen? [j/N]: ").strip().lower()
        if confirm not in ('j', 'ja', 'y', 'yes'):
            print("Abgebrochen.")
            return None
        return plan
    def _ask_new_network(self) -> str | None:
        """Ask for the new network."""
        while True:
            network = input("Neues Netzwerk (z.B. 172.0.2.0/16): ").strip()
            if not network:
                print("Abgebrochen.")
                return None
            try:
                ipaddress.ip_network(network, strict=False)
                return network
            except ValueError as e:
                print(f"  Ungültiges Netzwerk: {e}")
    def _ask_gateway(self, network: ipaddress.IPv4Network) -> str:
        """Ask for the gateway in the new network."""
        # Suggest first usable IP as gateway
        suggested = str(list(network.hosts())[0])
        user_input = input(f"Neues Gateway [{suggested}]: ").strip()
        return user_input or suggested
    def _suggest_new_ip(self, old_ip: str, new_network: str) -> str:
        """Suggest a new IP by keeping the host part from the old IP."""
        old = ipaddress.ip_address(old_ip)
        new_net = ipaddress.ip_network(new_network, strict=False)
        # Keep the last octet(s) from the old IP
        old_host = int(old) & 0xFF  # last octet
        if new_net.prefixlen <= 16:
            # For /16 or bigger, keep last two octets
            old_host = int(old) & 0xFFFF
        new_ip = ipaddress.ip_address(int(new_net.network_address) | old_host)
        return str(new_ip)
    def _show_preview(self, plan: MigrationPlan):
        """Show a preview of all planned changes."""
        print("\n" + "=" * 60)
        print("  MIGRATION PREVIEW")
        print("=" * 60)
        ip_mapping = {n.current_ip: n.new_ip for n in plan.nodes if n.new_ip}
        print(f"\n  Netzwerk: {plan.old_network} -> {plan.new_network}")
        print(f"  Gateway: {plan.new_gateway}")
        print(f"  Bridge: {plan.bridge_name}")
        print(f"  Quorum verfügbar: {'Ja' if plan.quorum_available else 'NEIN'}")
        print("\n  [Node IP-Mapping]")
        for node in plan.nodes:
            status = "erreichbar" if node.is_reachable else "NICHT ERREICHBAR"
            print(f"    {node.name}: {node.current_ip} -> {node.new_ip} ({status})")
        if plan.ceph_config:
            print("\n  [Ceph Netzwerke]")
            print(f"    Public:  {plan.ceph_config.public_network} -> {plan.ceph_new_public_network}")
            print(f"    Cluster: {plan.ceph_config.cluster_network} -> {plan.ceph_new_cluster_network}")
            if plan.ceph_config.mon_hosts:
                print(f"    MON Hosts: {', '.join(plan.ceph_config.mon_hosts)}")
                new_mons = [ip_mapping.get(h, h) for h in plan.ceph_config.mon_hosts]
                print(f"            -> {', '.join(new_mons)}")
        print("\n  [Dateien die geändert werden]")
        print("    - /etc/network/interfaces  (auf jedem Node)")
        print("    - /etc/hosts               (auf jedem Node)")
        print("    - /etc/corosync/corosync.conf (auf jedem Node)")
        if plan.ceph_config:
            if plan.quorum_available:
                print("    - /etc/pve/ceph.conf       (über Cluster-FS)")
            else:
                print("    - /etc/ceph/ceph.conf      (direkt, da kein Quorum)")
        if not plan.quorum_available:
            print("\n  [!] WARNUNG: Kein Quorum verfügbar!")
            print("      Es wird 'pvecm expected 1' verwendet um Quorum zu erzwingen.")
            print("      Ceph-Config wird direkt auf jedem Node geschrieben.")
        print("\n" + "=" * 60)
    def generate_new_configs(self, plan: MigrationPlan) -> dict:
        """Generate all new configuration file contents.
        Returns dict with:
          'corosync': new corosync.conf content
          'ceph': new ceph.conf content (or None)
          'nodes': {node_name: {'interfaces': content, 'hosts': content}}
        """
        ip_mapping = {n.current_ip: n.new_ip for n in plan.nodes if n.new_ip}
        configs = {
            'corosync': None,
            'ceph': None,
            'nodes': {},
        }
        # Generate new corosync.conf
        if plan.corosync_config:
            configs['corosync'] = generate_corosync_conf(
                plan.corosync_config, ip_mapping
            )
        # Generate new ceph.conf
        if plan.ceph_config:
            configs['ceph'] = generate_ceph_conf(
                plan.ceph_config, ip_mapping,
                plan.ceph_new_public_network,
                plan.ceph_new_cluster_network,
            )
        # Generate per-node configs
        new_cidr = ipaddress.ip_network(plan.new_network, strict=False).prefixlen
        # Detect old gateway from first reachable node
        old_gateway = None
        for node in plan.nodes:
            for iface in node.interfaces:
                if iface.name == plan.bridge_name and iface.gateway:
                    old_gateway = iface.gateway
                    break
            if old_gateway:
                break
        for node in plan.nodes:
            if not node.new_ip or not node.network_interfaces_content:
                continue
            node_configs = {}
            # Network interfaces
            node_configs['interfaces'] = generate_network_interfaces(
                node.network_interfaces_content,
                node.current_ip, node.new_ip,
                new_cidr, plan.new_gateway, old_gateway,
            )
            # /etc/hosts
            node_configs['hosts'] = generate_hosts(
                node.hosts_content, ip_mapping
            )
            configs['nodes'][node.name] = node_configs
        return configs
@@ -0,0 +1,2 @@
 # Proxmox Cluster Network Changer
 # Keine externen Dependencies - nutzt nur Python stdlib + system ssh
@@ -0,0 +1,228 @@
 """Emergency/Rescue Network - Temporäres Netzwerk zur SSH-Kommunikation.
 Wenn Nodes in verschiedenen Subnetzen sind und sich nicht mehr erreichen
 können, wird ein temporäres Emergency-Netzwerk aufgebaut:
 - Jeder Node bekommt eine zusätzliche IP auf der Bridge (z.B. vmbr0)
 - Über dieses Netz kann das Tool dann per SSH arbeiten
 - Nach der Migration werden die Emergency-IPs wieder entfernt
 """
 import ipaddress
 import time
 from models import NodeInfo, CorosyncConfig
 from config_parser import parse_corosync_conf
 from ssh_manager import SSHManager
 class RescueNetwork:
    """Manages an emergency network for broken clusters."""
    def __init__(self, ssh: SSHManager):
        self.ssh = ssh
        self.rescue_subnet: str = ""
        self.rescue_ips: dict[str, str] = {}  # node_name -> rescue_ip
        self.bridge: str = "vmbr0"
        self.active: bool = False
    def setup_interactive(self, corosync: CorosyncConfig) -> list[NodeInfo] | None:
        """Interactively set up the rescue network.
        Returns updated node list with rescue IPs as ssh_host, or None on abort.
        """
        print("\n" + "=" * 60)
        print("  RESCUE NETZWERK")
        print("=" * 60)
        print()
        print("  Dieses Feature richtet ein temporäres Netzwerk ein,")
        print("  damit alle Nodes sich wieder per SSH erreichen können.")
        print()
        print("  Ablauf:")
        print("  1. Du gibst ein freies Subnetz an (z.B. 10.99.99.0/24)")
        print("  2. Das Tool zeigt dir für jeden Node den Befehl an")
        print("  3. Du führst die Befehle manuell auf jedem Node aus")
        print("     (z.B. über IPMI/iLO/iDRAC/KVM-Konsole)")
        print("  4. Danach kann das Tool alle Nodes per SSH erreichen")
        print()
        # Ask for bridge
        user_input = input(f"  Bridge für Emergency-IPs [{self.bridge}]: ").strip()
        if user_input:
            self.bridge = user_input
        # Ask for rescue subnet
        while True:
            subnet_input = input("  Emergency Subnetz (z.B. 10.99.99.0/24): ").strip()
            if not subnet_input:
                print("  Abgebrochen.")
                return None
            try:
                subnet = ipaddress.ip_network(subnet_input, strict=False)
                self.rescue_subnet = str(subnet)
                break
            except ValueError as e:
                print(f"  Ungültiges Subnetz: {e}")
        # Generate IPs for all nodes
        hosts = list(subnet.hosts())
        print()
        print("  " + "-" * 56)
        print(f"  Emergency Subnetz: {self.rescue_subnet}")
        print(f"  Bridge: {self.bridge}")
        print("  " + "-" * 56)
        print()
        nodes = []
        for i, cs_node in enumerate(corosync.nodes):
            if i >= len(hosts):
                print(f"  [!] FEHLER: Nicht genug IPs im Subnetz für alle Nodes!")
                return None
            rescue_ip = str(hosts[i])
            self.rescue_ips[cs_node.name] = rescue_ip
            cidr = subnet.prefixlen
            node = NodeInfo(
                name=cs_node.name,
                current_ip=cs_node.ring0_addr,
                ssh_host=rescue_ip,  # Use rescue IP for SSH
            )
            nodes.append(node)
            # Show command for this node
            cmd = f"ip addr add {rescue_ip}/{cidr} dev {self.bridge}"
            print(f"  {cs_node.name} ({cs_node.ring0_addr}):")
            print(f"    Rescue-IP: {rescue_ip}/{cidr}")
            print(f"    Befehl:    {cmd}")
            print()
        # Apply locally
        print("  " + "-" * 56)
        print()
        # Find local node
        import socket
        local_hostname = socket.gethostname()
        local_node = None
        for node in nodes:
            if node.name == local_hostname:
                local_node = node
                node.is_local = True
                break
        if local_node and local_node.name in self.rescue_ips:
            rescue_ip = self.rescue_ips[local_node.name]
            cidr = ipaddress.ip_network(self.rescue_subnet, strict=False).prefixlen
            print(f"  Lokaler Node erkannt: {local_node.name}")
            answer = input(
                f"  Emergency-IP {rescue_ip}/{cidr} auf {self.bridge} "
                f"automatisch setzen? [J/n]: "
            ).strip().lower()
            if answer not in ('n', 'nein', 'no'):
                rc, _, err = self.ssh.execute_local(
                    f"ip addr add {rescue_ip}/{cidr} dev {self.bridge} 2>/dev/null; echo ok"
                )
                if rc == 0:
                    print(f"  -> {rescue_ip}/{cidr} auf {self.bridge} gesetzt")
                    local_node.is_reachable = True
                else:
                    print(f"  -> WARNUNG: {err}")
                    local_node.is_reachable = True  # It's local, still reachable
            else:
                local_node.is_reachable = True
        # Wait for user to configure other nodes
        print()
        print("  " + "=" * 56)
        print("  Bitte führe jetzt die oben genannten Befehle auf den")
        print("  anderen Nodes aus (IPMI/iLO/iDRAC/KVM-Konsole).")
        print("  " + "=" * 56)
        print()
        input("  Drücke ENTER wenn alle Nodes konfiguriert sind...")
        # Test connectivity
        print()
        print("  [Verbindungstest]")
        all_ok = True
        for node in nodes:
            if node.is_local:
                print(f"  {node.name}: OK (lokal)")
                continue
            rescue_ip = self.rescue_ips[node.name]
            reachable = self.ssh.is_reachable(rescue_ip)
            if reachable:
                print(f"  {node.name} ({rescue_ip}): OK")
                node.is_reachable = True
            else:
                print(f"  {node.name} ({rescue_ip}): NICHT ERREICHBAR")
                all_ok = False
        if not all_ok:
            print()
            print("  [!] Nicht alle Nodes erreichbar!")
            answer = input("  Trotzdem fortfahren? [j/N]: ").strip().lower()
            if answer not in ('j', 'ja', 'y', 'yes'):
                self.cleanup(nodes)
                return None
        self.active = True
        print()
        print("  Rescue-Netzwerk aktiv. Migration kann starten.")
        return nodes
    def cleanup(self, nodes: list[NodeInfo]):
        """Remove emergency IPs from all nodes."""
        if not self.active and not self.rescue_ips:
            return
        print("\n  [Rescue] Emergency-IPs entfernen...")
        cidr = ipaddress.ip_network(self.rescue_subnet, strict=False).prefixlen
        for node in nodes:
            if node.name not in self.rescue_ips:
                continue
            rescue_ip = self.rescue_ips[node.name]
            cmd = f"ip addr del {rescue_ip}/{cidr} dev {self.bridge} 2>/dev/null"
            if node.is_local:
                rc, _, _ = self.ssh.execute_local(cmd)
            elif node.is_reachable:
                # Try to reach via new IP first (after migration), then rescue IP
                if node.new_ip:
                    rc, _, _ = self.ssh.execute(node.new_ip, cmd)
                else:
                    rc, _, _ = self.ssh.execute(rescue_ip, cmd)
            status = "entfernt" if True else "FEHLER"
            print(f"  {node.name}: {rescue_ip}/{cidr} {status}")
        self.active = False
        print("  [Rescue] Emergency-IPs entfernt.")
    def get_rescue_commands(self, corosync: CorosyncConfig,
                           subnet: str, bridge: str = "vmbr0") -> list[dict]:
        """Generate rescue commands without interactive prompts.
        Returns list of {name, ip, cidr, command, current_ip}
        """
        network = ipaddress.ip_network(subnet, strict=False)
        hosts = list(network.hosts())
        commands = []
        for i, cs_node in enumerate(corosync.nodes):
            if i >= len(hosts):
                break
            rescue_ip = str(hosts[i])
            cidr = network.prefixlen
            commands.append({
                'name': cs_node.name,
                'current_ip': cs_node.ring0_addr,
                'ip': rescue_ip,
                'cidr': cidr,
                'command': f"ip addr add {rescue_ip}/{cidr} dev {bridge}",
                'remove_command': f"ip addr del {rescue_ip}/{cidr} dev {bridge}",
            })
        return commands
@@ -0,0 +1,140 @@
 """SSH connection manager for remote Proxmox nodes."""
 import subprocess
 from typing import Optional
 class SSHManager:
    """Manages SSH connections to Proxmox nodes using system ssh."""
    def __init__(self, ssh_user: str = "root", ssh_key: Optional[str] = None,
                 ssh_port: int = 22):
        self.ssh_user = ssh_user
        self.ssh_key = ssh_key
        self.ssh_port = ssh_port
    def _build_ssh_cmd(self, host: str, command: str) -> list[str]:
        """Build the ssh command list."""
        cmd = [
            "ssh",
            "-o", "StrictHostKeyChecking=no",
            "-o", "ConnectTimeout=10",
            "-o", "BatchMode=yes",
            "-p", str(self.ssh_port),
        ]
        if self.ssh_key:
            cmd.extend(["-i", self.ssh_key])
        cmd.append(f"{self.ssh_user}@{host}")
        cmd.append(command)
        return cmd
    def execute(self, host: str, command: str, timeout: int = 30) -> tuple[int, str, str]:
        """Execute a command on a remote host via SSH.
        Returns: (return_code, stdout, stderr)
        """
        cmd = self._build_ssh_cmd(host, command)
        try:
            result = subprocess.run(
                cmd,
                capture_output=True,
                text=True,
                timeout=timeout,
            )
            return result.returncode, result.stdout, result.stderr
        except subprocess.TimeoutExpired:
            return -1, "", f"SSH command timed out after {timeout}s"
        except Exception as e:
            return -1, "", str(e)
    def read_file(self, host: str, path: str) -> tuple[bool, str]:
        """Read a file from a remote host.
        Returns: (success, content)
        """
        rc, stdout, stderr = self.execute(host, f"cat {path}")
        if rc == 0:
            return True, stdout
        return False, stderr
    def write_file(self, host: str, path: str, content: str) -> tuple[bool, str]:
        """Write content to a file on a remote host.
        Returns: (success, message)
        """
        # Use heredoc via ssh to write file
        escaped = content.replace("'", "'\\''")
        cmd = self._build_ssh_cmd(host, f"cat > {path} << 'PROXMOX_NET_EOF'\n{content}\nPROXMOX_NET_EOF")
        try:
            result = subprocess.run(
                cmd,
                capture_output=True,
                text=True,
                timeout=30,
            )
            if result.returncode == 0:
                return True, "OK"
            return False, result.stderr
        except Exception as e:
            return False, str(e)
    def is_reachable(self, host: str) -> bool:
        """Check if a host is reachable via SSH."""
        rc, _, _ = self.execute(host, "echo ok", timeout=10)
        return rc == 0
    def execute_local(self, command: str, timeout: int = 30) -> tuple[int, str, str]:
        """Execute a command locally.
        Returns: (return_code, stdout, stderr)
        """
        try:
            result = subprocess.run(
                command,
                shell=True,
                capture_output=True,
                text=True,
                timeout=timeout,
            )
            return result.returncode, result.stdout, result.stderr
        except subprocess.TimeoutExpired:
            return -1, "", f"Command timed out after {timeout}s"
        except Exception as e:
            return -1, "", str(e)
    def read_local_file(self, path: str) -> tuple[bool, str]:
        """Read a local file."""
        try:
            with open(path, 'r') as f:
                return True, f.read()
        except Exception as e:
            return False, str(e)
    def write_local_file(self, path: str, content: str) -> tuple[bool, str]:
        """Write a local file."""
        try:
            with open(path, 'w') as f:
                f.write(content)
            return True, "OK"
        except Exception as e:
            return False, str(e)
    def run_on_node(self, host: str, command: str, is_local: bool = False,
                    timeout: int = 30) -> tuple[int, str, str]:
        """Run a command on a node (local or remote)."""
        if is_local:
            return self.execute_local(command, timeout)
        return self.execute(host, command, timeout)
    def read_node_file(self, host: str, path: str, is_local: bool = False) -> tuple[bool, str]:
        """Read a file from a node (local or remote)."""
        if is_local:
            return self.read_local_file(path)
        return self.read_file(host, path)
    def write_node_file(self, host: str, path: str, content: str,
                        is_local: bool = False) -> tuple[bool, str]:
        """Write a file to a node (local or remote)."""
        if is_local:
            return self.write_local_file(path, content)
        return self.write_file(host, path, content)
@@ -0,0 +1,112 @@
 """Phase 5: Verify the migration was successful."""
 import time
 from models import MigrationPlan
 from ssh_manager import SSHManager
 class Verifier:
    """Verifies the cluster state after migration."""
    def __init__(self, ssh: SSHManager):
        self.ssh = ssh
    def run(self, plan: MigrationPlan) -> bool:
        """Run all verification checks."""
        print("\n=== Phase 5: Verifikation ===\n")
        all_ok = True
        # Check node reachability on new IPs
        print("[Node-Erreichbarkeit (neue IPs)]")
        for node in plan.nodes:
            if not node.new_ip:
                continue
            if node.is_local:
                # Check local IP
                rc, stdout, _ = self.ssh.execute_local(
                    f"ip addr show | grep -q '{node.new_ip}'"
                )
                reachable = rc == 0
            else:
                reachable = self.ssh.is_reachable(node.new_ip)
            status = "OK" if reachable else "FEHLER"
            print(f"  {node.name} ({node.new_ip}): {status}")
            if not reachable:
                all_ok = False
        # Check cluster status
        print("\n[Cluster Status]")
        rc, stdout, _ = self.ssh.execute_local("pvecm status 2>/dev/null")
        if rc == 0:
            # Extract relevant info
            for line in stdout.split('\n'):
                line = line.strip()
                if any(k in line for k in ['Quorate:', 'Nodes:', 'Node name',
                                           'Total votes', 'Expected votes']):
                    print(f"  {line}")
            if "Quorate:          Yes" not in stdout:
                print("  [!] WARNUNG: Cluster hat KEIN Quorum!")
                all_ok = False
        else:
            print("  [!] pvecm status fehlgeschlagen")
            all_ok = False
        # Check corosync members
        print("\n[Corosync Members]")
        rc, stdout, _ = self.ssh.execute_local("corosync-cmapctl 2>/dev/null | grep 'ip(' || true")
        if rc == 0 and stdout.strip():
            for line in stdout.strip().split('\n'):
                print(f"  {line.strip()}")
        else:
            print("  Keine Corosync-Member-Info verfügbar")
        # Check Ceph if it was configured
        if plan.ceph_config:
            print("\n[Ceph Status]")
            rc, stdout, _ = self.ssh.execute_local("ceph -s 2>/dev/null")
            if rc == 0:
                for line in stdout.split('\n'):
                    line = line.strip()
                    if line:
                        print(f"  {line}")
            else:
                print("  [!] ceph -s fehlgeschlagen")
                all_ok = False
            print("\n[Ceph MON Status]")
            rc, stdout, _ = self.ssh.execute_local("ceph mon stat 2>/dev/null")
            if rc == 0:
                print(f"  {stdout.strip()}")
            else:
                print("  [!] ceph mon stat fehlgeschlagen")
            print("\n[Ceph OSD Status]")
            rc, stdout, _ = self.ssh.execute_local("ceph osd tree 2>/dev/null")
            if rc == 0:
                for line in stdout.split('\n')[:20]:  # First 20 lines
                    if line.strip():
                        print(f"  {line}")
        # Summary
        print("\n" + "=" * 60)
        if all_ok:
            print("  MIGRATION ERFOLGREICH!")
            print("  Alle Checks bestanden.")
        else:
            print("  MIGRATION MIT WARNUNGEN ABGESCHLOSSEN")
            print("  Einige Checks sind fehlgeschlagen. Bitte manuell prüfen!")
        print("=" * 60)
        # Suggest next steps
        print("\n[Empfohlene nächste Schritte]")
        print("  1. VMs/CTs auf allen Nodes prüfen: qm list / pct list")
        print("  2. Live-Migration testen: qm migrate <vmid> <target>")
        print("  3. Ceph Recovery abwarten: ceph -w")
        if not all_ok:
            print("  4. Bei Problemen Backup wiederherstellen:")
            print("     ls /root/network-migration-backup-*/")
        return all_ok
		`@@ -0,0 +1,2 @@`
							`# Proxmox Cluster Network Changer`
							`# Keine externen Dependencies - nutzt nur Python stdlib + system ssh`