fesibility pash for vision node

2026-03-20 16:56:42 +07:00 · 2026-03-20 16:56:42 +07:00 · 79f7fbad1c
parent 6b912191d6
commit 79f7fbad1c
1 changed files with 833 additions and 0 deletions
--- a/src/amr_vision_node/docs/feasibility.md
+++ b/src/amr_vision_node/docs/feasibility.md
@ -0,0 +1,833 @@
 # Feasibility Study: Vision Sensor for AMR ROS2 K4
 > **Date**: 2026-03-20
 > **Scope**: Color/object recognition, object counting, Blockly integration
 > **Platform**: Raspberry Pi 4/5 (linux-aarch64) + Desktop (linux-64)
 ---
 ## 1. Executive Summary
 Implementasi vision sensor pada Kiwi Wheel AMR layak dilakukan menggunakan **OpenCV dengan HSV color thresholding** sebagai pendekatan utama. Pendekatan ini ringan secara komputasi (15-30 FPS pada Raspberry Pi 4 di resolusi 640x480), tidak memerlukan GPU atau model ML, dan dapat diintegrasikan langsung ke arsitektur Blockly yang sudah ada menggunakan pattern yang sama dengan odometry (fetch once, extract many).
 **Rekomendasi**: Mulai dari Phase 1 (MVP) — OpenCV direct capture, HSV thresholding, 4 Blockly blocks, color profile JSON. Tidak perlu ROS2 image pipeline di awal.
 ---
 ## 2. Requirements Analysis
 Berdasarkan brief di readme.md, terdapat 3 kebutuhan utama:
 ### R1: Pengenalan Warna dan Obyek + Prosedur Training Warna
 - Deteksi obyek berdasarkan warna dalam frame kamera
 - User dapat mendefinisikan (training) warna baru melalui prosedur yang user-friendly
 - Return data obyek terdeteksi: label warna, posisi di frame, bounding box
 ### R2: Penghitungan Obyek (Urut Kiri ke Kanan)
 - Hitung obyek yang tersusun secara sekuensial di pandangan kamera
 - Urutan berdasarkan posisi horizontal (x-coordinate) dari kiri ke kanan
 - Return: total count dan posisi individual tiap obyek
 ### R3: Integrasi Blockly App
 - Vision blocks harus terintegrasi ke visual programming Blockly yang sudah ada
 - Mengikuti pattern yang established: JS block registration, handler decorator, ROS2 action
 - User dapat menggunakan vision blocks dalam program Blockly tanpa menulis code
 ---
 ## 3. Hardware Options — Kamera untuk Raspberry Pi
 ### Option A: Raspberry Pi Camera Module v2 / v3
 | Aspek | Detail |
 |-------|--------|
 | Interface | CSI (MIPI) via ribbon cable |
 | Resolusi | 8 MP (v2), 12 MP (v3), autofocus pada v3 |
 | Kelebihan | Native Pi support, low latency, hardware-accelerated capture via `libcamera`/`picamera2` |
 | Kekurangan | Kabel pendek, posisi mounting terbatas, CSI tidak tersedia pada semua konfigurasi Pi |
 | Harga | ~$25 (v2), ~$35 (v3) |
 ### Option B: USB Webcam (Logitech C270, C920, atau sejenisnya)
 | Aspek | Detail |
 |-------|--------|
 | Interface | USB (V4L2) |
 | Resolusi | 720p - 1080p |
 | Kelebihan | Plug and play, kabel USB panjang, mudah mounting, tersedia luas, langsung bekerja dengan OpenCV `VideoCapture` |
 | Kekurangan | Latency lebih tinggi dari CSI, USB bandwidth contention di Pi, konsumsi daya USB |
 | Harga | ~$20 (C270), ~$60 (C920) |
 ### Rekomendasi
 **USB webcam untuk prototyping, CSI camera untuk production.**
 Kedua jenis kamera muncul sebagai `/dev/video*` di Linux melalui V4L2. Node harus mengabstraksi akses kamera sehingga keduanya bisa digunakan — cukup ganti device path via ROS2 parameter.
 ```
 Camera (CSI atau USB)
    ↓  V4L2 (/dev/video0)
 OpenCV VideoCapture
    ↓
 amr_vision_node
 ```
 ---
 ## 4. Software Stack
 ### 4.1 OpenCV — Library Utama (Rekomendasi)
 - Tersedia di conda-forge sebagai `py-opencv`
 - Berjalan di `linux-64` dan `linux-aarch64`
 - Menyediakan semua fungsi yang dibutuhkan: color space conversion, thresholding, contour detection, morphological operations
 - Ringan, well-supported di Raspberry Pi
 - Tidak memerlukan GPU untuk color detection dasar
 ### 4.2 ROS2 Vision Packages (Optional, Phase 2)
 | Package | Fungsi |
 |---------|--------|
 | `ros-jazzy-cv-bridge` | Konversi antara ROS2 `sensor_msgs/Image` dan OpenCV `cv::Mat` |
 | `ros-jazzy-image-transport` | Publishing gambar efisien dengan kompresi |
 | `ros-jazzy-camera-info-manager` | Manajemen kalibrasi kamera |
 **Catatan**: Ketersediaan packages di atas dalam RoboStack `robostack-jazzy` channel untuk `linux-aarch64` perlu diverifikasi. Jika tidak tersedia, gunakan OpenCV `VideoCapture` langsung (Phase 1 approach).
 ### 4.3 Fallback: OpenCV Direct (Phase 1)
 Untuk Phase 1, gunakan OpenCV `VideoCapture` langsung tanpa ROS2 image pipeline:
 ```python
 import cv2
 cap = cv2.VideoCapture("/dev/video0")  # atau device index 0
 cap.set(cv2.CAP_PROP_FRAME_WIDTH, 640)
 cap.set(cv2.CAP_PROP_FRAME_HEIGHT, 480)
 ret, frame = cap.read()  # BGR numpy array
 ```
 Pendekatan ini memiliki **zero additional ROS2 dependencies** dan cukup untuk semua kebutuhan di Phase 1.
 ---
 ## 5. Color Recognition — HSV Thresholding
 ### 5.1 Pipeline
 HSV (Hue-Saturation-Value) color space lebih robust terhadap variasi pencahayaan dibanding RGB karena memisahkan informasi warna (Hue) dari intensitas cahaya (Value).
 ```
 Frame (BGR)
    ↓ cv2.cvtColor(frame, cv2.COLOR_BGR2HSV)
 Frame (HSV)
    ↓ cv2.inRange(hsv, lower_bound, upper_bound)
 Binary Mask (0/255)
    ↓ cv2.erode() + cv2.dilate()   ← morphological cleanup
 Clean Mask
    ↓ cv2.findContours()
 Contours
    ↓ filter by area (reject noise)
 Detected Objects
 ```
 ### 5.2 Contoh Implementasi
 ```python
 import cv2
 import numpy as np
 def detect_color(frame, lower_hsv, upper_hsv, min_area=500):
    """Detect objects of a specific color in a BGR frame.
    Args:
        frame: BGR image from camera
        lower_hsv: (H, S, V) lower bound, e.g. (0, 100, 100)
        upper_hsv: (H, S, V) upper bound, e.g. (10, 255, 255)
        min_area: minimum contour area in pixels to filter noise
    Returns:
        List of detected objects: [{x, y, w, h, area, cx, cy}, ...]
    """
    hsv = cv2.cvtColor(frame, cv2.COLOR_BGR2HSV)
    mask = cv2.inRange(hsv, np.array(lower_hsv), np.array(upper_hsv))
    # Morphological cleanup — remove small noise, fill small holes
    kernel = cv2.getStructuringElement(cv2.MORPH_ELLIPSE, (5, 5))
    mask = cv2.erode(mask, kernel, iterations=1)
    mask = cv2.dilate(mask, kernel, iterations=2)
    contours, _ = cv2.findContours(mask, cv2.RETR_EXTERNAL, cv2.CHAIN_APPROX_SIMPLE)
    objects = []
    for cnt in contours:
        area = cv2.contourArea(cnt)
        if area < min_area:
            continue
        x, y, w, h = cv2.boundingRect(cnt)
        cx, cy = x + w // 2, y + h // 2  # centroid
        objects.append({"x": x, "y": y, "w": w, "h": h, "area": int(area), "cx": cx, "cy": cy})
    return objects
 ```
 ### 5.3 Color Training Procedure
 **Tujuan**: User dapat mendefinisikan warna baru tanpa menulis code. Prosedur dilakukan melalui Blockly block.
 **Langkah-langkah**:
 1. **Persiapan**: User menempatkan objek referensi warna di depan kamera, dengan pencahayaan yang konsisten
 2. **Capture**: Node menangkap N frame (default: 10) dari kamera
 3. **Sampling**: Dari tiap frame, ambil Region of Interest (ROI) di tengah frame (default: 50x50 pixel)
 4. **Kalkulasi**: Hitung median HSV dari semua sample ROI, tentukan range sebagai `median ± tolerance`
 5. **Simpan**: Color profile disimpan sebagai JSON file
 **Format Color Profile** (`~/.amr_vision/colors.json`):
 ```json
 {
  "colors": {
    "red": {
      "lower_hsv": [0, 100, 100],
      "upper_hsv": [10, 255, 255],
      "trained_at": "2026-03-20T10:30:00",
      "samples": 10,
      "tolerance": 15
    },
    "blue": {
      "lower_hsv": [100, 100, 100],
      "upper_hsv": [130, 255, 255],
      "trained_at": "2026-03-20T10:35:00",
      "samples": 10,
      "tolerance": 15
    }
  }
 }
 ```
 **Contoh Training Algorithm**:
 ```python
 def train_color(cap, color_name, roi_size=50, num_samples=10, tolerance=15):
    """Train a color by sampling the center of the camera frame.
    Args:
        cap: OpenCV VideoCapture object
        color_name: name for the trained color (e.g. "red")
        roi_size: size of the square ROI at frame center
        num_samples: number of frames to sample
        tolerance: HSV range tolerance (+/-)
    Returns:
        Color profile dict with lower_hsv and upper_hsv
    """
    hsv_samples = []
    for _ in range(num_samples):
        ret, frame = cap.read()
        if not ret:
            continue
        hsv = cv2.cvtColor(frame, cv2.COLOR_BGR2HSV)
        h, w = hsv.shape[:2]
        cx, cy = w // 2, h // 2
        half = roi_size // 2
        roi = hsv[cy - half:cy + half, cx - half:cx + half]
        # Median HSV of the ROI
        median_hsv = np.median(roi.reshape(-1, 3), axis=0)
        hsv_samples.append(median_hsv)
    overall_median = np.median(hsv_samples, axis=0).astype(int)
    lower = np.clip(overall_median - tolerance, [0, 0, 0], [179, 255, 255]).tolist()
    upper = np.clip(overall_median + tolerance, [0, 0, 0], [179, 255, 255]).tolist()
    return {
        "lower_hsv": lower,
        "upper_hsv": upper,
        "samples": num_samples,
        "tolerance": tolerance,
    }
 ```
 **Catatan tentang Hue wrapping**: Warna merah memiliki Hue di sekitar 0° dan 180° (wrapping). Untuk menangani ini, training procedure harus mendeteksi apakah Hue sample berada di kedua ujung range dan menghasilkan dua range terpisah yang digabung dengan bitwise OR.
 ---
 ## 6. Object Detection & Counting (Left-to-Right)
 ### 6.1 Algoritma
 Setelah color detection menghasilkan daftar contour per warna:
 1. **Hitung centroid** tiap obyek: `cx = x + w/2`
 2. **Sort by x-coordinate** (ascending) → otomatis urut kiri ke kanan
 3. **Assign index** sekuensial: 1, 2, 3, ...
 4. **Minimum separation filter**: jika dua obyek terlalu dekat (< `min_distance` pixel), gabungkan sebagai satu obyek — menghindari double-counting dari fragmentasi mask
 ### 6.2 Output Format
 ```json
 {
  "count": 3,
  "objects": [
    {"index": 1, "cx": 120, "cy": 240, "w": 60, "h": 55, "color": "red", "area": 2850},
    {"index": 2, "cx": 320, "cy": 235, "w": 58, "h": 52, "color": "red", "area": 2640},
    {"index": 3, "cx": 510, "cy": 242, "w": 62, "h": 57, "color": "red", "area": 3020}
  ]
 }
 ```
 ### 6.3 Multi-Color Detection
 Untuk mendeteksi multiple warna sekaligus:
 ```python
 def detect_all_colors(frame, color_profiles, min_area=500):
    all_objects = []
    for name, profile in color_profiles.items():
        objects = detect_color(frame, profile["lower_hsv"], profile["upper_hsv"], min_area)
        for obj in objects:
            obj["color"] = name
        all_objects.extend(objects)
    # Sort all objects left-to-right regardless of color
    all_objects.sort(key=lambda o: o["cx"])
    for i, obj in enumerate(all_objects):
        obj["index"] = i + 1
    return {"count": len(all_objects), "objects": all_objects}
 ```
 ---
 ## 7. ROS2 Node Design — `amr_vision_node`
 ### 7.1 Package Type
 **ament_python** — konsisten dengan `blockly_executor` karena semua logika adalah OpenCV/Python.
 ### 7.2 Package Structure
 ```
 src/amr_vision_node/
 ├── docs/
 │   └── feasibility.md          # dokumen ini
 ├── amr_vision_node/
 │   ├── __init__.py
 │   ├── vision_node.py           # Main ROS2 node
 │   ├── color_detector.py        # HSV thresholding + contour detection
 │   ├── color_trainer.py         # Color training / calibration logic
 │   └── config/
 │       └── default_colors.json  # Default color definitions
 ├── resource/
 │   └── amr_vision_node          # ament resource marker
 ├── package.xml
 ├── setup.py
 └── setup.cfg
 ```
 ### 7.3 Node Architecture
 ```
 amr_vision_node (Python, ROS2 Node)
 │
 ├── Timer callback (configurable, default 10 Hz)
 │   ├── Capture frame dari kamera (OpenCV VideoCapture)
 │   ├── Untuk setiap trained color: detect objects, compute bounding boxes
 │   └── Cache detection results (thread-safe)
 │
 ├── Subscriber: /vision/train (std_msgs/String)
 │   ├── Receive JSON: {"color_name": "red", "roi_size": 50, "samples": 10}
 │   └── Execute training procedure → save ke colors.json
 │
 ├── Publisher: /vision/detections (std_msgs/String)
 │   └── Publish JSON detection results setiap cycle (untuk executor handler)
 │
 └── ROS2 Parameters:
    ├── camera_device: string = "/dev/video0"
    ├── frame_width: int = 640
    ├── frame_height: int = 480
    ├── publish_rate: double = 10.0
    ├── min_area: int = 500
    └── colors_file: string = "~/.amr_vision/colors.json"
 ```
 ### 7.4 Communication Pattern
 Mengikuti 2 pattern yang sudah established di project ini:
 **Read pattern** (seperti `as5600_node` → `odometry_read` handler):
 ```
 amr_vision_node → publish /vision/detections (JSON string)
                                    ↑
 executor handler (vision_detect) ← lazy-subscribe, cache latest value
 ```
 **Write pattern** (seperti `gpio_node` write):
 ```
 executor handler (vision_train_color) → publish /vision/train (JSON string)
                                                    ↓
                                        amr_vision_node ← subscribe, execute training
 ```
 ### 7.5 Custom Messages — Tidak Diperlukan untuk Phase 1
 Hasil deteksi dikembalikan sebagai **JSON string melalui `BlocklyAction.action` yang sudah ada** — identik dengan pattern odometry handler. Ini menghindari kebutuhan custom message baru dan menjaga `blockly_interfaces` tetap minimal.
 ```python
 # handlers/vision.py
@handler("vision_detect")
 def handle_vision_detect(params, hardware):
    color = params.get("color", "all")
    # Read from cache (lazy-subscribed to /vision/detections)
    return (True, json.dumps({"count": 3, "objects": [...]}))
 ```
 Jika di kemudian hari diperlukan typed messages (Phase 2+), custom messages bisa ditambahkan ke `blockly_interfaces`:
 ```
 # msg/VisionDetection.msg
 string color_name
 uint16 x
 uint16 y
 uint16 width
 uint16 height
 uint32 area
 ```
 ### 7.6 pixi.toml Dependencies
 ```toml
 # Tambah ke [dependencies] atau [target.linux-aarch64.dependencies]
 py-opencv = "*"
 # Build & run tasks
 [tasks.build-vision]
 cmd = "colcon build --packages-select amr_vision_node"
 depends-on = ["build-interfaces"]
 [tasks.vision-node]
 cmd = "ros2 run amr_vision_node vision_node"
 depends-on = ["build-vision"]
 ```
 Jika `ros-jazzy-cv-bridge` dan `ros-jazzy-image-transport` tersedia di RoboStack, tambahkan untuk Phase 2. Jika tidak, OpenCV `VideoCapture` langsung (zero extra deps) sudah cukup.
 ---
 ## 8. Blockly Integration Proposal
 ### 8.1 Overview — 4 Blocks, Mengikuti Pattern Odometry
 | Block | Tipe | Pattern | Deskripsi |
 |-------|------|---------|-----------|
 | `visionDetect` | ROS2 value block | mirrors `odometryRead.js` | Fetch semua deteksi dari kamera |
 | `visionGetCount` | Client-side | mirrors `odometryGet.js` | Extract jumlah obyek |
 | `visionGetObject` | Client-side | mirrors `odometryGet.js` | Extract field obyek ke-N |
 | `visionTrainColor` | ROS2 statement | mirrors `digitalOut.js` | Trigger training warna |
 ### 8.2 Block 1: `visionDetect` — Fetch Detections
 ```
 ┌────────────────────────────────────────────┐
 │ getVision  color: [All ▾]                  │ → output: Object (JSON)
 └────────────────────────────────────────────┘
 ```
 - **Dropdown**: `All`, atau nama warna yang sudah di-training
 - **Category**: `Robot`
 - **Command**: `vision_detect`
 **Generator** (mengikuti pattern `odometryRead.js`):
 ```javascript
 // blocks/visionDetect.js
 BlockRegistry.register({
  name: 'visionDetect',
  category: 'Robot',
  categoryColor: '#5b80a5',
  color: '#8E24AA',
  tooltip: 'Fetch vision detection data — use with "set variable" block',
  definition: {
    init: function () {
      this.appendDummyInput()
        .appendField('getVision')
        .appendField(new Blockly.FieldDropdown([
          ['All', 'all'],
          ['Red', 'red'],
          ['Blue', 'blue'],
          ['Green', 'green']
        ]), 'COLOR');
      this.setOutput(true, null);
      this.setColour('#8E24AA');
      this.setTooltip('Fetch all vision detections (count, objects[]) from camera');
    }
  },
  generator: function (block) {
    var color = block.getFieldValue('COLOR');
    var code =
      'JSON.parse((await executeAction(\'vision_detect\', { color: \'' + color + '\' })).message)';
    return [code, Blockly.JavaScript.ORDER_AWAIT];
  }
 });
 ```
 ### 8.3 Block 2: `visionGetCount` — Extract Count
 ```
 ┌───────────────────────────────────────────────┐
 │ getVisionCount from [detection ▾]             │ → output: Number
 └───────────────────────────────────────────────┘
 ```
 **Generator** (mengikuti pattern `odometryGet.js`):
 ```javascript
 // blocks/visionGetCount.js
 BlockRegistry.register({
  name: 'visionGetCount',
  category: 'Robot',
  categoryColor: '#5b80a5',
  color: '#8E24AA',
  tooltip: 'Get the number of detected objects from vision data',
  definition: {
    init: function () {
      this.appendValueInput('VAR')
        .appendField('getVisionCount')
        .appendField('from');
      this.setOutput(true, 'Number');
      this.setColour('#8E24AA');
      this.setTooltip('Extract object count from vision data');
    }
  },
  generator: function (block) {
    var varCode = Blockly.JavaScript.valueToCode(
      block, 'VAR', Blockly.JavaScript.ORDER_MEMBER) || '{}';
    var code = '(' + varCode + '.count)';
    return [code, Blockly.JavaScript.ORDER_MEMBER];
  }
 });
 ```
 ### 8.4 Block 3: `visionGetObject` — Extract Object Field
 ```
 ┌───────────────────────────────────────────────────────────┐
 │ getVisionObject [■ index] [X ▾] from [detection ▾]       │ → output: Number
 └───────────────────────────────────────────────────────────┘
 ```
 **Generator**:
 ```javascript
 // blocks/visionGetObject.js
 BlockRegistry.register({
  name: 'visionGetObject',
  category: 'Robot',
  categoryColor: '#5b80a5',
  color: '#8E24AA',
  tooltip: 'Get a field from a detected object by index (0-based, left to right)',
  definition: {
    init: function () {
      this.appendValueInput('INDEX')
        .appendField('getVisionObject');
      this.appendDummyInput()
        .appendField(new Blockly.FieldDropdown([
          ['Center X', 'cx'],
          ['Center Y', 'cy'],
          ['Width', 'w'],
          ['Height', 'h'],
          ['Area', 'area'],
          ['Color', 'color']
        ]), 'FIELD')
        .appendField('from');
      this.appendValueInput('VAR');
      this.setInputsInline(true);
      this.setOutput(true, null);
      this.setColour('#8E24AA');
      this.setTooltip('Extract a field from detected object at index');
    }
  },
  generator: function (block) {
    var indexCode = Blockly.JavaScript.valueToCode(
      block, 'INDEX', Blockly.JavaScript.ORDER_MEMBER) || '0';
    var field = block.getFieldValue('FIELD');
    var varCode = Blockly.JavaScript.valueToCode(
      block, 'VAR', Blockly.JavaScript.ORDER_MEMBER) || '{}';
    var code = '(' + varCode + '.objects[' + indexCode + '].' + field + ')';
    return [code, Blockly.JavaScript.ORDER_MEMBER];
  }
 });
 ```
 ### 8.5 Block 4: `visionTrainColor` — Train New Color
 ```
 ┌──────────────────────────────────────────────┐
 │ Vision Train Color  name: [input]            │
 │   ROI size: [50]   samples: [10]             │
 └──────────────────────────────────────────────┘
 ```
 **Generator**:
 ```javascript
 // blocks/visionTrainColor.js
 BlockRegistry.register({
  name: 'visionTrainColor',
  category: 'Robot',
  categoryColor: '#5b80a5',
  color: '#8E24AA',
  tooltip: 'Train a new color — place reference object in front of camera before running',
  definition: {
    init: function () {
      this.appendDummyInput()
        .appendField('trainColor')
        .appendField('name:')
        .appendField(new Blockly.FieldTextInput('red'), 'NAME');
      this.appendDummyInput()
        .appendField('ROI size:')
        .appendField(new Blockly.FieldNumber(50, 10, 200), 'ROI_SIZE')
        .appendField('samples:')
        .appendField(new Blockly.FieldNumber(10, 1, 50), 'SAMPLES');
      this.setPreviousStatement(true, null);
      this.setNextStatement(true, null);
      this.setColour('#8E24AA');
      this.setTooltip('Train a color by sampling the camera ROI');
    }
  },
  generator: function (block) {
    var name = block.getFieldValue('NAME');
    var roiSize = block.getFieldValue('ROI_SIZE');
    var samples = block.getFieldValue('SAMPLES');
    var code = 'await executeAction(\'vision_train_color\', ' +
      '{ name: \'' + name + '\', roi_size: \'' + roiSize + '\', samples: \'' + samples + '\' });\n';
    return code;
  }
 });
 ```
 ### 8.6 Contoh Penggunaan di Blockly
 **Program sederhana — hitung obyek merah**:
 ```
 ┌─ Main Program ──────────────────────────────┐
 │                                              │
 │  set [det] to [getVision color: Red]         │
 │  set [count] to [getVisionCount from [det]]  │
 │  print ["Jumlah obyek: " + count]            │
 │                                              │
 │  repeat [count] times with [i]:              │
 │    set [x] to [getVisionObject [i]           │
 │                 [Center X] from [det]]       │
 │    print ["Obyek " + (i+1) + " di x=" + x]  │
 │                                              │
 └──────────────────────────────────────────────┘
 ```
 **Program training warna baru**:
 ```
 ┌─ Main Program ─────────────────────────────────┐
 │                                                  │
 │  print ["Taruh obyek KUNING di depan kamera"]    │
 │  delay [3] seconds                               │
 │  trainColor  name: "kuning"                      │
 │              ROI size: 50  samples: 10           │
 │  print ["Training selesai!"]                     │
 │                                                  │
 │  set [det] to [getVision color: kuning]          │
 │  print ["Terdeteksi: " + getVisionCount [det]]   │
 │                                                  │
 └──────────────────────────────────────────────────┘
 ```
 ### 8.7 Handler Python — `handlers/vision.py`
 ```python
 # handlers/vision.py — auto-discovered, no imports to update
 import json
 import threading
 from . import handler
 from .hardware import Hardware
 def _get_vision_subscriber(hardware: Hardware):
    """Lazy-create subscriber for /vision/detections."""
    if not hasattr(hardware.node, "_vision_cache"):
        hardware.node._vision_cache = {}
        hardware.node._vision_lock = threading.Lock()
        hardware.node._vision_sub = None
    if hardware.node._vision_sub is None:
        from std_msgs.msg import String
        def _vision_cb(msg: String):
            with hardware.node._vision_lock:
                hardware.node._vision_cache = json.loads(msg.data)
        hardware.node._vision_sub = hardware.node.create_subscription(
            String, "/vision/detections", _vision_cb, 10
        )
    return hardware.node._vision_cache
 def _get_vision_publisher(hardware: Hardware):
    """Lazy-create publisher for /vision/train."""
    if not hasattr(hardware.node, "_vision_train_pub"):
        from std_msgs.msg import String
        hardware.node._vision_train_pub = hardware.node.create_publisher(
            String, "/vision/train", 10
        )
    return hardware.node._vision_train_pub
@handler("vision_detect")
 def handle_vision_detect(
    params: dict[str, str], hardware: Hardware
 ) -> tuple[bool, str]:
    color = params.get("color", "all")
    hardware.log(f"vision_detect(color={color})")
    data = {"count": 0, "objects": []}
    if hardware.is_real():
        cache = _get_vision_subscriber(hardware)
        with hardware.node._vision_lock:
            if cache:
                if color == "all":
                    data = cache
                else:
                    # Filter by color
                    filtered = [o for o in cache.get("objects", []) if o.get("color") == color]
                    data = {"count": len(filtered), "objects": filtered}
    return (True, json.dumps(data))
@handler("vision_train_color")
 def handle_vision_train_color(
    params: dict[str, str], hardware: Hardware
 ) -> tuple[bool, str]:
    name = params.get("name", "unknown")
    roi_size = params.get("roi_size", "50")
    samples = params.get("samples", "10")
    hardware.log(f"vision_train_color(name={name}, roi_size={roi_size}, samples={samples})")
    if hardware.is_real():
        from std_msgs.msg import String
        pub = _get_vision_publisher(hardware)
        msg = String()
        msg.data = json.dumps({"color_name": name, "roi_size": int(roi_size), "samples": int(samples)})
        pub.publish(msg)
    return (True, f"Training color '{name}' initiated")
 ```
 ---
 ## 9. Implementation Phases
 ### Phase 1 — Minimum Viable Product (Rekomendasi untuk memulai)
 | Komponen | Detail |
 |----------|--------|
 | Camera | OpenCV `VideoCapture` langsung (no ROS2 image pipeline) |
 | Detection | HSV thresholding + contour detection |
 | Training | Capture ROI samples → compute HSV range → save JSON |
 | Blockly | 4 blocks: `visionDetect`, `visionGetCount`, `visionGetObject`, `visionTrainColor` |
 | Handler | `vision_detect`, `vision_train_color` (pattern identik odometry) |
 | Message | Tidak perlu custom message — JSON via `BlocklyAction.action` |
 | Platform | Berjalan di Pi 4/5 dan Desktop |
 **Deliverables**:
 - `src/amr_vision_node/` — ROS2 Python package lengkap
 - 4 Blockly block files di `src/blockly_app/.../blocks/`
 - 2 handler functions di `src/blockly_executor/.../handlers/vision.py`
 - Update `manifest.js` dan `pixi.toml`
 - Integration tests
 ### Phase 2 — Enhanced (setelah Phase 1 stabil)
 | Komponen | Detail |
 |----------|--------|
 | ROS2 Image Pipeline | `cv_bridge`, `image_transport`, `sensor_msgs/Image` |
 | HMI Camera Feed | Widget menampilkan live camera thumbnail di HMI panel |
 | ML Color Classifier | k-Nearest Neighbors (KNN) trained on HSV samples |
 | Multi-Color | Deteksi beberapa warna secara simultan |
 | Custom Messages | `VisionDetection.msg`, `VisionDetections.msg` |
 ### Phase 3 — Advanced (future enhancement)
 | Komponen | Detail |
 |----------|--------|
 | YOLO Detection | YOLOv8-nano via ONNX Runtime (~5 FPS on Pi 5) |
 | Object Tracking | Track objects across frames (persistent ID) |
 | Shape Recognition | Deteksi bentuk selain warna (lingkaran, persegi, dll) |
 ---
 ## 10. Performance Estimates pada Raspberry Pi
 Berdasarkan benchmark OpenCV pada Raspberry Pi 4/5 yang dipublikasikan:
 | Operasi | Pi 4 | Pi 5 |
 |---------|------|------|
 | HSV threshold + contour (640x480) | 15-30 FPS | 30+ FPS |
 | Single color detection pipeline | ~10-20 ms/frame | ~5-10 ms/frame |
 | 3 colors simultaneously | ~30-50 ms/frame | ~15-25 ms/frame |
 | Memory usage (OpenCV + camera buffer) | ~50-100 MB | ~50-100 MB |
 | YOLO v8-nano (ONNX Runtime) | ~2-3 FPS | ~5-7 FPS |
 **Handler round-trip** (Blockly → executor → vision_node cache → result): menambah ~10-100 ms, sehingga effective detection rate dari Blockly adalah 5-15 Hz. Cukup memadai untuk sequential object counting yang tidak memerlukan real-time tracking.
 ---
 ## 11. Risks & Mitigations
 | Risk | Impact | Likelihood | Mitigation |
 |------|--------|------------|------------|
 | RoboStack tidak punya `cv_bridge`/`image_transport` untuk aarch64 | Tidak bisa pakai ROS2 image pipeline | Medium | Phase 1 pakai OpenCV `VideoCapture` langsung — zero ROS2 image deps |
 | Sensitivitas pencahayaan HSV | Deteksi tidak akurat saat cahaya berubah | High | Training procedure, tolerance parameter adjustable, auto white balance kamera |
 | Pi overheat saat continuous vision | Throttling, FPS drop | Medium | Kurangi frame rate, gunakan heatsink/fan, configurable `publish_rate` |
 | USB bandwidth contention | Frame drops | Low | Gunakan CSI camera, atau kurangi resolusi |
 | Obyek overlapping/occlusion | Count salah | Medium | Minimum separation filter, morphological operations, area filter |
 | Hue wrapping untuk warna merah | Training merah gagal | Medium | Deteksi Hue bimodal, gunakan 2 range + bitwise OR |
 ---
 ## 12. Conclusion & Recommendation
 ### Kelayakan
 Implementasi vision sensor pada AMR ROS2 K4 **layak dilakukan** dengan pendekatan HSV color thresholding menggunakan OpenCV. Pendekatan ini:
 1. **Ringan secara komputasi** — berjalan 15-30 FPS pada Raspberry Pi 4 tanpa GPU
 2. **Memenuhi semua requirements** — color recognition, color training, object counting left-to-right
 3. **Terintegrasi natural** ke arsitektur Blockly yang sudah ada — mengikuti pattern odometry yang terbukti (fetch once, extract many via JSON)
 4. **Tidak memerlukan custom message baru** — JSON via `BlocklyAction.action` cukup untuk Phase 1
 5. **Inkremental** — Phase 1 bisa dimulai segera, Phase 2/3 bisa ditambahkan saat dibutuhkan
 ### Rekomendasi Langkah Selanjutnya
 1. **Verifikasi** ketersediaan `py-opencv` di RoboStack `linux-aarch64` channel
 2. **Implementasi Phase 1** — `amr_vision_node`, 4 Blockly blocks, 2 handlers
 3. **Testing** — integration tests di dummy mode + manual test dengan kamera USB di Pi
 4. **Iterasi** — tune HSV parameters, tambah default color profiles, uji berbagai kondisi pencahayaan