Go to file

Jake 6330d452c1 added file overview to readme again		2025-10-24 21:40:26 +08:00
.gitignore	realtime.py voice-to-text added, needs to be integrated	2025-10-24 10:27:05 +00:00
README.md	added file overview to readme again	2025-10-24 21:40:26 +08:00
RetinaFace.rknn	face detect and video streaming	2025-10-24 10:11:17 +00:00
main.py	face detection and voice-to-text integrated into flask stream	2025-10-24 21:13:18 +08:00
realtime.py	realtime.py voice-to-text added, needs to be integrated	2025-10-24 10:27:05 +00:00

README.md

Little Sophia's Brain

A modular Python script designed to run on Little Sophia's Radxa ZERO 3W. It handles higher-level input processing outside the abilities of the ESP32 central controller.

Face Detection
Voice-to-Text (V2T)

Installation

Clone this repository to your Radxa ZERO 3W.
Download and place the Vosk model in the install directory:
- vosk-model-small-en-us-0.15
(Jake needs to add the rest of the prerequisites here — e.g., Python packages, environment setup, etc.)

TODO:

Tested with usb mic and camera only (Logitech C920)

File Overview

File	Description
`RetinaFace.rknn`	RKNN model for face detection and video streaming.
`main.py`	Core script. Integrates face detection and voice-to-text into Flask stream.
`realtime.py`	Voice-to-text module (Vosk). For testing separately.