Abstract: Given the scarcity of Code-Switching (CS) datasets, most researchers synthesize CS speech using multiple monolingual datasets. However, this approach presents challenges in synthesizing CS ...
An ESP32 client that captures audio over I2S and posts WAV to a server. A lightweight Flask/Gunicorn server that returns JSON transcriptions via speech_recognition. Designed for deterministic embedded ...