Speech · Audio ML · Indigenous Languages

Quechua Audio Enhancement

Speech Processing for Indigenous Language Preservation

SpeechAudio MLQuechuaLow-Resource

WIP

In Progress

Southern Quechua

ASR

Speech domain

Open

Open Source

01 · context

Part of a broader ecosystem of NLP infrastructure for Southern Quechua — alongside QuechuaTok. This project focuses on audio enhancement and speech processing for Quechua language data, addressing the severe underrepresentation of indigenous languages in speech ML.

02 · the problem

Southern Quechua has virtually no speech data in the major public corpora used to train speech models. The challenge is not just collecting audio — it is building the tooling to clean, align, and enhance recordings from field conditions, where equipment is variable and acoustic environments are complex.

03 · approach

The audio enhancement pipeline targets noise reduction, speech-to-noise ratio improvement, and segment alignment for downstream ASR training. It is designed to run on low-resource hardware and integrate directly with the QuechuaTok tokenization pipeline.

Currently in development — check GitHub for updates

View on GitHub →See QuechuaTok →