Tasks¶
Overview¶
The articubench benchmark provides three distinct methods for initializing the PAULE control model and generating Control Parameters (CPs).
Task Types¶
Acoustic-only (Copy-Synthesis)¶
Input: Target audio recording (human speech) and sample rate
- Purpose:
Test model’s ability to mimic human speech acoustics
Focus on articulatory and acoustic quality
No semantic information provided
Semantic-only¶
Input: Target semantic embedding vector, desired duration
- Purpose:
Generate speech from meaning alone
Test semantic-to-articulation mapping
Handle one-to-many relationship (multiple valid pronunciations)
Also known as “full generation task”
Semantic-Acoustic¶
Input: Target audio recording with sample rate AND semantic embedding vector with a desired duration
- Purpose:
Joint optimization of acoustics and meaning
Most complete evaluation scenario
Balance multiple constraints