Skip to content

Vowel FX confusion, or, the uncanny valley of speech synthesis #1992

@Phen-Ro

Description

@Phen-Ro

The built-in documentation for the Vowel FX reads, "Human singing voice sounds are easily achieved with a source of a saw wave with a little vibrato."
I have not been able to achieve anything remotely "human" sounding, easily or otherwise. Is this FX broken, or am I using it the wrong way? Or perhaps the documentation exaggerates. Or perhaps we hang out with very different kinds of "humans".

By passing in random arguments to the FX and a synth, I can occasionally, briefly, generate something somewhat similar to Pink Trombone. Low notes seem to work best, for a very creepy value of best.

Here's an example of such a horror show. I don't recommend playing it out loud if children are present.

with_fx :vowel do |vowel|
  live_loop :test do
    #s = synth (ring :saw, :dsaw, :mod_saw, :supersaw).tick,
    s = synth :saw,
      note: (rrand 20, 70),
      release: 4
    
    8.times do
      control s, note_slide: 0.125, note: (rrand 20, 70)
      control vowel, vowel_sound: [1,2,3,4,5].choose, voice: [0,1,2,3,4].choose
      sleep 0.5
    end
    
  end
end

Is this what it's supposed to sound like?
I'm using Sonic Pi 3.1.0 on Windows 10.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions