/plaza/ - General/Random

The place to post!


New Reply[×]
Name
No-bump
Message
File 10MB total
Oekaki
Password
[New Reply]


Screen_Shot_2026-03-02_at_4.18.40_PM.png
[Hide] (443.4KB, 2003x1640)
So I've been working on a project to re-implement the VOCALOID1 engine. 
I'm basing it on the description in Jordi Bonada's PhD thesis "Voice 
Processing and Synthesis by Performance Sampling and Spectral Models" 
and not the original papers as the former is more detailed, easier to 
follow, and also describes the VOCALOID2 engine.

After a lot of trouble with getting TWM f0 estimation to work, I've 
finally gotten to implementing MFPA. And amazingly, it seems to have 
worked first try.

Compare my results: 
https://i.ibb.co/dsvgv0fd/Screen-Shot-2026-03-02-at-3-54-48-PM.png

To the results in the study: 
https://i.ibb.co/C3fjdWVd/Screen-Shot-2026-03-02-at-3-55-09-PM.png
[New Reply]
0 replies | 1 file
Connecting...
Show Post Actions

Actions:

Captcha: