#speechllm
Explore tagged Tumblr posts
Text
Cascading models (which first turn your speech into text and then convert it back into speech) have some problems. They can lose important information along the way, which leads to mistakes & errors.
For example, ‘The weather is cold today’ is processed as ‘The weather is gold today’ due to small difference in pronunciations.
0 notes