#utau tutorial
Explore tagged Tumblr posts
katzenklavierr · 3 months ago
Text
Have you 🫵 ever wanted to create your own UTAU voicebank🎵? Not sure where to start?
Well, good news! I've finally gotten around to revising and finishing my tutorial series aimed at absolute beginners!
These are text-based tutorials hosted on my UTAU website with audio and visual aids provided throughout.
If you're completely new to the software and want to learn more about it, check out Introduction to UTAU. This covers what UTAU is, how to install it (and OpenUTAU), how to find and install voicebanks, and how to set up UTAU project files.
If you want to jump into making your own VB and want an in-depth guide to walk you through creating one from start to finish, then check out Creating Your First Voicebank. This guide is a little different than other beginner tutorials, but I feel it will better prepare you for VB development by teaching you with modern tools and methods.
The website also has a handful of tutorials aimed at intermediate users, plus all of my voicebanks and reclists. I hope you find it helpful!
62 notes · View notes
epicdogymoment · 2 months ago
Text
twisting ft. @miodiodavinci's SALVADOR Auto Recovery
Tumblr media
credits under the cut
original, instrumental by They Might Be Giants
UST, tuning, mix, art by @epicdogymoment
42 notes · View notes
nitw · 25 days ago
Text
shoutout to that one time i tried to make an utau for the first time while i had a terrible cold. i still have no clue how it works lol i didn't get anywhere
8 notes · View notes
liure00 · 2 years ago
Text
Mixing Stuff Masterpost for Vocal Synth Users
i'll say a few things here and there on how i approach mixing based on a set of guidelines i've been giving thru learning. i won't go 100% and i encourage you research further on your own as everyone has a different perspective of certain concepts. whats important is that you understand the concept so that you are able to interpolate on it with your own liberties. yeah. please read the links before looking at my commentary or you won't understand what im saying.
Some DAWs, Their Guides, & Some Freebies: One of the first things you should do is pick a DAW and learn how to use it and its functions to streamline your mixing process.
Free DAWs: The Best Available in 2023 by Produce Like A Pro
Audacity / DarkAudacity (i like darkaudacity): has a section of the site dedicated to tutorials on using Audacity!
Reaper: has a 3 hour course FREE course on mixing!
FL Studio: has a demo version you can pretty much use forever with a few.........exceptions. I won't be linking any cracked versions though. Here's a manual for this program since many people use it!
Free VST Plugins by Bedroom Producers Blog
37 Best Free Mixing VST Plugins by hiphopmakers
ORDER IN THE COURT!: The order of plugins is more important than you think. These links should also introduce some terms we use in the audio production world (like "gain staging" or "EQing")
WHAT'S THE BEST EFFECTS CHAIN ORDER FOR MIXING? by Icon Collective:
The Order Of Things: Audio Plug-ins by AskAudio
Plugin order is viewed from "top to bottom". BASICALLY... most like to gain stage -> EQ -> compress -> saturate -> MORE EQing -> whatever else at this point, but i do my process a bit differently. don't be afraid to bend the rules a little bit. but the guidelines are there for a reason.....based on what they do
Basics: I'll link to some tutorials to elaborate on what was listed by Icon Collective's list.
Gain Staging: Gain Staging Like a Pro by Sweetwater
Saturation: Saturation in Mixing – Instant Warmth, Glue and Fullness with One Plugin by Tough Tones (soundgoodizer fans make some fucking noise i guess)
EQ: SUBTRACTIVE VS ADDITIVE EQ (WHEN TO USE EACH & WHY) by Producer Hive
Compression: THE COMPLETE GUIDE TO AUDIO COMPRESSION by Icon Collective + Audio Compression Basics by Universal Audio
Modulation: Modulation Effects: Flanging, Phase Shifting, and More by Universal Audio
Time Based Effects: Reverb Vs. Delay: Complete Guide To 3D Mixing by Mastering.com
Audio Busing/Routing/Sending Tracks: Your guide to busing and routing audio tracks like a pro by Splice
Limiters: 10 BEST LIMITER PLUGINS FOR MIXING AND MASTERING by Icon Collective
Sidechaining: Sidechain compression demystified: what it is and how to use it by Native Instruments (i dont know anything about this lol)
Automation: Mix Automation 101: How to Automate Your Sound For a Better Mix by Landr (p.s learn how to write automation in your respective programs)
Last note: great. these are the main things you should focus on understanding in mixing. now you are FREE my friend!
youtube
Bonus: Tempo Mapping in Reaper (if you want to learn how to midi songs with bpm changes!!!)
134 notes · View notes
bluu3berry · 11 months ago
Text
Tumblr media
Bleh!!! Ehehe I genuinely liked doodling this I love this pose am!! So fun to draw!!!
Don't repost my art without creds
Tumblr media
@anon-coke @scramble-eg @borisboring @thelunarsystemwrites @the-second-reason
20 notes · View notes
damage-incorp0rated · 1 month ago
Text
upon remembering denki sai existed i did the next most logical thing.
you may say: "oh, so you just looked up some of his covers? his original tracks?"
no!
i installed utau and his voicebank and made this complete and utter nightmare of an experiment
this is a lot harder than it sounds. i know it's rough around the edges and all but it's seriously the best i could do lmao
and yes i did make the image that was paired with the audio. yes i'm a photoshop wizard. i know.
6 notes · View notes
shimmerloid-ai · 1 year ago
Text
Introduction - Vocal Synth Terminology - Part 1
This post will be split into multiple parts due to Tumblr's character limit.
If you are new to the Vocal Synth community, you may encounter some words and phrases you don’t understand. For instance, someone may tell you about Rin and Len’s appends, and you may confuse that term for the difficulty in Project Sekai! Colorful Stage! Or may have heard someone discussing USTs, but can not find its definition anywhere nor figure out what the hell they are talking about.
Well, I made a dictionary of sorts to help newbie fans get used to Vocal Synth jargon. The keyword is “Vocal Synth” as these apply to other software as well. These definitions have a greater focus on the programs themselves than the characters.
Credits to Vocaloid Wiki and Minnemi on YouTube for some of these definitions.
Vocal Synthesizer: A digital instrument that creates tracks like any other DAW, but instead of piano notes, guitar strums, or drum beats, you compose vocals! Also known as “vocal synths”. Examples of vocal synthesizers include VOCALOID, UTAU, SynthesizerV, CeVIO, and Piapro Studio.
Voicebank: A collection of recordings of the sounds that make up a language. These sounds are typically vowels and constants, but depending on the voice bank, you may also get breath notes and pronunciation effects. Or, in simpler terms, the singers that are used in vocal synths! There are ton of voicebanks in the vocal synth community, with some of the popular ones being Hatsune Miku (VOCALOID + Piapro Studio), Kagamine Rin and Len (VOCALOID + Piapro Studio), Megurine Luka (VOCALOID + Piapro Studio), Kasane Teto (UTAU + SynthesizerV), Megpoid Gumi (VOCALOID + SynthesizerV + A.I. VOICE, FineSpeech Ver3), flower (VOCALOID + Gynoid Talk + CeVIO), IA (VOCALOID + CeVIO), and KAFU (CeVIO + SynthesizerV)! Individual vocal synth characters can also have different versions of their voice, such as Yuzuki Yukari’s Onn (soft) and Lin (power) voicebanks!
Voice Provider: The person whose voice that a voicebank is created. Voice providers record samples of their voice (specifically vowels and constants) at a certain key (for instance A3), which are turned into a voicebank with the company’s black magic (I’m kidding, I don’t know how they process and put the vocals together). For instance, PIKO is Utatane Piko’s voice provider, Satoshi Fukase is Fukase’s voice provider, and Naoto Fuga (shown below) is KAITO’s voice provider!
Tumblr media
Crypton Future Media: The brains behind some of the most popular VOCALOIDs, which are Hatsune Miku, Kagamine Rin, Kagamine Len, Megurine Luka, KAITO, and MEIKO. Aside from voicebanks, they created games, concerts, merchandise, and much more relating to these beloved VOCALOIDS! Cryptonloids are… VOCALOIDS created by Crypton. Soon, Crypton departed from Yamaha and made its own vocal synthesizer in affiliation with another company called Piapro named Piapro Studio. There are two versions of this software; Piapro Studio NT and Piapro Studio V4x.
UTAU: A vocal synthesizer that is considered the “sister” software to VOCALOID. Unlike VOCALOID, this software is 100% free and you can create your own voicebank. There are thousands of UTAUloids at this point in time, giving you a huge selection of different ranges and strengths. Popular UTAUloids include Utatane “Defoko” Uta, Kasane Teto, Namine Ritsu, Momo Momone, Yowane Ruko, Sukone Tei, Rook, Gahata Meiji (shown below), Yamane Renri, Matsudappoiyo, Keine Ron, Kohaku Merry, Gekiyaku, Kazehiki, Adachi Rei, Ooka Mika, and so many others! There is also an open-source version of UTAU called Open UTAU, which is much easier to install and use (it has a dark mode!). Vipperloids are the classic UTAUloids that share surnames ending with “-ne” and their VOCALOIDish designs. These include Utatane “Defoko” Uta, Kasane Teto, Namine Ritsu, Momo Momone, Yowane Ruko, Sukone Tei, and many others.
Tumblr media
SynthesizerV Studio: Also known as SynthV, this is a vocal synthesizer made by Dreamtonics that is well-known for its AI voicebanks. For a software that is smaller than VOCALOID, they are extremely advanced with realistic-sounding voicebanks, piano-roll tuning, rap vocals, and so many other features. It’s also much cheaper (thank you, Yamaha money sharks). In addition, Dreamtonics has two free versions; SynthesizerV Studio R1, and SynthesizerV Studio Basic R2. Popular SynthV voicebanks include Eleanor Forte, Kaorou Rikka, GENBU, Tsurumaki Maki, SAKI, SOLARIA, KEVIN (fan design by ivylare shown below), Stardust, ROSE, POPPY, and Kasane Teto Ai!
Tumblr media
CeVIO Project: A collection of voice synthesizers created in collaboration with five different companies including Techno Speech and Frontier Works. Not only do they make vocal synthesizers, but their softwares have speech interfaces as well. As of now, their most popular program is CeVIO AI, a next-generation vocal synthesizer that uses AI technology to create powerful vocals as seen in SynthesizerV. Popular voicebanks include Chis-A (shown below), KAFU, Sato Sasara, IA AI, ONE, Yuzuki Yukari Rei, CiFlower, POPPY, ROSE, and many others.
Tumblr media
Tuning: Essentially how you want a song or cover to sound. By editing the parameters of the individual notes and that of the voicebank itself (including the pitch, volume, strength, sharpness, and breaths), you can obtain an entirely different result of how the singer sings the encoded notes through different methods. This blog is dedicated to teaching people how to tune, so I’ll show a variety of tuning styles in the software.
V_: The VOCALOID software edition. As of now, there are six editions of the software, which are VOCALOID, VOCALOID2, VOCALOID3, VOCALOID4, VOCALOID5, and VOCALOID6. A lot of VOCALOID voicebanks would be named after the edition they were designed for, such as Gackpoid V4.
VSQ/VSQx/VPR/UST/SVP: The different vocal file formats through which the note, lyric, and tuning data are saved in different vocal synthesizers. These files are not exactly specific to a single editor as they can be converted to the appropriate formats: 
VSQ: VOCALOID2 and VOCALOID3
VSQx: VOCALOID4
VPR: VOCALOID5 and VOCALOID6
UST: UTAU and OPENUTAU
SVP: SynthesizerV Studio
Phonemes: In linguistics and developmental psychology, phenomes are the smallest sounds of speech that distinguish one word from another. Similarly, in vocal synths, these are the building blocks of the individual lyrics that are read by the voicebank. Phonemes differ from the lyrics in a vocal synth file as the lyrics are the actual syllables in language while the phonemes are based on the X-SAMPA system. For instance, let’s examine and compare lyrics from “The Lost One’s Weeping” by neru to the phonemes that would be written in a vocal synth. Romaji lyrics (Source - Vocaloid Lyric Wiki): kokuban no kono kanji ga yomemasu ka? Romanji lyrics in VOCALOID4: [ko] [ku] [ba] [n] [no] [ko] [no] [ka] [n] [ji] [ga] [yo] [me] [ma] [su [ka] Phonemes in a vocal synthesizer VOCALOID4: [k o] [k M] [b a] [n] [n o] [k o] [k a] [n] [dZ i] [g a] [j o] [me] [m a] [s M] [k a] As we can see here, the phonemes of a song can differ significantly from the lyrics that are entered into a program. You can also edit the phonemes of a lyric for better pronunciation (for instance, for the word “you’d”, you can try [y M d]), or split them up into vowels and constants in notebending. In addition, there are entirely different phonemes for voicebanks designed for different languages; for instance, VOCALOID has Japanese, English, Chinese, Korean, and Spanish voicebanks. However, it is possible to make voicebanks sing in different languages, like how Utsu-P makes Miku V4 English sing in fluent Japanese. There are also phonemes for breaths, and glottal stops, as well as pronunciation effects that are exclusive to some voicebanks, like Enhanced Voice Expression Control (E.V.E.C.) in the V4x Cryptonloids. I will go into greater depth on phonemes in a future post.
Pitch bending: The effect where one note slides to another in a clean fashion without sounding flat. When people usually mention pitch bending in a vocal synth, they are referring to the tuning style where you alter the pitch using the “pitch bend” and “pitch bend sensitivity” parameters. If you have seen tuning streams or covers where people show their editors, you may have noticed dynamic and sometimes dramatic lines either on top of the notes or in a box beneath the piano roll. These are pitch bends! By drawing pitch curves in different ways, you can acquire different ways the notes are sung. You can then increase or decrease the pitch bend sensitivity of certain notes to change the factor of how many semitones the pitch curves will jump or fall by when the pitch bend parameter is brought to the maximum or minimum values. To paint a better picture of this concept, I made a quick VSQx of the "watashi" ([w a] [t a] [S i]). The curves on cutting through the green box are my pitch bends, and the thin red line running through the notes is the result. The transparent box behind it is my pitch bend sensitivity, which I increased for more sensitive in the [w a] and [t a] notes, and decreased for less for the [S i] phoneme.
Tumblr media
Note bending: A tuning style where you manipulate the pitch by splitting notes into smaller notes. You can move the notes up and down or edit the phonemes to obtain different effects in notes. If you would like to breakdown the phrase [w a] [t a] [S i], you can write the notes out as [w a] [a] [a] [a] [a] [t a] [a] [S i [i] [i]! This is my preferred method of tuning as I do not enjoy drawing lines and like the nostalgic effect of the clean, slightly robotic sounds.
Tumblr media
Portamento Timing: This term can have multiple definitions, but the general meaning is a slide from one note to the next. Do not confuse this for pitch bending as the way that notes transition in portamento is different from the former. In Vocaloid, portamento is a parameter that allows you to alter the timing of the pitch. Increasing the value would result in the pitch being more delayed, and decreasing it will cause the pitch to be sung earlier. In UTAU and SynthesizerV, portamento refers to the editable points in a pitch curve. Adding more points allows you to have more freedom in creating pitch bends.
Pitchsnap Mode: A setting in vocal synthesizers that causes the pitch curves to “snap” from one note to another. This setting yields a more autotuney and robotic tone in tuning. While I prefer to tune with this feature shut off, I have heard that the pitchsnap function makes pitch-bending much easier. Remember our "The Lost One's Weeping" example? Here is an amazing cover of it by our lord and saviour Jade S. with Fukase and Miku V3 Solid that showcases how beautiful the pitchsnap function can make the vocals sound when used correctly!
youtube
Mixing: A process of blending vocals with an off-vocal or instrumental so the singing fits in the environment of the vocal's music. It's more than just plugging in an audio track, you need to ensure that the vocals are cleaned up, are at an appropriate volume, and do not sound out of place. People can get super creative with mixing by adding reverb, radio-like effects, growls, and “adlibs” during instrumental breaks! All in all, the mixing of vocals is just as important as the tuning.
Producer: Anyone who makes music using vocal synths. This title was initially reserved for people who make original songs but can be used to describe cover artists like myself as well. Popular producers include ryo(supercell), kzlivetune, wowaka(shown below; Rest in Peace), neru, Deco* 27, and many others!
Tumblr media
“-P”: Standing for “producer title”, this suffix originated from the IDOLM@STER fandom and refers to anyone who makes music with vocal synths, or in other words, vocal synth producers! For instance, why do we call Circus-P by his name with the "-P" suffix? Because that is what he is, a producer! You can also use the title “vocalo-p” to address synth users.
Tumblr media
32 notes · View notes
bmpmp3 · 1 year ago
Text
Tumblr media
HOMURANE RAYYYYYYYYYYYYY
9 notes · View notes
dead-byte · 2 years ago
Text
Question for Y'all UTAU Users
I kinda want to make a series on developing UTAU voicebanks. And I'm thinking about _maybe_ offering free 1-pitch commissions, on the condition that I use it as an example in the tutorial videos. Would any of y'all be interested in donating a voicebank if I did that?
Only thing is that I might critique the voicebank some, but it should hopefully go without saying that none of it would be mean-spirited, and purely with the intent of helping people make better voicebanks.
I might just record my own voicebanks for said videos, but I'd also maybe like to try a voicebank with less predictable traits for when users might encounter stuff that I might not initially think about.
I'm very undecided, but I would be curious as to gauge interest.
14 notes · View notes
ofpearlesce · 2 years ago
Text
SOMEONE WHO KNOWS HOW TO OTO. FOR THE LOVE OF GOD PLEASE HELP ME
14 notes · View notes
onyx-p · 2 years ago
Text
the disappearance of silly from pocket singer 😔😔😭😭
Tumblr media
6 notes · View notes
bluberimufim · 1 year ago
Text
I sat down to write and got distracted for ~3 hours bc I decided to teach myself how to use OpenUTAU
5 notes · View notes
leo-doesnt-know · 2 months ago
Text
Tumblr media
Followed a rendering tutorial video thingy and drew Nora, this was my result :3
1 note · View note
f4ilure-g1rl-fuyu · 2 months ago
Note
17, 30, 31 /nf :3
17 - want any piercings? where?
a tongue piercing or tongue split and an eyebrow piercing ^^
30 - what are you looking forward to in the near future?
making the in iolite inspired song (might get utau just for defoko vocals) and a family gathering from our father's side of the family
31 - what are you looking forward to in the distant future?
getting into the school we want, graduating, going to university and dying getting to be a scientist
1 note · View note
bluu3berry · 1 year ago
Text
Artrades are open
looking for someone around my skill level!! if you have a better skill level (in my opinion) i will offer more art however !! Turn around time: 1 hour - 1 week (Depending on motivation, free time, and amount of art)
Will do: Skeletons Furries Humans Sfw cartoony gore !! Couples/Romance Oc x Canon/Selfship
Will NOT. Do: NSFW Proship + Darkship + comship Overly detailed characters meca/robots (semi-robots are okay) Ferals + On 4 legs, or Digi grade legs suggestive content
examples under cut!!!!
Fully rendered
Tumblr media Tumblr media
Flat color
Tumblr media Tumblr media
Oc (not mine)
Sketch
Tumblr media Tumblr media
Tradtionall:
Tumblr media Tumblr media
23 notes · View notes
skelevision · 11 months ago
Text
i have piko now...... trying to make him sound clear is tricky but im making progress i think. also a lot of his D sounds are like. off-time for some reason. vowel starts too early. i dont know why, you can mitigate some of it with cranking the velocity to Fuck. i would make the note start a little later but i want to put the vpr file up when im done with this cover and id feel bad for whoever would have to deal with that over the velocities. also, feels very evil clearing out all the effects and slapping on a different vocal and its immediately super clear. literally a skill issue on my part though being unable to fully clear out muffledness of some vocals
0 notes