AiVoiceGuy (@aivoicetutor) Twitter Tweets • TwiCopy

AiVoiceGuy

2 years ago

Facepoke is so awesome! Haven’t had this much fun clicking around on photos since Kai‘s Power Goo. But unlike Kai‘s, Facepoke isn‘t just fun. It‘s actually very helpful for finetuning faces.

thumb_up_off_alt22

chat_bubble_outline1

repeat4

shareShare

As a gamer, I am absolutely loving this! Managed to finish a level in Ms. Pac-Man. Hard to believe but the controls and game mechanics are working great. And it runs just as smooth on a M1 as it does on a 4090.

thumb_up_off_alt4

chat_bubble_outline1

repeat1

shareShare

cocktail peanut

@cocktailpeanut

2 years ago

Major E2-F5-TTS Update! Now the 1-click launcher uses the OFFICIAL github repo (Originally it was using an unofficial fork). Notable updates: 1. Podcast generation (Mutli-voice) 2. Multiple speech type generation (Multi-emotion) 3. Improved batching 4. Audio crossfading

thumb_up_off_alt403

chat_bubble_outline13

repeat40

shareShare

Chubby♨️

@kimmonismus

2 years ago

This is how I imagine the future with AR/VR

thumb_up_off_alt7,7K

chat_bubble_outline255

repeat825

shareShare

cocktail peanut

@cocktailpeanut

2 years ago

F5-TTS Finetune Web UI The E2-F5-TTS project has a huge update. Now anyone can easily finetune the model with their own audio using a Gradio app! It automatically transcribes the audio clips using whipser, and seems to work for most languages. Here's a Korean example.

thumb_up_off_alt458

chat_bubble_outline22

repeat61

shareShare

cocktail peanut

@cocktailpeanut

2 years ago

Some tips on doing your own E2/F5 TTS finetune. Has anyone succeeded in training a working finetune yet? I've not been able to keep track of everything going on in the original training thread.

thumb_up_off_alt12

chat_bubble_outline1

repeat1

shareShare

AiVoiceGuy

@aivoicetutor

2 years ago

I’m very impressed by what you can do by combining F5 TTS with FaceFusion 3. Made a video about what I learned so far (using only F5 for the voiceover). youtube.com/watch?v=-brbxJ…

thumb_up_off_alt155

chat_bubble_outline4

repeat11

shareShare

cocktail peanut

@cocktailpeanut

2 years ago

WOW, The Rock and Lex Fridman cloned using 100% local and open source AI tools. Completely for free. - F5-TTS for cloning voice - Facefusion for cloning face Check out the full video to learn how you can do it too on your computer.

thumb_up_off_alt1,1K

chat_bubble_outline17

repeat150

shareShare

cocktail peanut

@cocktailpeanut

a year ago

Pyramid Flow over FLUX Pyramid flow is a video gen AI that's been around a bit, but the original version was bad (based on SD3). But they retrained the model using FLUX. Now it's REALLY good. Both txt2vid and img2vid. Even works on Macs! Gradio App 1-Click Launcher is here.

thumb_up_off_alt253

chat_bubble_outline9

repeat42

shareShare

cocktail peanut

@cocktailpeanut

a year ago

dont make plans for tomorrow

thumb_up_off_alt143

chat_bubble_outline16

repeat5

shareShare

cocktail peanut

@cocktailpeanut

a year ago

Custom Avatar with Echomimic 2 Wow I actually tried to do something like this with no success. I am guessing the trick here is to use a photo that has the same pose? Would appreciate if you could briefly explain how you did this! Mikerhinos

thumb_up_off_alt8

chat_bubble_outline4

repeat1

shareShare

cocktail peanut

@cocktailpeanut

a year ago

The Ultimate Tutorial for Mastering Voice Cloning with Fish Audio and E2-F5-TTS I can't believe this is just 8 minutes long but teaches everything you need to know about Fish and F5. Even for me, 90% of the content was new and I learned A LOT. A STRONG recommend!

thumb_up_off_alt305

chat_bubble_outline4

repeat36

shareShare

AiVoiceGuy

@aivoicetutor

a year ago

Here’s a 100% free workflow for making your own AI influencers talk so that they can be used as virtual video presenters for example. youtu.be/-5fWC7RYXYk

thumb_up_off_alt10

chat_bubble_outline2

repeat1

shareShare

cocktail peanut

@cocktailpeanut

a year ago

1-Click HunyuanVideo ComfyUI Launcher This week we have tons of Pinokio updates coming up. Let's start the week with the best open source video AI we've ever seen: Hunyuan Video. A 1 Click launcher for ComfyUI-HunyuanVideoWrapper by Jukka Seppänen - run locally, for free.

thumb_up_off_alt315

chat_bubble_outline17

repeat51

shareShare

camenduru

@camenduru

a year ago

THE BELOW VIDEOS ARE 100% AI-GENERATED WITH AN OPEN-SOURCE MODEL, A COMPETITOR TO SORA! Marques Brownlee You can generate videos with it using a consumer-level GPU right now. Contact me, and I will provide you with a user interface and free unlimited generation to try open-source models.

thumb_up_off_alt148

chat_bubble_outline25

repeat27

shareShare

camenduru

@camenduru

a year ago

💃 StableAnimator: High-Quality Identity-Preserving Human Image Animation 🕺 template with @gradio 🥳 Thanks to Shuyuan Tu ❤ Zhen Xing ❤ Xintong Han ❤ Zhi-Qi Cheng ❤ Qi Dai ❤ Chong Luo ❤ Zuxuan Wu ❤ 🌐page: francis-rings.github.io/StableAnimator/ 🧬code:

thumb_up_off_alt343

chat_bubble_outline2

repeat65

shareShare