FUN FACT : ANYONE CAN NOW train audio , make Audio deepfakes, and apply it to a still image AND ANIMATE the head and lips using AI = How to normally make these 2023.05 video + audio deepfakes. EASY : =====
- first make audio - ElevenLabs free demo for the voice cloning, or 80 competitors - then make audio into a video with forced face animation - SadTalker
= = = =
SadTalker is now trivially easy to use inside AUTOMATIC1111 ====
The code repos used are listed in their credits section, and it looks like a mixture of (customised?) Tacotron2, Glow-TTS, HifGan, and others. Videos are generated using Wav2Lip.
Text-To-Speech (TTS) has improved greatly over the past several years, but there's still a lot of metallic sounds in "pure" TTS implementations. I've started exploring voice style conversion, otherwise known as "voice cloning", and there are some interesting repos out there with decent results. These work differently from TTS, in that you don't type out the text to be spoken, but rather pass in an audio file of what you want the cloned speaker to say, and the system outputs an audio file with the same sounds (words, intonation) but with a different speaker identity.
This may be easier to get the right cadence and emotion in the generated audio, as text doesn't capture proper emotion and intonation. I suspect game character audio will use more of voice-style conversion instead of pure TTS simply to get the right emotional cadence of the lines being delivered.
Tortoise TTS in inference mode - Strong multi-voice capabilities. Highly realistic prosody and intonation
- Tortoise TTS in inference mode : - https://github.com/neonbjb/tortoise-tts (decent but remains slow - 2022.05.17) - (does a perfect Joe Rogan, probably input model bias, 2 minutes per sentence) - paper - 2022.05.17 - TorToiSe - Spending Compute for High Quality TTS - uses UnivNet at end : https://files.catbox.moe/yph8v2.pdf - paper - 2020 - wav2vec2-large model - wav2vec 2.0-+ A Framework for Self-Supervised Learning of Speech Representations : https://files.catbox.moe/p193m3.pdf - paper - 2017 - famous - Neural Discrete Representation Learning by Aaron van den Oord : https://files.catbox.moe/2kml42.pdf
(humor) π¨π’π BEST CLIPS of the year!!! π AI Generated Celebrity voices, most using FREE FIVE VIDEOS TRIALπ₯ INFAMOUS today Feb 1 2023!! Many sites DELETING these! π¨
AI Generated Celebrity voices, most using FREE FIVE VIDEOS TRIAL (tied to per ethernet controller MAC chip address) https://beta.elevenlabs.io/
Many more similar companies and free open source code for AI SV2TTS exist, but the ADL/SPLC want the open source code to be somehow controlled and criminalized.
Some chaos lords, perhaps me, made some of these! So Funny.
Spread, enjoy. I might post more of my collection in a few salvos.
[ + ] bonghits4jeebus
[ - ] bonghits4jeebus 4 points 2.1 yearsApr 30, 2023 21:59:56 ago (+5/-1)
[ + ] root
[ - ] root 3 points 2.1 yearsMay 1, 2023 02:55:29 ago (+3/-0)*
=====
- first make audio - ElevenLabs free demo for the voice cloning, or 80 competitors
- then make audio into a video with forced face animation - SadTalker
= = = =
SadTalker is now trivially easy to use inside AUTOMATIC1111
====
- SadTalker - https://github.com/OpenTalker/SadTalker
- paper - 2023 - CVPR 2023 - SadTalker - Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation : https://files.catbox.moe/zj9yrt.pdf
- https://sadtalker.github.io/
- https://huggingface.co/spaces/vinthony/SadTalker
- engineering demo : https://colab.research.google.com/github/Winfredy/SadTalker/blob/main/quick_demo.ipynb
- AUTOMATIC1111!!! 2023.04.15 INTEGRATED plugin for https://github.com/AUTOMATIC1111/stable-diffusion-webui !!!!
- DEMO of lip sync onto a single STATIC IMAGE input ! https://files.catbox.moe/kio1vb.mp4
- https://www.reddit.com/r/StableDiffusion/comments/12h210y/stable_diffusion_tutorial_install_sadtalker/
- Stable diffusion tutorial install Sadtalker (AUTOMATIC1111): New Extension Create TALKING AI AVATAR:
- https://www.youtube.com/watch?v=0hO-NrnthXk
= = = =
Eleven Labs And 46 Other competing AI Tools For Text to speech:
====
https://theresanaiforthat.com/ai/eleven-labs/
https://sourceforge.net/software/product/ElevenLabs/alternatives
https://slashdot.org/software/p/ElevenLabs/alternatives
https://news.ycombinator.com/item?id=29688048
This looks like it was previously known as Vocodes, made by echelon who is here on HN:
https://news.ycombinator.com/item?id=23965787
The code repos used are listed in their credits section, and it looks like a mixture of (customised?) Tacotron2, Glow-TTS, HifGan, and others. Videos are generated using Wav2Lip.
Text-To-Speech (TTS) has improved greatly over the past several years, but there's still a lot of metallic sounds in "pure" TTS implementations. I've started exploring voice style conversion, otherwise known as "voice cloning", and there are some interesting repos out there with decent results. These work differently from TTS, in that you don't type out the text to be spoken, but rather pass in an audio file of what you want the cloned speaker to say, and the system outputs an audio file with the same sounds (words, intonation) but with a different speaker identity.
This may be easier to get the right cadence and emotion in the generated audio, as text doesn't capture proper emotion and intonation. I suspect game character audio will use more of voice-style conversion instead of pure TTS simply to get the right emotional cadence of the lines being delivered.
= = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = =
Some interesting voice style conversion repos (in no order, just a random selection if anyone is interested in exploring):
=========
= = = =
SV2TTS : 2021 version - EVERYONE copied this and made it faster , or slower and better :
- https://github.com/CorentinJ/Real-Time-Voice-Cloning ( SV2TTS )
- paper - Master thesis 2019 SV2TTS - Automatic Multispeaker Voice Cloning - https://files.catbox.moe/4g2mzl.pdf
- paper - famous SV2TTS- NeurIPS 2018 - Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis : https://files.catbox.moe/i4qpt7.pdf
- paper 2018 vocoder - Efficient Neural Audio Synthesis - fatchord/WaveRNN - https://files.catbox.moe/jedy8w.pdf
- paper 2017.04 - Tacotron synthesizer - Towards End-to-End Speech Synthesis - fatchord/WaveRNN - https://files.catbox.moe/1d4tmo.pdf
- https://github.com/fatchord/WaveRNN
- 2021 encoder GE2E - Generalized End-To-End Loss for Speaker Verification in SV2TTS : https://files.catbox.moe/gymju3.pdf
- LibriSpeech ( For the synthesizer and the vocoder) - 5.9 gigabytes of a pre-trained dataset input : LibriSpeech/train-clean-100 : https://www.openslr.org/resources/12/train-clean-100.tar.gz
- To re-train (rare) you need 500 gigabytes of these files and 1 terabyte free space :
- https://github.com/CorentinJ/Real-Time-Voice-Cloning/wiki/Training#datasets
= = = =
Tortoise TTS in inference mode - Strong multi-voice capabilities. Highly realistic prosody and intonation
- Tortoise TTS in inference mode :
- https://github.com/neonbjb/tortoise-tts (decent but remains slow - 2022.05.17)
- (does a perfect Joe Rogan, probably input model bias, 2 minutes per sentence)
- paper - 2022.05.17 - TorToiSe - Spending Compute for High Quality TTS - uses UnivNet at end : https://files.catbox.moe/yph8v2.pdf
- paper - 2020 - wav2vec2-large model - wav2vec 2.0-+ A Framework for Self-Supervised Learning of Speech Representations : https://files.catbox.moe/p193m3.pdf
- paper - 2017 - famous - Neural Discrete Representation Learning by Aaron van den Oord : https://files.catbox.moe/2kml42.pdf
= = = =
= = = =
https://github.com/MycroftAI (https://mycroft.ai/)
https://github.com/MycroftAI/mimic3 (A fast local neural text to speech engine for Mycroft)
https://github.com/MycroftAI/mimic-recording-studio (inputs audio input into mimic2)
https://github.com/VOICEVOX/voicevox_engine (2023.04.27 amazing, but model is japanese speech only)
uses https://github.com/VOICEVOX/voicevox_core/
https://github.com/yl4579/StarGANv2-VC
https://github.com/ebadawy/voice_conversion
https://github.com/RussellSB/tt-vae-gan
https://github.com/auspicious3000/autovc
https://github.com/edresson/yourtts
https://github.com/coqui-ai/TTS
Coqui example: https://www.youtube.com/watch?v=6QAGk_rHipE
https://github.com/NVIDIA/NeMo (Nemo from Nvidia)
https://github.com/neonbjb/tts-scores (open source tool to TEST output quality of models)
Papers With Code has interesting repos there as well: https://paperswithcode.com/task/voice-conversion/latest
= = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = =
(humor) π¨π’π BEST CLIPS of the year!!! π AI Generated Celebrity voices, most using FREE FIVE VIDEOS TRIALπ₯ INFAMOUS today Feb 1 2023!! Many sites DELETING these! π¨
AI Generated Celebrity voices, most using FREE FIVE VIDEOS TRIAL (tied to per ethernet controller MAC chip address) https://beta.elevenlabs.io/
Many more similar companies and free open source code for AI SV2TTS exist, but the ADL/SPLC want the open source code to be somehow controlled and criminalized.
Some chaos lords, perhaps me, made some of these! So Funny.
Spread, enjoy. I might post more of my collection in a few salvos.
= = = = =
DeepFake Audio - Humor Quality A:
===
- Ben Shapiro reads a note from a listener.mp3 https://files.catbox.moe/ykes5m.mp3
- Ben Shapiros sister abbie and her khazar milkers.mp3 == https://files.catbox.moe/6r6y7i.mp3
- Better Call Saul talks about the dangers of the vaccines.mp3 == https://files.catbox.moe/hdpmlm.mp3
- Biden Internet tough guy has 300 confirmed kills and is top sniper.mp3 == https://files.catbox.moe/ic07wu.mp3
- Biden loved Epstein island.mp3 == https://files.catbox.moe/sjj38b.mp3
- Biden opinions on Trannies - DeepFake video.mp4 == https://files.catbox.moe/fzng9z.mp4
- Biden wants to bang the Animal Crossing dog.mp3 == https://files.catbox.moe/oov1v6.mp3
- Bill Gates - Gloating over vaxxtard VAIDs and going to Epstein Island - best.mp3 == https://files.catbox.moe/q9ty5i.mp3
- Bill Gates - Gloating over vaxxtard VAIDs and going to Epstein Island - original.mp3 == https://files.catbox.moe/fnb9g8.mp3
- darth vader actor James Earl Jones talks africa.mp3 == https://files.catbox.moe/yamp7u.mp3
- David Attenborough observes the globohomo parasite.mp3 == https://files.catbox.moe/69w0zf.mp3
- David Attenborough pavement apes.mp3 == https://files.catbox.moe/fc7y17.mp3
- elon musk has plans for Bulgaria.mp3 == https://files.catbox.moe/a6t06l.mp3
- emma watson reads Hitler.mp4 == https://files.catbox.moe/szdo01.mp4
- Gordon Ramsay - Auschwitz meal grossly undercooked == https://files.catbox.moe/0iwby9.mp3
- HARRY POTTER Gas Chamber.mp3 == https://files.catbox.moe/kk68ui.mp3
- Hillary opinions on trannies.mp3 == https://files.catbox.moe/6frldi.mp3
- Joe Biden declares nuclear war on Russia.mp3 == https://files.catbox.moe/xx223a.mp3
- Joe Biden declares war on Russia.mp3 == https://files.catbox.moe/42yjlx.mp3
- Joe Biden explains Sneed's Feed and Seed.mp3 == https://files.catbox.moe/gklk6p.mp3
- Joe Biden loves the Latino Taco Truck girl.mp3 == https://files.catbox.moe/08ci1q.mp3
- Joe Biden thinks America has no culture.mp3 == https://files.catbox.moe/7cjtto.mp3
- Joe Biden's wife didn't prepare his dinner.mp3 == https://files.catbox.moe/z5uldw.mp3
- joe rogan -talks about seeing bitchute based videos on the JQ.mp3 == https://files.catbox.moe/krv0jk.mp3
- John Lennon beats his wife == https://files.catbox.moe/qxtads.mp3
- Jordan Peterson gets 3 packets for McChicken.mp3 == https://files.catbox.moe/kf9e88.mp3
- Jordan Peterson talks racial crime statistics.mp3 == https://files.catbox.moe/x182e7.mp3
- Jordan Peterson thinks black neighborhoods are chaos.mp3 == https://files.catbox.moe/tzae3e.mp3
- morgan freeman on chimpouts - ai.mp4 == https://files.catbox.moe/mc427n.mp4
- Neil DeGrase Tyson wants to cover the Earth in shit.mp3 == https://files.catbox.moe/4r1rvn.mp3
- PICARD disgusted by degenerates.mp3 == https://files.catbox.moe/9roig9.mp3
- PICARD wants race war.mp3 == https://files.catbox.moe/ybaomw.mp3
- Scott Adams Names The Jews.mp4 == https://files.catbox.moe/3pcei1.mp4
- Tom Cruise thinks Top Gun is great and homosexual.mp3 == https://files.catbox.moe/0q3ji7.mp3
- TRUDEAU seething.mp3 == https://files.catbox.moe/pwpyad.mp3
- Tucker Carlson - final broadcast total negro death.mp3 == https://files.catbox.moe/3tdcjf.mp3
- Tucker Carlson - finally blames the J.mp3 == https://files.catbox.moe/i8jrfi.mp3
- Tucker Carlson - interviews scientist DEGRASE.mp3 == https://files.catbox.moe/zloowf.mp3
DeepFake Audio - Humor Quality B:
===
- Agent 47 (Hitman videogame) is assigned George Floyd as a target.mp3 https://files.catbox.moe/3ozwjn.mp3
- alex the rambler - video reviewer twitch gamer.mp3 == https://files.catbox.moe/e95r9o.mp3
- AOC Jan 6.mp3 == https://files.catbox.moe/8wt72i.mp3
- barclay star trek holodeck - Call of Duty 2009 configuration for xbox lobby.mp4 == https://files.catbox.moe/eilnlt.mp4
- barclay star trek holodeck - cannot use cyanide gas.mp3 == https://files.catbox.moe/s1sngs.mp3
- barclay star trek holodeck - commits catapult suicide in the Holodeck.mp3 == https://files.catbox.moe/ri58w2.mp3
- barclay star trek holodeck - holodeck 2 - Troy lactose intolerance.mp3 == https://files.catbox.moe/cf140l.mp3
- Ben Shapiro discusses the logic behind giving AOC cunnilingus.mp4 == https://files.catbox.moe/7m5w7d.mp4
- Ben Shapiro discusses his love of AOC big butt.mp3 == https://files.catbox.moe/13p7di.mp3
- Biden reads forchan.mp3 == https://files.catbox.moe/afyc2j.mp3
- David Attenborough - Dangerous wild Negro male.mp3 == https://files.catbox.moe/luyjai.mp3
- George Bush Jr. blames Israel for 9/11.mp3 == https://files.catbox.moe/gs22gw.mp3
- hannity on biden photos.mp3 == https://files.catbox.moe/hba0ge.mp3
- JFK (KENNEDY) fakes first moon landing.mp3 == https://files.catbox.moe/38mz09.mp3
- JFK opinions on trannies.mp3 == https://files.catbox.moe/k44fqa.mp3
- JFK talks conspiracies.mp3 == https://files.catbox.moe/eg1pqj.mp3
- Joe Rogan interviews Obama about Big Mike == https://files.catbox.moe/zxqbv4.mp3
- Jordan Peterson - video -This Will Change Your Life. Uncensored Speech.mp4 https://files.catbox.moe/0dukoo.mp4
- Jordan Peterson 13% of lobsters.mp3 == https://files.catbox.moe/depydz.mp3
- Jordan Peterson doesn't like the vaccine.mp3 == https://files.catbox.moe/nusvgn.mp3
- Jordan Peterson loves the Israelis.mp3 == https://files.catbox.moe/4xingx.mp3
- kevin spacey and epstein.mp3 == https://files.catbox.moe/rrw3ly.mp3
- KIRK vs redditors.mp3 == https://files.catbox.moe/lyjsz9.mp3
- Klaus Schwab tells you to eat the bugs.mp3 == https://files.catbox.moe/9aa833.mp3
- Louis CK knows a MILF that fucks retards.mp3 == https://files.catbox.moe/yvlfe3.mp3
- norm macdonald jewish crab.mp3 == https://files.catbox.moe/yqu1av.mp3
- Rod Serling - punching door.mp3 == https://files.catbox.moe/5w7wye.mp3
- Rod Serling asks you to imagine the best Twilight Zone blowjob.mp3 == https://files.catbox.moe/rxgvqu.mp3
- Saruman (lord of the rings) - Degenerate American Amerimutt Orcs.mp3 == https://files.catbox.moe/6ye4tg.mp3
- Scott Adams hypnotizes you into getting a vaccine.mp3 == https://files.catbox.moe/atqluq.mp3
- Scott Adams is angry about his circular thinking.mp3 == https://files.catbox.moe/241wxq.mp3
- Scott Adams teaches you a helpful trick.mp3 == https://files.catbox.moe/j6ic0x.mp3
- spacey reads I've never been in this neighborhood before.mp3 == https://files.catbox.moe/37jpbp.mp3
- Stanley Parable videogame - ni- - rs into acid.mp3 == https://files.catbox.moe/oe44b2.mp3
- tate brags about his bugati.mp3 == https://files.catbox.moe/u1mr74.mp3
- Tate in romania is prison bitch.mp3 == https://files.catbox.moe/y5ekul.mp3
- taylor swift reads uncle ted K.mp3 == https://files.catbox.moe/z8p5uu.mp3
- Tony Soprano talks about the logistic problems with the Holocaust.mp3 == https://files.catbox.moe/02vnwf.mp3
- troy mclure tranny.mp3 == https://files.catbox.moe/ikbiww.mp3
= = = = =
TL/DR: OP meme is a current popular fad this week. You can make your own by telling StableDiffusion to race swap and use image2Image assist
[ + ] Empire_of_the_Mind
[ - ] Empire_of_the_Mind 1 point 2.1 yearsMay 1, 2023 01:50:42 ago (+1/-0)
[ + ] AngryWhiteKeyboardWarrior
[ - ] AngryWhiteKeyboardWarrior 3 points 2.1 yearsMay 1, 2023 03:55:57 ago (+3/-0)
[ + ] Fascinus
[ - ] Fascinus 2 points 2.1 yearsMay 1, 2023 10:49:21 ago (+2/-0)
https://files.catbox.moe/7toe3k.jpg
[ + ] BadBoyBubby
[ - ] BadBoyBubby 1 point 2.1 yearsMay 1, 2023 04:10:39 ago (+1/-0)