Projects I've worked on (or contributed to)
mrfakename PRO
AI & ML interests
LLMs, TTS, & Open Source
Organizations
SAM Audio
The SAM Audio model licenses allow for redistribution so long as the original license files are included
OpenF5 TTS
The OpenF5 TTS model series (currently OpenF5 TTS Base - more variants coming soon ๐)
Zero-Shot Voice Cloning
TTS models that support zero-shot voice cloning
-
MegaTTS 3: Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis
Paper โข 2502.18924 โข Published โข 16 -
MaskGCT: Zero-Shot Text-to-Speech with Masked Generative Codec Transformer
Paper โข 2409.00750 โข Published โข 6 -
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching
Paper โข 2410.06885 โข Published โข 48 -
StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion
Paper โข 2409.10058 โข Published โข 2
Spaces of the Week
My spaces or spaces I worked featured on Spaces of the Week! Ones at the top are the oldest, newest at the bottom ๐ค
- Running on ZeroAgentsFeatured731
StyleTTS 2
๐ฃ731Efficient, fast, and natural text to speech with StyleTTS 2!
- Running on ZeroAgentsFeatured416
OpenDalle V1.1 GPU Demo
๐ผ416A demo of OpenDalle V1.1 on a ZERO GPU.
- Runtime errorAgentsFeatured74
RWKV Music
๐ต74Generate MIDI music using RWKV v4!
- Build errorFeatured144
MetaVoice 1B
๐ฃ144A demo of MetaVoice 1B, a new TTS model by MetaVoice.
Voice Acting Models
With LAION
Ministral 3 Llamafied
Ministral 3 models converted to the Llama format (without the vision encoder)
Podcast Pile
EmoAct
Llamafied Models
Models converted to the Llama format
-
mrfakename/Apriel-5B-Instruct-llamafied
Text Generation โข 5B โข Updated โข 10 โข 4 -
mrfakename/Apriel-5B-Base-llamafied
Text Generation โข 5B โข Updated โข 2 -
llamafy/Qwen-Qwen2.5-1.5B-llamafied
Text Generation โข 2B โข Updated โข 3 -
llamafy/Qwen-Qwen2.5-1.5B-Instruct-llamafied
Text Generation โข 2B โข Updated โข 3
Failed Experiments
Experiments that didn't work out.
Projects
Projects I've worked on (or contributed to)
-
laion/Emolia
Viewer โข Updated โข 71.8M โข 8.2k โข 12 -
mrfakename/OpenF5-TTS-Base
Text-to-Speech โข Updated โข 163 โข 86 - Running on ZeroAgentsFeatured2.87k
F5-TTS
๐ฃ2.87kF5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
-
mrfakename/EmoAct-MiMo
Text Generation โข Updated โข 2 โข 12
Ministral 3 Llamafied
Ministral 3 models converted to the Llama format (without the vision encoder)
SAM Audio
The SAM Audio model licenses allow for redistribution so long as the original license files are included
Podcast Pile
OpenF5 TTS
The OpenF5 TTS model series (currently OpenF5 TTS Base - more variants coming soon ๐)
EmoAct
Zero-Shot Voice Cloning
TTS models that support zero-shot voice cloning
-
MegaTTS 3: Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis
Paper โข 2502.18924 โข Published โข 16 -
MaskGCT: Zero-Shot Text-to-Speech with Masked Generative Codec Transformer
Paper โข 2409.00750 โข Published โข 6 -
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching
Paper โข 2410.06885 โข Published โข 48 -
StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion
Paper โข 2409.10058 โข Published โข 2
Llamafied Models
Models converted to the Llama format
-
mrfakename/Apriel-5B-Instruct-llamafied
Text Generation โข 5B โข Updated โข 10 โข 4 -
mrfakename/Apriel-5B-Base-llamafied
Text Generation โข 5B โข Updated โข 2 -
llamafy/Qwen-Qwen2.5-1.5B-llamafied
Text Generation โข 2B โข Updated โข 3 -
llamafy/Qwen-Qwen2.5-1.5B-Instruct-llamafied
Text Generation โข 2B โข Updated โข 3
Spaces of the Week
My spaces or spaces I worked featured on Spaces of the Week! Ones at the top are the oldest, newest at the bottom ๐ค
- Running on ZeroAgentsFeatured731
StyleTTS 2
๐ฃ731Efficient, fast, and natural text to speech with StyleTTS 2!
- Running on ZeroAgentsFeatured416
OpenDalle V1.1 GPU Demo
๐ผ416A demo of OpenDalle V1.1 on a ZERO GPU.
- Runtime errorAgentsFeatured74
RWKV Music
๐ต74Generate MIDI music using RWKV v4!
- Build errorFeatured144
MetaVoice 1B
๐ฃ144A demo of MetaVoice 1B, a new TTS model by MetaVoice.
Failed Experiments
Experiments that didn't work out.
Voice Acting Models
With LAION