A Spark Between Voice and Text
24
Public repositories
41,163
Total stars
3,457
Followers
Fish Audio has a significant public presence on GitHub, showcasing a wide range of repositories primarily in Python, TypeScript, and C#. Notable projects include fish-speech, a state-of-the-art open-source TTS system, and Bert-VITS2, which integrates multilingual capabilities. Their repositories focus on audio processing, voice synthesis, and related frameworks.
SOTA Open Source TTS
vits2 backbone with multilingual-bert
An easy to understand TTS / SVS / SVC framework
Preprocess Audio for training
The official Python library for the Fish Audio API.
No description provided for this repository.
RTVC: Real-Time Voice Conversion GUI
OpenUTAU renderer for diffsinger / 适用于diffsinger的OpenUTAU渲染器,使用方法:https://github.com/xunmengshe/OpenUtau/wiki/%E4%BD%BF%E7%94%A8%E6%96%B9%E6%B3%95%EF%BC%88%E4%B8%AD%E6%96%87%EF%BC%89
No description provided for this repository.
A simple svs labeling tool
No description provided for this repository.
No description provided for this repository.
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
Official documentation for products, services, and projects by Fish Audio
The official n8n node for the Fish Audio API.
The official Go SDK for the Fish Audio API.
No description provided for this repository.
⚡ A Simple / Speedy / Secure Link Shortener with Analytics, 100% run on Cloudflare.
Actix Web is a powerful, pragmatic, and extremely fast web framework for Rust.
Build cross-platform Native Progressive Web Apps for iOS, Android, and the Web ⚡️
No description provided for this repository.
An open framework and intermediary model for converters among project files of various singing voice synthesizers
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...) (AAAI 2025).
Hanabi interview demo
Fish Audio builds various tools related to voice and audio processing on GitHub. Their repositories include frameworks for text-to-speech and voice synthesis, such as fish-speech and fish-diffusion, which cater to developers in the audio technology space.
Fish Audio primarily uses Python, TypeScript, and C#. Their public repositories reflect this, featuring several projects that leverage these languages for developing audio processing tools and libraries, including fish-audio-python and OpenUtau.
Yes, Fish Audio's repositories are all public on GitHub. This openness allows developers and researchers to access and contribute to their projects, enhancing collaboration in the fields of voice synthesis and audio technology.
Monitor Fish Audio with RepoGuard and get alerted the moment a new public repository appears.
Monitor this account