narusemioshirakana / dragonianvoice Goto Github PK

View Code? Open in Web Editor NEW

974.0 17.0 115.0 103.98 MB

多个SVC/TTS的C++推理库

License: GNU Affero General Public License v3.0

C++ 42.89% C 56.52% C# 0.52% CMake 0.06% Python 0.01%

pytorch hifigan onnxruntime tacotron2 tts vits sovits diffsvc diffsinger rvc

dragonianvoice's Introduction

这里是纳鲁赛繆 · 尤梅米 · 希娜卡纳

❤️ 这里是 MoeSS/MoeVoiceStudio/LibDlVoiceCodec(LibSvc & LibTTS & LibSvs) 的开发者。其中LibDlVoiceCodec为多个 "Svc/TTS/Svs" 仓库的 C/Cpp/C# Onnx 推理程序仓库以及自研张量库和基于自研张量库的"Svc/TTS/Svs"推理仓库，目前后者正处于开发阶段，而前者已经可以稳定使用；而MoeSS/MoeVoiceStudio则为基于LibDlVoiceCodec开发的UI应用软件，可以比较方便的编辑LibDlVoiceCodec中的参数，以及实现专业歌声合成软件的部分功能。
✉️ 我的邮箱：[email protected]
⚡ 爱发电地址：纳鲁赛繆 · 尤梅米 · 希娜卡纳

My Skill Set

使用的语言

AI相关

杂项

GitHub状态

dragonianvoice's People

Contributors

Stargazers

Watchers

Forkers

wiinew shmassociation forlorn233 clocke181 sdlibowen boltzmannentropy jellybrick yuquan-zuo lucifere02 332plim ikuseiso nakatsusizuru kotori05 jaychan001 miuzarte 97001 metaalms isgasho qiaolinwang f-tang xiyang233 jackylee1 ningpengtao-coder jsionr avcssef flyingwince tdroseval qianniaoofficial behpozl ezhangle vikingmew pofengzhiyi freekatz hhh546 tankini miosavart 33646341 dingguoli ustr0ate xyzjm abdm357 uzstudio nbblscott yyxcvc cyborgparadisum yes2eyes arwin-cc yida-9527 panhiuchuen scottsln mimix1i0 turbobos zwluoqi yinjake panerure jovidong hongwen-sun zhqnbq ricecakey06 sunsetmkt 1357565032 road2018 spiderbord cuphead2006 jwjohns rashtug2 abdhussin123 marlonsagui liucr bobqiu lcsouzamenezes percychai axfox airadsk zyfily bewitching-coder shosseini811 duelqiuqiu zzmaze duhuasong bigdatasciencegroup otosaka02 cherub0526 2660890854 iopav lemon22333 gvs xiguazhiprince ototao pzx-star ellyiiioo sakurasblossoms nekrogeddon bsdmylove suryatmodulus zejay sanyaade-teachings zhixingheyixsh wyd520520 l9961chn

dragonianvoice's Issues

Open command line or API interface

Enable MoeSS to easily collaborate with other open source projects.

VITS模型放在mods目录下没反应

Describe the bug
如题所述，按照格式写了json文件，打开exe没报错了，也没反应。

GPU版本无法使用，打开直接闪退

已安装全部依赖+所有插件，CPU版本正常运行，GPU版本正常打开，选择模型后几秒钟闪退

同一个RVC的ONNX模型在之前的moess-win7里面可以用，换到这个里面报错了

模型配置:

    "Folder" : "march7",
    "Name" : "march7",
    "Type" : "RVC",
    "Rate" : 48000,
    "Hop" : 320,
    "Cleaner" : "",
    "Hubert": "hubert4.0",
    "Diffusion": false,
    "CharaMix": true,
    "Volume": false,
    "HiddenSize": 256,
    "Characters" : ["march7"]

模型地址：https://huggingface.co/spaces/qinzhu/girlfriend/resolve/main/march7_RVC.onnx
错误信息:

[ERROR][In "GPT-SoVits.cpp" Line 594] Locate: SoVits Non-zero status code returned while running Reshape node. Name:'/vq_model/enc_p/encoder_ssl/attn_layers.0/Reshape_7' Status Message:

我用的GPT-SOVITS的onnx_export.py导出的模型,报这个错，怎么改导出的设置啊

[ERROR][In "GPT-SoVits.cpp" Line 594] Locate: SoVits
Non-zero status code returned while running Reshape node. Name:'/vq_model/enc_p/encoder_ssl/attn_layers.0/Reshape_7' Status Message:

RVC 的json模板有误

如图，这个字段为true时，根据代码，软件会将其识别成SoVits模型（在加载模型的时候提示找不到 xxx_SoVits.onnx）

[求助]关于集成libsvc并推理RVC-ONNX的一些问题

你好，我集成了libsvc至我的工程中，但在使用NativeAPI.h提供的接口进行推理时，遇到了一个棘手的问题，请问能否帮忙解答一下呢？

我之前使用pytorch和pth的时候，pcm的输入个数与输出相同。
但是在使用LibSvcInferSlice接口时，发现pcm的输入和输出无法应对上。
跟进函数内部后，发现推理前会先进行hubert推理，此时得到的hubert输出大小与我预期的onnx-rvc模型的pcm输入个数也有出入。
即便我调整hubert输入大小，使得hubert输出大小刚好匹配上，使之能够得到finaOut时，却发现finaOut固定输出大小为20000，即便最后通过采样率计算resize之后，也与我实际的输入pcm个数不匹配。最终导致我输出的音频播放要么断断续续，要么声音不正确。
请问，我该如何确保LibSvcInferSlice接口的输入大小与输出大小匹配呢？

在加载模型时闪退

正在使用CPU版本；Models使用的模型来源为他人分享；目录结构如下：

点击该“加载模型”按钮时，GUI将无征兆地闪退。且内存、显存并无明显波动。

CUDA版本为12.1，不符合GPU版本使用条件；但是考虑到使用的是CPU，理应没有影响。

log内容如下：

[Info] Removing Env & Release Memory
[Info] Complete!
[Info] Creating Env
[Info] Env Created

烦请大佬解疑释惑，提前感谢。

新版仍然无法使用

前置包已经下完按说明解压。

CPU版加载模型后出现错误提示。

GPU版加载模型后没有错误提示，几秒后自己闪退。

啥时候搞一下GPT-soVITS

Model not recognized, So Vits 4.0

MoeSS not recognizing model:

Hubert4.0 Model file directory:

Mods folder directory:

Model folder directory:

Config json file:

{
    "Folder" : "hapiv2",
    "Name" : "hapiv2",
    "Type" : "SoVits",
    "Rate" : 44100,
    "Hop" : 512,
    "Cleaner" : "",
    "Hubert": "hubert4.0",
    "SoVits4": true,
    "Characters" : [""]
}
//Hop：HopLength of the model, if you don't know what it is you are advised to look up the information on the internet. This must be filled in the configuration file of the SoVits model.（The value must be the one you set during training and can be seen in the configuration file you used to train the model）
//Cleaner：The name of the plugin,can be left blank, but if it is filled in, the corresponding CleanerDll must be placed in the Cleaner folder, if the Dll does not exist or if there is an internal error in the Dll, it will report an error when loading the model
//Hubert：Hubert model name, required and must be placed in the "Hubert" folder for Hubert models downloaded from the sub-model repository
//Characters：For multi-speaker model this must be filled in as a list of your speakers' names, for single-speaker model it can be left out

Am I missing something here? It doesnt seem to be recognizing the model.

Failed to Inference

弹窗

推理切片或用旧版方式推理均有该弹窗，

RVC的json文件：

{
    "Folder" : "c_RVC",
    "Name" : "c",
    "Type" : "RVC",
    "Rate" : 40000,
    "Hop" : 400,
    "Cleaner" : "",
    "Hubert": "hubert4.0",
    "Diffusion": false,
    "CharaMix": true,
    "Volume": false,
    "HiddenSize": 256,
    "Characters" : []
}

设置

Is this compatible with so-vits-svc-fork?

https://github.com/voicepaw/so-vits-svc-fork
If yes, please point me to how to use it,
Thanks

bert-vits2 中文特化版支持吗？

请问如果支持的话该如何使用呢？
我看了中文特化用到的二郎神bert，好像不太好导出onnx？我导出时产生了300多个文件而不是一个单独的onnx文件。

载入时提示加载模型失败

载入模型后提示加载模型失败

不考虑支持一下4.0-Vec768-Layer12吗似乎只支持到V2

Is your feature request related to a problem? Please describe.
A clear and concise description of what the problem is. Ex. I'm always frustrated when [...]

Describe the solution you'd like
A clear and concise description of what you want to happen.

Describe alternatives you've considered
A clear and concise description of any alternative solutions or features you've considered.

Additional context
Add any other context or screenshots about the feature request here.

为什么要用Python，为什么不保持C/C++的纯洁性

readme里面描述的一些支持的模型在源代码里面没有。

能正常使用exe支持RVC模型的转换，但是我发现最新的代码里面没有相关的代码。

exe运行问题

当我这样放置json文件时，点击exe无法出现ui界面，

当我把json文件放入模型文件夹中，可以运行exe，但是无法载入模型。

我的模型是通过ddsp 5.0训练的，这是我的json文件内容，
请问问题出在哪里呢？

关于新版的一些问题

新版tts支持生成特殊语气的语音么，比如特别愤怒，特别开心，等不同的语气和情绪
新版tts支持中文生成不，是只要模型支持就可以是吧

请问`转换ONNX的程序`在哪？

模型需要转换为ONNX模型，转换ONNX的程序我已经pull到每个项目的源仓库了，PTH不能直接用！！！！！！！！！！！！！

请问ReadMe中提到的pth转换ONNX的程序在哪里？

onnx 模型

您好，我想用onnx推理，但是项目里用python推得脚本我不确定输入输出是什么。换个说法就是我该用哪个模型转的onnx来作为model.onnx呢？我用sovits-4.1训好了一个，但是转onnx后不可以直接用该项目的python脚本推理，非常感谢！

无法加载任何模型

当前的Release版本，下载后同时去下载了三个模型，并解压到了Mods文件夹

打开exe不弹窗报错，但是模型列表为空，仅显示未选中

// ====
经查看.mod文件，发现与readme不符，

将 Mdid 修改为 Folder 后重试，依旧无法加载，请问怎么才能运行呢

error when use cuda

环境创建失败!
OrtException:
[ln "EnwManager.cpp" Line 68]
D: a work)1s onnxruntime core session(provider bridge ort.cc:1209
onnxruntimeProviderLibrary..Get [ONNXRuntimeError] : 1 : FAIL :
loadLibrary failed with error 126 " when trying to load
CUsers Administrator Downloads MoeVoicestudio.-1TS MoeVoicest
udio - T1s onnxruntime providers cuda.dll

加载插件失败，可能是插件文件不存在（没有更多的报错信息）

我已经安装了所有的前置模型还有两个预制模型。但是当我选择这个模型的时候会有这个报错。
我找不到LowerCharacters这个cleaner。请问在哪里可以获取？

vits 转 onnx能提供下思路吗？

官方vits pth转换成onnx只能固定维度使用，设置dynamic_axes也不行，请问任意音频长度
你是怎么转换的？谢谢。

ShirohaSoftVits.7z解压需要密码

你好，感谢你的工作，我下载了[ShirohaSoftVits.7z模型但解压需要密码，能告诉我密码是多少吗

MoeSS3.0使用SoVits模型转换语音时出现如下错误

$1S9}9DZ$2 H_1}{}SE0WHKE$

加载插件失败，可能是插件文件不存在

失败

加载插件失败，可能是插件文件不存在

确定

错误 C2039 "MemberBegin": 不是 "MJsonValue" 的成员

严重性代码说明项目文件行禁止显示状态详细信息
错误 C2039 "MemberBegin": 不是 "MJsonValue" 的成员

求arm linux 开发板能跑

预训练模型

请问预训练模型是不是没有公开

没有调用GPU推理

首先感谢作者，总体使用很方便，效果不错

但是我在运行时默认调用的是CPU，而不是GPU，请问这可能是什么原因导致的？
我使用的是SoVITS的模型，之前用Python推理时没有这个问题

2.5.0版本，使用SoVits模型，转换文本时报无法创建ffmpeg任务

距离构建成功还差一步

问下作者用的VS那个版本构建的。
我用VS2019编译过了，链接出错：

Linked 错误 C1007 无法识别的标志“-Zc:nrvo”(在“p2”中)

可能是lib链接的问题。

SoVits Error: cannot open audio file

When I try to use SoVits, it worked well when I converted my first .wav file. However, after I saved the first converted file, it reports the error that it cannot open the audio file when I try to convert the second .wav file.

my system is Windows 11, and I used the cuda version of Moess.

I'm sure the file address is right, when I close the software and restart it. I can convert my sound file. The problem still happens when I try to convert the second file.

By the way, is this software support f0 inference of SoVits? How to set f0? In Sovits command it is used as an option parameter "-a".

Thanks for your attention!

遇到的问题

我使用RVC的cmd推导onnx在输入完所有信息（模型到音频路径）之后，在运行时，会直接闪退，并且没有音频文件输出出来。

然后我在使用moevs的ui界面时，怎么导入diffusion模型（需要傻瓜式教程，我用rvc模型的）

我不导入diffusion模型，在加载模型会闪退，在导入音频无法导入（能对音频切片页面，但是没有导入进来）
log：
[Info] Removing Env & Release Memory
[Info] Complete!
[Info] Creating Env
[Info] Env Created