torch 2.1.2
torchaudio 2.1.2
torchmetrics 1.3.0.post0
torchvision 0.16.2
cuda_12.2.r12.2
Driver Version: 535.54.03
从小到大,都是别人教你:该做什么,不该做什么,其实,人生,这么复杂,哪里是一句:一份耕耘、一份收获,就可以讲的清楚的呢?遵从内心一次。
/root/anaconda3/envs/fish-speech/lib/python3.10/site-packages/torch/nn/utils/weight_norm.py:30: UserWarning: torch.nn.utils.weight_norm is deprecated in favor of torch.nn.utils.parametrizations.weight_norm.
warnings.warn("torch.nn.utils.weight_norm is deprecated in favor of torch.nn.utils.parametrizations.weight_norm.")
2024-01-25 04:19:39.209 | INFO | __main__:main:51 - Restored model from checkpoint
2024-01-25 04:19:39.210 | INFO | __main__:main:54 - Processing in-place reconstruction of lzl1.wav
2024-01-25 04:19:40.343 | INFO | __main__:main:62 - Loaded audio with 17.06 seconds
2024-01-25 04:19:41.554 | INFO | __main__:main:102 - Generated indices of shape torch.Size([4, 368])
2024-01-25 04:19:42.025 | INFO | __main__:main:126 - VQ Encoded, indices: torch.Size([4, 1, 368, 1]) equivalent to 21.53 Hz
Traceback (most recent call last):
File "/path/to/fish-speech/tools/vqgan/inference.py", line 147, in <module>
main()
File "/root/anaconda3/envs/fish-speech/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
return func(*args, **kwargs)
File "/root/anaconda3/envs/fish-speech/lib/python3.10/site-packages/torch/amp/autocast_mode.py", line 16, in decorate_autocast
return func(*args, **kwargs)
File "/root/anaconda3/envs/fish-speech/lib/python3.10/site-packages/click/core.py", line 1157, in __call__
return self.main(*args, **kwargs)
File "/root/anaconda3/envs/fish-speech/lib/python3.10/site-packages/click/core.py", line 1078, in main
rv = self.invoke(ctx)
File "/root/anaconda3/envs/fish-speech/lib/python3.10/site-packages/click/core.py", line 1434, in invoke
return ctx.invoke(self.callback, **ctx.params)
File "/root/anaconda3/envs/fish-speech/lib/python3.10/site-packages/click/core.py", line 783, in invoke
return __callback(*args, **kwargs)
File "/path/to/fish-speech/tools/vqgan/inference.py", line 135, in main
fake_audios = model.generator(decoded_mels)
File "/root/anaconda3/envs/fish-speech/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1518, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "/root/anaconda3/envs/fish-speech/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1527, in _call_impl
return forward_call(*args, **kwargs)
File "/path/to/fish-speech/fish_speech/models/vqgan/modules/decoder.py", line 76, in forward
xs += self.resblocks[i * self.num_kernels + j](x)
File "/root/anaconda3/envs/fish-speech/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1518, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "/root/anaconda3/envs/fish-speech/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1527, in _call_impl
return forward_call(*args, **kwargs)
File "/path/to/fish-speech/fish_speech/models/vqgan/modules/decoder.py", line 172, in forward
xt = c1(xt)
File "/root/anaconda3/envs/fish-speech/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1518, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "/root/anaconda3/envs/fish-speech/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1527, in _call_impl
return forward_call(*args, **kwargs)
File "/root/anaconda3/envs/fish-speech/lib/python3.10/site-packages/torch/nn/modules/conv.py", line 310, in forward
return self._conv_forward(input, self.weight, self.bias)
File "/root/anaconda3/envs/fish-speech/lib/python3.10/site-packages/torch/nn/modules/conv.py", line 306, in _conv_forward
return F.conv1d(input, weight, bias, self.stride,
RuntimeError: Invalid argument