Code Monkey home page Code Monkey logo

xbsdsongnan's Projects

textaudit icon textaudit

一个短视频app文本审核模块的实现思路及demo

text_review icon text_review

本项目旨在识别长短文本中的敏感词,并对整段/句文本进行语义分类,从而达到文本审核的目的

transformertts icon transformertts

🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.

transnet icon transnet

This is the PyTorch implementation of the paper "TransNet: Full Attentiodn Network for CSI Feedback in FDD Massive MIMO System". Please read the README.md to help your reproducing.

tts icon tts

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

unilm icon unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

upscayl icon upscayl

🆙 Upscayl - #1 Free and Open Source AI Image Upscaler for Linux, MacOS and Windows.

v-express icon v-express

V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.

video-subtitle-remover icon video-subtitle-remover

基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.

visual-chatgpt icon visual-chatgpt

Official repo for the paper: Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models

visualglm-6b icon visualglm-6b

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

voice_datasets icon voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (50+ datasets).

vosk-api icon vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

vpn icon vpn

VPN软件(小三VPN),完全免费,不注册,不限速,不限流,不看广告不流氓

waifu2x icon waifu2x

Image Super-Resolution for Anime-Style Art

wav2letter icon wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

wenda icon wenda

闻达:一个LLM调用平台。目标为针对特定环境的高效内容生成,同时考虑个人和中小企业的计算资源局限性,以及知识安全和私密性问题

wenet icon wenet

Transformer based ASR Engine.

wenet-kws icon wenet-kws

Production First and Production Ready End-to-End Keyword Spotting Toolkit

wordscheck icon wordscheck

敏感词检测,违禁词过滤,敏感词过滤,敏感词库,一键启动,本地运行,私有化部署,1分钟接入完成,开箱即用,支持docker,支持在线api

zamia-speech icon zamia-speech

Open tools and data for cloudless automatic speech recognition

zhrtvc icon zhrtvc

Chinese real time voice cloning (VC) and Chinese text to speech (TTS). 好用的中文语音克隆兼中文语音合成系统,包含语音编码器、语音合成器、声码器和可视化模块。

zhvoice icon zhvoice

Chinese voice corpus. 中文语音语料,语音更加清晰自然,包含8个开源数据集,3200个说话人,900小时语音,1300万字。

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.