techthiyanes Goto Github PK
Name: Thiya
Type: User
Bio: Data Scientist
Location: Bengaluru
Name: Thiya
Type: User
Bio: Data Scientist
Location: Bengaluru
中文文本分析工具包(包括- 文本分类 - 文本聚类 - 文本相似性 - 关键词抽取 - 关键短语抽取 - 情感分析 - 文本纠错 - 文本摘要 - 主题关键词-同义词、近义词-事件三元组抽取)
使用rnn,lstm,gru,fasttext,textcnn,dpcnn,rnn-att,lstm-att,兼容huggleface/transformers,以及以transforemrs作为词嵌入模型,后面接入cnn、rnn、attention等等做文本分类。以及各个模型的对比
Modify Chinese text, modified on LaserTagger Model. 文本复述,基于lasertagger做中文文本数据增强。
Notebooks to better understand text generation
Utility for Text Normalisation or Inverse Normalisation
Text similarity using BERT sentence embeddings
A PyTorch-based knowledge distillation toolkit for natural language processing
Text classification models implemented in Keras, including: FastText, TextCNN, TextRNN, TextBiRNN, TextAttBiRNN, HAN, RCNN, RCNNVariant, etc.
Text classifier for Hierarchical Attention Networks for Document Classification
A Python library for calculating a large variety of statistics from text
Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing
A Model for Natural Language Attack on Text Classification and Inference
textgen, Text Generation models. 文本生成,包括:UDA,GPT2,Seq2Seq,BART,T5等模型实现,开箱即用。
Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.
自然语言处理实验(sougou数据集),TF-IDF,文本分类、聚类、词向量、情感识别、关系抽取等
Library for Textless Spoken Language Processing
基于Pytorch的,中文语义相似度匹配模型(ABCNN、Albert、Bert、BIMPM、DecomposableAttention、DistilBert、ESIM、RE2、Roberta、SiaGRU、XlNet)
QAmatch(qa_match)/文本匹配/文本分类/文本embedding/文本聚类/文本检索(bow/ifidf/ngramtf-df/bert/albert/bm25/…/nn/gbdt/xgb/kmeans/dscan/faiss/….)
Text analysis with networks.
TEXTOIR is a flexible toolkit for open intent detection and discovery. (ACL 2021)
A PyTorch-based model pruning toolkit for pre-trained language models
A synthetic data generator for text recognition
Reinforcement learning in text generation with transformers
短文本相似度
Textual is a TUI (Text User Interface) framework for Python inspired by modern web development.
Tez is a super-simple and lightweight Trainer for PyTorch. It also comes with many utils that you can use to tackle over 90% of deep learning projects in PyTorch.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.