Comments (4)
So is the accent on particular characters? You could define tags at character levels and basically work with a character level Transformer Encoder.
from torchnlp.
Yes, the accent is on specific letter.
Does Transform need a dictionary for character level taggng?
What should my next steps be in order to train Transformer accentation on Lithuanina language. I have a dataset of ~13 K sentences with accentation. I'm suspicious it may not be enough to train Transformer though, but I'm very keen to try...
from torchnlp.
I think you can map the input directly to the unicode character values. The infrastructure around the Tagger classes currently works at a word (+char) level. We'll have to make it more generic to handle character only input (An incentive for me to work on this!).
But the Transformer module is independent of the input (check this file).
13K sentences should be more than enough if you're working at a character level. Do you have tags for each character (including none)?
from torchnlp.
I do have tags for each char.
from torchnlp.
Related Issues (15)
- import error HOT 1
- Batch size stuck at 100 HOT 2
- Add & Norm HOT 1
- Question: time series
- How to mask <pad> in sentence?
- Training killed HOT 1
- Issue with installation of torchnlp per instructions here
- kernel size in position-wise ffn
- is there a way to port it to 0.3.0? HOT 3
- F score on NER task HOT 4
- . HOT 1
- POS Tagging HOT 1
- python setup.py error: no commands supplied
- Can't get to run torchnlp.ner properly HOT 11
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from torchnlp.