Comments (5)
Oh, ok, perfect then!
I noticed multiple improvements that were needed on that part and I see that rules are quite easy to write. That should make some rules way more accurate. But I understand the concern about the fact that the sentence can contain mistakes and, thus, the disambiguation could lead to misinterpretation. However, I'm quite confident that I can get good results. I'm gonna try and open a few MRs if you're ok. I will probably need a few trial and error to get something consistent, though.
from languagetool.
I noticed a few more examples:
"Il prend café"
Token | Lemma | Part-of-speech |
---|---|---|
Il | il | R pers suj 3 m s |
prend | prendre | V ind pres 3 s |
café | café | J e sp / N m s |
There is no way that café could be an adjective. There is not even a single name in the sentence...
"Il prend pelle"
Token | Lemma | Part-of-speech |
---|---|---|
Il | il | R pers suj 3 m s |
prend | prendre | V ind pres 3 s |
pelle | pelle / peller | N f s / V imp pres 2 s / V ind pres 1 s / V ind pres 3 s / V sub pres 1 s / V sub pres 3 s |
How can a conjugated verb follow a conjugated verb that is not an auxiliary? What could be the subject of "V sub pres 1 s"? That doesn't seem to make any sense.
from languagetool.
In the first sentence, équipes
is disambiguated correctly as a noun:
<S> Je[je/R pers suj 1 s] veux[vouloir/V ind pres 1 s] mieux[mieux/A] faire[faire/V inf] travailler[travailler/V inf] les[le/D e p,les/_GN_FP] équipes[équipe/N f p,équipes/_GN_FP] de[de/P] développement[développement/N m s] et[et/C coor] de[de/P] production[production/N f s,</S>]<P/>
from languagetool.
Interesting. It probably evolved recently I guess. The other 2 are still valid concerns, though.
from languagetool.
The words in the sentences start with all the tags present in the dictionary. Then, in disambiguation.xml
, we select some tags. But this is a difficult process. It is even more difficult with sentences that can contain errors.
In general, we do the minimal disambiguation necessary to make the grammar rules work well.
If you really need to disambiguate those cases for some grammar rules, we can try to improve the disambiguation rules. But if it is not needed, we will not invest time on this.
from languagetool.
Related Issues (20)
- Global spelling doesn't work 100% in LibreOffice? — 2024-04-03 HOT 11
- A bug in parsing dates in `UkrainianWordTokenizer` HOT 3
- [en] False warning EN_UNPAIRED_QUOTES
- Add “sanitorium” HOT 1
- Add Portuguese words
- [pt] “Elevar a escrita” — rule set - 2024-04-06 HOT 2
- [pt] Main suggestion not appearing HOT 8
- Use latest version of Indriya
- Weird behaviour with LT 6.4 and pipelinePrewarming=true
- [PT] “Etc.” and comma HOT 1
- Abbreviations issue
- Please add words
- Libreoffice addon doesn't work, clicking buttons does nothing HOT 2
- LT is not enabled on the GitHub web editor HOT 2
- [pt] Rule for "don't separate subject and verb with a comma" HOT 3
- [ca] bug in the last version of the add-on with l·l HOT 1
- [ca] deixar de marcar «baixar» com a no pronominal
- [pt] Number of examples in antipatterns HOT 2
- [DE] Fehlalarm mit "Mutter Theresa" HOT 4
- Potential replacements for fasttext?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from languagetool.