Code Monkey home page Code Monkey logo

Comments (5)

Sharcoux avatar Sharcoux commented on June 4, 2024 1

Oh, ok, perfect then!

I noticed multiple improvements that were needed on that part and I see that rules are quite easy to write. That should make some rules way more accurate. But I understand the concern about the fact that the sentence can contain mistakes and, thus, the disambiguation could lead to misinterpretation. However, I'm quite confident that I can get good results. I'm gonna try and open a few MRs if you're ok. I will probably need a few trial and error to get something consistent, though.

from languagetool.

Sharcoux avatar Sharcoux commented on June 4, 2024

I noticed a few more examples:


"Il prend café"

Token Lemma Part-of-speech
Il il R pers suj 3 m s
prend prendre V ind pres 3 s
café café J e sp / N m s

There is no way that café could be an adjective. There is not even a single name in the sentence...


"Il prend pelle"

Token Lemma Part-of-speech
Il il R pers suj 3 m s
prend prendre V ind pres 3 s
pelle pelle / peller N f s / V imp pres 2 s / V ind pres 1 s / V ind pres 3 s / V sub pres 1 s / V sub pres 3 s

How can a conjugated verb follow a conjugated verb that is not an auxiliary? What could be the subject of "V sub pres 1 s"? That doesn't seem to make any sense.

from languagetool.

jaumeortola avatar jaumeortola commented on June 4, 2024

In the first sentence, équipes is disambiguated correctly as a noun:

<S> Je[je/R pers suj 1 s] veux[vouloir/V ind pres 1 s] mieux[mieux/A] faire[faire/V inf] travailler[travailler/V inf] les[le/D e p,les/_GN_FP] équipes[équipe/N f p,équipes/_GN_FP] de[de/P] développement[développement/N m s] et[et/C coor] de[de/P] production[production/N f s,</S>]<P/>

from languagetool.

Sharcoux avatar Sharcoux commented on June 4, 2024

Interesting. It probably evolved recently I guess. The other 2 are still valid concerns, though.

from languagetool.

jaumeortola avatar jaumeortola commented on June 4, 2024

The words in the sentences start with all the tags present in the dictionary. Then, in disambiguation.xml, we select some tags. But this is a difficult process. It is even more difficult with sentences that can contain errors.
In general, we do the minimal disambiguation necessary to make the grammar rules work well.
If you really need to disambiguate those cases for some grammar rules, we can try to improve the disambiguation rules. But if it is not needed, we will not invest time on this.

from languagetool.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.