Code Monkey home page Code Monkey logo

ntextcat's People

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

ntextcat's Issues

its appear to not recognize hebrew right

so i used your package here and i copy-pasted your code and added the xml and the txt and lm files from those folders into my app
and its appears to recognize hebrew as danish for some reason
you can try this to check it yourself:
var languages = identifier.Identify("קדימה");
and when i checked xml hebrew is apparent there

Turkish language not detected properly?

I have used documents of varying sizes in Turkish but it always gets detected as Swedish or Norwegian. Is there a known issue with detecting Turkish? Thanks.

Online demo update

The online demo webpage needs to be updated.
Requirements:

  • should have a header
  • should have a text area
  • should have button "Get language" which identifies the language of the text in the text area (POST to a remote API).
    • (nice to have) if 3 seconds have passed after the last time the user has finished typing the last character and if this text has not been sent to the identification API yet, then start the identification process automatically (send to the API and update the status accordingly).
      • the visual effect of the automatic identification is the same as when hitting the button.
  • should have a status label:
    • it is blank at the start
    • when the identification process has started, it shows "Identifying the language of the text...".
      • (nice to have) the label would shows a spinner ("washing machine") icon which indicates the request is in progress.
    • when there is a result, it says "The language of the text is eng" (or another 3-letter code).
    • when there is an error, it renders the text of the error from the JSON in red.

Is this project alive?

Hi Ivan,

Are you accepting any PR's or plan to do a .net core/standard version?

Would you consider allowing other contributors on this repo?

Thanks and well cone on your port, it is a great accomplishment.

How to use your library?

Could you give a small example of using your library?

win 7x64
vs - 2017

Installed "ntextcat" through "nuget"
I need to determine the language of the text that is entered in "textBox2.Text".
Result output in "textBox1.Text"
It is supposed to enter the text: European languages, languages ​​with hieroglyphs (Chinese, Japanese) and others

Found sample code.
I get a string error
var identifier = factory.Load("NTextCat 0.2.1.1\\LanguageModels\\Core14.profile.xml");

cod

using NTextCat;

namespace rsh
{
    public partial class Form2 : Form
    {
        public Form2()
        {
            InitializeComponent();
        }

        private void button1_Click(object sender, EventArgs e)
        {
            var factory = new RankedLanguageIdentifierFactory();
            var identifier = factory.Load("NTextCat 0.2.1.1\\LanguageModels\\Core14.profile.xml");
            var languages = identifier.Identify(textBox2.Text);
            var mostCertainLanguage = languages.FirstOrDefault();

            textBox1.Text = mostCertainLanguage.Item1.Iso639_3;
        }
    }
}

How to solve the problem?

2018-10-14_18-48-10

Несколько языков в тексте

Иван добрый день,
какая логика определения языка, если в тексте встречаются фразы на 2х или 3х языках сразу?
Спасибо за вашу работу!

thread safe?

is the RankedLanguageIdentifierFactory and RankedLanguageIdentifier thread safe?

Undefined license

Thanks for making your library available. Could you state the license/terms of use?

.NetStandard Compatibility

Looking at your library, it has no dependencies, and so should be very simple (less than a couple of hours work) to convert to .Net Standard.
Do you have any plans to do this?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.