Code Monkey home page Code Monkey logo

konlp's People

Contributors

choraeng avatar hagazzusa avatar hyunyoung2 avatar konltk7 avatar luaperl avatar simple1007 avatar sonsunghwan avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

konlp's Issues

JVM init problem

jvm 을 사용하는 형태소 분석기들을 동시에 사용할때,
각자 종속적인 라이브러리 파일을 가지고 초기화하므로
별도의 파일에서 jvm 초기화 함수를 선언하고 그것을 사용할 필요가 있음

Version checking

Auto spacing's static library version checking.

Index's performance and library version checking.

So, check the status of dynamic library two ways.

Knu word2vec docstring modify

doctest 부분 import 경로 수정

  • before
from konlp.embedding import knu_word2vec
test = knu_word2vec()
  • after
from konlp.embedding import KnuWord2Vec
test = KnuWord2Vec()

kconv test

  • scan() function update
  • utf16 LE <-> utf16 BE
  • file, scan test

Using User Dictonary

How can we use user dictionary

KLT2000.ini File 에 UserDicName 에 euckr로 인코딩한 사용자사전 경로를 추가 했지만 동작하지 않습니다.
OutputLevel 에 1이 아닌 2를 할당 했을 때 분석결과가 2개나 오는것으로 보아 KLT2000.ini은 동작합니다.

비표준문자 분석이슈

'햇볓' 이라는 글자가 문장에 포함되어있으면 아예 분석결과가 나오지 않습니다.
'핱' 이라는 글자가 문장에 포함되어있으면 아예 분석결과가 나오지 않습니다.
k.analyze(simple_txt)

[]

Wen directory's error handling

reading sources... [100%] news                                                                                                                                                                
/home/hyunyoung2/KoNLTK/konlp/konlp/misc/kconv/kconv.py:docstring of konlp.misc.kconv.kconv:17: WARNING: Unexpected indentation.
/home/hyunyoung2/KoNLTK/konlp/konlp/misc/kconv/kconv.py:docstring of konlp.misc.kconv.kconv.convert:3: WARNING: Inline interpreted text or phrase reference start-string without end-string.
WARNING: autodoc: failed to import module 'konlp.test.test_kconv'; the following exception was raised:
No module named 'pytest'
WARNING: autodoc: failed to import module 'konlp.test.test_klt_kma'; the following exception was raised:
No module named 'pytest'
WARNING: autodoc: failed to import module 'konlp.test.test_tokenize_klt_autospacing'; the following exception was raised:
No module named 'pytest'
/home/hyunyoung2/KoNLTK/konlp/konlp/tokenize/klt/autospace.py:docstring of konlp.tokenize.klt.autospace.KltAsp.asp:12: WARNING: Inline interpreted text or phrase reference start-string without end-string.
looking for now-outdated files... none found
pickling environment... done
checking consistency... /home/hyunyoung2/KoNLTK/konlp/web/api/modules.rst: WARNING: document isn't included in any toctree

We got a error when we made website of Kontlk.

From now on we have to handle this error

We have to reduce the width of a line

>>> from konlp.kma.klt import klt
>>> k = klt.KltKma()
>>> simple_txt = "안녕하세요. 국민대학교 자연어처리 연구실입니다."
>>> k.analyze(simple_txt)
[('안녕하세요', [('안녕', 'N'), ('하', 't'), ('세요', 'e')]), ('.', [('.', 'q')]),
('국민대학교', [('국민대학교', 'N')]), ('자연어처리', [('자연어처리', 'N')]), ('연구실입니다',
[('연구실', 'N'), ('이', 'c'), ('습니다', 'e')]), ('.', [('.', 'q')])]
>>> k.morphs(simple_txt)
['안녕', '하', '세요', '.', '국민대학교', '자연어처리', '연구실', '이', '습니다', '.']
>>> k.nouns(simple_txt)
['안녕', '국민대학교', '자연어처리', '연구실']

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.