Code Monkey home page Code Monkey logo

dkpro-keyphrases's People

Stargazers

 avatar

Watchers

 avatar

dkpro-keyphrases's Issues

Remove KeyphraseCandidate type

KeyphraseCandidate type is not necessary since the candidate annotator and the 
filters can work on the Keyphrase type without any problem.

Original issue reported on code.google.com by [email protected] on 31 Jan 2014 at 1:45

Implement a pos filter

Implement a filter which removes candidates that either do or do not match a 
pos pattern.

Original issue reported on code.google.com by [email protected] on 29 Jan 2014 at 7:06

Tree-tagger Processes don't terminate

Hi,

I'm using DKPro Keyphrases' CooccurrenceGraphExtractor to extract keyphrases 
from various texts. The keyphrase extraction for the texts is
performed sequentially. However, my Windows Task-Manager reports that the 
tree-tagger processes do not terminate. So although I process the texts 
sequentially, a growing number of tree-tagger processes accumulates in my RAM 
until my RAM is used up completely.

My code that invokes the keyphrase extraction looks like this:


For(String text : allTexts){
    CooccurrenceGraphExtractor extractor = new CooccurrenceGraphExtractor();
    extractor.setMinKeyphraseLength(2);
    extractor.setCandidate(new Candidate(CandidateType.Token, PosType.N));
    List<Keyphrase> keyphrases = extractor.extract(text);
    keyphrases = getTopRankedUniqueKeyphrases(keyphrases, keyphrases.size());

    // save text 
    ...
}

Is there a way to avoid this accumulation of tree-tagger processes?
Thanks in advance.

Sincerely yours,
Laura




What steps will reproduce the problem?
1.
2.
3.

What is the expected output? What do you see instead?


What version of the product are you using? On what operating system?


Please provide any additional information below.


Original issue reported on code.google.com by [email protected] on 8 Jul 2014 at 1:01

KeyphraseWriter should also output the keyphrases to a file

KeyphraseWriter purpose is to be used for evaluation. However, if we run a 
keyphrase extracton pipeline for several configurations using dkpro lab, it is 
not that simple to analyze the outcome print in the console. KeyphraseWriter 
should offer the option of using a file to print out the keyphrases extracted.

Original issue reported on code.google.com by [email protected] on 5 May 2014 at 8:02

Drop kea wrapper

Kea dependency is not on maven central, release should not be blocked by that.


Original issue reported on code.google.com by [email protected] on 23 Jun 2014 at 3:17

Remove the Keyphrase mode from StructureFilter

Only keyphrase candidates should be removed from the indexes, not the 
keyphrases themselves. The idea of a keyphrase candidate is that it is only a 
candidate, so, it might be removed from indexes after the filtering phase. If 
the keyphrase candidate becomes a keyphrase, then it does not make sense to 
remove it through a filter. Then, the mode from StructureFilter which filters 
out Keyphrase types should be removed.

Original issue reported on code.google.com by [email protected] on 29 Jan 2014 at 2:35

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.