Code Monkey home page Code Monkey logo

naturallanguage's People

Contributors

parkerburchett avatar

Watchers

 avatar

naturallanguage's Issues

Add a method that unpickles all the relevant things

Right now you have

classifierList = open("C:/Users/parke/Documents/GitHub/NaturalLanguage/NaturalLanguage/OwnPrograms/pickled_TrainedClassifierList.pickle", "rb")
classifiers = pickle.load(classifierList)
classifierList.close()

testingSetIn = open("C:/Users/parke/Documents/GitHub/NaturalLanguage/NaturalLanguage/OwnPrograms/pickled_TestingData.pickle", "rb")
TestingSet = pickle.load(testingSetIn)
testingSetIn.close()

word_featuresIN = open("C:/Users/parke/Documents/GitHub/NaturalLanguage/NaturalLanguage/OwnPrograms/pickled_word_features.pickle", "rb")
word_features = pickle.load(word_featuresIN)
word_featuresIN.close()

documentsIN = open("C:/Users/parke/Documents/GitHub/NaturalLanguage/NaturalLanguage/OwnPrograms/pickled_documents.pickle","rb")
documents = pickle.load(documentsIN)
documentsIN.close()

You should put that in a method in general so that it is more clear

Replace NuSVM_Classifier

I don't know why this classifier takes forever but it is not loading. Replace this classifier with a different classifier.

You want to have 7 different classifiers so that there is an odd number in the vote classifier

POS tag is broken

Look at POS3.

The word features are all the same. actually Identical. when they should be different based on the rt of speech. But the accuracy is different for the classifiers.

I have no idea why this is the case. The differences might be just random variation.

Refactor PickleEveryThing.py

Need to rewrite this to send all the pickled info into a single folder, and write a method that will unpickle all the stuff in that folder as is.

Bug Fix sentiment(text)

Right now there is an error deep in numpy. I should make sure that the pickled algos work and are accurate, before I try to make sentiment work.

Write a method that takes in a tweet and spits out a prediction.

Pseudocode:
You need to have word_features here as well.

Convert Tweet via word_tokenize -> call this 'words'

Convert 'words' to a long boolean vector -> call this tweet_as_vector

Pass 'tweet_as_vector' to the VoteClassifier -> return positive or negative

You might want to create 9 different slight variations of the classifier (eg different cost functions or initial parameters). Then when there is 7,8 or 9 votes for one of the modes return 'Positive" or "Negative" or "Unsure"

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.