gouwsmeister / textcleanser Goto Github PK
View Code? Open in Web Editor NEWNormalizes lexically ill-formed text to its most likely clean text, e.g. "c u thr 2nite!" -> "see you there tonight!".
License: GNU General Public License v3.0
Normalizes lexically ill-formed text to its most likely clean text, e.g. "c u thr 2nite!" -> "see you there tonight!".
License: GNU General Public License v3.0
I cannot find the tweet-lm file in the repo, could you add it? or explain how it can be generated? thanks
There seems to be an error with this line -
command = "%slattice-tool -in-lattice - -read-mesh -posterior-decode -zeroprob-word blemish -use-server %s " % (LATTICE_TOOL_DIR, NGRAM_SERVER_IP)
Hello Sir,
I'm try to run your programs, but there is an Assertion error. What's that mean sir?
I'm planning to attempt this for my research works now, related to normalize tweets data. The message looks like this :
python cleanser.py
Traceback (most recent call last):
File "cleanser.py", line 85, in <module>
tc = TextCleanser()
File "cleanser.py", line 19, in __init__
self.decoder = Decoder()
File "/Users/cliefsengkey/Dropbox/RESEARCH-LDA/LDA-GENSIM/TextCleanser/decoder.py", line 27, in __init__
assert os.path.exists(LATTICE_TOOL_DIR + "lattice-tool")
AssertionError
Thank You,
On running python generator.py, I receive the following error,
Traceback (most recent call last):
File "generator.py", line 557, in
print gen.word_generate_candidates(w)
TypeError: word_generate_candidates() takes at least 3 arguments (2 given)
What am I missing? Kindly help.
When running the ./start_ngram_server.sh,
It shows the following message:
Starting prob server on port 12345
could not bind socket:
How to solve this problem?
Thank you!
Hi,
I'm getting this error while running start server script.
/home/chandresh/ckm/code/lexicalNormalization/TextCleanser/data/latimes-lm.gz.part0: line 115443: reached EOF before \end\ format error in mix-lm file /home/chandresh/ckm/code/lexicalNormalization/TextCleanser/data/latimes-lm.gz.part0
Can you look into it?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.