Comments (3)
Thanks. I am not familiar with libchardet, but it seems like a new project. uchardet is relatively older and not yet up to date. I will consider collaborate with libchardet.
from uchardet.
Hi,
I had a look, and though the main page has a 2015 copyright, the code seems to not have had any update for nearly 2 years (last commit's age is "641d 09h" at the time of writing, which — I will assume — means 641 days old). So that's not such a new project actually.
Apart from this, looking at the main page and the header (http://svn.oops.org/wsvn/OOPS.libchardet/trunk/src/chardet.h), they seem to have 2 logics to detect encoding, one with a single call (though it still needs init() and free() calls which goes against the whole point), one by feeding data with potentially several calls (same as us), though I don't really understand why they create 2 internal objects for this.
They also have API to return the version of the library.
In the end, it seems similar to us, except that they don't use a clear namespaced naming (they don't prefix their functions with some recognizable pattern like chardet_ or whatever), which can be quite a problem on big problem because of name clashing.
I've added libchardet in our list of "Related Projects".
from uchardet.
Also I've checked their list of commits and they don't seem to have any significant fix (there was one commit saying it fixes TIS-620 but checking it contents, it just seems it was not activated on their code, that's all. Also I've created a TIS-620 test file and it works well with uchardet), or feature that we could use in uchardet.
This concludes this ticket that I will now close.
from uchardet.
Related Issues (20)
- Windows-1251 detection failed on a file in Russian. HOT 1
- Invalid WINDOWS-1255 file detected as WINDOWS-1255 HOT 10
- Detect files whose encoded has been corrupted by a text editor ? HOT 1
- PACKAGE_NAME opencc??? HOT 1
- Add a dbus service HOT 1
- Transferring to uchardet organization? HOT 17
- GB18030 file detected as WINDOWS-1252 HOT 6
- Can't detect GBK. HOT 2
- Next release HOT 4
- WINDOWS-1253 file detected as ISO-8859-7 HOT 10
- Possibly incomplete project license HOT 2
- LangModels refs error HOT 21
- Cast unsigned int HOT 1
- The code make me egg pain. HOT 1
- Can this code be used to make a Windows DLL? How? HOT 6
- UTF-8 with right single quote (U+2019) mistaken as Windows-1250 HOT 2
- lower case german umlauts in utf-8 are detected incorrectly HOT 1
- uchardet wrongly determines the text as WINDOWS-1252 HOT 2
- Make a portable executable
- libuchardet-ios.a能不能支持下iOS Simulator~~~~ HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from uchardet.