Code Monkey home page Code Monkey logo

zimreader-java's Introduction

ZIMReader in Java

This is a port of the ZIMReader in Java. One of the aims of this project is to enable mobile users developing on Android, J2ME and other platforms to use ZIM files and build offline Wikipedia readers.

I'll soon add a javadoc, in the mean time you can go through the comments that I have provided in the source code. Also, try running the example ZIMTest.java.

This code was built on Java 1.6 and has not been tested on previous versions. However, I'll do that soon on previous ones as well. In the next release, I intend to provide an Ant file.

If you find any bugs, please report them to or visit the IRC channel #openzim on Freenode and ping 'gremmachook'.

This library is licensed under the LGPL v3.0 license. However, I understand that sometimes licensing can be a problem for you. I would be happy to provide a alternate lesser permissive license if the need be.

Found this library useful? Drop in a mail, I love to hear feedback.

Before this ends, I'd like to thank Lasse Collin <Lasse.collin at tukaani.org>, who maintains the Tukaani project, for his port of XZ in Java, without which it wouldn't have been possible for me to write this library.

-- Arunesh Mathur

zimreader-java's People

Contributors

cscott avatar kelson42 avatar

Stargazers

Prahlad Yeri avatar Alex D. Torres avatar  avatar David de Caires e Freitas avatar x8edu avatar

Watchers

Arunesh Mathur avatar David de Caires e Freitas avatar James Cloos avatar  avatar  avatar

zimreader-java's Issues

Trouble parsing wikipedia zim file with zimreader-java

Hi everyone,

For some reason, I see a parse error when I use this zim reader https://github.com/wikimedia/openzim/tree/master/zimreader-java with the latest wikipedia zim file (http://download.kiwix.org/zim/wikipedia_en_all.zim).

I'm searching for an article called 'Chocolate', and the compression type of the cluster (e.g., first byte of the cluster) ends up being something odd like '189' (not one of the acceptable types 0, 1 or 4). Is the offset incorrect? The md5 checksum of the file matches so I know the file isn't corrupt. What could it be? The file header seems to be ok? Any ideas?

Magic number: 72173914
Version: 5
UUID: -4097
Article Count: 17454230
Cluster Count: 90296
URL pointer position: 236
Title position: 139634076
Cluster position: 1237308322
Mime list position: 80
Main page: -1
Layout page: -1
Checksum position: -1728689809
Geo position: 2087809897
mime types:
ion/javascript
application/ogg
application/pdf
image/gif
image/jpeg
image/png
image/svg+xml
image/tiff
inode/symlink
text/css
text/html
text/plain\

Reported first here https://sourceforge.net/p/kiwix/discussion/604122/thread/617f227b

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.