Code Monkey home page Code Monkey logo

doc-scan's Introduction

Doc-Scan

Purpose:-

The use of this tool is to obtain any differences between files so as to make sure there are no errors while copying the files or for checking scripts to see if they are the exact same (Used to prevent direct copying of answers and documents).

Libraries needed for the program are:-

  1. pip install PyPDF2

Additional features to be added:-

  • Add a GUI.
  • Allow for different file formats using.
  • Find more efficient and accurate way to detect differences between pdf files rather than finding mistakes.
  • Add a function to tell how similiar the documents are, using probability and other tools which can be used for this purpose.

Maintained by: Mohamed Ayaan

doc-scan's People

Contributors

mohamed-ayaan358 avatar samarth777 avatar sarthakskumar avatar

Stargazers

 avatar

Watchers

 avatar  avatar  avatar

doc-scan's Issues

Allow for different formats of files to be used.

What do we want you to do?

Pdf files and word files at the moment can be used in the script but more files like .pptx, .jpg(Compare the binary encoding of the jpg so as to see if they differ), and .mp4

Possible solutions

Look into other libraries that can also compare for example these libraries were taken from elsewhere and after sufficient tweaking, they were made to fit in. You can perform audio analysis on mp4 files to compare the two of them.

Logs:

1. Open Access time:

[1 hour] if each block time has passed

2. Block time:

[1 hour], to show significant progress.

3. Brownie points: 50 Points

Find a more efficient and accurate way to find errors in a pdf

What do we want you to do?

At the moment in the pdf, the method to find errors between the pdfs we have a very simplistic piece of code which doesn't account for edge cases.

Possible solutions

Potentially applying conditions relating with spaces and what not.

Logs:

1. Open Access time:

[30 minutes] if each block time has passed

2. Block time:

[30 minutes], to show significant progress.

3. Brownie points: 30 Points

Add a GUI

What do we want you to do?

Currently, we have to enter the file name, into the command line for us to compare the two files. Perhaps enter an upload tool where we can upload the files.

Possible solutions

Anything is appreciated, as any improvements over the command line is quite nice :)

Logs:

1. Open Access time:

[30 minutes] if each block time has passed

2. Block time:

[1 hour], to show significant progress.

3. Bounty points: 50 Points

4. Brownie points: 5-30 points based on the extent to which you have gone

Remember that adding more components will only help.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.