Code Monkey home page Code Monkey logo

Comments (7)

eikek avatar eikek commented on August 22, 2024 1

Now that 0.7.0 is out, I'll be working on adding fulltext search. I like the idea to make it optional.

from docspell.

eikek avatar eikek commented on August 22, 2024

Thanks for sharing your thoughts! I know the content search feature cannot be replaced by auto-tagging. Even if it is going further (which is my plan in the long run). For the SSN example I could imagine to be included in auto-tagging some day. But, of course, this will only make people want to search for something else :-).

Maybe it can be implemented as you said on a first step. The results are currently already limited to some fixed number (I think 100 or 200) – which is also something to improve. We could simply go through all and do a substring search in the extracted texts. For a better impl, I guess something like elasticsearch or solr is required to make it work in a distributed environment. This would complicate the setup with another component; while it may be possible to provide a simpler solution for deployments with one rest server.

from docspell.

BradleyShaner avatar BradleyShaner commented on August 22, 2024

from docspell.

eikek avatar eikek commented on August 22, 2024

Thanks for the hint. I tried lodestone the other day, and I found it is really nice! It uses elasticsearch.

It's true that docker-compose can hide a lot of complexity, but only for those that use it. Others need to deal with it, too. An additional rather resource hungry component will make it harder to run on platforms like the pi. Otoh I think that something like elasticsearch or solr is the way to go when making a serious attempt on implementing this. Maybe one can provdie an embedded version for small setups….

from docspell.

rklueber avatar rklueber commented on August 22, 2024

+1 for the full text search. With this the syste would be fullfilling my dreams to get rid of paper. Have you had a look into this "walk-through"?

https://blog.logrocket.com/full-text-search-with-node-js-and-elasticsearch-on-docker/

from docspell.

eikek avatar eikek commented on August 22, 2024

@rklueber thank you for the link. At least I can say that fulltext search is now definitely on my list for the "next big thing" to work at :-). But it could take a while until there is something to show. The problem is not so much to start elasticsearch using docker, but more to efficiently integrate it into the application (how to query, what to store at the index, how to have both dbs in sync etc). I'm currently more in favor to use solr. It would also be possible to use postgres, but then docspell would require postgres and looses support for mariadb and h2. I also want it to work using the "download, unpack, run" model, i.e. without docker and also without postgres….

from docspell.

Skyr avatar Skyr commented on August 22, 2024

I'd second the wish for full-text search.
What about making an external full-text search engine like solr optional? If not configured (for those who don't want to go through the hassle of installing it), full-text search is not available.

from docspell.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.