Comments (2)
Hi,
As you see I put a negative value to content size, because I just want to index file names and nothing else
That is not really what the --content-size
argument is meant to do. It's meant to limit the number of characters of 'content' (in text files/documents). Even if set to a negative number, it will still attempt to get the metadata of media files, generate thumbnails etc
I just want to index file names and nothing else
If you wish, I can attempt to create a --name-only
flag that only saves the file names.
Thanks!
from sist2.
Hi
Thanks for the reply. I think that name-only
flag would be excellent since not everyone might need to index contents. Or even some kind of settings for which files should be content indexed would be nice.
thanks
from sist2.
Related Issues (20)
- [SIST2-Admin] Task History and Log for frozen jobs.
- The text preview when clicking on the additional information icon does not load HOT 1
- Toggle or configure minimum characters limit for file HOT 1
- Support sqlite spellfix for 'fuzzy' search HOT 1
- Support RAG with HOT 3
- sist2-admin should handle auth argument errors HOT 2
- allow multiple selection filter path
- Docker cant find the Doc path? HOT 3
- CUDA support
- Support PostgreSQL (pg_bm25+pgvector) as an alternative to elastic search
- PDF viewer resets to top of the document always :( HOT 2
- Can not search in whisper transcript when using sqlite index HOT 2
- Indexing error: Invalid PATH argument. File not found HOT 5
- custom web interface HOT 7
- EML format support HOT 1
- Models are not downloaded in the browser HOT 4
- [FEAT SUGGEST] Configure an 'absoloute path' for each indicies, for convenience HOT 5
- Scan task once halted never continues
- Add speech to text functionality HOT 1
- "sist2 web module encountered an error while connecting to Elasticsearch. See server logs for more information." after relaunch/serve HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from sist2.