Comments (6)
@stchris Says we're done here. Thanks for your help @friendly-wolfbat much appreciated
from aleph.
Hi @friendly-wolfbat, thanks for opening this issue. You are correct, the documentation isnβt up to date in this regard. We do plan to address this as part of a larger overhaul of the technical documentation. But this may take some time, so we should probably address the issue you brought up earlier than that.
from aleph.
Hi @tillprochaska, thanks for responding. If you can clarify on points 2 and 3 briefly (or link me to some code where I can find the answer), I can put together a pull request for this particular issue to correct the documentation.
from aleph.
Hi @friendly-wolfbat , thanks for noticing and for offering to fix this.
- You're totally right, we missed a few spots where
convert-document
is still mentioned. - We have added a section on scaling workers in general here, so it'd be best to link to that.
- That's right, that section should stay, because it provides a useful workaround, but the references to the "convert-document service" should probably should just mention "document convert operations" (in
ingest-file
).
We're very happy to review your contribution as a PR, but otherwise I can take care of these changes, just let me know. Thanks either way!
from aleph.
I tried my hand at the PR. I think one piece that I'm missing is, if a user wishes to disable threading, how many workers should they have relative to the number of cores, and how many ingest-file containers should they have relative to the number of cores?
On a separate note, I see that the installation instructions also say, "[f]or the purpose of scaling workers and getting more predictable performance [...]"--does this mean that disabling threading and having more containers is "better" in terms of performance, or just more predictable?
from aleph.
Thanks @friendly-wolfbat , I'll reply in the PR.
from aleph.
Related Issues (20)
- FEATURE: Allow/improve partial search HOT 3
- FEATURE: Grafana dashboard for Prometheus metrics
- FEATURE: Add Document Tagging and Display Lists in Search Results
- Adding a new entity from a custom processor HOT 1
- Remove deleted data from ES Cluster
- High resource usage problem. Memory leak HOT 1
- BUG: Names not extracted as mentions HOT 4
- BUG: link to localhost on sign up HOT 1
- BUG: Information in JSON for an ingesting dataset does not seem to be entirely correct HOT 1
- BUG: Problem when running aleph on raspberry pi 5 8gb HOT 1
- BUG: Error during OAuth callback / missing constraints for `role_membership` table HOT 3
- BUG: Metadata API Request Error Handling HOT 1
- FEATURE: Split xref into separate sub-tasks HOT 3
- BUG: Can't preview image and pdf also can't converte to PDF HOT 4
- BUG: Freeze upload at 67% on Mac M1 HOT 1
- BUG: UI constructs excessively long URLs
- FEATURE: opensearch support HOT 1
- FEATURE: add an API endpoint to touch a dataset, thereby updating its content updated date to the current time HOT 1
- BUG: Datasets disappear from the status page before they are completed`
- BUG: If maintenance mode is enabled, UI sends requests in infinite loop
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. πππ
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google β€οΈ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from aleph.