Comments (3)
@wmburke @giuseppetotaro, by default we plan to provide crawling in batches of 10 iterations at a time (with 12 groups in one iteration) before the script would detect a CTRL+C.
This means that if a user presses CTRL+C while iteration 1 is going on, the script would complete all remaining and safely exit, not killing any crawls mid way. But this may mean the user would have to wait for longer. To reduce wait times we could decrease the number for 10 to something smaller.
Note : reducing to something smaller incurs an overhead of starting sparkler everytime (reserving JVM resources and spark, etc)
from sce.
from sce.
Closed by commit c8fc460 so that when CRTL-C is hit the user can force to stop the crawl job instead of waiting the job ends gracefully.
Furthermore, the user can now use the kill
sub-command (e.g., ./sce.sh kill
) to force the stop of a running crawl job (commit 61b0141).
from sce.
Related Issues (20)
- Develop release plan that includes solid testing
- pagination on search results
- Add documentation into the wiki
- Fix link to crawl dashboard for local install HOT 2
- fix line wrap when triple digits are reached under Generate a Model
- Banana dashboard needs tweaking
- Retreived 12 webpages in UI are already colored before voting for relevancy HOT 3
- allow search operators in search box
- Need visual indication that a choice has been made for a given URL.
- Documentation on how to construct a keyword list needs to be added to wiki or somewhere accessible HOT 1
- Anyone seen this error before? HOT 3
- The crawl dashboard is not coming up HOT 3
- And now ./dumper.sh is not working either
- Update wiki to include instructions on GUI capabilities currently not mentioned
- create travis build for sce
- separate the build components out of the docker file
- tidy up url paths
- remove flask and switch to gunicorn or similar
- create k8s compatible deployment
- Solr keeps shutting down
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from sce.