Code Monkey home page Code Monkey logo

storj-system-health.sh's People

Contributors

bjoerrrn avatar f-systemes avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

storj-system-health.sh's Issues

New version notification

Add delay of 10 days, before showing a new storj version.

Then, display message just once a day.

Pending audits: run script again automatically

In case of pending audits: run script again automatically in order to verify that audits are working well.

Send a warning message only in the case, the "verification run" of the script still warns about pending audits (not anymore immediately).

Alert on a) low thresholds for success rates + b) storage node status

  • Low audit success rate (<95%)
  • Low repair success rate (<95%). Risk of getting disqualified.
  • Low customer download success rate (<90%). No disqualification risk.
  • Low upload success rate (<90%). No disqualification risk.
  • No upload or download activity for quite some time.
  • Storagenode not running
  • Storagenode unable to checkin including pingback error.
  • ...

may this can be a help: https://forum.storj.io/t/guide-to-debug-my-storage-node-uplink-s3-gateway-satellite/1372

URGENT: Prevent mail alerts from spamming the mail account

There needs to be a logic to not or much less send the same error log extract again and again.

Usually, older error messages should disappear after 24h, as the log selection is limited. Anyway, one e-mail for each new error found should be fine.

add an analysis and warning mechanism of too huge audit request time lags

Is your feature request related to a problem? Please describe.
a time lag of audit requests is not logged into the storage node log file.

issues on that will result more or less quick into disqualification without any warning to the storage node operator (SNO).

Describe the solution you'd like
analyse the log extract selected by minlog and check for long time lags between audit requests

example select statement:

cat /mnt/WD1003/logs/sn1.log | grep 1wFTAgs9DP5RSnCqKV1eLf6N9wtk4EAtmN5DpSxcs8EjT69tGE | grep -E "GET_AUDIT" | jq -R '. | split("\t") | (.[4] | fromjson) as $body | {SatelliteID: $body."Satellite ID", ($body."Piece ID"): {(.[0]): .[3]}}' | jq -s 'reduce .[] as $item ({}; . * $item)'

example result from the command:

{
  "SatelliteID": "1wFTAgs9DP5RSnCqKV1eLf6N9wtk4EAtmN5DpSxcs8EjT69tGE",
  "NG6KAUMU7TP22DNGROKBU2MRRNV675QYEOJC3X2BXH4OCML6BPNQ": {
    "2022-06-28T21:24:39.646Z": "download started",
    "2022-06-28T21:24:40.002Z": "downloaded"
  },
  "IADTQX62PCZQEJRRYPCKNWX3QSPG7A3U53IBWPQRSX6ZMH6I45UQ": {
    "2022-06-28T21:32:40.597Z": "download started",
    "2022-06-28T21:32:40.893Z": "downloaded",
    "2022-07-09T20:00:10.698Z": "download started",
    "2022-07-09T20:00:10.995Z": "downloaded"
  },
  "MZEPH4JSGSAJZ72QQV4YOYYVGLER7KOQPBUB2VEANL4MPNSZDBTA": {
    "2022-06-28T21:58:56.184Z": "download started",
    "2022-06-28T21:58:56.454Z": "downloaded"
  },
  "GFATHGO2WFBZNAOQJKXYNHTFKH2T5T4OXK3BEL7U62FNK5ZRR6OQ": {
    "2022-06-28T22:08:49.765Z": "download started",
    "2022-06-28T22:08:50.089Z": "downloaded"
  },
...

Additional context

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.