Code Monkey home page Code Monkey logo

Comments (2)

consideRatio avatar consideRatio commented on July 26, 2024 1

For QCL, I think they merit from having all culling logic disabled to avoid issues - but warn them that they need to shut down their own servers if they aren't using them.

If they have very expensive machines running long duration, and they incorrectly fail along the way due to culling, that is the far bigger cost I expect.

Related

Action points

  • Investigate basehub's jupyterhub-idle-culling configuration in jupyterhub
    jupyterhub-idle-culler is not configured in basehub, but it is by default in z2jh to cull servers with no activity reported in the last hour
  • Investigate basehub's kernel culling configuration in user servers
    Kernels are not culled if they are busy by the kernel culling, but idle kernels are after one hour of idling
  • Investigate if kernel culling is something you opt-in or opt-out of, so that we understand the consequences of removing config via basehub
    It seems that cull_idle_timeout is defaulting to 0, with culling of idle kernels disabled.
    We are setting it to 3600, which means that a server with a long running job will loose its state after 3600 seconds.
  • Read up on what Min helped me understand once in jupyterhub/jupyterhub-idle-culler#55
  • Make a decision on how to best help QCL avoid possible disruption of long running jobs
    • Idea 1: disable kernel culling to avoid loosing state after a long computation completes
    • Idea 2: disable jupyterhub-idle-culler to avoid loosing server after a period of inactivity
  • Consider if and how we want to update our docs and default config for basehub
    Yes. But I'll open a separate issue about it.

from docs.

consideRatio avatar consideRatio commented on July 26, 2024 1

Advice provided, I'll probably reconfigure something for QCL as a followup so I re-assigned myself to the support ticket.

from docs.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.