Code Monkey home page Code Monkey logo

Comments (4)

TheRealHaoLiu avatar TheRealHaoLiu commented on July 23, 2024

Random crashes with message The running ansible process received a shutdown signal.

where are u seeing this? please provide some context

currently we do not have enough information to understand what is happening here.

from awx.

Peter1295 avatar Peter1295 commented on July 23, 2024

AWX Template fails with that message. Time is random, mostly between 7-12mins of job running, I can see it happens with a jobs what are doing changes on multiple hosts (patching, VM customization etc.).

Unfortunately awx-task logs do not show anything helpful, just a message job/workflow failed.
Workflow job 18542 failed due to reason: No error handling path for workflow job node(s) [(26156,failed)]. Workflow job node(s) missing unified job template and error handling path [].

Cluster is running on k8s with setup of 2 Control planes and 4 worker nodes, where maximum CPU and Memory usage based on command kubectl top node is around 20%/80% (CPU/MEM) and all nodes have at least 40% free disk space.

AWX database is running on external Postgres server.

from awx.

Peter1295 avatar Peter1295 commented on July 23, 2024

Attaching logs from automation pod what failed in the middle. Absolutely no info what is happening, not from awx-task, awx-web nor automation pod. Any suggestion what to look for?
task3.log

AWX is really needed for us, we are using it for managing, deploying, patching etc. on daily basis, it is running at least 50 templates a day and I cannot be permanently connected on it to check if it's still working.
We have another instance on production environment, where we still run 23.3.1 what is running properly, but unfortunately downgrade is not working anymore, it cannot use upgraded database.

from awx.

Peter1295 avatar Peter1295 commented on July 23, 2024

Another update, issue is not version related, I was able to downgrade AWX with to version 23.8.1 (what should not have such problems).
Issue is not even with database, where I used both actual and older postgres from before migration.
Sometimes it fails in 5min, sometimes job run for almost 1h.

from awx.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.