Comments (9)
The master_minion is the salt master node, but the name of the salt master might not be the minion name. For me,
salt 'admin.ceph' grains.get master
returns salt. If the master_minion does not match the actual minion name of the master, then salt will complain about no target matching. That's why I added master_minion.sls since there's several reasons that the names might not match.
from deepsea.
@swiftgist right, but that was not my point.
The problem that I faced was: I was running an ssh session to the salt master node, and ran stage.0, which at some point tries to restart the salt master node, and kills my ssh session along with the salt-runner execution.
from deepsea.
oh... sorry, working as intended. Your master updated its kernel and needs to reboot. As far as restarting the stage, that requires the /etc/salt/master.d/reactor.conf to do it automatically. Manually, you have to kick off the stage 0 again. I went with the reactor as the solution to this issue.
from deepsea.
But the reactor.conf present in the "master" branch has all lines commented, can you update the version you are using?
Moreover, even if the reactor starts again stage.0, it does not prevent the user from seeing his ssh connection to close without any notice.
This is the kind of issues that I think that can be better handled with the "frontend" application, by alerting the user that the system requires reboot, and ask for the user to run the deployment again.
from deepsea.
I'm also kind of not sure whether we should always upgrade to the latest kernel, I assume this is something that we should discuss.. what about cases when there is a ptf/ fix released for a particular kernel and a newer kernel isn't what we want in the final product?
from deepsea.
@rfd I left it commented out so that things wouldn't be too "magical". Also, I had been testing and modifying it separately so I didn't want to inflict untested versions. As far as the default, I think we should discuss it.
The issue with a separate frontend application is "where would it run?" Some POCs are combining as many roles as possible. Even with a separate admin host, how do you plan to update its kernel when necessary and not reboot?
@theanalyst I think it's a decent default for us though and should be delegated to SUMA since they have the kernel selection functionality already. In other words, we need to make the update step optional, but I will need to know the workflow for the alternate path. I expect Ceph admins using SUMA will still want some level of automation and not have to visit every node.
from deepsea.
@swiftgist, we cannot avoid the reboot. What we can do is alert the user that the current node, where he is running the deployment, needs to reboot. The frontend application should always run in the salt master node.
from deepsea.
I have been leaving this issue open to add some sort of nice messaging prior to the reboot. We need nice messages elsewhere as well. Undecided on the best approach for Salt.
from deepsea.
With 0.5.2, minions will now issue a wall command prior to the reboot.
from deepsea.
Related Issues (20)
- make install fails on archlinux HOT 1
- cephdisks.unused falsely returns mounted disks
- Typo - should be c_v_commands instead of d_v_commands
- Example code for return structures of modules HOT 6
- rebuild runner needs to read error messages from osd.py(runner) HOT 2
- osd.remove fails to zap devices on ceph version 14.2.3-349 HOT 10
- functests.3nodes fails with IndexError HOT 4
- How to deploy /dev/sdb ,/dev/sdc only use /dev/nvme0n1 as db device ? HOT 5
- Can not remove cluster node HOT 2
- [SES5] remove.osd functest fails HOT 9
- No role for rbd-client (e.g. mapping images) HOT 1
- Stage.3 Hang in Disks.Deploy HOT 6
- IGW lrbd support
- Device discovery not working in KVM environment HOT 3
- SES5: "time_init: ntp" does not work as expected HOT 1
- SES5: osd.redeploy fails for filestore -> bluestore
- deepsea hangs in stage 3 @ ceph.mgr.auth on
- deepsea monitor not working with gitfs
- stage.0 Exception: 'getpwuid(): uid not found:
- Deepsea development status after cephadm release HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from deepsea.