I have gotten several more reports of login problems (presumably all related to https://github.com/SSSD/sssd/issues/5672). It would be nice if the infrastructure status page reflected the fact that we have an ongoing (if intermittent) disruptive problem, and provided links to any known information — possible workarounds, fixes being tested, etc.
Soon, unless the problem can be fixed sooner :)
Yeah, a good idea. I was thinking we might try and make some kind of nagios check for when the problem happens so we can restart it too...
Will look at this monday.
FWIW, the last time it hit the failed state was thursday at 20utc ish... it's been ok since then, so if folks are having problems after that it might be something else.
We should perhaps make a 'debug login problems page'
Metadata Update from @humaton: - Issue tagged with: low-trouble, medium-gain, ops
Metadata Update from @kevin: - Issue priority set to: Waiting on Assignee (was: Needs Review)
Added a note, hopefully it's somewhat clear.
Improvements welcome!
Metadata Update from @kevin: - Issue close_status updated to: Fixed - Issue status updated to: Closed (was: Open)
Looks great -- thanks!
Log in to comment on this ticket.