Visit https://fedoraplanet.org and observe it is dead, displaying a message:
Application is not available :( The application is currently not serving requests at this endpoint. It may not have been started or is still starting. Possible reasons you are seeing this page: * Scheduled maintenance. Check if there is a maintenance planned or known outage on the Fedora Infrastructure Status page. * Service outage. If you see this message for an extended time, it might indicate a more serious problem with this application. Contact the oncall person for more information on the Fedora Infrastructure Matrix channel by using the !oncall command. Alternatively, open a ticket on the Fedora Infrastructure issue tracker. * Configuration issue. If you are a developer working on this application, make sure that the resources exposed by this route (pods, services, deployments, etc) exist and have at least one pod running.
Meanwhile https://status.fedoraproject.org/ is green, saying "all systems operational", indicating a secondary bug, in that the status app is not monitoring fedoraplanet.org sufficiently well.
The status service is a manually set item and not meant to be a monitoring service. This is a design choice of balancing the amount of work to set up a monitoring service sufficiently independent from Fedora Infrastructure to not also be broken when Fedora is broke.. and the amount of time trying to keep Fedora going.
The monitoring currently would be done either in nagios.fedoraproject.org or zabbix.fedoraproject.org(? not sure about that url) but it also needs a third component.. people to have the time and energy to deal with the constant noise of broken things.
/facepalm looking at the status.fp.o site again, it even says right there that the status is manually update.
It happens to a LOT of people who are used to a site using a third party which does that for whatever project. Trying to find one which was open-source was a problem in the far past. Not sure about now.
Metadata Update from @zlopez: - Issue priority set to: Waiting on Assignee (was: Needs Review) - Issue tagged with: Needs investigation, high-gain, ops
I don't see any error on https://nagios.fedoraproject.org/nagios/, so let me check what is wrong with that.
The problem is that one of the config volumes wasn't set up properly. @phsmoura already working on it.
Metadata Update from @zlopez: - Issue untagged with: Needs investigation - Issue assigned to phsmoura - Issue tagged with: medium-trouble
Duplicate by me btw::https://pagure.io/fedora-infrastructure/issue/12172 (so issue still happens)
Planet is up and running again
Metadata Update from @phsmoura: - Issue close_status updated to: Fixed - Issue status updated to: Closed (was: Open)
Log in to comment on this ticket.