[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[AWIPS #RDH-492687]: EDEX Outage



Hi there,

> Thanks for your reply. I had subscribed to the Allison House EDEX server for 
> the past few years, but have unfortunately
> discontinued this service a week ago. This leaves the Unidata server as the 
> only feed for this critical data.
>
> Would there be any avenues that the Unidata AWIPS team could muster to 
> provided a backup EDEX server or marshal other
> local entities to provide this only as a back up ? Sadly, we do not have to 
> IT expertise to spin up a local EDEX server.
>
> Thanks for any ideas you may have

Sorry for the delayed response.  
That is unfortunate to hear about AllisonHouse.

We do have a backup server, but it is internal only and we use it for 
development as well.  That is how we were able to 
get services back up and running so quickly on Monday -- our main production 
server is actually still down from another
outage on Wednesday. The main problem with this particular outage was the cloud 
provided did hardware migrations on a 
Friday, and we weren't aware of the outage until Monday. Why they decided to do 
this kind of potentially disruptive
work on a Friday is beyond me.  We did not have any staff available over the 
weekend to correct the problem, and we 
technically don't actually work on weekends.  If we are available and able, we 
try to address issues as quickly as possible.

We need our backup server for development and testing, so we can troubleshoot 
and fix problems without disrupting our live
server.  Our EDEX servers consist of 3 machines and are using over a combined 
100 CPUs, and 100s of GB of RAM and TBs of 
storage.  We don't have the resources to be running multiple production, public 
EDEX servers for our users.

We do our best to provide near 100% uptime, but unfortunately since our cloud 
provider transitioned to version 2 of their 
cloud platform almost two years ago, we have had continuous issues with 
reliability.  We are doing our best to provide a
reliable service, but we only have so many resources (human power, money, 
hardware, etc) available to us.

Again, your only other option that I can think of, is to run your own 
production server.  This is something some of our 
users are doing successfully, and we have offered support and troubleshooting 
to help them establish their systems.

--Shay Carter

She/Her/Hers
AWIPS Software Engineer
NSF Unidata

If you're interested, please feel free to fill out a survey about the support 
you receive: 
https://docs.google.com/forms/d/e/1FAIpQLSeDIkdk8qUMgq8ZdM4jhP-ubJPUOr-mJMQgxInwoAWoV5QcOw/viewform

Ticket Details
===================
Ticket ID: RDH-492687
Department: Support AWIPS
Priority: Low
Status: Open
===================
NOTE: All email exchanges with NSF Unidata User Support are recorded in the 
Unidata inquiry tracking system and then made publicly available through the 
web.  If you do not want to have your interactions made available in this way, 
you must let us know in each email you send to us.