Short downtime
log in

Advanced search

Message boards : News : Short downtime

Author Message
Kevin
Project administrator
Project developer
Project tester
Project scientist
Avatar
Send message
Joined: 4 Feb 12
Posts: 147
Credit: 2,427,829
RAC: 0
Message 427 - Posted: 12 Nov 2012, 12:41:49 UTC
Last modified: 12 Nov 2012, 19:50:36 UTC

Hi Guys,
There will be a short downtime today due to a hardware fault. All currently running tasks will be completed but new tasks will not be issued. I hope to have everything running again in the next few hours.
Thanks for you patience,
Kevin

Edit:
Unfortunately it's going to take a little longer than expected. We will be back up and running in 24 hours.
Kevin

jovada
Avatar
Send message
Joined: 31 Jul 12
Posts: 42
Credit: 907,705
RAC: 0
Message 428 - Posted: 13 Nov 2012, 18:51:55 UTC - in response to Message 427.
Last modified: 13 Nov 2012, 18:52:12 UTC

Edit:
Unfortunately it's going to take a little longer than expected. We will be back up and running in 24 hours.
Kevin


http://en.wikipedia.org/wiki/Hofstadter's_law
:)

Jerry
Send message
Joined: 30 Sep 12
Posts: 2
Credit: 629,963
RAC: 0
Message 429 - Posted: 14 Nov 2012, 16:36:51 UTC - in response to Message 428.

What is your best guess when your system will be returned to normal?

Jerry

Kevin
Project administrator
Project developer
Project tester
Project scientist
Avatar
Send message
Joined: 4 Feb 12
Posts: 147
Credit: 2,427,829
RAC: 0
Message 430 - Posted: 14 Nov 2012, 20:34:32 UTC

We're back online now. We're now using an extra server for the results database which should improve the performance of the main server.
Thanks,
Kevin

Profile Saenger
Avatar
Send message
Joined: 23 Jul 12
Posts: 24
Credit: 141,695
RAC: 99
Message 515 - Posted: 30 Jan 2013, 17:45:47 UTC

There was a quite severe downtime today, that has just finished. The whole server, including this forum, was unavailable.
What happened? Hard- or software problems?
____________
Grüße vom Sänger

Ant Chubb
Project administrator
Project developer
Project tester
Project scientist
Avatar
Send message
Joined: 18 Jul 12
Posts: 118
Credit: 1,019,875
RAC: 0
Message 518 - Posted: 31 Jan 2013, 9:52:28 UTC - in response to Message 515.

To be honest I don't know. Kevin may be able to figure it out. The server shut itself off! No one was around, and it was working fine in the morning, so there is no obvious reason why. Restarting seemed to work fine. It does highlight the need for an alarm system of sorts. We'll figure it out and add this alert so that this kind of issue does not affect users again.
In future we'll also add the forum to a separate server, but we're using donated hand-me-down hardware (read: we have no funds for this project), so options are limited.
Thanks for your patience.
ciao,
Ant

Profile (retired account)
Send message
Joined: 14 Dec 12
Posts: 7
Credit: 174,229
RAC: 0
Message 519 - Posted: 31 Jan 2013, 20:44:36 UTC - in response to Message 518.

Hello Ant,

thanks for your and Kevins support during the Charity event. IMHO this is outstanding. Taking the servers to the limit has always been a feature (or should I say objective *g*) of such events.

Suggestion: If you could require additional hardware, why not adding an item to the "Make A Donation" section on the front page asking for specific hardware donations? The crunching community includes many IT pros or nerds (me being at the utmost the latter) having access to either decommissioned or new hardware at a reasonable cost or for free and might be willing to donate it.

Cheers

Ant Chubb
Project administrator
Project developer
Project tester
Project scientist
Avatar
Send message
Joined: 18 Jul 12
Posts: 118
Credit: 1,019,875
RAC: 0
Message 522 - Posted: 1 Feb 2013, 10:06:51 UTC - in response to Message 519.

Hi Beorn,

Thanks for that. Great idea. We're always open to donated blade servers less than 5 years old!

Thanks,
Ant

Message boards : News : Short downtime


Main page · Your account · Message boards


Copyright © 2017 Dr Anthony Chubb