Visit The New, Modern Unix Linux Community


Datacenter Crash (Server Unreachable for About 17 minutes)


 
Thread Tools Search this Thread
Special Forums Cybersecurity Datacenter Crash (Server Unreachable for About 17 minutes)
# 1  
Datacenter Crash (Server Unreachable for About 17 minutes)

Datacenter is trying to determine why the server when down.

All log files show no errors, including syslog, dmesg, etc and there are no core files.

Dashboard and stats show no unusual activity and light CPU load prior to crash.

I suspect a power issue in the datacenter. The datacenter team is looking into it.

Thank you for your patience and sorry for the short outage.


Code:
reboot   system boot  4.15.0-33-generi Mon Feb 24 07:00   still running
root     pts/0        159.192.217.25   Mon Feb 24 06:37 - crash  (00:22)

I'll post back if they come up with a reason on the datacenter end.

My searches of the file system and logs show no errors or internal server issues.
# 2  
Brief Update from Datacenter support:

Update 1:


Quote:
Thank you for your request. The servers are back up, we are still checking
what caused the issue. We are sorry for the inconvenience caused.
Update 2:

Quote:
Thank you for your request. The technicians are still checking what may have
caused it, but at the moment we do not have a feedback still.

Best regards
Looks like "servers" in the first update means a datacenter issue with either power or network connectivity.
# 3  
Searched all log files again... there is nothing indicating a crash except the active ssh session "crash".

My "uptime bot" shows the following 17 minute down time window (GMT+7):

Datacenter Crash (Server Unreachable for About 17 minutes)-img_0482jpg
# 4  
Final update from data center:

Quote:
There has been a network outage in the room, it has been fixed and the servers brought back online.
Hmmmm.

The good news is that it was not a server error.

The bad news is that is was a data center network error.
This User Gave Thanks to Neo For This Post:
# 5  
Possible Attack on Global DNS Today or DNS Outage?

Is anyone else noticing that DNS is not working normally and globally?

I am seeing an unusual situation where DNS names in many domains are not resolving.
# 6  
Ah .....

Turns out the datacenter is at fault again.

Quote:
Thank you for your request.

We had a emergency maintenance.
Problem seems to be solved at the moment.

Please accept our apologies for the inconvenience this is causing you.


Time to move to another provider?
# 7  
Quote:
emergency maintenance
If we can believe that we'll believe anything.

Anyway, what the h*ll does that mean??

An emergency is......e.g. a fire........maintenance is........er!!......should be planned? How do you get a mix of those two???
What is "emergency maintenance"? Answers on a postcard please!!

The only excuse is a power failure and any decent datacenter should have a backup power strategy for that.

All the pro's on here know that systems should be designed clustered, high availability, etc (typical outage 20 seconds). Who are they trying to kid?
This User Gave Thanks to hicksd8 For This Post:

Previous Thread | Next Thread
Thread Tools Search this Thread
Search this Thread:
Advanced Search

Test Your Knowledge in Computers #578
Difficulty: Medium
Lists are ordered collections that are essentially static arrays.
True or False?

10 More Discussions You Might Find Interesting

1. AIX

System p 9115-505: Server and HMC unreachable

Hi there I've bought a used System p 9115-505. When I attach the LAN cable to my router the HMC receives an IP address from my router, but the HMC is unreachable. There are no open ports. Does anybody know that problem? Any help greatly appreciated. Greetings from Italy! (2 Replies)
Discussion started by: mediaset23
2 Replies

2. Red Hat

how to configure netdump to copy the crash in the server itself??

hi, i would like to configure netdump, but saving the var/crash in the server itself, not in another server. could anybody tell me if this is possible? thanks (4 Replies)
Discussion started by: pabloli150
4 Replies

3. Programming

Client accidently close when the server crash

The steps to test the problem 1. Open TCP Server 2. Open TCP Client 3. TCP Client sends data to Server. 4. Close TCP Server and the client also crash without any notification Second wonderful test: 1. Comment the following statement in Client.c (at line 168) and compile it Writen(... (1 Reply)
Discussion started by: sehang
1 Replies

4. SCO

Crash error on my unix server

Hi there. Well i have a really bad problem with my server: UnixWare Version 5 Release 7 The system crash :wall: and show the error: Panic: Kernel-mode address fault on user address 0x00000004 :eek: If anyone knows about the reason of this error please give me a help Sorry by my english.... (3 Replies)
Discussion started by: danilosevilla
3 Replies

5. Programming

Client/Server Socket Application - Preventing Client from quitting on server crash

Problem - Linux Client/Server Socket Application: Preventing Client from quitting on server crash Hi, I am writing a Linux socket Server and Client using TCP protocol on Ubuntu 9.04 x64. I am having problem trying to implement a scenario where the client should keep running even when the... (2 Replies)
Discussion started by: varun.nagpaal
2 Replies

6. Linux

crash dump server for red hat ent 4

Is it true that you can't have the crash dump server/client on the same server? I know I've installed Nagios open source before, I though it's only for that kind of thing. I never though that Red hat ent 4 would be like client/server on the crash dump. if someone is having problem with high... (0 Replies)
Discussion started by: itik
0 Replies

7. UNIX for Dummies Questions & Answers

Notification if server unreachable?

Is it possible for a group of servers to monitor each other and then send an alert if one of them is no longer 'alive'? Or if its easier have one server that monitors the other five and then sends an alert. If so how would this be done? Thanks (3 Replies)
Discussion started by: Sepia
3 Replies

8. UNIX for Advanced & Expert Users

Solaris Server Crash

We have had a server (Solaris 2.6) hardisk crash. When booting the server we get: ok> boot -S Boot Device: /sbus/espdmc@e, 8400000/esp@e,8800000/sd@0,0 short read 0x2000 chars read disk read error The only way we can get into the console is to ok> boot cdrom whereby everything (e.g.... (3 Replies)
Discussion started by: Breen
3 Replies

9. UNIX for Dummies Questions & Answers

server crash

Our SUn Solaris Server has crashed second time in 2 days, reason is not known , we are trying to determine what could have gone wrong, any ideas, the power supply seems to be fine, there is no response from keyboard,monitor etc and we had to do a hot boot yesterday.. Any suggestions what could be... (9 Replies)
Discussion started by: knarayan
9 Replies

10. UNIX for Advanced & Expert Users

linux server crash

Hi I faced a problem while booting linux which is as follows;- ************************************************* Inode 146180 has illegal block(s) xauth:error in locking authority file /home/root/.Xauthority Fatal Server Error: Could not create lock file in /tmp/tXo-lock ... (1 Reply)
Discussion started by: Abhishek
1 Replies

Featured Tech Videos