Maint, resyncing and last-erred notifications


 
Thread Tools Search this Thread
Operating Systems Solaris Maint, resyncing and last-erred notifications
# 1  
Old 02-12-2018
Maint, resyncing and last-erred notifications

Hi fellow members!

I have a oracle solaris server with two internal disks, that acts as an authentication server only, and for now the server seems to be doing its job, but when typing metastat -c I get some notifications.
I am not familiar with SVM, I wonder if someone can help me on this:
Code:
metastat -c
d50              m  8.0GB d51 d52
    d51          s  8.0GB c0t0d0s6
    d52          s  8.0GB c0t1d0s6
d30              m   89GB d31 (maint) d32 (maint)
    d31          s   89GB c0t0d0s5 (resyncing)
    d32          s   89GB c0t1d0s5 (last-erred)
d20              m   16GB d21 d22
    d21          s   16GB c0t0d0s1
    d22          s   16GB c0t1d0s1
d10              m   15GB d11
    d11          s   15GB c0t0d0s0
d60              m  8.0GB d61 d62
    d61          s  8.0GB c0t0d0s7
    d62          s  8.0GB c0t1d0s7
d12              s   15GB c0t1d0s0

and the output of iostat -E shows me:

Code:
 iostat -E
sd2       Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
Vendor: FUJITSU  Product: MBD2147RC        Revision: 3702 Serial No:
Size: 146.81GB <146810536448 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
Illegal Request: 0 Predictive Failure Analysis: 0
sd3       Soft Errors: 26 Hard Errors: 28 Transport Errors: 8
Vendor: FUJITSU  Product: MBD2147RC        Revision: 3702 Serial No:
Size: 146.81GB <146810536448 bytes>
Media Error: 24 Device Not Ready: 0 No Device: 4 Recoverable: 26
Illegal Request: 0 Predictive Failure Analysis: 4
sd4       Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
Vendor: TEAC     Product: DV-W28S-V        Revision: J.0B Serial No:
Size: 0.00GB <0 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
Illegal Request: 2 Predictive Failure Analysis: 0

The logs show media errors:

Code:
Feb  9 16:23:39 maphmi  Error for Command: read(10)                Error Level: Retryable
Feb  9 16:23:39 maphmi scsi: [ID 107833 kern.notice]    Requested Block: 72021888                  Error Block: 72021888
Feb  9 16:23:39 maphmi scsi: [ID 107833 kern.notice]    Vendor: FUJITSU                            Serial Number: D0C5PA800SFH
Feb  9 16:23:39 maphmi scsi: [ID 107833 kern.notice]    Sense Key: Unit_Attention
Feb  9 16:23:39 maphmi scsi: [ID 107833 kern.notice]    ASC: 0x29 (scsi bus reset occurred), ASCQ: 0x2, FRU: 0x0
Feb  9 16:23:39 maphmi scsi: [ID 107833 kern.warning] WARNING: /pci@0,600000/pci@0/pci@0/scsi@0/sd@1,0 (sd3):
Feb  9 16:23:39 maphmi  Error for Command: write(10)               Error Level: Informational
Feb  9 16:23:39 maphmi scsi: [ID 107833 kern.notice]    Requested Block: 65024710                  Error Block: 65024710
Feb  9 16:23:39 maphmi scsi: [ID 107833 kern.notice]    Vendor: FUJITSU                            Serial Number: D0C5PA800SFH
Feb  9 16:23:39 maphmi scsi: [ID 107833 kern.notice]    Sense Key: Soft_Error
Feb  9 16:23:39 maphmi scsi: [ID 107833 kern.notice]    ASC: 0x5d (firmware impending failure too many block reassigns), ASCQ: 0x64, FRU: 0x0
Feb  9 16:23:42 maphmi scsi: [ID 107833 kern.warning] WARNING: /pci@0,600000/pci@0/pci@0/scsi@0/sd@1,0 (sd3):
Feb  9 16:23:42 maphmi  Error for Command: read(10)                Error Level: Retryable
Feb  9 16:23:42 maphmi scsi: [ID 107833 kern.notice]    Requested Block: 72021888                  Error Block: 72022006
Feb  9 16:23:42 maphmi scsi: [ID 107833 kern.notice]    Vendor: FUJITSU                            Serial Number: D0C5PA800SFH
Feb  9 16:23:42 maphmi scsi: [ID 107833 kern.notice]    Sense Key: Media_Error
Feb  9 16:23:42 maphmi scsi: [ID 107833 kern.notice]    ASC: 0x11 (read retries exhausted), ASCQ: 0x1, FRU: 0x0
Feb  9 16:23:46 maphmi scsi: [ID 107833 kern.warning] WARNING: /pci@0,600000/pci@0/pci@0/scsi@0/sd@1,0 (sd3):
Feb  9 16:23:46 maphmi  Error for Command: read(10)                Error Level: Retryable
Feb  9 16:23:46 maphmi scsi: [ID 107833 kern.notice]    Requested Block: 72021888                  Error Block: 72022006
Feb  9 16:23:46 maphmi scsi: [ID 107833 kern.notice]    Vendor: FUJITSU                            Serial Number: D0C5PA800SFH
Feb  9 16:23:46 maphmi scsi: [ID 107833 kern.notice]    Sense Key: Media_Error
Feb  9 16:23:46 maphmi scsi: [ID 107833 kern.notice]    ASC: 0x11 (read retries exhausted), ASCQ: 0x1, FRU: 0x0
Feb  9 16:23:53 maphmi scsi: [ID 107833 kern.warning] WARNING: /pci@0,600000/pci@0/pci@0/scsi@0/sd@1,0 (sd3):
Feb  9 16:23:53 maphmi  Error for Command: read(10)                Error Level: Fatal
Feb  9 16:23:53 maphmi scsi: [ID 107833 kern.notice]    Requested Block: 72021888                  Error Block: 72022010
Feb  9 16:23:53 maphmi scsi: [ID 107833 kern.notice]    Vendor: FUJITSU                            Serial Number: D0C5PA800SFH
Feb  9 16:23:53 maphmi scsi: [ID 107833 kern.notice]    Sense Key: Media_Error
Feb  9 16:23:53 maphmi scsi: [ID 107833 kern.notice]    ASC: 0x11 (read retries exhausted), ASCQ: 0x1, FRU: 0x0
Feb  9 16:23:53 maphmi md_stripe: [ID 641072 kern.warning] WARNING: md: d32: read error on /dev/dsk/c0t1d0s5
Feb  9 20:10:37 maphmi dtlogin[23097]: [ID 293258 user.error] libsldap: Status: 49  Mesg: openConnection: simple bind failed - Invalid credentials
Feb  9 22:34:24 maphmi dtlogin[10415]: [ID 293258 user.error] libsldap: Status: 49  Mesg: openConnection: simple bind failed - Invalid credentials
Feb 10 00:04:10 maphmi dtlogin[15976]: [ID 293258 user.error] libsldap: Status: 49  Mesg: openConnection: simple bind failed - Invalid credentials
Feb 10 00:04:26 maphmi last message repeated 1 time
Feb 10 03:34:01 maphmi syslogd: going down on signal 15
Feb 10 12:55:57 maphmi dtlogin[2513]: [ID 293258 user.error] libsldap: Status: 49  Mesg: openConnection: simple bind failed - Invalid credentials
Feb 10 15:34:01 maphmi syslogd: going down on signal 15
Feb 10 17:08:18 maphmi dtlogin[19265]: [ID 293258 user.error] libsldap: Status: 49  Mesg: openConnection: simple bind failed - Invalid credentials
Feb 11 03:34:01 maphmi syslogd: going down on signal 15
Feb 11 07:24:50 maphmi dtlogin[27628]: [ID 293258 user.error] libsldap: Status: 49  Mesg: openConnection: simple bind failed - Invalid credentials
Feb 11 15:34:01 maphmi syslogd: going down on signal 15
Feb 12 03:34:01 maphmi syslogd: going down on signal 15
Feb 12 07:42:35 maphmi dtlogin[23240]: [ID 293258 user.error] libsldap: Status: 49  Mesg: openConnection: simple bind failed - Invalid credentia

Can you help
# 2  
Old 02-12-2018
This disk c0t1d0 is experiencing read errors.

Currently it is only evident on slice 5 to the SVM.
iostat is showing errors for entire device.

Have a replacement disk ready.

Detach all slices from metadevices involving failed disk (c0t1d0..)
You will need to identify the disk physically and unconfigure it using cfgadm

The blue led should indicate the disk is safe to remove.

Replace of faulty disk in the server.

After the new working disk is inserted and visible to the operating system, proceed and follow the docs regarding svm replace faulty disk (or root disk) and its slices.

You might also want to considering making backup asap.

This is no easy task, and should be performed with care.
Verify each step taken and follow the documentation for the exact version of SVM and operating system you are using.

Hope that helps.
Regards
Peasant.
This User Gave Thanks to Peasant For This Post:
# 3  
Old 02-12-2018
thanks for your help, but I can only proceed if I have a spare/replacement disk, but for the moment I should backup all data on disks right? Would a backup be reliable when there is media or read errors?
# 4  
Old 02-12-2018
Since other device in the mirror is ok, the answer is yes, the backup should be reliable.

Be vary tho, the d12 metadevice is not mirrored (and its slice belongs to faulty disk).
Is that swap ?

Regards
Peasant.
Login or Register to Ask a Question

Previous Thread | Next Thread

9 More Discussions You Might Find Interesting

1. Infrastructure Monitoring

Notifications not coming through

Issue: I'm not receiving notifications I can succesfully receive an e-mail if I do this on the command line: /usr/bin/mail -s "NAGIOS HOST ALERT on $HOSTNAME$" rgouette@butlerbros.com but, my command.cfg configuration below, refuses to send an e-mail when I set a service to a critical... (3 Replies)
Discussion started by: rgouette
3 Replies

2. Solaris

Expiring password notifications

I have a Solaris 10 server that notified one of my users/database administrators of another users account expiring within X amount of days. It attempted to email the user@ server.com, which failed and was forwarded from the mailer-daemon@ server.com, to the DBA. She's not listed as an alias for... (2 Replies)
Discussion started by: Nvizn
2 Replies

3. Emergency UNIX and Linux Support

mdadm unable to fail a resyncing drive?

Hi All I have a RAID 5 array consisting of 4 drives that had a partial drive failure in one of the drives. Rebooting shows the faulty drive as background rebuilding and mdadm /dev/ARRAYID shows three drives as in sync with the fourth drive as spare rebuilding. However the array won't come... (9 Replies)
Discussion started by: Bashingaway
9 Replies

4. Solaris

SVM - metastat - Last Erred

My company is running a solaris 2.7 machine. The machine is getting slow recently. I have no expert in solaris. Please help. I checked the log in /var/adm/message: I also checked with the command iostat -nE. It returns: Metastat returns the followings: What should i do now?... (10 Replies)
Discussion started by: nickychung
10 Replies

5. Solaris

Resyncing Progress of hardware mirror

Hi, I've recently mirrored the 4 disks in a V440. Disks 0 + 1 have been mirrored with hardware mirroring using the command raidctl -c c1t0d0 c1t2d0, the other 2 disks have been mirrored using Soltisce Disk Suite. I know how to check the progress for the SDS mirroring but how can I find the... (2 Replies)
Discussion started by: Chains
2 Replies

6. Post Here to Contact Site Administrators and Moderators

Servers, email notifications

Hi. On 09-05-2007 Neo posted that new servers were on their way. 1) It would be nice to hear of occasional progress. As far as I can tell, there has not been much of a change. The notice that is posted every now and then talks about a new server "this week". (In fact as I was previewing... (1 Reply)
Discussion started by: drl
1 Replies

7. UNIX for Dummies Questions & Answers

Maint user cannot execute ping command

I want give rights for the maint user to execute the "ping" command. Currently root user can execute the "ping" command, but the maint user is not able to execute the command. (3 Replies)
Discussion started by: kabeer_n
3 Replies

8. Post Here to Contact Site Administrators and Moderators

trying to turn email notifications off

I'm trying to disable the email notifications; despite going into my control panel and selecting NO next to Use 'Email Notification' by default? I keep getting email notifications anyway! And I know I've never selected the checkbox next to Email Notification: emails sent to you whenever someone... (2 Replies)
Discussion started by: oombera
2 Replies

9. HP-UX

HP-UX LVM Maint. Mode

What exactly happens when you boot HP-UX with the -lm flag? We tried this the other night, with some wacky results... We poked around, and found that / was mounted to /dev/rroot (I think... maybe it was just /dev/root). So, (still playing) we activated the volume group (lvchange -ay), and... (2 Replies)
Discussion started by: LivinFree
2 Replies
Login or Register to Ask a Question