Reboot VIO - OK?


 
Thread Tools Search this Thread
Operating Systems AIX Reboot VIO - OK?
# 1  
Old 01-15-2016
IBM Reboot VIO - OK?

We've got two datacenters and in every datacenter 2 VIOs.
The VIO manages the I/O of the LPARs. So: Is it possible to reboot the VIO without shuting down an LPARSmilie
# 2  
Old 01-15-2016
yes, it is possible. but you will get after it, depends on your LPAR configuration.
# 3  
Old 01-15-2016
Tools

Hi agent.kgb

Thanks for your answer!

What do you mean with:
Quote:
but you will get after it
How can I check the config of the LPAR to determine if a reboot would work without any trouble?
# 4  
Old 01-15-2016
Let's say you have a managed system with two VIOs (vio1 and vio2) and a lot of fully-virtualized LPARs (lpar1, lpar2, ...).

If an LPAR has VSCSI connection, it must be connected to both VIOs.
Code:
$ lsdev -l vscsi*
vscsi2 Available  Virtual SCSI Client Adapter
vscsi3 Available  Virtual SCSI Client Adapter
$ lsattr -El vscsi2
rw_timeout      0         Virtual SCSI Read/Write Command Timeout True
vscsi_err_recov fast_fail N/A                                     True
vscsi_path_to   30        Virtual SCSI Path Timeout               True
$ lsattr -El vscsi3
rw_timeout      0         Virtual SCSI Read/Write Command Timeout True
vscsi_err_recov fast_fail N/A                                     True
vscsi_path_to   30        Virtual SCSI Path Timeout               True

VIO mappings:
Code:
$ echo "cvai" | kdb
NAME       STATE    CMDS_ACTIVE  ACTIVE_QUEUE       HOST
vscsi2     0x000007 0x0000000000 0x0                vio1->vhost1
vscsi3     0x000007 0x0000000000 0x0                vio2->vhost1

If you have VSCSI disks:
Code:
$ lsdev -Cc disk -s vscsi
hdisk6 Available  Virtual SCSI Disk Drive

check their healthcheck parameters:
Code:
$ lsattr -El hdisk6
PCM             PCM/friend/vscsi                 Path Control Module        False
algorithm       fail_over                        Algorithm                  True
hcheck_cmd      test_unit_rdy                    Health Check Command       True+
hcheck_interval 60                               Health Check Interval      True+
hcheck_mode     nonactive <-- VERY BAD!                        Health Check Mode          True+
max_transfer    0x40000                          Maximum TRANSFER Size      True
pvid            00000000000000000000000000000000 Physical volume identifier False
queue_depth     32                               Queue DEPTH                True
reserve_policy  no_reserve                       Reserve Policy             True+

and that the disks are available through both vscsi adapters:
Code:
$ lspath -l hdisk6
Enabled hdisk6 vscsi2
Enabled hdisk6 vscsi3

Check priorities of the paths:
Code:
$ lspath -AHE -l hdisk6 -p vscsi2
attribute value description user_settable

priority  2     Priority    True
$ lspath -AHE -l hdisk6 -p vscsi3
attribute value description user_settable

priority  1     Priority    True

If you use VFC (NPIV), check that you have at least 2 VFC adapters from both of VIOs:
Code:
$ lsdev -Cc adapter -t IBM,vfc-client
fcs2 Available C4-T1 Virtual Fibre Channel Client Adapter
fcs3 Available C5-T1 Virtual Fibre Channel Client Adapter

Check their mappings:
Code:
$ echo vfcs | kdb
NAME      ADDRESS             STATE   HOST      HOST_ADAP  OPENED NUM_ACTIVE
fcs2      0xF1000A00001E8000  0x0008  vio1 vfchost14 0x01    0x0000
fcs3      0xF1000A00001EA000  0x0008  vio2 vfchost14 0x01    0x0000

Check that dyntrk and fast_fail for fscsi devices set to yes and fast_fail:
Code:
$ lsattr -El fscsi2
attach       switch    How this adapter is CONNECTED         False
dyntrk       yes       Dynamic Tracking of FC Devices        True
fc_err_recov fast_fail FC Fabric Event Error RECOVERY Policy True
scsi_id      0x340a01  Adapter SCSI ID                       False
sw_fc_class  3         FC Class for Fabric                   True
$ lsattr -El fscsi3
attach       switch    How this adapter is CONNECTED         False
dyntrk       yes       Dynamic Tracking of FC Devices        True
fc_err_recov fast_fail FC Fabric Event Error RECOVERY Policy True
scsi_id      0x330b01  Adapter SCSI ID                       False
sw_fc_class  3         FC Class for Fabric                   True

Then check that every disk device is available through both of the adapters. This check depends on your drivers. If you use AIX MPIO, you can look this information using lspath command. For EMC - powermt, for Hitachi - dlnkmgr.

Then, when you are sure, that all of your disks are available through both VIOs, you must check network connection.
Code:
$ lsdev -Cc adapter -t IBM,l-lan -s vdevice
ent3 Available  Virtual I/O Ethernet Adapter (l-lan)

You have to go on your VIO servers and check their, if you have SEA (shared ethernet adapter) or some other type of configuration. With SEA Fail-over configuration, you have to check which VIO server is active:
Code:
vio1$ lsdev -type sea
name             status      description
ent20            Available   Shared Ethernet Adapter
ent21            Available   Shared Ethernet Adapter
ent22            Available   Shared Ethernet Adapter
ent23            Available   Shared Ethernet Adapter
vio1$ entstat -all entSEA | grep Active

After you've checked everything, you can rely that there are no known bugs in your AIX and VIO versions, which could prevent switching from one VIO to another, but you can also switch everything manual - change priorities for every disk, or switch off devices (make them "defined") which use vio1 resources, and then reboot vio1. After it comes up, check everything again. If you changed priorities or removed some devices, restore the configuration before rebooting vio2.

I hope I didn't forget something very critical.
These 4 Users Gave Thanks to agent.kgb For This Post:
# 5  
Old 01-15-2016
A very concise description by agent.kgb, thank you for that!

@thread-o/p: notice that what agent.kgb described is in fact "best practice" and even if you would not have the need to reboot the VIOS right now you should set things up like agent.kgb described it anyway!

If you find out that this is not the case it would be a perfect occasion to get some downtime URGENTLY and correct this! Otherwise these problems will come back to haunt you and most probably will raise their ugly heads in the most inconvenient moment possible.

I hope this helps.

bakunin
This User Gave Thanks to bakunin For This Post:
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. AIX

Problems with VIO O/S 2.2.3

I hope someone has experienced this problem before and can help. I have a P730 (8231-E2D) which is fitted with 8 IBM SAS disks on 2 separate SISSAS controllers. The disk type is ST9146853SS (Seagate SAS 146Gb). The problem occurs when I try to install the VIO server software at 2.2.3. It... (3 Replies)
Discussion started by: johnf
3 Replies

2. AIX

VIO Server

Hi, I am facing an issue in vio server. When I run bosboot -ad /dev/hdisk0 I am getting an error trustchk: Verification of attributes failed: /usr/sbin/bootinfo : accessauths regards, vjm Please use code tags next time for your code and data. (8 Replies)
Discussion started by: vjm
8 Replies

3. AIX

vio server ethernet to vio client ethernet(concepts confusing)

Hi In the vio server when I do # lsattr -El hdisk*, I get a PVID. The same PVID is also seen when I put the lspv command on the vio client partition. This way Im able to confirm the lun using the PVID. Similarly how does the vio client partition gets the virtual ethernet scsi client adapter... (1 Reply)
Discussion started by: newtoaixos
1 Replies

4. Red Hat

vio in linux

I am working in AIX. In a P series machine both aix and linux os can install.. specific command is there for aix to list out the vio servers details mapped in a frame(ie. lssrc "IBM.ManagementServer", like that..) is there any command is der to list out vio servers details in linux or how to find... (1 Reply)
Discussion started by: muralikrish
1 Replies

5. AIX

sddpcm on vio

I have same lun (hdisk3) assigned to 2 vios server (dual vio server) and this is mapped to the client lpar. I have sddpcm on the VIO. On the VIOA, the attributes for fscsi1 and fscsi3 are set to fast_fail and dyntrk is set to yes Now, if I do a lqueryvg -Atp on the hdisk3, I get a "DISK... (4 Replies)
Discussion started by: mk8570
4 Replies

6. AIX

Vio Build

I am planning carving out dual VIO servers on a P750 based and would like to know how to allocate internal disks to the VIOS. This has 4 internal disks and I would to assign 2 disks to each VIO. This is what I was thinking of assigning . Any advise? vio1 U5802.001.9K8H757-P1-C1 PCI-E x8... (17 Replies)
Discussion started by: mk8570
17 Replies

7. AIX

VIO Redundancy

Hi Guys, I'm working on building a new 595 machine.. I'm try to achieve high availability for the VIO clients using 2 VIO servers, The ethernet part is satisfied using SEA failover, Now how do I achieve redundancy for the disks, its the servers internal disk attached to the scsi cards... ... (2 Replies)
Discussion started by: kkeng808
2 Replies

8. AIX

vio related

In a dual vio, how will u know one lpar is in which vio? (1 Reply)
Discussion started by: shobhan.murthy
1 Replies

9. AIX

vio server and vio client

Hi, I want to know wheather partition size for installation of vio client can be specified on vio server example If I am installing vio server on blade with 2*300gb hard disk,after that I want to create 2 vio client (AIX Operating system) wheather I can specify hard disk size while... (1 Reply)
Discussion started by: manoj.solaris
1 Replies

10. Solaris

different between soft reboot and hard reboot

Hi Guru's Can any want here could explain to me the different between soft reboot and hard reboot . Best Regards Seelan (3 Replies)
Discussion started by: seelan3
3 Replies
Login or Register to Ask a Question