DLPAR Operation Fails
One cause of failed Dynamic LPAR (DLPAR) operations is duplicate ct_node_id's. This results in an apparent communications failure from the HMC when attempting DLPAR operations.
This can be caused usually by cloning AIX systems via alt_disk_install or other more obtuse means (eg. moving one half of a mirrored rootvg between nodes).
To check if this is the case, compare the 16 digit hexidecimal number in the first line of /etc/ct_node_id.
The ct_node_id is used by the following:
- LPARs
- Dynamic LPARs
- HACMP-ES
- HACMP-ES-CRM
- PSSP
- CSM
- GPFS
- VSD
- RVSD
- Oracle Parallel Server
- Oracle 9i RAC
To assign a new ct_node_id, perform the following:
# stopsrc -g rsct 0513-044 The ctrmc Subsystem was requested to stop. # /usr/sbin/rsct/install/bin/uncfgct -n # /usr/sbin/rsct/install/bin/cfgct 0513-071 The ctcas Subsystem has been added. 0513-071 The ctrmc Subsystem has been added. 0513-059 The ctrmc Subsystem has been started. Subsystem PID is 233648.
The following may be required to re-configure rsct, although in tests it has not been required.
# /usr/sbin/rsct/bin/rmcctrl -z # /usr/sbin/rsct/bin/rmcctrl -A 0513-071 The ctrmc Subsystem has been added. 0513-059 The ctrmc Subsystem has been started. Subsystem PID is 237814. # /usr/sbin/rsct/bin/rmcctrl -p
After assigning a new ct_node_id, wait several minutes before trying the DLPAR operation. The HMC must re-synchronize its state before it will work.
If this is a new system install, and DLPAR operations fail, make sure that the csm.client fileset is installed:
ksh$ lslpp -L csm.client | head -4 Fileset Level State Type Description (Uninstaller) ---------------------------------------------------------------------------- csm.client 1.4.1.0 C F Cluster Systems Management Client
See Also
- Dynamic LPAR tips and checklists for RMC authentication and authorization archived at scribd.