星期三, 6月 26, 2013

HP UX syslog "Rebooting for cluster integrity" powerpath

安裝Oracle RAC 之前, 如果沒先處理好powerpath 等等的multi-path 軟體問題,
可能在dbca 之後一小時內會發生node reboot情形


以下連結供參考Ref:
https://forums.oracle.com/thread/927929
https://forums.oracle.com/thread/993066
https://forums.oracle.com/thread/581397

最後查到這篇才證實了我的猜測
==> 294430.1
LONG LATENCIES TO THE VOTING DISKS : EMC PowerPath path error detection and I/O repost and redirect greater than default misscount 

The most common problems relate to multi-path IO software drivers, and the reconfiguration times resulting from a failure in the IO path.
Hardware and (re)configuration issues that introduce these latencies should be corrected.


Reboot原因說明如下

Disk LUN I/O重新導向的狀況下...只要符合以下任一條件, 就會node reboot
Takes more than Disktimeout seconds (200 sec) or
Takes more than Misscount Seconds (30 sec)

* By default Misscount is less than Disktimeout seconds
只有在網路互ping超過misscountVoting disk超過disktimeout時機器才會reboot.

misscount :   Default Value is 60 Sec (Linux) and 30 Sec in Unix platform
disktimeout : Default Value is 200. (Disk IO)


沒有留言:

LinkWithin-相關文件

Related Posts Plugin for WordPress, Blogger...