本版版主招募中

 
标题: 帮忙看看disk还有没有救?
  本主题由 老农 于 2008-9-5 23:31 移动 
xuchen1982
LU幼天使
Rank: 2



UID 44330
精华 0
积分 37
帖子 68
活跃指数 32
LU金币 100 个
LU金条 0 个
阅读权限 20
注册 2006-4-7
 
发表于 2008-9-5 22:15  资料  个人空间  短消息  加为好友 
帮忙看看disk还有没有救?

mail里每天报一次错:
如下:
Notification Time: Fri Sep  5 10:24:48 2008

guigang1 sent Event Monitor notification information:

/storage/events/disks/default/0_5_1_0.8.0.255.0.3.0
is >= 3.
Its current value is CRITICAL(5).



Event data from monitor:

Event Time..........: Fri Sep  5 10:24:48 2008
Severity............: CRITICAL
Monitor.............: disk_em
Event #.............: 100142
System..............: guigang1

Summary:
     Disk at hardware path 0/5/1/0.8.0.255.0.3.0 : Media failure


Description of Error:

     While attempting to spare a block, the device failed to do so because no
     defect spare location was available. The I/O request that necessitated
     this action may have been processed in a way which could cause damage to
     or loss of data.

Probable Cause / Recommended Action:

     Reformatting the medium may fix the problem.

     Alternatively, the medium in the device is flawed. If the medium is
     removable, replace the medium with a fresh one.

     Alternatively, if the medium is not removable, the device has experienced
     a hardware failure. Contact your HP support representative to have the
     device checked.

Additional Event Data:
     System IP Address...: 192.168.99.200
     Event Id............: 0x48c0987000000000
     Monitor Version.....: B.01.01
     Event Class.........: I/O
     Client Configuration File...........:
     /var/stm/config/tools/monitor/default_disk_em.clcfg
     Client Configuration File Version...: A.01.00
          Qualification criteria met.
               Number of events..: 1
     Associated OS error log entry id(s):
          0x48c0987000000000
     Additional System Data:
          System Model Number.............: 9000/800/rp4440
          OS Version......................: B.11.11
          STM Version.....................: A.45.00
          EMS Version.....................: A.04.00
     Latest information on this event:
          http://docs.hp.com/hpux/content/hardware/ems/scsi.htm#100142

v-v-v-v-v-v-v-v-v-v-v-v-v    D  E  T  A  I  L  S    v-v-v-v-v-v-v-v-v-v-v-v-v



Component Data:
     Physical Device Path...: 0/5/1/0.8.0.255.0.3.0
     Device Class...........: Disk
     Inquiry Vendor ID......: HP 73.4G
     Inquiry Product ID.....: ST373307FC
     Firmware Version.......: HP05
     Serial Number..........: 3HZ9A7L8

Product/Device Identification Information:

     Logger ID.........: sdisk
     Product Identifier: SCSI Disk
     Product Qualifier.: HP73.4GST373307FC
     SCSI Target ID....: 0x03
     SCSI LUN..........: 0x00

I/O Log Event Data:

     Driver Status Code..................: 0x0000007E
     Length of Logged Hardware Status....: 22 bytes.
     Offset to Logged Manager Information: 24 bytes.
     Length of Logged Manager Information: 34 bytes.

Hardware Status:

     Raw H/W Status:
          0x0000: 00 00 00 02   F0 00 04 00   A6 1A 90 0A   00 00 00 00
          0x0010: 32 00 04 00   00 00

     SCSI Status...: CHECK CONDITION (0x02)
          Indicates that a contingent allegiance condition has occurred.  Any
          error, exception, or abnormal condition that causes sense data to be
          set will produce the CHECK CONDITION status.

SCSI Sense Data:

     Undecoded Sense Data:
          0x0000: F0 00 04 00   A6 1A 90 0A   00 00 00 00   32 00 04 00
          0x0010: 00 00

     SCSI Sense Data Fields:
          Error Code                      : 0x70
          Segment Number                  : 0x00
          Bit Fields:
               Filemark                   : 0
               End-of-Medium              : 0
               Incorrect Length Indicator : 0
          Sense Key                       : 0x04
          Information Field Valid         : TRUE
          Information Field               : 0x00A61A90
          Additional Sense Length         : 10
          Command Specific                : 0x00000000
          Additional Sense Code           : 0x32
          Additional Sense Qualifier      : 0x00
          Field Replaceable Unit          : 0x04
          Sense Key Specific Data Valid   : FALSE
          Sense Key Specific Data         : 0x00 0x00 0x00

          Sense Key 0x04, HARDWARE ERROR, indicates that the device detected a
          nonrecoverable hardware failure (for example, controller failure,
          device failure, parity error, etc.) while performing the command or
          during a self test.

          The combination of Additional Sense Code and Sense Qualifier (0x3200)
          indicates: No defect spare location available.

SCSI Command Data Block:

     Command Data Block Contents:
          0x0000: 28 00 00 A6   1A 90 00 01   00 00

     Command Data Block Fields (10-byte fmt):
          Command Operation Code...(0x28)..: READ
          Logical Unit Number..............: 0
          DPO Bit..........................: 0
          FUA Bit..........................: 0
          Relative Address Bit.............: 0
          Logical Block Address............: 10885776 (0x00A61A90)
          Transfer Length..................: 256 (0x0100)

Manager-Specific Data Fields:
     Request ID.............: 0x117AD343
     Data Residue...........: 0x0001FE00
     CDB status.............: 0x00000002
     Sense Status...........: 0x00000000
     Bus ID.................: 0x11
     Target ID..............: 0x03
     LUN ID.................: 0x00
     Sense Data Length......: 0x12
     Q Tag..................: 0xD0
     Retry Count............: 13


>---------- End Event Monitoring Service Event Notification ----------<

顶部
bennial
技术专家
Rank: 14Rank: 14Rank: 14Rank: 14



UID 235
精华 6
积分 477
帖子 831
活跃指数 66
LU金币 3558 个
LU金条 0 个
阅读权限 200
注册 2003-9-29
 
发表于 2008-9-5 22:49  资料  个人空间  短消息  加为好友 
发错区了吧,你这是HP的机器





承接各种IBM DS3000/DS4000/DS5000/DS6000/DS8000/ESS800/SVC, Brocade SAN Switch收费维护服务
DS4000系列的数据恢复业务
AIX LVM数据恢复业务
顶部
mali8507 (路遥芝麻粒)
LU幼天使
Rank: 2



UID 98532
精华 1
积分 58
帖子 90
活跃指数 24
LU金币 210 个
LU金条 0 个
阅读权限 20
注册 2007-11-10
 
发表于 2008-9-6 14:33  资料  个人空间  短消息  加为好友  添加 mali8507 为MSN好友 通过MSN和 mali8507 交谈
把结构描述清楚。

先检查存储。没问题的话,检查主机卡、链路。

如果是多路径的  单个路径报错问题不大,可以试图删除该路径重新认。之前做好数据备份。

顶部
 



当前时区 GMT+8, 现在时间是 2008-11-21 20:48
乐悠LoveUnix论坛-京ICP备05005823号

Thanks to Discuz!  © 2001-2007    Power by LoveUnix.net
Processed in 0.067590 second(s), 7 queries , Gzip enabled

清除 Cookies - 联系我们 - 乐悠LoveUnix - Archiver