S.M.A.R.T - b

Wszystko związane z jądrem systemowym, sterownikami, sprzętem itp.
poldas
Beginner
Posty: 105
Rejestracja: 12 grudnia 2006, 08:51

S.M.A.R.T - błędy na dyskach, programowy raid1

Post autor: poldas »

Witam.

Mam na komputerze programowy RAID-1 (mdadm). Ostatnio postanowiłem sprawdzić stan dysków S.M.A.R.T-em i zaobserwowałem błędy. Będę wdzięczny za informacje w czym tkwi problem i czy konieczna jest wymiana dysków?

Kod: Zaznacz cały

smartctl -a /dev/hda

Kod: Zaznacz cały

smartctl version 5.38 [i686-pc-linux-gnu] Copyright (C) 2002-8 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF INFORMATION SECTION ===
Device Model:     SAMSUNG SP0411N
Serial Number:    S01JJ20WB76790
Firmware Version: TW100-08
User Capacity:    40,060,403,712 bytes
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   7
ATA Standard is:  ATA/ATAPI-7 T13 1532D revision 0
Local Time is:    Thu Oct  6 11:53:18 2011 CEST

==> WARNING: May need -F samsung or -F samsung2 enabled; see manual for details.

SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00)    Offline data collection activity
                    was never started.
                    Auto Offline Data Collection: Disabled.
Self-test execution status:      ( 112)    The previous self-test completed having
                    the read element of the test failed.
Total time to complete Offline 
data collection:          ( 900) seconds.
Offline data collection
capabilities:              (0x1b) SMART execute Offline immediate.
                    Auto Offline data collection on/off support.
                    Suspend Offline collection upon new
                    command.
                    Offline surface scan supported.
                    Self-test supported.
                    No Conveyance Self-test supported.
                    No Selective Self-test supported.
SMART capabilities:            (0x0003)    Saves SMART data before entering
                    power-saving mode.
                    Supports SMART auto save timer.
Error logging capability:        (0x01)    Error logging supported.
                    No General Purpose Logging support.
Short self-test routine 
recommended polling time:      (   1) minutes.
Extended self-test routine
recommended polling time:      (  15) minutes.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000b   100   100   051    Pre-fail  Always       -       11
  3 Spin_Up_Time            0x0007   063   063   000    Pre-fail  Always       -       6336
  4 Start_Stop_Count        0x0032   094   094   000    Old_age   Always       -       6535
  5 Reallocated_Sector_Ct   0x0033   099   099   010    Pre-fail  Always       -       3
  7 Seek_Error_Rate         0x000b   253   253   051    Pre-fail  Always       -       0
  8 Seek_Time_Performance   0x0024   253   253   000    Old_age   Offline      -       0
  9 Power_On_Hours          0x0032   097   097   000    Old_age   Always       -       1846290
 10 Spin_Retry_Count        0x0013   253   253   049    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   098   098   000    Old_age   Always       -       2531
194 Temperature_Celsius     0x0022   133   118   000    Old_age   Always       -       35
195 Hardware_ECC_Recovered  0x000a   100   100   000    Old_age   Always       -       74417408
196 Reallocated_Event_Count 0x0012   099   099   000    Old_age   Always       -       4
197 Current_Pending_Sector  0x0033   253   253   010    Pre-fail  Always       -       0
198 Offline_Uncorrectable   0x0031   100   100   010    Pre-fail  Offline      -       1
199 UDMA_CRC_Error_Count    0x000b   100   100   051    Pre-fail  Always       -       0
200 Multi_Zone_Error_Rate   0x000b   100   100   051    Pre-fail  Always       -       0
201 Soft_Read_Error_Rate    0x000b   100   100   051    Pre-fail  Always       -       1

SMART Error Log Version: 1
Warning: ATA error count 4864 inconsistent with error log pointer 5

ATA Error Count: 4864 (device log contains only the most recent five errors)
    CR = Command Register [HEX]
    FR = Features Register [HEX]
    SC = Sector Count Register [HEX]
    SN = Sector Number Register [HEX]
    CL = Cylinder Low Register [HEX]
    CH = Cylinder High Register [HEX]
    DH = Device/Head Register [HEX]
    DC = Device Command Register [HEX]
    ER = Error register [HEX]
    ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 4864 occurred at disk power-on lifetime: 8947 hours (372 days + 19 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  04 51 00 01 c2 4f a0  Error: ABRT

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  b0 d9 00 01 c2 4f a0 00      03:16:46.938  SMART DISABLE OPERATIONS
  ec 00 ff 01 00 00 a0 00      03:16:46.938  IDENTIFY DEVICE
  10 00 ff 01 01 00 a0 00      03:16:46.875  RECALIBRATE [OBS-4]
  91 00 ff 01 01 00 af 00      03:16:46.875  INITIALIZE DEVICE PARAMETERS [OBS-6]
  ec 00 ff 01 00 00 a0 00      03:16:46.875  IDENTIFY DEVICE

Error 4863 occurred at disk power-on lifetime: 7915 hours (329 days + 19 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  04 51 00 01 c2 4f a0  Error: ABRT

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  b0 d9 00 01 c2 4f a0 00      01:11:17.438  SMART DISABLE OPERATIONS
  ec 00 ff 01 00 00 a0 00      01:11:17.438  IDENTIFY DEVICE
  10 00 ff 01 01 00 a0 00      01:11:17.375  RECALIBRATE [OBS-4]
  91 00 ff 01 01 00 af 00      01:11:17.375  INITIALIZE DEVICE PARAMETERS [OBS-6]
  ec 00 ff 01 00 00 a0 00      01:11:17.188  IDENTIFY DEVICE

Error 4862 occurred at disk power-on lifetime: 7915 hours (329 days + 19 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  04 51 00 01 c2 4f a0  Error: ABRT

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  b0 d9 00 01 c2 4f a0 00      00:52:32.625  SMART DISABLE OPERATIONS
  ec 00 ff 01 00 00 a0 00      00:52:32.625  IDENTIFY DEVICE
  10 00 ff 01 01 00 a0 00      00:52:32.625  RECALIBRATE [OBS-4]
  91 00 ff 01 01 00 af 00      00:52:32.625  INITIALIZE DEVICE PARAMETERS [OBS-6]
  ec 00 ff 01 00 00 a0 00      00:52:32.500  IDENTIFY DEVICE

Error 4861 occurred at disk power-on lifetime: 4604 hours (191 days + 20 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  04 51 00 01 c2 4f a0  Error: ABRT

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  b0 d9 00 01 c2 4f a0 00      00:00:06.625  SMART DISABLE OPERATIONS
  ec 00 ff 01 00 00 a0 00      00:00:06.625  IDENTIFY DEVICE
  10 00 ff 01 01 00 a0 00      00:00:06.625  RECALIBRATE [OBS-4]
  91 00 ff 01 01 00 af 00      00:00:06.625  INITIALIZE DEVICE PARAMETERS [OBS-6]
  ec 00 ff 01 00 00 a0 00      00:00:06.563  IDENTIFY DEVICE

Error 4860 occurred at disk power-on lifetime: 1210 hours (50 days + 10 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  04 51 00 01 c2 4f a0  Error: ABRT

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  b0 d9 00 01 c2 4f a0 00      00:00:05.688  SMART DISABLE OPERATIONS
  ec 00 ff 01 00 00 a0 00      00:00:05.688  IDENTIFY DEVICE
  10 00 ff 01 01 00 a0 00      00:00:05.625  RECALIBRATE [OBS-4]
  91 00 ff 01 01 00 af 00      00:00:05.625  INITIALIZE DEVICE PARAMETERS [OBS-6]
  ec 00 ff 01 00 00 a0 00      00:00:05.563  IDENTIFY DEVICE

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed: read failure       00%     15319         76596352
# 2  Short offline       Completed without error       00%     15318         -

Device does not support Selective Self Tests/Logging

Kod: Zaznacz cały

smartctl -a /dev/hdb

Kod: Zaznacz cały

smartctl version 5.38 [i686-pc-linux-gnu] Copyright (C) 2002-8 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF INFORMATION SECTION ===
Device Model:     SAMSUNG SV0411N
Serial Number:    S01RJ20WB91051
Firmware Version: UA100-08
User Capacity:    40,060,403,712 bytes
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   7
ATA Standard is:  ATA/ATAPI-7 T13 1532D revision 0
Local Time is:    Thu Oct  6 12:05:59 2011 CEST

==> WARNING: May need -F samsung or -F samsung2 enabled; see manual for details.

SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00)    Offline data collection activity
                    was never started.
                    Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0)    The previous self-test routine completed
                    without error or no self-test has ever 
                    been run.
Total time to complete Offline 
data collection:          (1200) seconds.
Offline data collection
capabilities:              (0x1b) SMART execute Offline immediate.
                    Auto Offline data collection on/off support.
                    Suspend Offline collection upon new
                    command.
                    Offline surface scan supported.
                    Self-test supported.
                    No Conveyance Self-test supported.
                    No Selective Self-test supported.
SMART capabilities:            (0x0003)    Saves SMART data before entering
                    power-saving mode.
                    Supports SMART auto save timer.
Error logging capability:        (0x01)    Error logging supported.
                    No General Purpose Logging support.
Short self-test routine 
recommended polling time:      (   1) minutes.
Extended self-test routine
recommended polling time:      (  20) minutes.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000b   100   100   051    Pre-fail  Always       -       0
  3 Spin_Up_Time            0x0007   079   042   000    Pre-fail  Always       -       4032
  4 Start_Stop_Count        0x0032   098   098   000    Old_age   Always       -       2952
  5 Reallocated_Sector_Ct   0x0033   253   253   010    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x000b   253   253   051    Pre-fail  Always       -       0
  8 Seek_Time_Performance   0x0024   253   253   000    Old_age   Offline      -       0
  9 Power_On_Hours          0x0032   098   098   000    Old_age   Always       -       1709141
 10 Spin_Retry_Count        0x0013   253   253   049    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   099   099   000    Old_age   Always       -       1488
194 Temperature_Celsius     0x0022   133   124   000    Old_age   Always       -       35
195 Hardware_ECC_Recovered  0x000a   100   100   000    Old_age   Always       -       53090052
196 Reallocated_Event_Count 0x0012   253   253   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0033   253   253   010    Pre-fail  Always       -       0
198 Offline_Uncorrectable   0x0031   253   253   010    Pre-fail  Offline      -       0
199 UDMA_CRC_Error_Count    0x000b   100   100   051    Pre-fail  Always       -       0
200 Multi_Zone_Error_Rate   0x000b   100   100   051    Pre-fail  Always       -       0
201 Soft_Read_Error_Rate    0x000b   100   100   051    Pre-fail  Always       -       0

SMART Error Log Version: 1
Warning: ATA error count 9984 inconsistent with error log pointer 5

ATA Error Count: 9984 (device log contains only the most recent five errors)
    CR = Command Register [HEX]
    FR = Features Register [HEX]
    SC = Sector Count Register [HEX]
    SN = Sector Number Register [HEX]
    CL = Cylinder Low Register [HEX]
    CH = Cylinder High Register [HEX]
    DH = Device/Head Register [HEX]
    DC = Device Command Register [HEX]
    ER = Error register [HEX]
    ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 9984 occurred at disk power-on lifetime: 8671 hours (361 days + 7 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  04 51 00 01 c2 4f a0  Error: ABRT

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  b0 d9 00 01 c2 4f a0 00      07:02:29.938  SMART DISABLE OPERATIONS
  ec 00 ff 01 00 00 a0 00      07:02:29.938  IDENTIFY DEVICE
  10 00 ff 01 01 00 a0 00      07:02:29.875  RECALIBRATE [OBS-4]
  91 00 ff 01 01 00 af 00      07:02:29.875  INITIALIZE DEVICE PARAMETERS [OBS-6]
  ec 00 ff 01 00 00 a0 00      07:02:25.813  IDENTIFY DEVICE

Error 9983 occurred at disk power-on lifetime: 8671 hours (361 days + 7 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  04 51 00 01 c2 4f a0  Error: ABRT

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  b0 d9 00 01 c2 4f a0 00      06:45:01.750  SMART DISABLE OPERATIONS
  ec 00 ff 01 00 00 a0 00      06:45:01.750  IDENTIFY DEVICE
  10 00 ff 01 01 00 a0 00      06:45:01.750  RECALIBRATE [OBS-4]
  91 00 ff 01 01 00 af 00      06:45:01.750  INITIALIZE DEVICE PARAMETERS [OBS-6]
  ec 00 ff 01 00 00 a0 00      06:44:57.688  IDENTIFY DEVICE

Error 9982 occurred at disk power-on lifetime: 8671 hours (361 days + 7 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  04 51 00 01 c2 4f a0  Error: ABRT

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  b0 d9 00 01 c2 4f a0 00      06:29:51.563  SMART DISABLE OPERATIONS
  ec 00 ff 01 00 00 a0 00      06:29:51.563  IDENTIFY DEVICE
  10 00 ff 01 01 00 a0 00      06:29:51.563  RECALIBRATE [OBS-4]
  91 00 ff 01 01 00 af 00      06:29:51.563  INITIALIZE DEVICE PARAMETERS [OBS-6]
  ec 00 ff 01 00 00 a0 00      06:29:47.125  IDENTIFY DEVICE

Error 9981 occurred at disk power-on lifetime: 8665 hours (361 days + 1 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  04 51 00 01 c2 4f a0  Error: ABRT

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  b0 d9 00 01 c2 4f a0 00      00:00:33.563  SMART DISABLE OPERATIONS
  ec 00 ff 01 00 00 a0 00      00:00:33.563  IDENTIFY DEVICE
  10 00 ff 01 01 00 a0 00      00:00:33.500  RECALIBRATE [OBS-4]
  91 00 ff 01 01 00 af 00      00:00:33.500  INITIALIZE DEVICE PARAMETERS [OBS-6]
  ec 00 ff 01 00 00 a0 00      00:00:28.500  IDENTIFY DEVICE

Error 9980 occurred at disk power-on lifetime: 8665 hours (361 days + 1 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  04 51 00 01 c2 4f a0  Error: ABRT

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  b0 d9 00 01 c2 4f a0 00      00:00:17.563  SMART DISABLE OPERATIONS
  ec 00 ff 01 00 00 a0 00      00:00:17.563  IDENTIFY DEVICE
  10 00 ff 01 01 00 a0 00      00:00:12.813  RECALIBRATE [OBS-4]
  91 00 ff 01 01 00 af 00      00:00:12.813  INITIALIZE DEVICE PARAMETERS [OBS-6]
  ec 00 ff 01 00 00 a0 00      00:00:08.688  IDENTIFY DEVICE

Warning! SMART Self-Test Log Structure error: invalid SMART checksum.
SMART Self-test log structure revision number 1
No self-tests have been logged.  [To run self-tests, use: smartctl -t]


Device does not support Selective Self Tests/Logging

ODPOWIEDZ