Jump to content

Recommended Posts

  • Administrators
Posted

It would be great if we get the full S.M.A.R.T. information for each hard drive. This way we can monitor the RAID status too :lol: .

Cheers.

  • 5 months later...
  • 3 months later...
Posted

Hi All,

In the meantime I've set a notification to come up if the following happens:

Event Log: System

Level: Error

Keywords: bad,block

Event IDs: 7

Sources: Disk

This happened on a work PC but I noticed it, but handy to have it tell you.

  • Administrators
Posted

I used the following keywords: bluescreen,crash,dump,controller,raid,disk

But it was designed to cover a lot more scenarios.

  • 3 months later...
Posted

I used the following keywords: bluescreen,crash,dump,controller,raid,disk

But it was designed to cover a lot more scenarios.

Thanks for the post about what you do. I thought I would add to the community and say what's happened at our place. I've just had a failure on our server RAID and it has had to be rebuilt. I copied the keywords you suggested "bluescreen,crash,dump,controller,raid,disk" and i didnt get a warning. The events created in event viewer are list below. I was hoping of using PCMonitor to warn me when hard drives failures, any ideas how we can capture the failures?

Log Name: System

Source: HpCISSs2

Date: 15/10/2012 15:50:41

Event ID: 129

Task Category: None

Level: Warning

Keywords: Classic

User: N/A

Computer: computername

Description:

Reset to device, \Device\RaidPort2, was issued.

Event Xml:

<Event xmlns="http://schemas.microsoft.com/win/2004/08/events/event">

<System>

<Provider Name="HpCISSs2" />

<EventID Qualifiers="32772">129</EventID>

<Level>3</Level>

<Task>0</Task>

<Keywords>0x80000000000000</Keywords>

<TimeCreated SystemTime="2012-10-15T14:50:41.904Z" />

<EventRecordID>3509648</EventRecordID>

<Channel>System</Channel>

<Computer>computername</Computer>

<Security />

</System>

Log Name: System

Source: disk

Date: 15/10/2012 15:57:21

Event ID: 11

Task Category: None

Level: Error

Keywords: Classic

User: N/A

Computer: ?????????

Description:

The driver detected a controller error on \Device\Harddisk0\DR0.

Event Xml:

<Event xmlns="http://schemas.microsoft.com/win/2004/08/events/event">

<System>

<Provider Name="disk" />

<EventID Qualifiers="49156">11</EventID>

<Level>2</Level>

<Task>0</Task>

<Keywords>0x80000000000000</Keywords>

<TimeCreated SystemTime="2012-10-15T14:57:21.530Z" />

<EventRecordID>3509670</EventRecordID>

<Channel>System</Channel>

<Computer>???????</Computer>

<Security />

</System>

  • Administrators
Posted

Hello

In this case I would create a second filter along with the ones with keywords that will only include the following source filters: HpCISSs2,disk

Paul.

Posted

HpCISSs2 is a specific device controller. We dont have that one in our other servers. I was hoping there would be a catch all set of keywords that would notify me of RAID issues. Am i looking for somethings thats not possible with PC Monitor?

  • Administrators
Posted

You can catch all RAID issues by monitoring the disk source, by default Windows logs all raid events to disk source events.

  • Administrators
Posted (edited)

Viewing the S.M.A.R.T. data of a device can give you insight of hardware statistics and status, for instance my SSD gives me two counters for the amount of GB read and written and remaining life of the drive. On my HP SmartArray 430 I get RAID status and Raid health. Just google for smart values and I'm sure you will understand.

The problem with SMART is that each manufacturer uses it's own set of value types and it's almost impossible of knowing each value id what it means, for instance SMART value id 55 can mean three different things if read on three different manufacturers.

Also if SMART get's tripped you know if the drive is faulty and requires replacement.

Edited by Paul
Posted

Although not automatic, you can you use the powershell line below which will show true or false. If True, anticipate disk failure at some point.

gwmi -Namespace root\wmi -Class MSStorageDriver_FailurePredictStatus -property PredictFailure

Posted

Another thought - create a scheduled task to run powershell script and look for the word True and if found, send an email using powershell

  • Administrators
Posted

Or have a plugin send a notification for that -regarding my current project- B) .

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
×
×
  • Create New...