Solved

Sharepoint Environment, getting Event 823/824 on the SQL Server

Posted on 2013-06-25
1
1,020 Views
Last Modified: 2013-07-09
SharePoint DB Environment issues
VM Environment
2 Servers
Web Front End
Windows 2008 R2
SharePoint 2010

Database Back End
Windows 2008 R2
MS SQL Server 2008 x64

Going over the SQL Server logs we have seen a high occurrence of Error 823 and 824.  They also show up in the Windows Application Event Viewer.

This is the general description we have been getting:
823) The operating system returned error incorrect checksum (expected: 0xb016ce52; actual: 0xb016ce52) to SQL Server during a read at offset 0x00000000fc6000 in file 'E:\Program Files\Microsoft SQL Server\MSSQL10_50.MSSQLSERVER\MSSQL\DATA\tempdb.mdf'. Additional messages in the SQL Server error log and system event log may provide more detail. This is a severe system-level error condition that threatens database integrity and must be corrected immediately. Complete a full database consistency check (DBCC CHECKDB). This error can be caused by many factors; for more information, see SQL Server Books Online.

824) SQL Server detected a logical consistency-based I/O error: incorrect checksum (expected: 0x686d8353; actual: 0x8ab05a4c). It occurred during a read of page (1:204495) in database ID 2 at offset 0x00000063d9e000 in file 'E:\Program Files\Microsoft SQL Server\MSSQL10_50.MSSQLSERVER\MSSQL\DATA\tempdb.mdf'.  Additional messages in the SQL Server error log or system event log may provide more detail. This is a severe error condition that threatens database integrity and must be corrected immediately. Complete a full database consistency check (DBCC CHECKDB). This error can be caused by many factors; for more information, see SQL Server Books Online.


We restarted the SQL Server service so that tempdb would be recreated.  We also ran DBCC CHECKDB on all the databases and found no issues.

We have also used SQLIOSim to stress test the database server.  The stress test gives an error of:
Error: 0x80070467
Error Text: While accessing the hard disk, a disk operation failed even after retries.
Description: Buffer validation failed on C:\sqliosim.mdx Page: 398593, offset 0x8

We also get the following warnings
Error: 0x00000000
Error Text:
Description: 296 IO requests are outstanding for more than 15 sec.

Our system administrator performed a diagnostic on both the hard drive and memory for the environments.  No issues were found.

Our question is why would this happen to only temp.mdf?  And what are the chances that other databases could start getting checksum errors?  Any suggestions on what we could test next?
0
Comment
Question by:rmuyargas
1 Comment
 
LVL 20

Accepted Solution

by:
Marten Rune earned 500 total points
ID: 39277102
Your harddisk subsystem, or your SAN disks are not healthy. This error is an indication of failing disks. If you use tempDB on it's own drive. Then you are lucky since this is the error you disclosed to us. Either way.

1. Replace the drives as fast as posssible.
2. Run consistency checks on all your user databases.
3. Make sure your backups are solid.

A disk that finda a faulty block, marks is as suspect, and stops using it. Therefore the databases with the most I/O is the most lightly to report these errors. I have never heard of a red herring error regarding Error 823 and 824

More info on:
http://support.microsoft.com/kb/2015756
https://www.simple-talk.com/sql/performance/reliable-storage-systems-for-sql-server/

You could look at SQLStress remotely. But it needs to be run as a local admin for the first time. I wouldn't run it locally.

Best regards, and good luck, Marten
0

Featured Post

Is Your Active Directory as Secure as You Think?

More than 75% of all records are compromised because of the loss or theft of a privileged credential. Experts have been exploring Active Directory infrastructure to identify key threats and establish best practices for keeping data safe. Attend this month’s webinar to learn more.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

I have put this article together as i needed to get all the information that might be available already into one general document that could be referenced once without searching the Internet for the different pieces. I have had a few issues where…
This article shows gives you an overview on SQL Server 2016 row level security. You will also get to know the usages of row-level-security and how it works
This video shows, step by step, how to configure Oracle Heterogeneous Services via the Generic Gateway Agent in order to make a connection from an Oracle session and access a remote SQL Server database table.
Via a live example, show how to extract information from SQL Server on Database, Connection and Server properties

919 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

14 Experts available now in Live!

Get 1:1 Help Now