Solved

Sharepoint Environment, getting Event 823/824 on the SQL Server

Posted on 2013-06-25
1
1,031 Views
Last Modified: 2013-07-09
SharePoint DB Environment issues
VM Environment
2 Servers
Web Front End
Windows 2008 R2
SharePoint 2010

Database Back End
Windows 2008 R2
MS SQL Server 2008 x64

Going over the SQL Server logs we have seen a high occurrence of Error 823 and 824.  They also show up in the Windows Application Event Viewer.

This is the general description we have been getting:
823) The operating system returned error incorrect checksum (expected: 0xb016ce52; actual: 0xb016ce52) to SQL Server during a read at offset 0x00000000fc6000 in file 'E:\Program Files\Microsoft SQL Server\MSSQL10_50.MSSQLSERVER\MSSQL\DATA\tempdb.mdf'. Additional messages in the SQL Server error log and system event log may provide more detail. This is a severe system-level error condition that threatens database integrity and must be corrected immediately. Complete a full database consistency check (DBCC CHECKDB). This error can be caused by many factors; for more information, see SQL Server Books Online.

824) SQL Server detected a logical consistency-based I/O error: incorrect checksum (expected: 0x686d8353; actual: 0x8ab05a4c). It occurred during a read of page (1:204495) in database ID 2 at offset 0x00000063d9e000 in file 'E:\Program Files\Microsoft SQL Server\MSSQL10_50.MSSQLSERVER\MSSQL\DATA\tempdb.mdf'.  Additional messages in the SQL Server error log or system event log may provide more detail. This is a severe error condition that threatens database integrity and must be corrected immediately. Complete a full database consistency check (DBCC CHECKDB). This error can be caused by many factors; for more information, see SQL Server Books Online.


We restarted the SQL Server service so that tempdb would be recreated.  We also ran DBCC CHECKDB on all the databases and found no issues.

We have also used SQLIOSim to stress test the database server.  The stress test gives an error of:
Error: 0x80070467
Error Text: While accessing the hard disk, a disk operation failed even after retries.
Description: Buffer validation failed on C:\sqliosim.mdx Page: 398593, offset 0x8

We also get the following warnings
Error: 0x00000000
Error Text:
Description: 296 IO requests are outstanding for more than 15 sec.

Our system administrator performed a diagnostic on both the hard drive and memory for the environments.  No issues were found.

Our question is why would this happen to only temp.mdf?  And what are the chances that other databases could start getting checksum errors?  Any suggestions on what we could test next?
0
Comment
Question by:rmuyargas
1 Comment
 
LVL 20

Accepted Solution

by:
Marten Rune earned 500 total points
ID: 39277102
Your harddisk subsystem, or your SAN disks are not healthy. This error is an indication of failing disks. If you use tempDB on it's own drive. Then you are lucky since this is the error you disclosed to us. Either way.

1. Replace the drives as fast as posssible.
2. Run consistency checks on all your user databases.
3. Make sure your backups are solid.

A disk that finda a faulty block, marks is as suspect, and stops using it. Therefore the databases with the most I/O is the most lightly to report these errors. I have never heard of a red herring error regarding Error 823 and 824

More info on:
http://support.microsoft.com/kb/2015756
https://www.simple-talk.com/sql/performance/reliable-storage-systems-for-sql-server/

You could look at SQLStress remotely. But it needs to be run as a local admin for the first time. I wouldn't run it locally.

Best regards, and good luck, Marten
0

Featured Post

Ransomware: The New Cyber Threat & How to Stop It

This infographic explains ransomware, type of malware that blocks access to your files or your systems and holds them hostage until a ransom is paid. It also examines the different types of ransomware and explains what you can do to thwart this sinister online threat.  

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Experts-Exchange users below are the steps you can follow to upgrade your Lync server to latest CU's or cumulative updates. Note: Perform it during non-production hours.   Step 1: Backup your lync and SQL server database. Follow below article: h…
I thought I'd write this up for anyone who has a request to create an anonymous whistle-blower-type submission form created using SharePoint 2010 (this would probably work the same for 2013). It's not 100% fool-proof but it's as close as you can get…
Familiarize people with the process of utilizing SQL Server functions from within Microsoft Access. Microsoft Access is a very powerful client/server development tool. One of the SQL Server objects that you can interact with from within Microsoft Ac…
Via a live example, show how to setup several different housekeeping processes for a SQL Server.

830 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question