Avatar of marrowyung
marrowyung
 asked on

monitoring configuration for SQL server DB

hi,

right now we are reviewing SQL monitor parameters, we have this but please share what you think the correct value you guys use, we are receiving more and more alert on

1) Page Splits / Sec   >2000  will have warning.  > 10000 will have Critical message.
2) Index searches /sec  >1000 will have warning. >2000 will have critical message.
3) SQL Compilations/Sec >100 will have warning ,>300 will have critical message
4) Page Faults/sec,  >800 will have warning.  >1000 will have critical message.

for monitoring parameters like that, any suggested better value ?
SQLMicrosoft SQL ServerMicrosoft SQL Server 2008

Avatar of undefined
Last Comment
marrowyung

8/22/2022 - Mon
Vitor Montalvão

Did you check Microsoft's recommendations for those counters?
If not, you can check them here.
marrowyung

ASKER
hi,

do you know if there are part 1 ? I assume part 1 is also only for SQL server?
ASKER CERTIFIED SOLUTION
Vitor Montalvão

THIS SOLUTION ONLY AVAILABLE TO MEMBERS.
View this solution by signing up for a free trial.
Members can start a 7-Day free trial and enjoy unlimited access to the platform.
See Pricing Options
Start Free Trial
GET A PERSONALIZED SOLUTION
Ask your own question & get feedback from real experts
Find out why thousands trust the EE community with their toughest problems.
marrowyung

ASKER
oh, that one seems more on server side, right?
Experts Exchange is like having an extremely knowledgeable team sitting and waiting for your call. Couldn't do my job half as well as I do without it!
James Murphy
Vitor Montalvão

SCOM monitors everything from Microsoft so it will always be my top choice for monitor SQL Server instances.
BCM Patrol is quite well known in very large companies as it can monitor almost everything, from Windows to Unix, SQL Server to Oracle, storage, network, etc.
marrowyung

ASKER
hi,

I keep seeing this kind of meansurment:

These should be below the 20% of Batch Requests/sec.

usually monitoring tools can't take this, any link with specific value ?

the second link shown this:

 Available MB must not drop below 100 MB, otherwise you will see severe performance issues due to Windows memory pressure.

– The average of %Processor Time must be less than 70%, otherwise you have a bottleneck at the CPU.

– These two counters need to be constantly under 20 msec to have good disk performance, otherwise you possibly have a bottleneck at the disk subsystem.

– %Usage must be less than 30%, otherwise you will see performance issues due to paging. Install more memory to mitigate this issue.

then we can see a figure which can be setup in any of monitoring tools in the value column.
Vitor Montalvão

Monitoring tools doesn't make any kind of calculation. It only gives you the thresholds.
What the above statement says is that counter is dependent of another counter and this is not so unusual. Some counters work on pairs.
⚡ FREE TRIAL OFFER
Try out a week of full access for free.
Find out why thousands trust the EE community with their toughest problems.
marrowyung

ASKER
"What the above statement says is that counter is dependent of another counter and this is not so unusual. Some counters work on pairs."

yes I knew, but it is hard to put it to monitoring tools to check it, right?
Vitor Montalvão

It may be hard, yes but I can't confirm that. Like I said, I'm used to work with the monitoring team and I only give them my requirements and they assure that can be done. I think it depends on the monitoring tool.
To be honest we don't monitor those counters. We use them for troubleshooting only and for that we don't need any especial monitoring tool since we can query them from the SQL Server instance or even use the local Performance Monitor tool.
marrowyung

ASKER
hi,

I am sorry that, I can see that at the end of that article,  I see this:


" Next week I will discuss how to collect useful data with aProfiler trace!"

ahhaa, I want to read  that too, can you suggest the page?
I started with Experts Exchange in 2004 and it's been a mainstay of my professional computing life since. It helped me launch a career as a programmer / Oracle data analyst
William Peck
Vitor Montalvão

I want to read  that too, can you suggest the page?
Well, that was what the author wrote in the end of the Part 1 and I've sent you first the Part 2, so is the article that he mentioned, meaning that you already read it.
marrowyung

ASKER
hi,

that one:

https://blogs.msdn.microsoft.com/john_daskalakis/2013/10/21/how-to-troubleshoot-sql-server-performance-issues-with-simple-tools-part-2-how-to-analyze-the-perfmon-trace-and-detect-sql-server-performance-issues/

 is part 2.

and it said at the end of that link.

sorry I think it is  that one:

https://blogs.msdn.microsoft.com/john_daskalakis/2013/10/30/how-to-troubleshoot-sql-server-performance-issues-with-simple-tools-part-3-the-profiler/

I search by typing this: 'How to troubleshoot SQL Server performance issues with simple tools (Part 3'

I am sorry for that man..

this might be the automated  checking tools,.
Vitor Montalvão

The Part 3 isn't helpful also but give it a read.
⚡ FREE TRIAL OFFER
Try out a week of full access for free.
Find out why thousands trust the EE community with their toughest problems.
marrowyung

ASKER
you mean the use of SQL profiler can't do much to detect problem ?
Vitor Montalvão

No. The Part 3 of the article. It doesn't have more performance counters for you.
marrowyung

ASKER
yes, tks.

I will add one more post to see any counter to detect index issue, please comment.
This is the best money I have ever spent. I cannot not tell you how many times these folks have saved my bacon. I learn so much from the contributors.
rwheeler23
marrowyung

ASKER
hi,


for SQL server database mirroring and replication monitoring, any parameter and counter you can suggest with suggested value ?
Vitor Montalvão

As far as I know there's no counters for that.
marrowyung

ASKER
hi,

someone might suggest:

SQLServer: Replication Dist      Dist:Delivery Latency
SQLServer: Database Mirroring      Bytes Received/Sec
                                                                     Bytes Sent/Sec
⚡ FREE TRIAL OFFER
Try out a week of full access for free.
Find out why thousands trust the EE community with their toughest problems.
Vitor Montalvão

Dist:Delivery Latency
That's the latency between Publisher and Subscriber. It can be interesting to have this information.

Bytes Received/Sec
Bytes Sent/Sec
Those are only information about how many bytes was sent and received. I don't think you can use those values to monitor the state of a mirroring.
marrowyung

ASKER
"That's the latency between Publisher and Subscriber"

any value can be suggested for the threshold ?
Vitor Montalvão

I guess it depends on the application requirements.
That value is in seconds so I would say that everything that is higher than 1 second will be bad but that's me that thinks that everything in a database need to perform below 1 second :)
If you have a slow network maybe start with a 2 seconds thresholds and see how many alerts it raises. If none during a long period (1 week, 1 month) then you can reduce it to 1 second.
Experts Exchange has (a) saved my job multiple times, (b) saved me hours, days, and even weeks of work, and often (c) makes me look like a superhero! This place is MAGIC!
Walt Forbes
marrowyung

ASKER
"I guess it depends on the application requirements. "

I also think that replication issue more related to RAM and tempdb contention as per subscription can take on the distributor/publisher 32GB of RAM and all data received in tempdb before merge to the target user database.

agree on that?
Vitor Montalvão

Not really. For me, replication has more to do with network than RAM.
marrowyung

ASKER
I tried one time that in my previous company , they chagned to CDN network and they thought it helps, finally it won't help at all.

previous we have publisher and distributor on the same box and whenever we add RAM, number of replication agent long running alert lesser.

if we add one more replication, these alert come again.
⚡ FREE TRIAL OFFER
Try out a week of full access for free.
Find out why thousands trust the EE community with their toughest problems.
Vitor Montalvão

previous we have publisher and distributor on the same box and whenever we add RAM, number of replication agent long running alert lesser.
If they are in the same box of course they will never use network and as they share the same memory they can use it easily. You can't compare things just like that because they've different configurations.
marrowyung

ASKER
I mean when adding replication all these resource contention rise.  so it did need more RAM. and tempdb contention will be show up in rpelication log.
Vitor Montalvão

How about this question?
Do you still need help with it?
Your help has saved me hundreds of hours of internet surfing.
fblack61
marrowyung

ASKER
"If they are in the same box of course they will never use network and as they share the same memory they can use it easily. Y"

so therefore more RAM.

but actually if they are not on the same box, they publsiher or distributor use more RAM, I think is should be the publisher.

for monitor parameter like this:

1)Page Faults
2) Page life expectancy
3) Page writes/sec
4) Pages Per Second

has a thershold, then we should add more RAM ? for me it is a yes but I would like to hear what you conclude too.
Vitor Montalvão

so therefore more RAM.
Basically, yes.

for monitor parameter like this:

 1)Page Faults
 2) Page life expectancy
 3) Page writes/sec
 4) Pages Per Second

 has a thershold, then we should add more RAM ?
What does counters have to have with Replication?
marrowyung

ASKER
sorry but my question at the beginning is not only replication but monitoring counter.

I think we can close this ticket now.
⚡ FREE TRIAL OFFER
Try out a week of full access for free.
Find out why thousands trust the EE community with their toughest problems.
marrowyung

ASKER
tks.