Quantcast
Channel: THWACK: All Content - Server & Application Monitor
Viewing all 12281 articles
Browse latest View live

Find Applications & Components that are NOT Polling

$
0
0

Want to know which Applications are not polling in your environment? Try adding this to a custom table report on your summary view. The query below targets any components that have not polled in the last hour. I excluded AppInsight applications from the results for several reasons.

 

 

1. Click 'customize page' in the top right. Or click the pencil in the upper left if you're on Orion Platform 2017.3 or later

2. Add a new resource window called "Custom Table"

3. Hit submit/done to go back to the summary view.

4. Click Edit in the upper right of the custom table resource window you just added.

5. Click "New Datasource" and select "Advanced Database Query" for the selection method.

6. Click the SQL radio button and put the following query in the text field.

SELECT TOP 20 n.Caption AS [Node Name], ccs.TimeStamp, c.Name AS [Component Name], a.Name AS [Application Name], n.NodeID, ccs.ApplicationID, ccs.ComponentID FROM [dbo].[APM_CurrentComponentStatus] ccs

JOIN APM_Application a ON a.ID = ccs.ApplicationID

JOIN APM_Component c ON c.ID = ComponentID

JOIN APM_ApplicationTemplate at ON at.ID = a.TemplateID

JOIN NodesData n ON n.NodeID = a.NodeID

WHERE [TimeStamp] < dateadd (hour , -1 , SYSUTCDATETIME())

AND at.Name NOT LIKE '%AppInsight%'

ORDER BY TimeStamp DESC

7. Click the Update Datasource to submit this query through.

8. Click on 'Add columns' on this page and select any columns you would like to see.

9. Click 'Submit' to save changes to the resource.


Tableau Server Services and Processes (Windows)

Slack Alerts Integration Issue

$
0
0

Hello,

 

Props to chad.every for making these great awesome slack integration alert, everyone should definitely have a look at his scripts

Slack Alert Integration - Overview

I am having an issue with one of the scripts though, For some reason, it does not seem to properly send the data over to slack, in particular, the Memory Module

 

Should like something like (Props to chad.every)

 

2016-05-05 14_30_05-Slack - LehiSalesEngineering.png

 

Here is the snippet below of the settings

 

Here is the snippet of the Alert settings, For some reason it doesn't look like it is not sending the $ORIONTIME? but for all his other scripts, they work excellent, just the "MEMORY POwershel",

 

I have attached the link of the script https://thwack.solarwinds.com/servlet/JiveServlet/downloadBody/188402-102-4-29570/

Another pair of eyes would be appreciated

 

 

 

 

 

Path I used from your snippet

C:\windows\system32\windowspowershell\v1.0\powershell.exe -ExecutionPolicy unrestricted -command "C:\curl\ps\Slack_Alert_Trigger_Action_Node_Memory.ps1 '${N=SwisEntity;M=DisplayName}' '${N=SwisEntity;M=Stats.PercentMemoryUsed}' '${N=SwisEntity;M=MemoryUsed}' '${N=SwisEntity;M=TotalMemory}' '${N=SwisEntity;M=PercentMemoryUsedThreshold.IsLevel1State}' '${N=SwisEntity;M=PercentMemoryUsedThreshold.IsLevel2State}' '${N=Alerting;M=AlertDetailsUrl}' '${N=SwisEntity;M=DetailsUrl}' '${N=Alerting;M=AcknowledgeUrl}' '${N=Generic;M=DateTime;F=OriginalValue}'"

 

From the powershell script - I Simply downloaded the link provided by your original post - The only area I have modified was the Token and Channel

Best Geek Movie Ever!

Would you watch a SolarWinds reality TV show?

$
0
0

Before you answer, consider the reality shows out there already:

  • Duck Dynasty
  • Honey Boo-Boo
  • Storage Wars
  • Swamp People
  • Ice Road Truckers
  • Finding Bigfoot
  • Ax Men

After interacting with all of my co-workers and product users, I realized that the software development process can be...entertaining, to say the least. Trust me, if you're a geek, there are some strange characters here, especially me. I would watch religiously. Be great marketing too. We can name it something slick like, "Developers," or "Thwack Men."


At least you wouldn't lose IQ points by watching our show. (Although, I must confess that I have watched the other shows I've listed...and screamed in agony from time to time.)

(Hey, my thwack snuggie went from dream to reality!)

Greatest Invention Ever!

$
0
0

There are countless important inventions. Only ten are allowed here, so...I know you lot are creative, and I did only play the basics so... (I voted for cuticle remover. Why? Because the cuticle is that piece of skin that starts at the base of your fingernail and you then tear back until it reaches your elbow. Yuck!) GAME ON!

Variable Question - SAM And NPM

$
0
0

Hello,

 

Trying to use the following variable, from a custom configuration file.

 

<OID OID="1.3.6.1.4.1.11307.10.1006" MIB="SNMPv2-SMI" Name="enterprises.11307.10.1006" Value="Application.Node.Caption: ${N=SwisEntity;M=Application.Node.Caption} Node.Caption: ${N=SwisEntity;M=Node.Caption} Caption: ${N=SwisEntity;M=Caption} NodeName: ${NodeName}" DataType="64" ValueName="Application.Node.Caption: ${N=SwisEntity;M=Application.Node.Caption} Node.Caption: ${N=SwisEntity;M=Node.Caption} Caption: ${N=SwisEntity;M=Caption} NodeName: ${NodeName}" HexValue=""></OID>

 

My questions/issues are, when an alert is generated and that information gets passed to our CA Spectrum system, anything for the Application Monitor will not populate the nodename request.   However, anything for the actual NODE alerts value will pass over the Node variable, with no issues.  When the first alerting method occurs, then our CA Spectrum system seems to not recognize anything with APM.Application as critical, and an SDR (ticket) will not get generated.  Is there something in the variable information causing this?  Or is it possible, we are passing too much information for the system to understand?  Or are those variables actually wrong for the S.A.M. side?   And if the last question is correct, then where do we find the correct list for the application monitors variables?

 

Is it possible to run two different template files for NPM related events and SAM related events?  Or because some SAM events are node related, that can cause a possible conflict with the correct information being passed through the alert information via the SNMP trap file?

 

I have attached two pictures, as examples of what (as a team) we are seeing for the alert information.

 

If anyone, has any insight into this problem/issue?   And can provide some valuable feedback, that would be greatly appreciated.

 

Connection timeout. A timeout occurred during execution which resulted in the job being canceled. HELP :(

$
0
0

Hi,

 

Has anyone come across this error on application templates before?

 

Connection timeout. A timeout occurred during execution which resulted in the job being canceled.

 

We have 8 polling engines and this is only happening on one of them. Some templates work fine on this polling engine as well, however A few were green yesterday and now have gone to unknown and are displaying this message.

 

I do keep tabs on all unknown templates and we do have very few that are actually unknown so I don't think its a stress issue, plus SAM polling is at less then 6%

 

I have seen similar known  issues but none that seem to display this exact error message.

 

The node is up and I have tested the credentials which all come back as working.

 

 


Monitor if a directory folder is accessible or not???? Help Please!

$
0
0

Hello fellow Thwack Heads.

 

 

I am hoping someone could help me here. I have been tasked with seeing if there is a way to monitor if a directory folder is accessible or not.

 

 

I have looked over Thwack for suggestions and have found some but nothing that is clean and will give me what I actually need. I am hoping someone has an idea or a way of accomplishing what I'm looking to do.

 

 

 

We basically own all SW products with the exception of LEM but I do not think an event log is created when a directory becomes inaccessible but maybe I'm wrong on that.

 

 

 

To clarify what I'm trying to do; I want to monitor with hopefully S.A.M or another SW product if a specific folder that an application resides in and that Citrix pulls the application from is accessible or not. I have already set-up monitors for if the services and processes are running or not but they would also like to see if a folder is able to be accessed or not. T:\APPS\RAIVLiq\rightangle.exe  

 

I would than also do this for other directories as well;

 

 

 

Any thoughts or idea's on how to accomplish this would be greatly appreciated.

Network Discovery and Application Monitor Question

$
0
0

Hello All,

 

I have an issue that I was hoping to get some assistance with.  I was able to setup my network discovery.  I scheduled the discovery to automatically import the discovered nodes and automatically setup monitoring.  The job worked fine with one exception.  In the Custom Properties section (when you click on "edit node" of a specific server), there is a field labeled "Environment".  We have the following 3 environments setup in SAM:

 

KADC

KSDC

KLDC

 

Our server names are structured in such a way, that they will be added to one of these 3 environments.  For example, server name hostAVname will be added to the KADC environment due to the letters AV in the server name.  hostSVname will be added to the KSDC environment due to the letters SV in the server name.  hostLVname will be added to the KLDC environment due to the letters LV in the server name.

 

What happens when I run the discovery is that all of the servers are place in an environment named "unknown" and I have to manually place each server in the correct environment.  Is there a way in SAM, via the Network Discovery and subsequent adding of the monitor for the applications, that each server can be automatically added to the appropriate environment based on the server name? 

Alert cleanup - node down and high packet loss

$
0
0

We are trying to work on alert clean up in the system and set our alerts so that all configured alerts will integrate with WHD and create a ticket. 

 

I'm currently having issues with the node down alerts and the high packet loss alerts.  From reading online the system determines if a nodes status by ping every 120 seconds (we left the default here).  Once the first ping is missed the node status is set to warning and then it goes into "fast ping" where it will ping the node every 10 seconds for 120 seconds.  If all pings are missed it will set the status as down.

 

How does Node Down Work?

 

Now for % Packet loss, this is calculated by looking at the last 10 pings in memory and from what i have read includes the "fast ping" responses.

 

How is % Packet Loss calculated?

 

 

This means that we will get an alert triggered on a % packet loss just before the node is considered down.  Now i have two separate tickets for what really should be just a down node.  I am trying to figure out what i can do to fix this logic and still get notified of issues as soon as possible.  Our site uses NPM and SAM, any recommendations on this would be appreciated.

Network Sonar Discovery - Exclude Devices

$
0
0

We have multiple locations and configure each location to be on a weekly discovery schedule. Every job is configured to scan a dedicated subnet that we have defined for non-client machines. Our smaller sites use that same range for other devices like UPS, network switches, etcs. The issue that we are having is, the auto-discovery scans everything within that subnet. Devices that do not SNMP setup, Solarwinds tries to scan it, and because of access limitation the devices that it is trying to scan sends out emails for unauthorized access every time the scan runs. We added those devices to the Ignore list, but it looks like all it is doing is stopping the devices from showing up in the import list. In some cases we are able setup SNMP credentials, but there are cases where it is not possible.

 

Is there a way for SolarWinds to not scan IPs that we have defined during discovery?

Server Clock Drift (PowerShell)

adding additional tabs for apps

$
0
0

Hi Solarwinds

 

I want to ask if its possible to add additional tabs for applications SAM summary view bar. kindly see the attached image for detailed view.

 

thank you

Hardware Health monitor issue

$
0
0

Hi

I have a problem with hardware health monitor.

My vcenter added to my NPM 11.5 and SAM 6.2.

One of my esxi server's HDD failed and I recive error and change the HDD .

But problem start after that. Error gone from server ilo and vcenter health monitor but in Solarwinds HDD sensor still failed.

Disable and enable the sensor from SAM was useless.

Even I remove and add specific server again.

There is no error log even in server ilo.

So tell me what should I do?


Network Discovery....

$
0
0

Hi everybody. In looking at the various way to add nodes to SAM for monitoring I was wondering if there is a way to not just import the node but to bring along some detailed information... i.e, Contacts, location ect. and populate the corresponding fields in the custom property section. Hopefully I made sense. Thanks for response.

Can we schedule a monitoring component be run twice and compare their result?

$
0
0

Hi all,

 

I want to run a monitoring component (SQL Server User Experience Monitor) twice.

The result will be a number and I want to alert when the result of second run is different than the first run.

Is there a way to do this from SAM?

I can schedule the monitor to run every 10 minutes but can't think of a way to save and compare the result of two runs.

 

Thanks in advance!

jobengineworker2 High CPU utilization

$
0
0

I have CPU utilization on my Additional Polling engines(even primary at times) with jobengineworker2 processes being the top contributor(s). In SolarWinds.APM.Probes_[PID].log, I see the below entries fill up:

 

2018-03-27 08:34:49,574 [317] [A(null)] [C(null)] WARN  SolarWinds.APM.Common.Diagnostics.PerformanceCounters - Initialization of performance counters SolarWinds: SAM Jobs(_Total) failed

System.InvalidOperationException: The requested Performance Counter is not a custom counter, it has to be initialized as ReadOnly.

   at System.Diagnostics.PerformanceCounter.InitializeImpl()

   at System.Diagnostics.PerformanceCounter.get_RawValue()

   at SolarWinds.APM.Common.Diagnostics.PerformanceCounters.GetPerformanceCounter(String category, String counterName, String instanceName, PerformanceCounterInstanceLifetime instanceLifetime)

   at SolarWinds.APM.Common.Diagnostics.PerformanceCounters.CreateInstancePerformanceCounters(String category, CounterCreationDataCollection counterDefs, String instanceName, PerformanceCounterInstanceLifetime lifeTime)

   at SolarWinds.APM.Common.Diagnostics.PerformanceCounters.InitializeCounters(Object state)

 

2018-03-26 16:31:31,040 [206] [A(null)] [C(null)] WARN  SolarWinds.APM.Common.Diagnostics.PerformanceCounters - Error creating counter

System.InvalidOperationException: The requested Performance Counter is not a custom counter, it has to be initialized as ReadOnly.

   at System.Diagnostics.PerformanceCounter.InitializeImpl()

   at System.Diagnostics.PerformanceCounter.get_RawValue()

   at SolarWinds.APM.Common.Diagnostics.PerformanceCounters.GetPerformanceCounter(String category, String counterName, String instanceName, PerformanceCounterInstanceLifetime instanceLifetime)

 

I have rebuilt the performance counters on affected polling engines per Performance counters are not working or missing in PerfMon - SolarWinds Worldwide, LLC. Help and Support , but that did not  help. I do not have many unknowns in my environment too, if that is a suspicion.

 

If anyone else has seen this too, please help.

SQL Job Threshold Alert

$
0
0

I am trying to setup SQL Job Threshold Alert. We have a job in SQL that get stuck in loop we need to get notified if it running longer then normal.

Attached is my setup for the alert Added SQL Server and SQL Job to monitor. Next step I am no 100% sure we exactly you set the threshold alert.

 

I used the condition must exist for 25 minutes but I am not getting any alerts.

 

Thank you

Bind (Linux - v9.6 and higher) Update? or alt monitoring package?

$
0
0

Hello all -

 

I just wanted to inquire if the Bind package has been updated since 2012. and or if there is an alternative to this package that may or may not be a later release? I don't know much of anything on the Linux world only that some of them are failing under this application monitor.

 

Thanks

Viewing all 12281 articles
Browse latest View live


<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>