[NSClient++] Topic #357 - Stops sending checks http://nsclient.org/nscp/discussion/topic/357 <p> Stops sending checks </p> en-us NSClient++ /trac/nswide.png http://nsclient.org/nscp/discussion/topic/357 Trac 1.0beta1 - DiscussionPlugin David Mon, 02 Mar 2009 13:14:40 GMT Topic #357 - Stops sending checks http://nsclient.org/nscp/discussion/topic/357#topic http://nsclient.org/nscp/discussion/topic/357#topic <p> I am using 3.6 on a Vista machine. I have it configured to send checks via NSCA. It works for a while and then stops sending. The service is running. If I restart the service on the vista box it starts working again and then fails. The log shows the following: </p> <p> 2009-03-02 08:02:04: debug:NSClient++.cpp:736: No shared session: ignoring change event! </p> <p> Thanks in advance, </p> <p> David </p> Topic Enrico Fri, 24 Apr 2009 15:13:32 GMT Reply #1160 to topic #357 - Stops sending checks http://nsclient.org/nscp/discussion/message/1160#message1160 http://nsclient.org/nscp/discussion/message/1160#message1160 <p> Hello, </p> <p> I have your some problem, I tryed also the rc8 but nothing to do.. I am experiment nagios in my farm, I have more then 100 windows machine and about 900 linux servers. For this reason I would like to use nsca instead nrpe. Nsca for linux servers work fine but for the windows servers, with nsclientpp (i tryed rc2 and rc8), after a while I didn't receive the passive checks anymore. For this reason today I switched to NRPE for the win servers. </p> <p> Please if you found a solution, write it here. </p> <p> Thank you -- </p> <blockquote> <p> Enrico </p> </blockquote> <p> [NSCA Agent] interval=120 encryption_method=1 password=blablabla nsca_host=nagios nsca_port=5667 hostname=ts02 </p> <p> [NSCA Commands] cpu_check=checkCPU warn=80 crit=90 time=20m time=10s time=4 mem_check=checkMem <a class="wiki" href="/nscp/wiki/MaxWarn">MaxWarn</a>=80% <a class="wiki" href="/nscp/wiki/MaxCrit">MaxCrit</a>=90% <a class="wiki" href="/nscp/wiki/ShowAll">ShowAll</a> type=page svc_check=checkServiceState <a class="missing wiki">CheckAll?</a> exclude=wampmysqld exclude=<a class="missing wiki">MpfService?</a> c_check=<a class="wiki" href="/nscp/wiki/CheckDriveSize">CheckDriveSize</a> <a class="wiki" href="/nscp/wiki/ShowAll">ShowAll</a> <a class="missing wiki">MinWarnFree?</a>=10% <a class="missing wiki">MinCritFree?</a>=5% Drive=c:\ l_check=<a class="wiki" href="/nscp/wiki/CheckDriveSize">CheckDriveSize</a> <a class="wiki" href="/nscp/wiki/ShowAll">ShowAll</a> <a class="missing wiki">MinWarnFree?</a>=10% <a class="missing wiki">MinCritFree?</a>=5% Drive=l:\ uptime=checkUpTime <a class="wiki" href="/nscp/wiki/MinWarn">MinWarn</a>=2h <a class="wiki" href="/nscp/wiki/MinCrit">MinCrit</a>=1h </p> <p> [NRPE Handlers] check_cpu=inject checkCPU warn=80 crit=90 5 10 15 check_eventlog=inject <a class="wiki" href="/nscp/wiki/CheckEventLog">CheckEventLog</a> Application warn.require.eventType=error warn.require.eventType=warning critical.require.eventType=error critical.exclude.eventType=info truncate=1024 descriptions </p> <p> check_disk_c=inject <a class="wiki" href="/nscp/wiki/CheckDriveSize">CheckDriveSize</a> <a class="wiki" href="/nscp/wiki/ShowAll">ShowAll</a> <a class="missing wiki">MinWarnFree?</a>=10% <a class="missing wiki">MinCritFree?</a>=5% Drive=c:\ check_disk_d=inject <a class="wiki" href="/nscp/wiki/CheckDriveSize">CheckDriveSize</a> <a class="wiki" href="/nscp/wiki/ShowAll">ShowAll</a> <a class="missing wiki">MinWarnFree?</a>=10% <a class="missing wiki">MinCritFree?</a>=5% Drive=d:\ check_disk_l=inject <a class="wiki" href="/nscp/wiki/CheckDriveSize">CheckDriveSize</a> <a class="wiki" href="/nscp/wiki/ShowAll">ShowAll</a> <a class="missing wiki">MinWarnFree?</a>=10% <a class="missing wiki">MinCritFree?</a>=5% Drive=l:\ check_services=inject checkServiceState <a class="missing wiki">CheckAll?</a> exclude=wampmysqld exclude=<a class="missing wiki">MpfService?</a> </p> <p> check_drivesize=inject <a class="wiki" href="/nscp/wiki/CheckDriveSize">CheckDriveSize</a> <a class="wiki" href="/nscp/wiki/MinWarn">MinWarn</a>=25% <a class="wiki" href="/nscp/wiki/MinCrit">MinCrit</a>=15% <a class="missing wiki">CheckAll?</a> <a class="missing wiki">FilterType?</a>=FIXED </p> <p> check_pagefile=inject checkMem <a class="wiki" href="/nscp/wiki/MaxWarn">MaxWarn</a>=80% <a class="wiki" href="/nscp/wiki/MaxCrit">MaxCrit</a>=90% <a class="wiki" href="/nscp/wiki/ShowAll">ShowAll</a> type=page check_mem=inject checkMem <a class="wiki" href="/nscp/wiki/MaxWarn">MaxWarn</a>=80% <a class="wiki" href="/nscp/wiki/MaxCrit">MaxCrit</a>=90% <a class="wiki" href="/nscp/wiki/ShowAll">ShowAll</a> check_uptime=inject checkUpTime <a class="wiki" href="/nscp/wiki/MinWarn">MinWarn</a>=2h <a class="wiki" href="/nscp/wiki/MinCrit">MinCrit</a>=1h </p> Message Mikko Tue, 14 Apr 2009 08:46:11 GMT Reply #1147 to topic #357 - Stops sending checks http://nsclient.org/nscp/discussion/message/1147#message1147 http://nsclient.org/nscp/discussion/message/1147#message1147 <p> No help with RC8 version. At this moment I have about 20 of 30 servers not sending results. </p> <p> No errors in windows event logs or nagios server logs. nsclient++ log has last lines: 2009-04-08 16:40:37: debug:modules\NSCAAgent\NSCAThread.cpp:182: Sending to server... 2009-04-08 16:40:37: debug:modules\NSCAAgent\NSCAThread.cpp:189: Looked up 192.168.xxx.xxx to 192.168.xxx.xxx </p> <p> And when I log on to the server to see what's going on, it logged these two lines: </p> <p> 2009-04-14 11:26:00: debug:NSClient++.cpp:745: No shared session: ignoring change event! 2009-04-14 11:26:01: debug:NSClient++.cpp:745: No shared session: ignoring change event! </p> <p> If I restart nsclientpp everything runs fine again for some time... </p> <p> Here is my nsc.ini: </p> <p> [modules] <a class="wiki" href="/nscp/wiki/FileLogger">FileLogger</a>.dll <a class="wiki" href="/nscp/wiki/CheckSystem">CheckSystem</a>.dll <a class="wiki" href="/nscp/wiki/CheckDisk">CheckDisk</a>.dll <a class="wiki" href="/nscp/wiki/CheckHelpers">CheckHelpers</a>.dll NSCAAgent.dll </p> <p> [Settings] use_file=1 </p> <p> [log] debug=1 file=nsclient.log root_folder=exe </p> <p> [NSCA Agent] interval=300 encryption_method=3 password=xxxxxxxxx </p> <p> hostname=HOSTNAME.domain.local nsca_host=192.168.xxx.xxx nsca_port=5667 </p> <p> [NSCA Commands] host_check=CheckOK uptimecheck=<a class="wiki" href="/nscp/wiki/CheckUpTime">CheckUpTime</a> <a class="wiki" href="/nscp/wiki/MinWarn">MinWarn</a>=8h <a class="wiki" href="/nscp/wiki/MinCrit">MinCrit</a>=1h hdcheck=<a class="wiki" href="/nscp/wiki/CheckDriveSize">CheckDriveSize</a> <a class="wiki" href="/nscp/wiki/ShowAll">ShowAll</a> <a class="missing wiki">CheckAll?</a> <a class="missing wiki">FilterType?</a>=FIXED <a class="missing wiki">MinWarnFree?</a>=10% <a class="missing wiki">MinCritFree?</a>=5% cpucheck=checkCPU warn=80 crit=90 time=20m time=10s time=4 memorycheck=checkMem <a class="wiki" href="/nscp/wiki/MaxWarn">MaxWarn</a>=90% <a class="wiki" href="/nscp/wiki/MaxCrit">MaxCrit</a>=95% <a class="wiki" href="/nscp/wiki/ShowAll">ShowAll</a> type=physical runningservices=checkServiceState <a class="missing wiki">CheckAll?</a> exclude=<a class="missing wiki">MpfService?</a> exclude=<a class="missing wiki">SysmonLog?</a> </p> Message Mikko Tue, 07 Apr 2009 11:23:23 GMT Reply #1142 to topic #357 - Stops sending checks http://nsclient.org/nscp/discussion/message/1142#message1142 http://nsclient.org/nscp/discussion/message/1142#message1142 <blockquote class="citation"> <p> Should I try the RC8 version? </p> </blockquote> <p> I installed RC8 on about 10 servers, and I'll keep you informed </p> Message Mikko Mon, 06 Apr 2009 07:07:10 GMT Reply #1138 to topic #357 - Stops sending checks http://nsclient.org/nscp/discussion/message/1138#message1138 http://nsclient.org/nscp/discussion/message/1138#message1138 <p> I have this same problem on multiple Win 2003 servers. I am only using passive checks on hosts, and they randomly just stop sending. I changed debug=1 on few servers, and I get this error message in log: cpp:736: No shared session: ignoring change event! , but only after I remote desktop to the server, before that nothing for like two days. </p> <p> When I restart the nsclientpp service everything starts to run normally. It can run 2 hours or 2 weeks just fine, and then stop again. </p> <p> Should I try the RC8 version? </p> Message David Fri, 06 Mar 2009 00:31:51 GMT Reply #1118 to topic #357 - Stops sending checks http://nsclient.org/nscp/discussion/message/1118#message1118 http://nsclient.org/nscp/discussion/message/1118#message1118 <p> Did some further investigation. Every time this stops working there is an error in event viewer: </p> <p> TCP/IP has reached the security limit imposed on the number of concurrent TCP connect attempts. M$ as a limit on vista ultimate of 25 connection. There are methods to get around this but would rather not. </p> <p> I have it set to send out every 5 seconds and using encryption. </p> <p> I turned it down to every 30 seconds and will see if the error returns. </p> <p> Regards, </p> <p> David </p> Message David Thu, 05 Mar 2009 13:00:59 GMT Reply #1115 to topic #357 - Stops sending checks http://nsclient.org/nscp/discussion/message/1115#message1115 http://nsclient.org/nscp/discussion/message/1115#message1115 <p> Prior to the RC7 version I did when it failed that would be the last entry. Before failing the "shared session" is not listed in the log. After moving to the RC7 release it just stops and requires a restart of the service. Here is the last entry in the log when it stopped last. There does not seem to be a pattern in the time frame that it stops. Also the service shows that it is running but the process does not consume and CPU time. When it is functioning properly I see it use CPU time in the task manager. </p> <pre class="wiki"> 2009-03-04 15:00:02: debug:NSClient++.cpp:1017: Injecting: CheckOK: Host is alive 2009-03-04 15:00:02: debug:NSClient++.cpp:1053: Injected Result: OK 'Host is alive' 2009-03-04 15:00:02: debug:NSClient++.cpp:1054: Injected Performance Result: '' 2009-03-04 15:00:02: debug:NSClient++.cpp:1017: Injecting: checkCPU: warn=80, crit=90, time=20m, time=10s, time=4 2009-03-04 15:00:02: debug:NSClient++.cpp:1053: Injected Result: OK 'OK CPU Load ok.' 2009-03-04 15:00:02: debug:NSClient++.cpp:1054: Injected Performance Result: ''20m'=5%;80;90; '10s'=6%;80;90; '4'=5%;80;90; ' 2009-03-04 15:00:02: debug:NSClient++.cpp:1017: Injecting: CheckDriveSize: CheckAll, ShowAll=long, MaxWarnUsed=90%, MaxCritUsed=95%, FilterType=FIXED 2009-03-04 15:00:02: debug:NSClient++.cpp:1053: Injected Result: OK 'OK: C:\: Total: 149G - Used: 95.4G (64%) - Free: 53.5G (36%)' 2009-03-04 15:00:02: debug:NSClient++.cpp:1054: Injected Performance Result: ''C:\'=64%;90;95; ' 2009-03-04 15:00:02: debug:NSClient++.cpp:1017: Injecting: CheckEventLog: filter=in, file=application, MaxWarn=1, MaxCrit=1, filter=in, filter-generated=&gt;1h, filter.eventID==1017, filter.eventID==35 2009-03-04 15:00:02: debug:NSClient++.cpp:1053: Injected Result: OK 'Eventlog check ok' 2009-03-04 15:00:02: debug:NSClient++.cpp:1054: Injected Performance Result: ''eventlog'=0;1;1; ' 2009-03-04 15:00:02: debug:NSClient++.cpp:1017: Injecting: checkMem: MaxWarn=80%, MaxCrit=90%, ShowAll, type=physical 2009-03-04 15:00:02: debug:NSClient++.cpp:1053: Injected Result: OK 'OK: physical memory: 2.53G' 2009-03-04 15:00:02: debug:NSClient++.cpp:1054: Injected Performance Result: ''physical memory'=72%;80;90; ' 2009-03-04 15:00:02: debug:modules\NSCAAgent\NSCAThread.cpp:187: Looked xx.xx.xx.xx to xx.xx.xx.xx </pre><p> Thanks for your help. </p> <p> David </p> Message mickem Thu, 05 Mar 2009 05:39:29 GMT Reply #1114 to topic #357 - Stops sending checks http://nsclient.org/nscp/discussion/message/1114#message1114 http://nsclient.org/nscp/discussion/message/1114#message1114 <p> do you always get the same "sahred session" message in your log? </p> Message David Wed, 04 Mar 2009 21:16:00 GMT Reply #1113 to topic #357 - Stops sending checks http://nsclient.org/nscp/discussion/message/1113#message1113 http://nsclient.org/nscp/discussion/message/1113#message1113 <p> Pulled down the 0.3.6-RC7. Process ran fine for 5 hours exactly. Then just stopped. No errors noted it just quit running. If I stop and restart the process it will continue to run fine and then quit after an undetermined amount of time. </p> Message David Wed, 04 Mar 2009 14:10:34 GMT Reply #1111 to topic #357 - Stops sending checks http://nsclient.org/nscp/discussion/message/1111#message1111 http://nsclient.org/nscp/discussion/message/1111#message1111 <p> Used the build and got the exact same message as noted above. Ran fine for about 19 hours and then stopped. </p> Message David Mon, 02 Mar 2009 21:11:03 GMT Reply #1102 to topic #357 - Stops sending checks http://nsclient.org/nscp/discussion/message/1102#message1102 http://nsclient.org/nscp/discussion/message/1102#message1102 <p> I will try that and let you know. Thanks. </p> Message mickem Mon, 02 Mar 2009 19:36:01 GMT Reply #1101 to topic #357 - Stops sending checks http://nsclient.org/nscp/discussion/message/1101#message1101 http://nsclient.org/nscp/discussion/message/1101#message1101 <p> I have uploaded a new nightly build which adds some more logging as well as catches around the NSCA thread so feel free to see if this sheds more light on the problem. </p> <p> <em> Michael Medin </em></p> Message David Mon, 02 Mar 2009 18:16:30 GMT Reply #1100 to topic #357 - Stops sending checks http://nsclient.org/nscp/discussion/message/1100#message1100 http://nsclient.org/nscp/discussion/message/1100#message1100 <p> Thanks for your assistance. Here is what is in the log when it stops sending. </p> <pre class="wiki">2009-03-01 19:13:57: debug:modules\NSCAAgent\NSCAThread.cpp:174: Looked up xx.xx.xx.xx to xx.xx.xx.xx 2009-03-01 19:14:02: debug:NSClient++.cpp:1017: Injecting: CheckOK: Host is alive 2009-03-01 19:14:02: debug:NSClient++.cpp:1053: Injected Result: OK 'Host is alive' 2009-03-01 19:14:02: debug:NSClient++.cpp:1054: Injected Performance Result: '' 2009-03-01 19:14:02: debug:NSClient++.cpp:1017: Injecting: checkCPU: warn=80, crit=90, time=20m, time=10s, time=4 2009-03-01 19:14:02: debug:NSClient++.cpp:1053: Injected Result: OK 'OK CPU Load ok.' 2009-03-01 19:14:02: debug:NSClient++.cpp:1054: Injected Performance Result: ''20m'=0%;80;90; '10s'=0%;80;90; '4'=2%;80;90; ' 2009-03-01 19:14:02: debug:NSClient++.cpp:1017: Injecting: CheckDriveSize: CheckAll, ShowAll=long, MinWarnUsed=10%, MinCritUsed=5%, FilterType=FIXED 2009-03-01 19:14:02: debug:NSClient++.cpp:1053: Injected Result: OK 'OK: C:\: Total: 149G - Used: 96.2G (64%) - Free: 52.7G (36%)' 2009-03-01 19:14:02: debug:NSClient++.cpp:1054: Injected Performance Result: ''C:\'=64%;10;5; ' 2009-03-01 19:14:02: debug:NSClient++.cpp:1017: Injecting: CheckEventLog: filter=in, file=application, MaxWarn=1, MaxCrit=1, filter=in, filter-generated=&gt;1h, filter.eventID==1017, filter.eventID==35 2009-03-01 19:14:02: debug:NSClient++.cpp:1053: Injected Result: OK 'Eventlog check ok' 2009-03-01 19:14:02: debug:NSClient++.cpp:1054: Injected Performance Result: ''eventlog'=0;1;1; ' 2009-03-01 19:14:02: debug:NSClient++.cpp:1017: Injecting: checkMem: MaxWarn=80%, MaxCrit=90%, ShowAll, type=page 2009-03-01 19:14:02: debug:NSClient++.cpp:1053: Injected Result: OK 'OK: page file: 2.84G' 2009-03-01 19:14:02: debug:NSClient++.cpp:1054: Injected Performance Result: ''page file'=39%;80;90; ' 2009-03-01 19:14:02: debug:modules\NSCAAgent\NSCAThread.cpp:174: Looked up xx.xx.xx.xx to xx.xx.xx.xx 2009-03-01 19:14:07: debug:NSClient++.cpp:736: No shared session: ignoring change event! </pre> Message mickem Mon, 02 Mar 2009 17:04:40 GMT Reply #1098 to topic #357 - Stops sending checks http://nsclient.org/nscp/discussion/message/1098#message1098 http://nsclient.org/nscp/discussion/message/1098#message1098 <p> Is that the only thing in the log? (if so enable debug logging and post full (or more detailed) log) </p> <p> <em> MickeM </em></p> Message