NSClient++ Help (#1) - Stops sending checks (#357) - Message List

Stops sending checks

I am using 3.6 on a Vista machine. I have it configured to send checks via NSCA. It works for a while and then stops sending. The service is running. If I restart the service on the vista box it starts working again and then fails. The log shows the following:

2009-03-02 08:02:04: debug:NSClient++.cpp:736: No shared session: ignoring change event!

Thanks in advance,

David

  • Message #1098

    Is that the only thing in the log? (if so enable debug logging and post full (or more detailed) log)

    MickeM

    • Message #1100

      Thanks for your assistance. Here is what is in the log when it stops sending.

      2009-03-01 19:13:57: debug:modules\NSCAAgent\NSCAThread.cpp:174: Looked up xx.xx.xx.xx to xx.xx.xx.xx
      2009-03-01 19:14:02: debug:NSClient++.cpp:1017: Injecting: CheckOK: Host is alive
      2009-03-01 19:14:02: debug:NSClient++.cpp:1053: Injected Result: OK 'Host is alive'
      2009-03-01 19:14:02: debug:NSClient++.cpp:1054: Injected Performance Result: ''
      2009-03-01 19:14:02: debug:NSClient++.cpp:1017: Injecting: checkCPU: warn=80, crit=90, time=20m, time=10s, time=4
      2009-03-01 19:14:02: debug:NSClient++.cpp:1053: Injected Result: OK 'OK CPU Load ok.'
      2009-03-01 19:14:02: debug:NSClient++.cpp:1054: Injected Performance Result: ''20m'=0%;80;90; '10s'=0%;80;90; '4'=2%;80;90; '
      2009-03-01 19:14:02: debug:NSClient++.cpp:1017: Injecting: CheckDriveSize: CheckAll, ShowAll=long, MinWarnUsed=10%, MinCritUsed=5%, FilterType=FIXED
      2009-03-01 19:14:02: debug:NSClient++.cpp:1053: Injected Result: OK 'OK: C:\: Total: 149G - Used: 96.2G (64%) - Free: 52.7G (36%)'
      2009-03-01 19:14:02: debug:NSClient++.cpp:1054: Injected Performance Result: ''C:\'=64%;10;5; '
      2009-03-01 19:14:02: debug:NSClient++.cpp:1017: Injecting: CheckEventLog: filter=in, file=application, MaxWarn=1, MaxCrit=1, filter=in, filter-generated=>1h, filter.eventID==1017, filter.eventID==35
      2009-03-01 19:14:02: debug:NSClient++.cpp:1053: Injected Result: OK 'Eventlog check ok'
      2009-03-01 19:14:02: debug:NSClient++.cpp:1054: Injected Performance Result: ''eventlog'=0;1;1; '
      2009-03-01 19:14:02: debug:NSClient++.cpp:1017: Injecting: checkMem: MaxWarn=80%, MaxCrit=90%, ShowAll, type=page
      2009-03-01 19:14:02: debug:NSClient++.cpp:1053: Injected Result: OK 'OK: page file: 2.84G'
      2009-03-01 19:14:02: debug:NSClient++.cpp:1054: Injected Performance Result: ''page file'=39%;80;90; '
      2009-03-01 19:14:02: debug:modules\NSCAAgent\NSCAThread.cpp:174: Looked up xx.xx.xx.xx to xx.xx.xx.xx
      2009-03-01 19:14:07: debug:NSClient++.cpp:736: No shared session: ignoring change event!
      
      • Message #1101

        I have uploaded a new nightly build which adds some more logging as well as catches around the NSCA thread so feel free to see if this sheds more light on the problem.

        Michael Medin

        • Message #1102

          I will try that and let you know. Thanks.

          • Message #1111

            Used the build and got the exact same message as noted above. Ran fine for about 19 hours and then stopped.

            • Message #1113

              Pulled down the 0.3.6-RC7. Process ran fine for 5 hours exactly. Then just stopped. No errors noted it just quit running. If I stop and restart the process it will continue to run fine and then quit after an undetermined amount of time.

              • Message #1114

                do you always get the same "sahred session" message in your log?

                • Message #1115

                  Prior to the RC7 version I did when it failed that would be the last entry. Before failing the "shared session" is not listed in the log. After moving to the RC7 release it just stops and requires a restart of the service. Here is the last entry in the log when it stopped last. There does not seem to be a pattern in the time frame that it stops. Also the service shows that it is running but the process does not consume and CPU time. When it is functioning properly I see it use CPU time in the task manager.

                   2009-03-04 15:00:02: debug:NSClient++.cpp:1017: Injecting: CheckOK: Host is alive
                  2009-03-04 15:00:02: debug:NSClient++.cpp:1053: Injected Result: OK 'Host is alive'
                  2009-03-04 15:00:02: debug:NSClient++.cpp:1054: Injected Performance Result: ''
                  2009-03-04 15:00:02: debug:NSClient++.cpp:1017: Injecting: checkCPU: warn=80, crit=90, time=20m, time=10s, time=4
                  2009-03-04 15:00:02: debug:NSClient++.cpp:1053: Injected Result: OK 'OK CPU Load ok.'
                  2009-03-04 15:00:02: debug:NSClient++.cpp:1054: Injected Performance Result: ''20m'=5%;80;90; '10s'=6%;80;90; '4'=5%;80;90; '
                  2009-03-04 15:00:02: debug:NSClient++.cpp:1017: Injecting: CheckDriveSize: CheckAll, ShowAll=long, MaxWarnUsed=90%, MaxCritUsed=95%, FilterType=FIXED
                  2009-03-04 15:00:02: debug:NSClient++.cpp:1053: Injected Result: OK 'OK: C:\: Total: 149G - Used: 95.4G (64%) - Free: 53.5G (36%)'
                  2009-03-04 15:00:02: debug:NSClient++.cpp:1054: Injected Performance Result: ''C:\'=64%;90;95; '
                  2009-03-04 15:00:02: debug:NSClient++.cpp:1017: Injecting: CheckEventLog: filter=in, file=application, MaxWarn=1, MaxCrit=1, filter=in, filter-generated=>1h, filter.eventID==1017, filter.eventID==35
                  2009-03-04 15:00:02: debug:NSClient++.cpp:1053: Injected Result: OK 'Eventlog check ok'
                  2009-03-04 15:00:02: debug:NSClient++.cpp:1054: Injected Performance Result: ''eventlog'=0;1;1; '
                  2009-03-04 15:00:02: debug:NSClient++.cpp:1017: Injecting: checkMem: MaxWarn=80%, MaxCrit=90%, ShowAll, type=physical
                  2009-03-04 15:00:02: debug:NSClient++.cpp:1053: Injected Result: OK 'OK: physical memory: 2.53G'
                  2009-03-04 15:00:02: debug:NSClient++.cpp:1054: Injected Performance Result: ''physical memory'=72%;80;90; '
                  2009-03-04 15:00:02: debug:modules\NSCAAgent\NSCAThread.cpp:187: Looked xx.xx.xx.xx to xx.xx.xx.xx
                  

                  Thanks for your help.

                  David

                  • Message #1118

                    Did some further investigation. Every time this stops working there is an error in event viewer:

                    TCP/IP has reached the security limit imposed on the number of concurrent TCP connect attempts. M$ as a limit on vista ultimate of 25 connection. There are methods to get around this but would rather not.

                    I have it set to send out every 5 seconds and using encryption.

                    I turned it down to every 30 seconds and will see if the error returns.

                    Regards,

                    David

  • Message #1138

    I have this same problem on multiple Win 2003 servers. I am only using passive checks on hosts, and they randomly just stop sending. I changed debug=1 on few servers, and I get this error message in log: cpp:736: No shared session: ignoring change event! , but only after I remote desktop to the server, before that nothing for like two days.

    When I restart the nsclientpp service everything starts to run normally. It can run 2 hours or 2 weeks just fine, and then stop again.

    Should I try the RC8 version?

    • Message #1147

      No help with RC8 version. At this moment I have about 20 of 30 servers not sending results.

      No errors in windows event logs or nagios server logs. nsclient++ log has last lines: 2009-04-08 16:40:37: debug:modules\NSCAAgent\NSCAThread.cpp:182: Sending to server... 2009-04-08 16:40:37: debug:modules\NSCAAgent\NSCAThread.cpp:189: Looked up 192.168.xxx.xxx to 192.168.xxx.xxx

      And when I log on to the server to see what's going on, it logged these two lines:

      2009-04-14 11:26:00: debug:NSClient++.cpp:745: No shared session: ignoring change event! 2009-04-14 11:26:01: debug:NSClient++.cpp:745: No shared session: ignoring change event!

      If I restart nsclientpp everything runs fine again for some time...

      Here is my nsc.ini:

      [modules] FileLogger.dll CheckSystem.dll CheckDisk.dll CheckHelpers.dll NSCAAgent.dll

      [Settings] use_file=1

      [log] debug=1 file=nsclient.log root_folder=exe

      [NSCA Agent] interval=300 encryption_method=3 password=xxxxxxxxx

      hostname=HOSTNAME.domain.local nsca_host=192.168.xxx.xxx nsca_port=5667

      [NSCA Commands] host_check=CheckOK uptimecheck=CheckUpTime MinWarn=8h MinCrit=1h hdcheck=CheckDriveSize ShowAll CheckAll? FilterType?=FIXED MinWarnFree?=10% MinCritFree?=5% cpucheck=checkCPU warn=80 crit=90 time=20m time=10s time=4 memorycheck=checkMem MaxWarn=90% MaxCrit=95% ShowAll type=physical runningservices=checkServiceState CheckAll? exclude=MpfService? exclude=SysmonLog?

    • Message #1142

      Should I try the RC8 version?

      I installed RC8 on about 10 servers, and I'll keep you informed

Subscriptions