As said earlier, the log you posted has no logging of ZWave messages. There are only logs from a couple of classes (no command classes, no ZWave messages).
Maybe there’s another exception somewhere that is bumping out the receive thread. This could then cause the receive buffer overflow and also the lack of logging. It would be well before the log full exception though.
Ok, so maybe the lack of messages is something else. As I suggested, I wonder if there’s another exception somewhere that is causing the receive thread to exit. Another suggestion is that something bad is happening earlier so that the binding stops processing received data, and then at some point later, you get the queue full.
I can’t tie up the exceptions with errors in the logs as the different logs don’t tie up for me.
The error you show above at 09:00:15 is this one -:
This is on the 6 May. If I look in the OH2 log, there is no error at this time on the 6th May. There are 3 exceptions in the OH2 log -:
2017-04-17 05:43:28.805 [ERROR] [WaveSerialHandler$ZWaveReceiveThread] - Exception during ZWave thread.
2017-04-29 02:02:28.219 [ERROR] [WaveSerialHandler$ZWaveReceiveThread] - Exception during ZWave thread.
2017-05-04 06:36:31.001 [ERROR] [WaveSerialHandler$ZWaveReceiveThread] - Exception during ZWave thread.
None of these tie up with either of the logs that you show in note1 or note2 (I’m assuming that’s what you are trying to say when you said it ties up with “this” - “this” being the exceptions?).
Is that what you are trying to show with the log images above?
Let me start from fresh and collect new data. But before that I want to make sure that I have the correct setting to collect DEBUG information. Can you take a look at the post above and tell me
should I run oh2 from the command line with start_debug, or is it ok to run oh2 as a service with the logging settings that I mentioned?
should I collect debug info from zwave binding only or from other bindings / modules too (and if yes, what is the easiest way to turn DEBUG for everything)?
should I change the ConverssionPattern format in the file org.ops4j.pax.logging.cfg, so that it is easier to see the DEBUG messages?
The debug settings you posted looked ok, so that’s why I think the lack of debug information must be due to an exception (ie stopping the logging, rather than the logging not being enabled).
I don’t use start_debug normally - this enables something else that is not related to logging (it provides a debug interface for Eclipse). I’m not sure how you have moved all ZWave logging into another file - I guess you’ve edited the logback.xml file - I don’t tend to do that - I tend to just log everything into a single file. That way everything is roughly in chronological order and we’re not trying to tie up dates from different files and getting into the confusion we have above.
I wouldn’t however enable debug for “everything” - you’ll end up swamping the system with all sorts of stuff from libraries like Jetty. Just enable debug for ZWave, and any other bindings you want - if it’s a reasonable amount, then in my opinion it’s best to have everything together so we see any other strange things happening in the system that might be linked.
Ok, the redirection to different file comes from the file org.ops4j.pax.logging.cfg (see above). I will unite openhab.log, events.log, zwave.log into a single file.
Hi chris, i noticed something strange, the “Node is not communicating with controller” thing only happens to the EZMotion Express sensor, not to my other battery operated devices.
When this happens it blocks my entire z-wave network.
Could it be that it has something to do with the device-list entry in the database or perhaps something else?
I was not able to reproduce the problem so far. I think that the events stopping after some time, was just an artifact of using the command tail -f /var/log/openhab2/events.log the and the log rotating.
@RayBe I think I saw the temporary Node is not communicating with the Controller message, but I did not experience any problem as a result. I didn’t find enough time to experiment with the EZMotion Express sensor extensively since it was fixed. I’ll keep my eyes open when I get a chance to play with it more.
It’s nothing to do with the database - this information only comes from the controller. I’ll likely change the logic around DEAD devices again before this is released…
it is about the zwave serial handler with queue full, causing the zwave controller to stop communicating with the network. Someone mentioned that this might be caused by dead nodes that can not be removed from the network. In my case node 35 is one of those. Only a reboot can restore normal operation.
I’m not sure this is related - but Chris - you may remember the issues I had with the new binding, where no devices would be identified, and that I restored a backup of the stick to get it going. The ONLY difference, that I am aware of, between the backup and the problematic stick was that I had also included the HSM100 in the non-working stick.
I have not yet re-added the hsm100 to the now working network, due to not having had the time.
This may not be related at all, but, I thought I should highlight. However, this does not explain why my system worked fine with the mainline 2.0 binding.
@chris
Hi Chris, since there are no reports anymore that indicate the hsm100 sensor isn’t working for somebody, I assume it’s working perfectly for everybody
Could you please send back the device and wrap it as I did? Thanks a million for all your hard work.
I just wanted to confirm that now, the HSM100 works perfectly for me. @RayBe thanks for sending the device @chris, thanks for putting all the work into fixing the problem.