openHAB 3 runs out of memory / java heap space errors, CPU 100%+ after a few hours

spy0r · December 24, 2020, 10:44pm

Are those DSL scripts created via the UI or from files? I seem to have similar problems with those from the UI…

Pedals2Paddles · December 25, 2020, 1:24am

The UI. And I am quickly realizing there is a potential huge problem with this. These two DSL scripts, composed in the UI, took 4 to 7 seconds to execute. During those 4-7 seconds, the CPU would be over 100%. And after several hours of normal operation (scripts running every 5-10 minutes), the whole thing would just keel over dead out of memory and CPU pegged.

The exact same operation performed by an ECMA script composed in the UI executes so fast the indicator doesn’t even change to running. This has been running for hours now honestly this is the best I’ve ever seen my install of OH3 operate. Everything it is does is instant and nothing is bogging down.

What or why this problem exists is well over my head, but process of elimination here seems to suggest it exists.

opus · December 25, 2020, 7:33am

You posted only parts of the rules, which triggers are you using and with what settings?

spy0r · December 25, 2020, 8:24am

That’s what i also saw. In my case the simplest rules from the UI in DSL take up to 10s with high CPU load, but not neccessarely high memory usage. I reduced how often these rules run but i think i only increased the time between the crashes.

Pedals2Paddles · December 25, 2020, 1:28pm

Those are scripts, in their entirety. There are separate rules that trigger them. Those triggers are either item state changes, or an every 5 minutes cron.

ljsquare · December 26, 2020, 1:50pm

I experience the same problem. I’ve created a new topic, but I think I can let it merge with this topic.
[OH3] high cpu load, unresponsive OH

spy0r · December 26, 2020, 6:37pm

Since i now moved all my DSL rules in files again, my cpu load looks stable and extremely low! I think that did the trick in my case!

(same rules, same triggers)

Lolodomo · December 26, 2020, 6:57pm

One git issue has to be declared if not existing.

BobMiles · December 27, 2020, 10:31am

I though I have my problems tackled but I still have a high CPU load.
Can someone run a

shell:threads - - list

in the karaf console and see what are the top CPU time consumers? For me DirWatcher and RuleRefresher are suspiciously high!
I use file-only DSL rules, but I also use jython with the (fixed) helper libraries…
When watching the event stream in the debug sidebar, I see sometimes that rules get reloaded even though I did not touch the configuration…
I also still see some org.eclipse.x bundles when I do bundle:list -s. Shouldn’t they be gone?

I tried one more thing and it looks like it helped:

sudo apt purge openhab2

Followed by a reboot.
I upgraded with the openhabian config tool, but somehow the were still leftovers… Now it’s running like a charm.

ljsquare · December 27, 2020, 12:08pm

For me this doesn’t apply unfortunately, because I run OH3 in a docker container, so I have a “clean” environment

spy0r · December 27, 2020, 12:22pm

in my case the RXTXPortMonitor is the highest, but none of the threads are suspiciously high.

spy0r · December 29, 2020, 9:28am

@Pedals2Paddles: Did you already create a git issue?

dominik_helleberg · December 29, 2020, 1:58pm

I’m observing a similar issue: round about 24 hours of running openhab3 in a docker container (clean install, setup from scratch) the VM throws an out-of-memory error and becomes slow beforehand. I suspect heavy rule-execution or exec-binding at the moment and will disable those one by one and hopefully narrow it down a bit…

spy0r · December 29, 2020, 2:14pm

Same question, do you have set up DSL rules via the UI? If so, put them in files like you did in OH2. That did the trick at my case and since then i have extremely low CPU and everything is stable

Andrew_Rowe · December 29, 2020, 2:20pm

please post link to git issue so we can play along at home
thanks

spy0r · December 29, 2020, 2:53pm

Wikibear · December 30, 2020, 9:32am

I don’t get memory leaks but i have a similar situation with long waiting issues of dispatch events:

dispatching event to subscriber ‘org.openhab.core.internal.items.ItemUpdater@1e6b6a0’ takes more than 5000ms

I will update all items to model simantics and it will go slower and slower…

Openhab will lost all network connections. No network things will be reached. After reboot everything is fine.

dominik_helleberg · January 3, 2021, 4:24pm

That actually fixed it for me. Up and running for 4 days now without issues. I only moved 2 highly-frequented rules back to files, that seems to do the trick.

spy0r · January 8, 2021, 6:01pm

That problem is solved in the current snapshots!

Pedals2Paddles · January 8, 2021, 6:24pm

What specifically has been fixed?