I’m sick and jet lagged so not exactly thinking my best today. I’m hoping someone can see what I’ve done or messed up.
My before state:
- Ubuntu 16.04 host
- Latest Docker
- Official Docker image from Docker Hub
I noticed that the base OS had a bunch of security updates so I ran my update Ansible playbook which updates the host and all of my Docker images. The pull of the openHAB image failed because the tags have changed, unsurprising.
So I changed my tag to use the 2.1.0-snapshot tag. OK would not restart. Rather than try to figure out what was wrong I just wiped out my userdata folder and tried to let ti recreate it. Didn’t work and did not attempt to recreate the userdata contents.
So I changed to the 2.0.0 tag and restored the userdata folder. Still wouldn’t start. Tried the delete of the userdata again and it still doesn’t start.
OH doesn’t start up enough to generate anything into the logs or connect to the console.
The logs I see when userdata is there (I have docker configured to send sysout to syslog):
Feb 3 12:09:51 chimera kernel: [835898.429815] aufs au_opts_verify:1597:dockerd[1688]: dirperm1 breaks the protection by the permission bits on the lower branch
Feb 3 12:09:51 chimera dockerd[1221]: time="2017-02-03T12:09:51.282544498-07:00" level=warning msg="Your kernel does not support swap memory limit."
Feb 3 12:09:51 chimera dockerd[1221]: time="2017-02-03T12:09:51.282629113-07:00" level=warning msg="Your kernel does not support cgroup rt period"
Feb 3 12:09:51 chimera dockerd[1221]: time="2017-02-03T12:09:51.282654112-07:00" level=warning msg="Your kernel does not support cgroup rt runtime"
Feb 3 12:09:51 chimera 04ce4fc366e1[1221]: Launching the openHAB runtime...
Feb 3 12:09:51 chimera 04ce4fc366e1[1221]: karaf: KARAF_ETC is not valid: /openhab/userdata/etc
Feb 3 12:09:51 chimera dockerd[1221]: time="2017-02-03T12:09:51.586181097-07:00" level=error msg="containerd: deleting container" error="exit status 1: \"container 04ce4fc366e19d60789aabbc4e63ec5b61d882d41c292677b3109617d23bf308 does not exist\\none or more of the container deletions failed\\n\""
When my old and previously working userdata is present I see the following in the logs:
Feb 3 12:12:23 chimera kernel: [836050.310213] aufs au_opts_verify:1597:dockerd[6814]: dirperm1 breaks the protection by the permission bits on the lower branch
Feb 3 12:12:23 chimera dockerd[1221]: time="2017-02-03T12:12:23.141370527-07:00" level=warning msg="Your kernel does not support swap memory limit."
Feb 3 12:12:23 chimera dockerd[1221]: time="2017-02-03T12:12:23.141816366-07:00" level=warning msg="Your kernel does not support cgroup rt period"
Feb 3 12:12:23 chimera dockerd[1221]: time="2017-02-03T12:12:23.141848277-07:00" level=warning msg="Your kernel does not support cgroup rt runtime"
Feb 3 12:12:23 chimera 04ce4fc366e1[1221]: Launching the openHAB runtime...
Feb 3 12:12:23 chimera 04ce4fc366e1[1221]: WARN: file:/openhab/userdata/etc/config.properties is not found, so not loaded
Feb 3 12:12:23 chimera 04ce4fc366e1[1221]: null
Feb 3 12:12:23 chimera 04ce4fc366e1[1221]: WARN: file:/openhab/userdata/etc/config.properties is not found, so not loaded
Feb 3 12:12:23 chimera 04ce4fc366e1[1221]: Error occurred shutting down framework: java.lang.NumberFormatException: null
Feb 3 12:12:23 chimera 04ce4fc366e1[1221]: java.lang.NumberFormatException: null
Feb 3 12:12:23 chimera 04ce4fc366e1[1221]: #011at java.lang.Integer.parseInt(Integer.java:542)
Feb 3 12:12:23 chimera 04ce4fc366e1[1221]: #011at java.lang.Integer.parseInt(Integer.java:615)
Feb 3 12:12:23 chimera 04ce4fc366e1[1221]: #011at org.apache.karaf.main.ConfigProperties.<init>(ConfigProperties.java:208)
Feb 3 12:12:23 chimera 04ce4fc366e1[1221]: #011at org.apache.karaf.main.Main.updateInstancePidAfterShutdown(Main.java:226)
Feb 3 12:12:23 chimera 04ce4fc366e1[1221]: #011at org.apache.karaf.main.Main.main(Main.java:191)
Feb 3 12:12:23 chimera dockerd[1221]: time="2017-02-03T12:12:23.87504043-07:00" level=error msg="containerd: deleting container" error="exit status 1: \"container 04ce4fc366e19d60789aabbc4e63ec5b61d882d41c292677b3109617d23bf308 does not exist\\none or more of the container deletions failed\\n\""
Note the warning about userdata/etc/config.properties not existing. This file does in fact exist. The NumberFormatException appears to be happening after some other non-reported error which causes karaf to fail to start or decide to shut down.
Unfortunately I can’t get the image to come up and run long enough to attach to the container and explore the file system to see if something else weird is going on.
I’d really like to keep running OH in Docker as it will make my future plans for backup, deployment, and eventual migration from an old laptop to a home lab type configuration easier. For the short term I’m going back to apt-get though as right now nothing works at all.
I welcome any insights.