There is a known intermittent bug with docker which breaks some of its functions, such as downloading images [1]. The details are being investigated, but most likely docker.service start occasionally fails to create all the subfolders required in /var/lib/docker. The workaround is a service restart. With this change, there is a short wait time after which docker health is checked and if the check fails the service is restarted. Note the required subfolders are created almost immediately, so the wait can be short. Still, pmon tolerance is slightly increased to allow the repair mechanism a couple retries before stepping in. [1] https://bugs.launchpad.net/ubuntu/+source/docker.io/+bug/1950751 Test Plan: PASS With pmon turned off and a sleep time of 10 secs, deleted /var/lib/docker/tmp and restarted docker. Then deleted /var/lib/docker/tmp dir during the 'sleep 10', observed that an automatic '/bin/systemctl restart docker.service' is triggered, docker is restarted and /tmp recreated successfully. PASS With pmon service up and using the proposed time intervals, restarted docker service successfully without interference between the two mechanisms PASS Completed the following operations: - AIO-SX install/bootstrap/unlock - lock/unlock - sudo reboot with the following results: - /var/lib/docker has all sub-directories - applications applied - docker service running - pulled hello-world image - no alarms - no 'download failed' error messages in daemon.log Partial-Bug: 1999182 Signed-off-by: Leonardo Fagundes Luz Serrano <Leonardo.FagundesLuzSerrano@windriver.com> Change-Id: Ide2d214ea3c7efb3f2a24327c11ae55f90d5a9cechanges/65/867565/7
parent
a8357be883
commit
e78e42fb38
Loading…
Reference in new issue