Hi Payne,
So the message that worries me is the "Caught SIGTERM" message near in time to the batch update. This implies to me that something outside of Octavia is initiating a graceful shutdown (Great that it's graceful, so often we see people with misconfigured systemd or k8s doing SIGKILLS).
There is a known issue I have called the "rabbit hole" caused by Oslo Messaging. Unfortunately, the current design of Oslo Messaging (which Octavia uses for the rabbitmq RPC calls) auto-ACKs the messages when it pulls them from the queue. If something goes wrong between the message being pulled off the queue and before we load the flow into Taskflow, there is a small chance the RPC message could be lost. Ideally we wanted to ACK the message once the TaskFlow is launched, but we found we cannot do that with the current Oslo Messaging design. This is outstanding tech debt in Oslo Messaging that has not yet been resolved. If you are interested in working on it, I can give you code pointers to where the issue is occurring.
That said, with a graceful shutdown, the controller worker should complete the processing and load the job into Taskflow before shutting the thread down. For this reason I am not 100% convinced this is the issue, but it is something to investigate more.
Michael