Hello folks:

I replied in [1]. I think we could have a potential problem when executing a DB change the impies a OF controller re-initialization. Most of the time we don't have problems but as we see in the CI, we could sometimes.

I'll push a patch to add a retry decorator on the methods that trigger this OF controller restart.

Regards.

[1]https://bugs.launchpad.net/neutron/+bug/1944201/comments/4


On Wed, Sep 22, 2021 at 4:36 PM Slawek Kaplonski <skaplons@redhat.com> wrote:
Hi

On Wed, Sep 22, 2021 at 03:30:08PM +0200, Lajos Katona wrote:
> Hi Slawek,
> Thanks for the summary.
> Regarding https://bugs.launchpad.net/neutron/+bug/1944201 not sure about if
> it is related to the number of hosts, there's some failure in
> singlenode jobs as well:
> http://logstash.openstack.org/#dashboard/file/logstash.json?query=message%3A%5C%22error%20Datapath%20Invalid%5C%22
>
> example:
> https://9f672e0630f459ee81cb-e4093b1756a9a5a7c7d28e6575b4af7f.ssl.cf5.rackcdn.com/805849/10/check/neutron-ovs-rally-task/5981df8/controller/logs/screen-q-agt.txt

Ok. So it happens on all types of jobs where neutron-ovs-agent is used :/

>
> Lajos (lajoskatona)
>
> Slawek Kaplonski <skaplons@redhat.com> ezt írta (időpont: 2021. szept. 22.,
> Sze, 12:10):
>
> > Hi,
> >
> > Due to very urgent things which I had yesterday, I wasn't able to run
> > Neutron
> > CI meeting as usually. Fortunatelly we don't have many new issues in our
> > CI.
> > There is only one new issue in our scenario jobs which I wanted to discuss
> > [1]. It's impacting Ironic gates but I noticed it also in the Neutron CI
> > as
> > well. See [2] or [3] for example. I'm not sure about Ironic jobs but in
> > Neutron I saw it mostly (or only, I'm not sure) in the multinode jobs.
> >
> > [1] https://bugs.launchpad.net/neutron/+bug/1944201
> > [2] https://
> > e36beaa2ff297ebe7d5f-5944c3d62ed334b8cdf50b534c246731.ssl.cf5.rackcdn.com/
> > 805849/9/check/neutron-ovs-tempest-dvr-ha-multinode-full/f83fa96/compute1/
> > logs/screen-q-agt.txt
> > <http://e36beaa2ff297ebe7d5f-5944c3d62ed334b8cdf50b534c246731.ssl.cf5.rackcdn.com/805849/9/check/neutron-ovs-tempest-dvr-ha-multinode-full/f83fa96/compute1/logs/screen-q-agt.txt>
> > [3] https://storage.bhs.cloud.ovh.net/v1/
> >
> > AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_88f/803045/12/check/
> > neutron-ovs-tempest-slow/88f8bb7/job-output.txt
> >
> > --
> > Slawek Kaplonski
> > Principal Software Engineer
> > Red Hat

--
Slawek Kaplonski
Principal Software Engineer
Red Hat