On 11/10/19 12:41, Matt Riedemann wrote:
On 11/10/2019 10:29 AM, Balázs Gibizer wrote:
- Also there could be two cells running this command at the same time
fighting for the API db lock,
In Train the --all-cells option was added to the CLI so that should resolve this issue. I think Mel said she backported those changes internally so I'm not sure how hard it would be for those to go back to Stein or Rocky or whatever release CERN is using now.
That's correct, I backported --all-cells [1][2][3][4] to Stein, Rocky, and Queens downstream. I found it not to be easy but YMMV.
The primary conflicts in Stein were with --before, so I went ahead and brought those patches back as well [5][6][7] since we also needed --before to help people avoid the "orphaned virt guests if archive runs while nova-compute is down" problem.
Same deal for Rocky.
And finally with Queens, there's an additional conflict around deleting instance group members [8], so I also brought that back because it's related to all of the database cleanup issues that support has repeatedly faced with customers.
Hope this helps anyone considering backporting --all-cells.
Cheers, -melanie
[1] https://review.opendev.org/675218 [2] https://review.opendev.org/675209 [3] https://review.opendev.org/675205 [4] https://review.opendev.org/507486 [5] https://review.opendev.org/661289 [6] https://review.opendev.org/556751 [7] https://review.opendev.org/643779 [8] https://review.opendev.org/598953