By observer I mean process which will actually notify about stack timeout. Maybe it was poor choice of words. Anyway, something will need to check what stacks are timed out, and that's new single point of failure. > -----Original Message----- > From: Zane Bitter [mailto:zbitter at redhat.com] > Sent: Thursday, November 13, 2014 3:49 PM > To: openstack-dev at lists.openstack.org > Subject: Re: [openstack-dev] [Heat] Using Job Queues for timeout ops > > On 13/11/14 09:31, Jastrzebski, Michal wrote: > > Guys, I don't think we want to get into this cluster management mud. > > You say let's make observer...and what if observer dies? Do we do > > observer to observer? And then there is split brain. I'm observer, I've lost > connection to worker. Should I restart a worker? > > Maybe I'm one who lost connection to the rest of the world? Should I > > resume task and risk duplicate workload? > > I think you're misinterpreting what we mean by "observer". See > https://wiki.openstack.org/wiki/Heat/ConvergenceDesign > > - ZB > > _______________________________________________ > OpenStack-dev mailing list > OpenStack-dev at lists.openstack.org > http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev