<div dir="ltr"><div>Hi,</div><div><br></div><div>There is number of additional network verifications that can improve troubleshooting experience or even cluster perfomance, like:<br></div><div><br></div><div> 1. multicast group verification for corosync messaging</div>
<div> 2. network connectivity with jumbo packets</div><div> 3. l3 connectivity verification</div><div> 4. some fencing verification</div><div> 5. allocated ip verification <a href="https://bugs.launchpad.net/fuel/+bug/1275641">https://bugs.launchpad.net/fuel/+bug/1275641</a></div>
<div> 6. measure network perfomance with iperf</div><div><br></div><div>Adding this stuff on fuel-web network tab will significantly worsen UX, also it is not friendly enough to extend current model with additional verifications.<br>
</div><div><br></div><div>Whole approach looks like networking health check for deployment, so in my opinion it should be done as separate tab similar to ostf health check.</div><div><br></div><div>fuel-ostf already has necessery db and rest-api code to support such extensions, and with some work this can be used as diagnostic tool not only for fuel, but in tripleo as well.</div>
<div><br></div><div>In my opinion this feature should be splited in two main parts:</div><div><br></div><div>PART 1 - new plugin-executor for ostf, ui tab in fuel-web, extending this plugin with existing verifications</div>
<div><br></div><div> 1. for now ostf has one plugin-executor - this plugin uses nose for running tests, add new executor that will be named smth like distributed, astute still will perform role of orchestartor</div><div>
<br></div><div> 2. adding new reporter to astute that will publish messages to ostf queue</div><div><br></div><div> 3. add ostf amqp receiver</div><div><br></div><div> 4. extend current plugin with verifications listed above</div>
<div><br></div><div> After this part of refactoring it should be possible to support rapid extension of distributed cluster diagnostic.</div><div><br></div><div>PART 2 - make integration with fuel plugable, it means:<br>
</div><div><br></div><div> 1. remove proxy dependency from ostf, it can be done with socks protocol that provides http proxy over ssh (it is supported by openssh server)</div><div><br></div><div> 2. make integration with nailgun plugable</div>
<div><br></div><div> 3. replace astute/mcollective with custom agent or some community solution</div><div><br></div><div><br></div><div>I will appreciate comments or suggestions, so dont hesitate to share your thoughts</div>
</div>