<div dir="ltr"><div>There has been some talk in cinder meetings about making cinder<->glance interactions more efficient. They are already optimised in some deployments, e.g. ceph glance and ceph cinder, and some backends cache glance images so that many volumes created from the same image becomes very efficient. (search the meeting logs or channel logs for 'public snapshot' to get some entry points into the discussions) <br><br>I'd like to see more work done on this, and perhaps re-examine a cinder backend to glance. This would give some of what you're suggesting (particularly fast, low traffic un-shelve), and there is more that can be done to improve that performance, particularly if we can find a better performing generic CoW technology than QCOW2.<br><br></div>As suggested in the review, in the short term you might be better experimenting with moving to boot-from-volume instances if you have a suitable cinder deployed, since that gives you some of the performance improvements already.<br></div><div class="gmail_extra"><br><div class="gmail_quote">On 16 February 2015 at 12:10, Kekane, Abhishek <span dir="ltr"><<a href="mailto:Abhishek.Kekane@nttdata.com" target="_blank">Abhishek.Kekane@nttdata.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div link="blue" vlink="purple" lang="EN-IN">
<div>
<p class="MsoNormal">Hi Devs,<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">Problem Statement: Performance and storage efficiency of shelving/unshelving instance booted from image is far worse than instance booted from volume.<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">When you unshelve hundreds of instances at the same time, instance spawning time varies and it mainly depends on the size of the instance snapshot and<u></u><u></u></p>
<p class="MsoNormal">the network speed between glance and nova servers.<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">If you have configured file store (shared storage) as a backend in Glance for storing images/snapshots, then it's possible to improve the performance of<u></u><u></u></p>
<p class="MsoNormal">unshelve instance dramatically by configuring nova.image.download.FileTransfer in nova. In this case, it simply copies the instance snapshot as if it is<u></u><u></u></p>
<p class="MsoNormal">stored on the local filesystem of the compute node. But then again in this case, it is observed the network traffic between shared storage servers and<u></u><u></u></p>
<p class="MsoNormal">nova increases enormously resulting in slow spawning of the instances.<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">I would like to gather some thoughts about how we can improve the performance of unshelve api (booted from image only) in terms of downloading large<u></u><u></u></p>
<p class="MsoNormal">size instance snapshots from glance.<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">I have proposed a nova-specs [1] to address this performance issue. Please take a look at it.<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">During the last nova mid-cycle summit, <a href="https://review.openstack.org/#/q/owner:mikal%2540stillhq.com+status:open,n,z" title="Michael Still <mikal@stillhq.com>" target="_blank">
<span style="color:windowtext;text-decoration:none">Michael Still</span></a> has suggested alternative solutions to tackle this issue.<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">Storage solutions like ceph (Software based) and NetApp (Hardare based) support exposing images from glance to nova-compute and cinder-volume with<u></u><u></u></p>
<p class="MsoNormal">copy in write feature. This way there will be no need to download the instance snapshot and unshelve api will be pretty faster than getting it<u></u><u></u></p>
<p class="MsoNormal">from glance.<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">Do you think the above performance issue should be handled in the OpenStack software as described in nova-specs [1] or storage solutions like<u></u><u></u></p>
<p class="MsoNormal">ceph/NetApp should be used in production environment? Apart from ceph/NetApp solutions, are there any other options available in the market.<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">[1] <a href="https://review.openstack.org/#/c/135387/" target="_blank">https://review.openstack.org/#/c/135387/</a><u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">Thank You,<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">Abhishek Kekane<u></u><u></u></p>
</div>
<br clear="all">
______________________________________________________________________<br>
Disclaimer: This email and any attachments are sent in strictest confidence<br>
for the sole use of the addressee and may contain legally privileged,<br>
confidential, and proprietary data. If you are not the intended recipient,<br>
please advise the sender by replying promptly to this email and then delete<br>
and destroy this email and any attachments without any further use, copying<br>
or forwarding.<br>
</div>
<br>__________________________________________________________________________<br>
OpenStack Development Mailing List (not for usage questions)<br>
Unsubscribe: <a href="http://OpenStack-dev-request@lists.openstack.org?subject:unsubscribe" target="_blank">OpenStack-dev-request@lists.openstack.org?subject:unsubscribe</a><br>
<a href="http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev" target="_blank">http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev</a><br>
<br></blockquote></div><br><br clear="all"><br>-- <br><div class="gmail_signature">Duncan Thomas</div>
</div>