[openstack-dev] [Glance] Summit Session Summaries

Zhi Yan Liu lzy.dev at gmail.com
Sat Nov 16 15:08:41 UTC 2013


Awesome Mark! It's really useful to help us start next works to
organize, prioritize and realize icehouse BPs, thanks.

Seems here is a good place to continue our discussion around my
proposal so I have added some inline replies to "Enhancing Image
Locations" section, you know 50 mins is a little tight to me :) I have
a lot of point try to address.

On Sat, Nov 16, 2013 at 8:29 AM, Yongsheng Gong <gongysh at unitedstack.com> wrote:
> great, thanks
>
>
> On Sat, Nov 16, 2013 at 5:10 AM, Mark Washenberger
> <mark.washenberger at markwash.net> wrote:
>>
>> Hi folks,
>>
>> My summary notes from the OpenStack Design Summit Glance sessions follow.
>> Enjoy, and please help correct any misunderstandings.
>>
>>
>>
>> Image State Consistency:
>> ------------------------
>>
>> https://etherpad.openstack.org/p/icehouse-summit-image-state-consistency
>>
>> In this session, we focused on the problem of snapshots that fail
>> after the image is created but before the image data is uploaded
>> result in a pending image that will never become active, and the
>> only operation nova can do is to delete the image. Thus there is
>> not a very good way to communicate the failure to users without
>> just leaving a useless image record around.
>>
>> A solution was proposed to allow Nova to directly set the status
>> of the image, say to "killed" or some other state.
>>
>> A problem with the proposed solution is that we generally have
>> kept the "status" field internally controlled by glance, which
>> means there are some modeling and authorization concerns.
>> However, it is actually something Nova could do today through
>> the hacky mechanism of initiating a PUT with data, but then
>> terminating the connection without sending a complete body. So
>> the authorization aspects are not really a fundamental concern.
>>
>> It was suggested that the solution to this problem
>> is to make Nova responsible for reporting these failures rather
>> than Glance. In the short term, we could do the following
>>  - have nova delete the image when snapshot fails (already merged)
>>  - merge nova patch to report the failure as part of instance
>>    error reporting
>>
>> In the longer term, it was seen as desirable for nova to treat
>> snapshots as asynchronous tasks and reflect those tasks in the
>> api, including the failure/success of those tasks.
>>
>> Another long term option that was viewed mostly favorably was
>> to add another asynchronous task to glance for vanilla uploads
>> so that nova snapshots can avoid creating the image until it
>> is fully active.
>>
>> Fei Long Wang is going to follow up on what approach makes the
>> most sense for Nova and report back for our next steps.
>>
>>
>>
>> What to do about v1?
>> --------------------
>>
>> https://etherpad.openstack.org/p/icehouse-summit-images-v1-api
>>
>> In this discussion, we hammered out the details for how to drop
>> the v1 api and in what timetable.
>>
>> Leaning heavily on cinder's experience dropping v1, we came
>> up with the following schedule.
>>
>> Icehouse:
>>     - Announce plan to deprecate the V1 API and registry in J and remove
>> it in K
>>     - Announce feature freeze for v1 API immediately
>>     - Make sure everything in OpenStack is using v2 (cinder, nova, ?)
>>     - Ensure v2 is being fully covered in tempest tests
>>     - Ensure there are no gaps in the migration strategy from v1 to v2
>>         - after the fact, it seems to me we need to produce a migration
>> guide as a way to evaluate the presence of such gaps
>>     - Make v2 the default in glanceclient
>>     - Turn v2 on by default in glance API
>>
>> "J":
>>     - Mark v1 as deprecated
>>     - Turn v1 off by default in config
>>
>> "K":
>>     - Delete v1 api and v1 registry
>>
>>
>> A few gotchas were identified, in particular, a concern was raised
>> about breaking stable branch testing when we switch the default in
>> glanceclient to v2--since latest glanceclient will be used to test
>> glance  in say Folsom or Grizzly where the v2 api didn't really
>> work at all.
>>
>> In addition, it was suggested that we should be very aggressive
>> in using deprecation warnings for config options to communicate
>> this change as loudly as possible.
>>
>>
>>
>>
>> Image Sharing
>> -------------
>>
>> https://etherpad.openstack.org/p/icehouse-summit-enhance-v2-image-sharing
>>
>> This session focused on the gaps between the current image sharing
>> functionality and what is needed to establish an image marketplace.
>>
>> One issue was the lack of verification of project ids when sharing an
>> image.
>>
>> A few other issues were identified:
>> - there is no way to share an image with a large number of projects in a
>> single api operation
>> - membership lists are not currently paged
>> - there is no way to share an image with everyone, you must know each
>> other project id
>>
>> We identified a potential issue with bulk operations and
>> verification--namely there is no way to do bulk verification of project ids
>> in keystone that we know of, so probably keystone work would be needed to
>> have both of these features in place without implying super slow api calls.
>>
>> In addition, we spent some time toying with the idea of image catalogs. If
>> publishers put images in catalogs, rather than having shared images show up
>> directly in other users' image lists, things would be a lot safer and we
>> could relax some of our restrictions. However, there are some issues with
>> this approach as well,
>> - How do you find the catalog of a trusted image publisher?
>> - Are we just pushing the issue of sensible world-listings to another
>> resource?
>> - This would be a big change.
>>
>>
>>
>> Enhancing Image Locations:
>> --------------------------
>>
>>
>> https://etherpad.openstack.org/p/icehouse-summit-enhance-image-location-property
>>
>> This session proposed adding several attributes to image locations
>>
>> 1. Add 'status' to each location.
>>
>> I think consensus was that this approach makes sense moving forward. In
>> particular, it would be nice to have a 'pending-delete' status for image
>> locations, so that when you delete a single location from an image it can be
>> picked up properly by the glance scrubber.
>>
>> There was some concern about how we define the overall image status if we
>> allow other statuses on locations. Is image status just stored independently
>> of image locations statuses? Or is it newly defined as a function of those
>> image locations statuses?

I prefer go former way, keep original image's status field
independent, since IMO we can't obtain 'killed' status base on
location's status.
As I proposed, available location status value set is: 'pending',
'active', 'pending_delete' and 'deleted'.

>>
>> 2. Allow disk_format, container_format, and checksum to vary per location.
>>
>> The usecase here is that if you have a multi-hypervisor cloud, where
>> different formats are needed, the client can automatically select the
>> correct format when it downloads an image.
>>
>> This idea was initially met with some skepticism because we have a strong
>> view that an image is immutable once it is created, and the checksum is a
>> big part of how we enforce that.
>>
>> However it was correctly pointed out that the immutability we care about
>> is actually a property of the block device that each image format
>> represents. But for the moment we were unsure how to enforce that block
>> device immutability save keeping the checksum and image formats the same.
>>

Agree, I needed to think more about Cloud deployment in the real world
to make sure this is a worth use case/function to go for us/Glance.
Actually I also like image-grouping idea, each one can have different
format specifications and version, in feature image versioning can
support  image thin clone, dependency management, merging and etc. .

>>
>> 3. Add metrics to each image location.
>>
>> The essential idea here is to track the performance metrics of each image
>> location to ensure we choose the fastest location. These metrics would not
>> be revealed as part of the API.
>>
>> I think most of us were initially a bit confused by this suggestion.
>> However, after talking with Zhi Yan after the session, I think it makes
>> sense to support this in a local sense rather than storing such information
>> in the database. Locality is critical because different glance nodes likely
>> have different relationships to the underlying locations in terms of network
>> distance, so each node should be gearing towards what is best for it.
>>
>> We can also probably reuse a local metrics tracking library to enable
>> similar optimizations in a future incarnation of the glance client.
>>

I think the objective is correct, but for realization TBH we have a
little miscommunication on this part sorry. IMO the metric can be
added to both Glance server/API side and client side, they are
independent, we know Glance server side and client side are all need
to select a location from the list to handle a request - download or
direct URL consuming. There are two points around this part: metric
data, and selection function (based on metric). (data structures and
algorithms, the everlasting topic :-/ )

1. metric data

I consider metric data can be added to database with image location
(just like metadata field), it will be used by image download handling
on server/API side, image creator or admin can add node-independent
indicators or node-dependent indicators to a location via existing
PATCH API, for former one the indicator can be a global weight for
customer, for latter one an ABAC[1] based selection function can work
base on those indicator values. For instance (the indicator key name
may be need to change):

location entry: [{'url': 'file:///image1', 'metrics': {'weight': 1,
'node-weights': {'novahost1': 1, 'novahost2': 2}}, 'metadata': {}},
                          {'url': 'swift://image1', 'metrics':
{'weight': 2, 'node-weights': {'novahost1': 2, 'novahost2': 1}},
'metadata': {}}]

In above case 'weight' metrics indicator is a node-independent
indicators, 'node-weigths' indicator is a node-dependent, and the
particular selection function (IMO it's a admin configurable option in
glance-api.conf) should take the responsibility to use the correct, or
say admin expected, metrics indicator to select a location to handle
this request. (for API it's 'download')

And glance-api and client both can use this metrics data since the
metrics can be exposed via standard 'direct url' mechanism, so we
should allow Glance and client share/reuse selection functions.

Above all metrics are static, they are created by image owner or admin
via PATCH api. And I agree in next step we need a dynamic local
metrics storage and relative tracking functions/library on the client
side, this just is what you said above Mark. They are not conflicting
but different, again I agree this point as I mentioned in our private
discussion (sorry if I confuse you :) ).

2. selection function
Selection functions be used to select a location from the image
location list base on a particular indicator(s). This indicator(s) can
come from metrics field of location entry (be static, created by image
owner or admin) or client side local metrics (be dynamic, collected by
tracking functions). And it should can be shared between the Glance
server/API side and client side. I'm also thinking not sure can we
leverage existing weight function from oslo?


[1] ABAC = Attribute Based Access Control, where something like
client's host name, IP or something like that.

>>
>>

I have also mentioned "image handler" point in the session, sorry for
I highlight it here again since it's really important for Glance Nova
co-working efficiently, currently the patch is still under review.

1. It give a plug-in point to Nova, to allow a particular image
handler implementation to consuming image by a special/efficient
approach via direct location URL. For example zero-copy image rather
than transferring the actual bits. (Refer my etherpad page if you
interested)
2. Above selection function can be add to Nova easily, since we can
add those selection handling before "image handler" base code, and
make it be transparent for particular image handling logic.
3. VMDK/OVA image handler can be implemented base on this also.

>>
>> Images and Taskflow
>> -------------------
>>
>> https://etherpad.openstack.org/p/icehouse-summit-taskflow-and-glance
>>
>> In this session we discussed both the general layout of taskflow the
>> strategy for porting the current image tasks under development to use
>> taskflow, and came up with the following basic outline.
>>
>> Short Term:
>>
>> As we add more and more complexity to the import task, we can try to
>> compose the work as a flow of tasks. With this set up, our local,
>> eventlet-backed executor (glance task execution engine) could be just a thin
>> wrapper around a local taskflow engine.
>>
>> Medium Term:
>>
>> At some point pretty early on we are going to want to have glance tasks
>> running on distributed worker processes, mostly likely having the tasks
>> triggered by rpc. At this point, we can copy the existing approach in cinder
>> c.a. Havana
>>
>> Longer Term:
>>
>> When taskflow engines support distributing tasks across different workers,
>> we can fall back to having a local task engine that is distributing tasks
>> using that engine.
>>
>> During the discussion a few concerns were discussed about working with
>> taskflow.
>> - tasks have to be structured in the right way to make restart, recovery,
>> and rollback work
>>   - in other words, if we don't think about this carefully, we'll likely
>> screw things up
>> - it remains difficult to determine if a task has stalled or failed
>> - we are not sure how to restart a failed task at this point
>>
>> Some of these concerns may already be being addressed in the library.
>>
>> _______________________________________________
>> OpenStack-dev mailing list
>> OpenStack-dev at lists.openstack.org
>> http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev
>>
>
>
> _______________________________________________
> OpenStack-dev mailing list
> OpenStack-dev at lists.openstack.org
> http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev
>

Any thoughts?

thanks,
zhiyan



More information about the OpenStack-dev mailing list