[openstack-dev] [Magnum] Consistent functional test failures (seems infra not have enough resource)

Kai Qiang Wu wkqwu at cn.ibm.com
Thu Aug 13 11:38:07 UTC 2015


Hi Tom,


I did talked to infra, which I think it is resource issue, But they thought
it is nova issue,


For we boot k8s bay, we use baymodel with falvor m1.small, you can find
devstack



+-----+-----------+-----------+------+-----------+------+-------+-------------+-----------+
| ID  | Name      | Memory_MB | Disk | Ephemeral | Swap | VCPUs |
RXTX_Factor | Is_Public |
+-----+-----------+-----------+------+-----------+------+-------+-------------+-----------+
| 1   | m1.tiny   | 512       | 1    | 0         |      | 1     | 1.0
| True      |
| 2   | m1.small  | 2048      | 20   | 0         |      | 1     | 1.0
| True      |
| 3   | m1.medium | 4096      | 40   | 0         |      | 2     | 1.0
| True      |
| 4   | m1.large  | 8192      | 80   | 0         |      | 4     | 1.0
| True      |
| 42  | m1.nano   | 64        | 0    | 0         |      | 1     | 1.0
| True      |
| 451 | m1.heat   | 512       | 0    | 0         |      | 1     | 1.0
| True      |
| 5   | m1.xlarge | 16384     | 160  | 0         |      | 8     | 1.0
| True      |
| 84  | m1.micro  | 128       | 0    | 0         |      | 1     | 1.0
| True      |
+-----+-----------+-----------+------+-----------+------+-------+-------------+-----------+



From logs below:

[req-e5bb52cb-387e-4638-911e-8c72aa1b6400 admin admin]
(devstack-trusty-rax-dfw-4299602, devstack-trusty-rax-dfw-4299602)
ram:5172 disk:17408 io_ops:0 instances:1 does not have 20480 MB usable
disk, it only has 17408.0 MB usable disk. host_passes
/opt/stack/new/nova/nova/scheduler/filters/disk_filter.py:60
2015-08-13 08:26:15.218 INFO nova.filters
[req-e

It is 20GB disk space, so failed for that.


I think it is related with this, the jenkins allocated VM disk space is not
large.
I am curious why it failed so often recently.  Does os-infra changed
something ?




Thanks




Best Wishes,
--------------------------------------------------------------------------------
Kai Qiang Wu (吴开强  Kennan)
IBM China System and Technology Lab, Beijing

E-mail: wkqwu at cn.ibm.com
Tel: 86-10-82451647
Address: Building 28(Ring Building), ZhongGuanCun Software Park,
         No.8 Dong Bei Wang West Road, Haidian District Beijing P.R.China
100193
--------------------------------------------------------------------------------
Follow your heart. You are miracle!



From:	Tom Cammann <tom.cammann at hp.com>
To:	"OpenStack Development Mailing List (not for usage questions)"
            <openstack-dev at lists.openstack.org>
Date:	08/13/2015 06:24 PM
Subject:	[openstack-dev] [Magnum] Consistent functional test failures



Hi Team,

Wanted to let you know why we are having consistent functional test
failures in the gate.

This is being caused by Nova returning "No valid host" to heat:

2015-08-13 08:26:16.303 31543 INFO heat.engine.resource [-] CREATE:
Server "kube_minion" [12ab45ef-0177-4118-9ba0-3fffbc3c1d1a] Stack
"testbay-y366b2atg6mm-kube_minions-cdlfyvhaximr-0-dufsjliqfoet"
[b40f0c9f-cb54-4d75-86c3-8a9f347a27a6]
2015-08-13 08:26:16.303 31543 ERROR heat.engine.resource Traceback (most
recent call last):
2015-08-13 08:26:16.303 31543 ERROR heat.engine.resource File
"/opt/stack/new/heat/heat/engine/resource.py", line 625, in
_action_recorder
2015-08-13 08:26:16.303 31543 ERROR heat.engine.resource     yield
2015-08-13 08:26:16.303 31543 ERROR heat.engine.resource File
"/opt/stack/new/heat/heat/engine/resource.py", line 696, in _do_action
2015-08-13 08:26:16.303 31543 ERROR heat.engine.resource     yield
self.action_handler_task(action, args=handler_args)
2015-08-13 08:26:16.303 31543 ERROR heat.engine.resource File
"/opt/stack/new/heat/heat/engine/scheduler.py", line 320, in wrapper
2015-08-13 08:26:16.303 31543 ERROR heat.engine.resource     step =
next(subtask)
2015-08-13 08:26:16.303 31543 ERROR heat.engine.resource File
"/opt/stack/new/heat/heat/engine/resource.py", line 670, in
action_handler_task
2015-08-13 08:26:16.303 31543 ERROR heat.engine.resource     while not
check(handler_data):
2015-08-13 08:26:16.303 31543 ERROR heat.engine.resource File
"/opt/stack/new/heat/heat/engine/resources/openstack/nova/server.py",
line 759, in check_create_complete
2015-08-13 08:26:16.303 31543 ERROR heat.engine.resource     return
self.client_plugin()._check_active(server_id)
2015-08-13 08:26:16.303 31543 ERROR heat.engine.resource File
"/opt/stack/new/heat/heat/engine/clients/os/nova.py", line 232, in
_check_active
2015-08-13 08:26:16.303 31543 ERROR heat.engine.resource     'code':
fault.get('code', _('Unknown'))
2015-08-13 08:26:16.303 31543 ERROR heat.engine.resource
ResourceInError: Went to status ERROR due to "Message: No valid host was
found. There are not enough hosts available., Code: 500"

And this in turn is being caused by the compute instance running out of
disk space:

2015-08-13 08:26:15.216 DEBUG nova.filters
[req-e5bb52cb-387e-4638-911e-8c72aa1b6400 admin admin] Starting with 1
host(s) get_filtered_objects /opt/stack/new/nova/nova/filters.py:70
2015-08-13 08:26:15.217 DEBUG nova.filters
[req-e5bb52cb-387e-4638-911e-8c72aa1b6400 admin admin] Filter
RetryFilter returned 1 host(s) get_filtered_objects
/opt/stack/new/nova/nova/filters.py:84
2015-08-13 08:26:15.217 DEBUG nova.filters
[req-e5bb52cb-387e-4638-911e-8c72aa1b6400 admin admin] Filter
AvailabilityZoneFilter returned 1 host(s) get_filtered_objects
/opt/stack/new/nova/nova/filters.py:84
2015-08-13 08:26:15.217 DEBUG nova.filters
[req-e5bb52cb-387e-4638-911e-8c72aa1b6400 admin admin] Filter RamFilter
returned 1 host(s) get_filtered_objects
/opt/stack/new/nova/nova/filters.py:84
2015-08-13 08:26:15.218 DEBUG nova.scheduler.filters.disk_filter
[req-e5bb52cb-387e-4638-911e-8c72aa1b6400 admin admin]
(devstack-trusty-rax-dfw-4299602, devstack-trusty-rax-dfw-4299602)
ram:5172 disk:17408 io_ops:0 instances:1 does not have 20480 MB usable
disk, it only has 17408.0 MB usable disk. host_passes
/opt/stack/new/nova/nova/scheduler/filters/disk_filter.py:60
2015-08-13 08:26:15.218 INFO nova.filters
[req-e5bb52cb-387e-4638-911e-8c72aa1b6400 admin admin] Filter DiskFilter
returned 0 hosts

For now a recheck seems to work about 1 in 2, so we can still land patches.

The fix for this could be to clean up our Magnum devstack install more
aggressively, which might be as simple as cleaning up the images we use,
or get infra to provide our tests with a larger disk size. I will
probably test out a patch today which cleans up the images we use in
devstack to see if that helps.

If anyone can help progress this let me know.

Cheers,
Tom



__________________________________________________________________________
OpenStack Development Mailing List (not for usage questions)
Unsubscribe: OpenStack-dev-request at lists.openstack.org?subject:unsubscribe
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openstack.org/pipermail/openstack-dev/attachments/20150813/fb9c22bd/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: graycol.gif
Type: image/gif
Size: 105 bytes
Desc: not available
URL: <http://lists.openstack.org/pipermail/openstack-dev/attachments/20150813/fb9c22bd/attachment-0001.gif>


More information about the OpenStack-dev mailing list