[openstack-dev] Quantum in Full Gate Issues (was Re: Gerrit's Jenkins should stop running tests after first failure)
Miguel Lavalle
miguel at mlavalle.com
Wed Jun 19 15:24:21 UTC 2013
No, we haven't put this document in etherpad. But it's a great idea. This
way other folks can jump in with ideas. I'll put it up in eteherpad over
the next few days
On Wed, Jun 19, 2013 at 10:10 AM, Sean Dague <sean at dague.net> wrote:
> This is great stuff! You guys have this up in etherpad somewhere?
>
> Really great to see people organizing around this. I'd reviewed a bunch of
> Jordan's code, so I knew he was in the mix, sorry for not realizing there
> were other folks working on this as well.
>
> -Sean
>
>
> On 06/19/2013 10:27 AM, Miguel Lavalle wrote:
>
>> Hi,
>>
>> As of last week, we have organized a team to deal with the Quantum in
>> full gate issues. There are 3 people working on this: Jordan Pittier,
>> Ala.Rezmerita and myself. Below you will find a document describing all
>> the identified issues and who is assigned to fix them. Of cource, we can
>> use more hands / heads. If anyone wants to help or has identified other
>> issues not listed below, please get in touch with us.
>>
>> Regards
>>
>> Fix Jenkins gate-tempest-devstack-vm-**quantum-full
>>
>>
>> This document lists all the Tempest tests that are failing on a Devstack
>> + Quantum Trunk setup. The goal is to make these tests pass for the
>> Havana Milestone 3. Alla, Miguel and Jordan are working on this.
>>
>>
>> For each test :
>>
>> *
>>
>> Provide the full path of the test. If the test has a double
>> interface (JSON and XML), it’s enough to provide only the JSON path.
>>
>> *
>>
>> Provide a small stacktrace or a link to a Paste service such as
>> http://paste.openstack.org/
>>
>> *
>>
>> Would be great to provide the CURL URL that triggers the bug
>>
>> *
>>
>> Provide a small analysis of what the bug may be and the components
>> involved (Quantum, Quantum API on Nova’s side, Nova-network etc.)
>>
>> *
>>
>> As far as possible, provide the file and the line number where the
>> Python exception is raised or translated (“casted”)
>>
>> *
>>
>> If already filed, an URL to the bug report in Launchpad
>>
>>
>> Tests related to the Fixed IPs Compute API extension (Miguel)
>>
>>
>> 1)
>> tempest.api.compute.admin.**test_fixed_ips:**FixedIPsTestJson.test_list_*
>> *fixed_ip_details
>>
>> *
>>
>> Trace: http://paste.openstack.org/**show/38363/<http://paste.openstack.org/show/38363/>
>>
>> *
>>
>> curl-H "X-Auth-Token:$TOKEN" -X GET
>> http://$IP:8774/v2/$TENANT_ID/**os-fixed-ips/10.0.0.3 <
>> http://10.0.0.3>
>>
>> *
>>
>> API: http://api.openstack.org/api-**ref.html#ext-os-fixed-ips<http://api.openstack.org/api-ref.html#ext-os-fixed-ips>
>>
>>
>> *
>>
>> Explanation: Call to
>> nova.api.openstack.compute.**contrib.fixed_ips.**
>> FixedIPController::show()
>> This file doesn’t use the Quantum API nor the Nova-Network API. It
>> interacts directly with the DB, which is bad.
>>
>> *
>>
>> A possible fix would be to :
>>
>> 1.
>>
>>
>> Change
>> nova.api.openstack.compute.**contrib.fixed_ips.**
>> FixedIPController::show()
>> to use either nova.network.api.API::get_**fixed_ip() (for Nova
>> Network) or nova.network.quantumv2.api.**API:get_fixed_ip() (for
>> Nova Network)
>>
>> 2.
>>
>>
>> Implement nova.network.quantumv2.api.**API:get_fixed_ip() which
>> currently raises a NotImplementedError exception
>>
>> 2)
>> tempest.api.compute.admin.**test_fixed_ips:**FixedIPsTestJson.test_set_**
>> reserve
>>
>> *
>>
>> Trace: http://paste.openstack.org/**show/38372/<http://paste.openstack.org/show/38372/>
>>
>> *
>>
>> curl-H "X-Auth-Token:$TOKEN" -X POST
>> http://$IP:8774/v2/$TENANT_ID/**os-fixed-ips/10.0.0.3/action
>> <http://10.0.0.3/action>
>>
>> *
>>
>> API: http://api.openstack.org/api-**ref.html#ext-os-fixed-ips<http://api.openstack.org/api-ref.html#ext-os-fixed-ips>
>>
>>
>> *
>>
>> Possible Fix: Should call
>> nova.api.openstack.compute.**contrib.fixed_ips.**
>> FixedIPController::_set_**reserved()
>> (once the result of db.fixed_ip_get_by_address() is made through the
>> API). Here the problem is that neither Nova-Network nor Quatum API
>> implement an equivalent of db.fixed_ip_update()
>>
>> 3)
>> tempest.api.compute.admin.**test_fixed_ips:**FixedIPsTestJson.test_set_**
>> unreserve
>>
>> *
>>
>> Same as 2)
>>
>>
>>
>> Tests related to Quotas Admin(Miguel)
>>
>>
>> 4)tempest.api.compute.admin.**test_quotas:**QuotasAdminTestJSON.test_**
>> security_groups_exceed_limit
>>
>> 5)tempest.api.compute.admin.**test_quotas:**QuotasAdminTestJSON.test_**
>> security_groups_rules_exceed_**limit
>>
>>
>> Tests related to Floating IPs
>>
>>
>> 6)tempest.api.compute.**floating_ips.test_floating_**ips_actions:**
>> FloatingIPsTestJSON.test_**associate_ip_to_server_**
>> without_passing_floating_ip
>>
>> *
>>
>> Trace Tempest http://paste.openstack.org/**show/38431/<http://paste.openstack.org/show/38431/>
>>
>> *
>>
>> Bug : https://bugs.launchpad.net/**quantum/+bug/1190242<https://bugs.launchpad.net/quantum/+bug/1190242>
>>
>> *
>>
>> curl-H "Content-Type: application/json" -H "X-Auth-Token:$TOKEN" -X
>> POST http://$IP:8774/v2/$TENANT_ID/**servers/$SERVER_ID/action
>> <http://10.1.59.157:8774/v2/**de6c5fbc55a34cbcaa3d79eb6b21a7**
>> 84/servers/0b2ad3b6-c14a-4d89-**b2a0-c015f0a88a1f/action<http://10.1.59.157:8774/v2/de6c5fbc55a34cbcaa3d79eb6b21a784/servers/0b2ad3b6-c14a-4d89-b2a0-c015f0a88a1f/action>
>> >-d
>> ‘{"addFloatingIp": {"address": ""}}’
>>
>>
>> *
>>
>> Notice that the address of the FloatingIP is empty.
>>
>> *
>>
>> API : http://api.openstack.org/api-**ref.html#ext-floating-ips<http://api.openstack.org/api-ref.html#ext-floating-ips>
>>
>>
>> *
>>
>> Explanation : Mismatch in the 2 API. If there is only one IP in the
>> pool, Quantum allows the floating IP to be empty and returns the one
>> and only IP in the pool. Nova-network doesn’t allow this and returns
>> a 404
>>
>> *
>>
>> Review https://review.openstack.org/#**/c/32740/<https://review.openstack.org/#/c/32740/>
>>
>>
>> 7)tempest.api.compute.**floating_ips.test_floating_**ips_actions:**
>> FloatingIPsTestJSON.test_**delete_floating_ip
>>
>> *
>>
>> Trace Tempest : http://paste.openstack.org/**show/38505/<http://paste.openstack.org/show/38505/>
>>
>> *
>>
>> Bug : https://bugs.launchpad.net/**tempest/+bug/1160309(see<https://bugs.launchpad.net/tempest/+bug/1160309(see>comment #2)
>>
>>
>> *
>>
>> This is related to bug 9)
>>
>>
>> 8)tempest.api.compute.**floating_ips.test_floating_**ips_actions:**
>> FloatingIPsTestJSON.test_**delete_nonexistant_floating_ip
>>
>>
>> *
>>
>> Related to 9)
>>
>>
>> 9)tempest.api.compute.**floating_ips.test_list_**floating_ips:**
>> FloatingIPDetailsTestJSON.**test_get_nonexistant_floating_**ip_details
>>
>> *
>>
>> Bug :https://bugs.launchpad.net/**tempest/+bug/1160309<https://bugs.launchpad.net/tempest/+bug/1160309>
>>
>> *
>>
>> Trace tempest:http://paste.**openstack.org/show/38430/<http://paste.openstack.org/show/38430/>
>>
>> *
>>
>> Trace nova: http://paste.openstack.org/**show/38433/<http://paste.openstack.org/show/38433/>
>>
>>
>> *
>>
>> Curl:curl -H "X-Auth-Token:$TOKEN" -X GET
>> http://$IP:8774/v2/$PROJECT_**ID/os-floating-ips/**99987878787878
>>
>> Proposed Fix: https://review.openstack.org/**33024<https://review.openstack.org/33024>
>>
>>
>> Tests Related to Security Groups (Jordan)
>>
>>
>> 10)tempest.api.compute.**security_groups.test_security_**group_rules:**
>> SecurityGroupRulesTestJSON.**test_security_group_rules_**
>> create_with_invalid_id
>>
>> *
>>
>> TRACE: http://paste.openstack.org/**show/38373/<http://paste.openstack.org/show/38373/>
>>
>> *
>>
>> curl-H "Content-Type: application/json" -H "X-Auth-Token:$TOKEN" -X
>>
>> POST http://$IP:8774/v2/$TENANT_ID/**os-security-group-rules -d
>> ‘{"security_group_rule": {"from_port": 22, "ip_protocol": "tcp",
>> "to_port": 22, "parent_group_id": "9991393497170", "cidr": null,
>> "group_id": null}}’
>>
>> *
>>
>> Explanation: Notice that the parent_goup_id is a numerical ID and
>> not a UUID. Quantum has an additional check to validate that the ID
>> is an UUID (see
>> nova/network/security_group/**quantum_driver.py::validate_**id())
>>
>> *
>>
>> API: http://api.openstack.org/api-**ref.html#ext-os-security-**groups<http://api.openstack.org/api-ref.html#ext-os-security-groups>
>>
>> *
>>
>> Bug: https://bugs.launchpad.net/**tempest/+bug/1182384<https://bugs.launchpad.net/tempest/+bug/1182384>
>>
>>
>> *
>>
>> Review: https://review.openstack.org/#**/c/29899/<https://review.openstack.org/#/c/29899/>
>>
>>
>> 11)tempest.api.compute.**security_groups.test_security_**group_rules:**
>> SecurityGroupRulesTestJSON.**test_security_group_rules_**
>> delete_with_invalid_id
>>
>> *
>>
>> http://paste.openstack.org/**show/38424/<http://paste.openstack.org/show/38424/>
>>
>>
>> *
>>
>> curl -H "X-Auth-Token:$TOKEN" -X DELETE
>> http://$IP:8774/v2$TENANT_ID/**os-security-group-rules/**
>> 9991407551273
>>
>> *
>>
>> Explanation : Same bug as 10
>>
>> *
>>
>> API: http://api.openstack.org/api-**ref.html#ext-os-security-**groups<http://api.openstack.org/api-ref.html#ext-os-security-groups>
>>
>>
>> *
>>
>> Review: https://review.openstack.org/#**/c/29899/<https://review.openstack.org/#/c/29899/>
>>
>> 12)tempest.api.compute.**security_groups.test_security_**
>> groups:SecurityGroupsTestJSON.**test_delete_nonexistant_**security_group
>>
>> *
>>
>> Same as bug 10
>>
>>
>> *
>>
>> Review: https://review.openstack.org/#**/c/29899/<https://review.openstack.org/#/c/29899/>
>>
>>
>> 13)tempest.api.compute.**security_groups.test_security_**
>> groups:SecurityGroupsTestJSON.**test_security_group_get_**
>> nonexistant_group
>>
>> *
>>
>> Same as bug 10
>>
>>
>> *
>>
>> Review: https://review.openstack.org/#**/c/29899/<https://review.openstack.org/#/c/29899/>
>>
>>
>> 14)tempest.api.compute.**security_groups.test_security_**
>> groups:SecurityGroupsTestJSON.**test_security_group_create_**
>> with_duplicate_name
>>
>> Security Group with duplicate name should not be created, but two groups
>> with the same name can be created in quantum. We have here the same
>> problem as in 15 and in 16. With Quantum, there is no validation that a
>> group with given name exists already or if the given SG name is empty
>> or is composed of white spaces or is more than 255 chars.
>>
>>
>> In the description of bug
>> https://bugs.launchpad.net/**nova/+bug/1161411this<https://bugs.launchpad.net/nova/+bug/1161411this>issue is generally
>>
>> discussed. SecurityGroup API are based on the ID and not the names,
>> except for adding an instance to a security group. In order to solve
>> the last problem the bug https://bugs.launchpad.net/**
>> nova/+bug/1161473was <https://bugs.launchpad.net/nova/+bug/1161473was>
>> added.
>>
>>
>> The major question is if these 3 tests (14, 15, 16) : does the name of a
>> security group is really that important? If so, we must add some
>> validation methods. If not the test suit concerning this part must be
>> disable in tempest. What do you think Miguel?
>>
>>
>>
>> 15)tempest.api.compute.**security_groups.test_security_**
>> groups:SecurityGroupsTestJSON.**test_security_group_create_**
>> with_invalid_group_description
>>
>> 16)tempest.api.compute.**security_groups.test_security_**
>> groups:SecurityGroupsTestJSON.**test_security_group_create_**
>> with_invalid_group_name
>>
>> BUG:https://bugs.launchpad.**net/nova/+bug/1161411+<https://bugs.launchpad.net/nova/+bug/1161411+>
>> https://bugs.launchpad.net/**nova/+bug/1161473<https://bugs.launchpad.net/nova/+bug/1161473>
>>
>> Traceback(tempest) :http://paste.openstack.org/**show/38423/<http://paste.openstack.org/show/38423/>
>>
>> The Security Group should not be created with group name an empty string
>> or with white spaces/chars more than 255
>>
>> CURL:curl -H "Content-Type: application/json" -H "X-Auth-Token:$TOKEN"
>> -X POST http://$IP:8774/v2/$PROJECT_**ID/os-security-groups -d
>> '{"security_group": {"name": " ", "description":
>> "description-1554950088"}}'
>>
>> curl -H "Content-Type: application/json" -H "X-Auth-Token:$TOKEN" -X
>> POST http://$IP:8774/v2/$PROJECT_**ID/os-security-groups -d
>> '{"security_group": {"name": " ", "description":
>> "description-1554950088"}}'
>>
>>
>> 17)tempest.api.compute.**security_groups.test_security_**
>> groups:SecurityGroupsTestJSON.**test_server_security_groups
>>
>> *
>>
>> Tempest Trace : http://paste.openstack.org/**show/38427/<http://paste.openstack.org/show/38427/>
>>
>> *
>>
>> Fixed by https://review.openstack.org/#**/c/32288/(merged<https://review.openstack.org/#/c/32288/(merged>on June,12th)
>>
>>
>>
>>
>> Tests related to servers(Ala)
>>
>>
>> 18)tempest.api.compute.**servers.test_list_server_**filters:**
>> ListServerFiltersTestXML.test_**list_servers_filtered_by_ip_**regex
>>
>> *
>>
>> BUG: https://bugs.launchpad.net/**quantum/+bug/1182883<https://bugs.launchpad.net/quantum/+bug/1182883>
>>
>> *
>>
>> BP: https://blueprints.launchpad.**net/quantum/+spec/like-op-list<https://blueprints.launchpad.net/quantum/+spec/like-op-list>
>>
>>
>> *
>>
>> CURL : GET http://$IP:8774/v2/$PROJECT_**ID/servers?ip=10.
>>
>> *
>>
>> Explanation: The regex search is not supported by Quantum. Thus
>> Quantum returns a 404 Not Found (0 server match) where Tempest
>> expects one server to be found.
>>
>> *
>>
>> Possible Fix : For "search port by IP with regex" feature, I think
>> the best place to hack it would be in file
>> quantum/db/db_base_plugin_v2.**py::_get_ports_query()
>>
>>
>> 19)tempest.api.compute.**servers.test_servers_negative:**
>> ServersNegativeTest.test_**create_with_nonexistent_**security_group
>>
>> FIXED : https://review.openstack.org/#**/c/30271/<https://review.openstack.org/#/c/30271/>
>>
>>
>> 20)tempest.api.compute.**servers.test_virtual_**interfaces:**
>> VirtualInterfacesTestXML.test_**list_virtual_interfaces
>>
>> *
>>
>> TRACE (NOVA): http://paste.openstack.org/**show/38371/<http://paste.openstack.org/show/38371/>
>>
>> *
>>
>> BUG: https://bugs.launchpad.net/**tempest/+bug/1183436<https://bugs.launchpad.net/tempest/+bug/1183436>
>>
>>
>> *
>>
>> CURL: GET
>> http://$IP:8774/v2/$PROJECT_**ID/servers/$SERVER/os-virtual-**
>> interfaces
>>
>> *
>>
>> Explanation: This HTTP request calls the Quantum API
>> (nova/nova/network/quantumv2/**api.py) and specifically the
>> get_vifs_by_* methods which are not implemented (raise
>> NotImplementedError())
>>
>> *
>>
>> Possible Fix:
>>
>>
>> o
>>
>> skip this test if Quantum is enabled as set in Tempest
>> configuration. Or
>>
>> o
>>
>> Implement the get_vifs_by_* methods
>>
>> PATCH: (still not approved) https://review.openstack.org/#**/c/31755/<https://review.openstack.org/#/c/31755/>
>>
>>
>> On Tue, Jun 18, 2013 at 5:56 PM, Sean Dague <sean at dague.net
>> <mailto:sean at dague.net>> wrote:
>>
>> On 06/18/2013 03:32 PM, Monty Taylor wrote:
>>
>>
>>
>> On 06/18/2013 03:14 PM, David Ripton wrote:
>>
>> On 06/18/2013 12:43 PM, Martina Kollarova wrote:
>>
>> Jenkins keeps running all the tests, even if the basic
>> pep8 test fails,
>> and runs all of the (very slow) Tempest Quantum tests,
>> even though
>> almost all of them are failing.
>>
>> I propose that it should fail and stop all of the other
>> tests once there
>> is a failure in a voting test. For non-voting tests, it
>> should stop only
>> itself, not the others.
>>
>> This would decrease the feedback loop and we wouldn't
>> have to wait for
>> the non-voting Quantum tests to see that they failed as
>> always.
>>
>>
>> -1
>>
>> In addition to the other objections, we currently get a lot
>> of false
>> positives (fail, retry, fail, retry, succeed), and it would
>> be harder to
>> debug these problems if the output was truncated differently
>> each time.
>>
>> Is anyone working on fixing the perma-failing Quantum test?
>> When the
>> Postgres test was perma-failing, one of the infrastructure
>> folks gave us
>> an ultimatum that if nobody fixed it soon, it would be
>> disabled. (Happy
>> ending: Mauro fixed it before it got disabled.)
>>
>>
>> That was brought up a little while ago, but we had already spent
>> so much
>> effort to get it working in the first place, none of us had the
>> heart to
>> put in such an ultimatum. But seriously- it might be time for an
>> all-hands-on-deck dogpile to figure out what's up the the
>> quantum gate.
>>
>>
>> The biggest cause of the Quantum vs. Full Tempest runs is that a lot
>> of the network api's in nova currently don't do translation of
>> errors. So under nova-network certain data validation and error
>> codes are returned, when quantum is used others are returned.
>>
>> This is a nova-api, so it needs to be consistent regardless of
>> backend (i.e. we don't return different API responses on different
>> databases).
>>
>> Issues like this one -
>> https://bugs.launchpad.net/__**nova/+bug/1160309<https://bugs.launchpad.net/__nova/+bug/1160309>
>>
>> <https://bugs.launchpad.net/**nova/+bug/1160309<https://bugs.launchpad.net/nova/+bug/1160309>
>> >
>>
>> Jordan Pittier has been working on some of these issues (he's the
>> only one I've seen working them from a Tempest / nova side), and got
>> to the crux of the problem. It could use more hands though to
>> organize the rest of those and get them banged out.
>>
>> I'm sure there are other issues once we get past this class. But
>> that would go a long way.
>>
>> -Sean
>>
>> --
>> Sean Dague
>> http://dague.net
>>
>> ______________________________**___________________
>> OpenStack-dev mailing list
>> OpenStack-dev at lists.openstack.**__org
>> <mailto:OpenStack-dev at lists.**openstack.org<OpenStack-dev at lists.openstack.org>
>> >
>> http://lists.openstack.org/__**cgi-bin/mailman/listinfo/__**
>> openstack-dev<http://lists.openstack.org/__cgi-bin/mailman/listinfo/__openstack-dev><
>> http://lists.openstack.org/**cgi-bin/mailman/listinfo/**openstack-dev<http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev>
>> >
>>
>>
>>
>>
>>
>> ______________________________**_________________
>> OpenStack-dev mailing list
>> OpenStack-dev at lists.openstack.**org <OpenStack-dev at lists.openstack.org>
>> http://lists.openstack.org/**cgi-bin/mailman/listinfo/**openstack-dev<http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev>
>>
>>
>
> --
> Sean Dague
> http://dague.net
>
> ______________________________**_________________
> OpenStack-dev mailing list
> OpenStack-dev at lists.openstack.**org <OpenStack-dev at lists.openstack.org>
> http://lists.openstack.org/**cgi-bin/mailman/listinfo/**openstack-dev<http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openstack.org/pipermail/openstack-dev/attachments/20130619/a230877f/attachment-0001.html>
More information about the OpenStack-dev
mailing list