Hola,
Les comento el problema con el que estoy lidiando para ver si alguien puede darme una mano.
Empezamos a tener problemas con el DHCP en nuestro cluster, de repente muchas de las instancias dejaron de obtener IP. Este es el log del servidor DHCP:
2017-04-26 09:25:07.948 18193 ERROR neutron.plugins.openvswitch.agent.ovs_neutron_agent [-] No local VLAN available for net-id=664f64fd-9844-4f1e-a040-9bac2dfa83bb
2017-04-26 09:25:07.951 18193 ERROR neutron.plugins.openvswitch.agent.ovs_neutron_agent [-] Error while processing VIF ports
2017-04-26 09:25:07.951 18193 TRACE neutron.plugins.openvswitch.agent.ovs_neutron_agent Traceback (most recent call last):
2017-04-26 09:25:07.951 18193 TRACE neutron.plugins.openvswitch.agent.ovs_neutron_agent File "/usr/lib/python2.7/dist-packages/neutron/plugins/openvswitch/agent/ovs_neutron_agent.py", line 1345, in rpc_loop
2017-04-26 09:25:07.951 18193 TRACE neutron.plugins.openvswitch.agent.ovs_neutron_agent ovs_restarted)
2017-04-26 09:25:07.951 18193 TRACE neutron.plugins.openvswitch.agent.ovs_neutron_agent File "/usr/lib/python2.7/dist-packages/neutron/plugins/openvswitch/agent/ovs_neutron_agent.py", line 1160, in process_network_ports
2017-04-26 09:25:07.951 18193 TRACE neutron.plugins.openvswitch.agent.ovs_neutron_agent devices_added_updated, ovs_restarted)
2017-04-26 09:25:07.951 18193 TRACE neutron.plugins.openvswitch.agent.ovs_neutron_agent File "/usr/lib/python2.7/dist-packages/neutron/plugins/openvswitch/agent/ovs_neutron_agent.py", line 1051, in treat_devices_added_or_updated
2017-04-26 09:25:07.951 18193 TRACE neutron.plugins.openvswitch.agent.ovs_neutron_agent ovs_restarted)
2017-04-26 09:25:07.951 18193 TRACE neutron.plugins.openvswitch.agent.ovs_neutron_agent File "/usr/lib/python2.7/dist-packages/neutron/plugins/openvswitch/agent/ovs_neutron_agent.py", line 957, in treat_vif_port
2017-04-26 09:25:07.951 18193 TRACE neutron.plugins.openvswitch.agent.ovs_neutron_agent ovs_restarted)
2017-04-26 09:25:07.951 18193 TRACE neutron.plugins.openvswitch.agent.ovs_neutron_agent File
"/usr/lib/python2.7/dist-packages/neutron/plugins/openvswitch/agent/ovs_neutron_agent.py", line 620, in port_bound
2017-04-26 09:25:07.951 18193 TRACE neutron.plugins.openvswitch.agent.ovs_neutron_agent lvm = self.local_vlan_map[net_uuid]
2017-04-26 09:25:07.951 18193 TRACE neutron.plugins.openvswitch.agent.ovs_neutron_agent KeyError: u'664f64fd-9844-4f1e-a040-9bac2dfa83bb'
2017-04-26 09:25:07.951 18193 TRACE neutron.plugins.openvswitch.agent.ovs_neutron_agent
2017-04-26 09:25:07.952 18193 INFO neutron.plugins.openvswitch.agent.ovs_neutron_agent [-] Agent out of sync with plugin!
Al revisar los puertos puedo ver lo siguiente:
# neutron --os-tenant-name tenant-340856 port-list
+--------------------------------------+------+-------------------+--------------------------------------------------------------------------------------+
| id | name | mac_address | fixed_ips |
+--------------------------------------+------+-------------------+--------------------------------------------------------------------------------------+
| 5b5a2eea-2604-4120-afa4-53bb242d1a97 | | fa:16:3e:f8:42:bf | {"subnet_id": "28fe403f-b06a-4d58-b6ba-3f1002634838", "ip_address": "192.168.200.5"} |
| 9892ca71-24c4-4d37-805f-d96deba775e8 | | fa:16:3e:67:4d:af | {"subnet_id": "28fe403f-b06a-4d58-b6ba-3f1002634838", "ip_address": "192.168.200.1"} |
| d8f623d0-9570-4049-992c-8faa6a35da78 | | fa:16:3e:99:97:94 | {"subnet_id": "28fe403f-b06a-4d58-b6ba-3f1002634838", "ip_address": "192.168.200.2"} |
| fcd2b665-c161-40e0-bfc5-7a737a6abaa8 | | fa:16:3e:13:b5:32 | {"subnet_id": "28fe403f-b06a-4d58-b6ba-3f1002634838", "ip_address": "192.168.200.6"} |
+--------------------------------------+------+-------------------+--------------------------------------------------------------------------------------+
# neutron --os-tenant-id cb06a97924f74001817c22ffb99d8b4c port-show 5b5a2eea-2604-4120-afa4-53bb242d1a97
+-----------------------+--------------------------------------------------------------------------------------+
| Field | Value |
+-----------------------+--------------------------------------------------------------------------------------+
| admin_state_up | True |
| allowed_address_pairs | |
| binding:vnic_type | normal |
| device_id | bdb86d6a-32e7-46c3-97bc-4a094e4e3ca3 |
| device_owner | compute:None |
| extra_dhcp_opts | |
| fixed_ips | {"subnet_id": "28fe403f-b06a-4d58-b6ba-3f1002634838", "ip_address": "192.168.200.5"} |
| id | 5b5a2eea-2604-4120-afa4-53bb242d1a97 |
| mac_address | fa:16:3e:f8:42:bf |
| name | |
| network_id | 664f64fd-9844-4f1e-a040-9bac2dfa83bb |
| security_groups | 4940d595-853a-4735-9347-e6eb2e62d8e8 |
| | 8c552636-b33b-4241-8a76-89ec83104105 |
| status | ACTIVE |
| tenant_id | cb06a97924f74001817c22ffb99d8b4c |
+-----------------------+--------------------------------------------------------------------------------------+
# neutron --os-tenant-id cb06a97924f74001817c22ffb99d8b4c port-show 9892ca71-24c4-4d37-805f-d96deba775e8
+-----------------------+--------------------------------------------------------------------------------------+
| Field | Value |
+-----------------------+--------------------------------------------------------------------------------------+
| admin_state_up | True |
| allowed_address_pairs | |
| binding:vnic_type | normal |
| device_id | c5064be4-82dd-48bb-8a48-fd677673ba1e |
| device_owner | network:router_interface |
| extra_dhcp_opts | |
| fixed_ips | {"subnet_id": "28fe403f-b06a-4d58-b6ba-3f1002634838", "ip_address": "192.168.200.1"} |
| id | 9892ca71-24c4-4d37-805f-d96deba775e8 |
| mac_address | fa:16:3e:67:4d:af |
| name | |
| network_id | 664f64fd-9844-4f1e-a040-9bac2dfa83bb |
| security_groups | |
| status | ACTIVE |
| tenant_id | cb06a97924f74001817c22ffb99d8b4c |
+-----------------------+--------------------------------------------------------------------------------------+
# neutron --os-tenant-id cb06a97924f74001817c22ffb99d8b4c port-show d8f623d0-9570-4049-992c-8faa6a35da78
+-----------------------+--------------------------------------------------------------------------------------+
| Field | Value |
+-----------------------+--------------------------------------------------------------------------------------+
| admin_state_up | True |
| allowed_address_pairs | |
| binding:vnic_type | normal |
| device_id | |
| device_owner | |
| extra_dhcp_opts | |
| fixed_ips | {"subnet_id": "28fe403f-b06a-4d58-b6ba-3f1002634838", "ip_address": "192.168.200.2"} |
| id | d8f623d0-9570-4049-992c-8faa6a35da78 |
| mac_address | fa:16:3e:99:97:94 |
| name | |
| network_id | 664f64fd-9844-4f1e-a040-9bac2dfa83bb |
| security_groups | 4940d595-853a-4735-9347-e6eb2e62d8e8 |
| status | DOWN |
| tenant_id | cb06a97924f74001817c22ffb99d8b4c |
+-----------------------+--------------------------------------------------------------------------------------+
# neutron --os-tenant-id cb06a97924f74001817c22ffb99d8b4c port-show fcd2b665-c161-40e0-bfc5-7a737a6abaa8
+-----------------------+--------------------------------------------------------------------------------------+
| Field | Value |
+-----------------------+--------------------------------------------------------------------------------------+
| admin_state_up | True |
| allowed_address_pairs | |
| binding:vnic_type | normal |
| device_id | reserved_dhcp_port |
| device_owner | network:dhcp |
| extra_dhcp_opts | |
| fixed_ips | {"subnet_id": "28fe403f-b06a-4d58-b6ba-3f1002634838", "ip_address": "192.168.200.6"} |
| id | fcd2b665-c161-40e0-bfc5-7a737a6abaa8 |
| mac_address | fa:16:3e:13:b5:32 |
| name | |
| network_id | 664f64fd-9844-4f1e-a040-9bac2dfa83bb |
| security_groups | |
| status | BUILD |
| tenant_id | cb06a97924f74001817c22ffb99d8b4c |
+-----------------------+--------------------------------------------------------------------------------------+
Como se ve el puerto correspondiente al DHCP queda en estado BUILD.
En el servidor de DHCP, veo que empieza a sincronizarse hasta que falla con el error del inicio de este mensaje, desanctiva todos los puertos y comienza a activarlos y así sucesivamente.
Alguien tuvo algún problema similar, ya se me están acabando las ideas, agradecería una pista por donde poder buscar la solución.
Gracias!