<html>

  <head>

    <meta content="text/html; charset=ISO-8859-1"

      http-equiv="Content-Type">

  </head>

  <body text="#000000" bgcolor="#FFFFFF">

    <div class="moz-cite-prefix">On 12/03/2012 05:32 PM, Gary Kotton

      wrote:<br>

    </div>

    <blockquote cite="mid:50BC71C8.2030903@redhat.com" type="cite">

      <meta content="text/html; charset=ISO-8859-1"

        http-equiv="Content-Type">

      On 12/03/2012 04:16 AM, gong yong sheng wrote:

      <blockquote cite="mid:50BC0B91.907@linux.vnet.ibm.com" type="cite">

        <meta content="text/html; charset=ISO-8859-1"

          http-equiv="Content-Type">

        <div class="moz-cite-prefix">On 12/03/2012 02:29 AM, Vinay

          Bannai wrote:<br>

        </div>

        <blockquote

cite="mid:CAO48XRG=UQbCEQaWx70SeKZynZ6Obd09J7VDpnogVn0cNHi7Jg@mail.gmail.com"

          type="cite">My understanding of the "scheduler" approach based

          on what I read on the ML's is to have a mechanism where the

          DHCP agents can land on different nodes. For example, just

          like we have compute hosts in nova, we have a bunch of DHCP

          capable hosts (and L3 capable hosts etc) that can be selected

          to host the network service for a tenant when the

          network/subnet is created. The process of selecting the host

          to run the service is based on a "scheduler". This allows a

          graceful horizontal scaling. This approach is similar to what

          nova does. You have a bunch of hosts capable of providing a

          network service and the "scheduler" picks them based on

          filters and other tunable knobs. I think you already know

          this:-). I  was spelling it out so that you can see where I am

          coming from. <br>

        </blockquote>

        If we don't want all dhcp agents to host the data of all the

        networks,<br>

        My Idea is:<br>

        1. let quantum server have the ability to know all about the

        dhcp agents. for example we can have quantum agents-list to show

        all the agents running in the quantum deloyment,<br>

        and the network they are hosting.<br>

      </blockquote>

      <br>

      ok, this may be useful for debugging. will this display and have

      the status of the dhcp agent, say for example i deploy and agent

      and it has an exception do to a bug? nn<br>

    </blockquote>

    I think it is good for management, not just good for debugging. If

    the agent cannot start, the quantum agents-list will not have it in

    a :) status. If it has exception and is still running, we can have

    it report its status with exception. But regarding the log,  I think

    we should leave it to upper management tools. For example, admin

    user can start the agent with integrated log facility.<br>

    <blockquote cite="mid:50BC71C8.2030903@redhat.com" type="cite"> <br>

      <blockquote cite="mid:50BC0B91.907@linux.vnet.ibm.com" type="cite">

        2. let admin user have the ability to config the dhcp agents

        what networks they should host.   For example, quantum

        dhcpagent-update dhcpagent1 --networks network1 network2

        network3. or quantum net-create network1 --dhcpagents agent1

        agent2. And if admin user does not specify which agent to host

        which network, we can let scheduler to decide automatically<br>

      </blockquote>

      <br>

      this is exactly what i am suggesting, except we do not need to

      change the quantum api to provide this. the agent can receive this

      as an input parameter. in principle we agree on what needs to be

      done, but the question is how.<br>

    </blockquote>

    I am suggesting to control agents from quantum server. we can use

    quantum cli or API to command which agents host which networks. Of

    course admin user also can configure the dhcp agents to accept only

    some networks data. We can support both.<br>

    We can add quantum api ( maybe by extension) and store the networks

    and dhcp agents mapping in db. then notify related dhcp agents with

    related data.<br>

    <blockquote cite="mid:50BC71C8.2030903@redhat.com" type="cite"> <br>

      <blockquote cite="mid:50BC0B91.907@linux.vnet.ibm.com" type="cite">

        So for scale vertically:<br>

        we can specify much agents host some same networks<br>

        So for scale horizontally:<br>

        we can add as many as dhcp agents. quantum scheduler will

        distribute new networks automatically or admin user can specify.<br>

      </blockquote>

      <br>

      i have a number of problems with a quantum "scheduler". the first

      being a single point of failure. the second being the fact that it

      needs to be aware of the state and load of a dhcp agent. how will

      the scheduler provide for HA?<br>

    </blockquote>

    What do you mean by single point of failure? In fact we don't need

    to run it in a separate binary at all. It is just a module of

    quantum-server. <br>

    state and load of a dhcp agent is in db. If you are meaning the

    quantum-server's HA, I don't think we already have a good one by

    now. I think we should break the current quantum-server into two

    parts:<br>

    1. one part is for just API (rest): for which we can use multiple

    process just like other nova api servers. and operator will also be

    able to use load balancer to HA it.<br>

    2. anyone part is for incoming queue message processing. We can run

    multiple this part nodes, this is the AMQP feature to scale out.<br>

    <blockquote cite="mid:50BC71C8.2030903@redhat.com" type="cite"> <br>

      <br>

      <blockquote cite="mid:50BC0B91.907@linux.vnet.ibm.com" type="cite">

        For us to run multiple dhcp agents, we need to make sure our

        dhcp anti spoofing work.<br>

        <blockquote

cite="mid:CAO48XRG=UQbCEQaWx70SeKZynZ6Obd09J7VDpnogVn0cNHi7Jg@mail.gmail.com"

          type="cite">

          <div> <br>

          </div>

          <div>Either way we look at it, I think it will be helpful if

            we decoupled the horizontal (scaling to multiple nodes) and

            vertical scaling (redundancy and failover). One should not

            imply the other. In your last paragraph, you mention

            "orchestration tool" and dhcp agents configured to handle

            specific networks. I have not been able to wrap my head

            around this completely but it appears to b ea different

            variant of the "scheduler" approach where it is configured

            manually. Is my understanding correct? Or if you don't mind,

            can you elaborate further on that idea. </div>

          <div><br>

          </div>

          <div>Thanks</div>

          <div>Vinay<br>

            <br>

            <div class="gmail_quote">On Sun, Dec 2, 2012 at 7:16 AM,

              Gary Kotton <span dir="ltr"><<a moz-do-not-send="true"

                  href="mailto:gkotton@redhat.com" target="_blank">gkotton@redhat.com</a>></span>

              wrote:<br>

              <blockquote class="gmail_quote" style="margin:0 0 0

                .8ex;border-left:1px #ccc solid;padding-left:1ex">

                <div bgcolor="#FFFFFF" text="#000000">

                  <div class="im"> On 12/01/2012 03:31 AM, gong yong

                    sheng wrote:

                    <blockquote type="cite">

                      <div>On 12/01/2012 07:49 AM, Vinay Bannai wrote:<br>

                      </div>

                      <blockquote type="cite">Gary and Mark,

                        <div><br>

                        </div>

                        <div>You brought up the issue of scaling

                          horizontally and vertically in your earlier

                          email. In the case of horizontal scaling, I

                          would agree that it would have to be based on

                          the "scheduler" approach proposed by Gong and

                          Nachi. <br>

                        </div>

                      </blockquote>

                    </blockquote>

                    <br>

                  </div>

                  I am not sure that I understand the need for a

                  scheduler when it comes to the DHCP agent.  In my

                  opinion this is unnecessary overhead and it is not

                  necessarily required. <br>

                  <br>

                  Last week Mark addressed the problem with all of the

                  DHCP agents all listening on the same message queue.

                  In theory we are able to run more than one DHCP agents

                  in parallel. This offers HA at the expense of an IP

                  per DHCP agent per subnet. <br>

                  <br>

                  I think that for the DHCP agents we need to look into

                  enabling the DHCP agents to treat specific networks.

                  This can be done in a very rudimentary way - have a

                  configuration variable for the DHCP agent indicating a

                  list of networks to be treated by the agent. A

                  orchestration tool can just configure the network ID's

                  and launch the service - then we will have scalable

                  and highly available DHCP service. I would prefer not

                  to have to add this into the Quantum API as it just

                  complicates things.

                  <div>

                    <div class="h5"><br>

                      <br>

                      <br>

                      <br>

                      <br>

                      <blockquote type="cite">

                        <blockquote type="cite">

                          <div>On the issue of vertical scaling (I am

                            using the DHCP redundancy as an example), I

                            think it would be good to base our

                            discussions on the various methods that have

                            been discussed and do pro/con analysis in

                            terms of scale, performance and other such

                            metrics. </div>

                          <div><br>

                          </div>

                          <div>- Split scope DHCP (two or more servers

                            split the IP address and there is no

                            overlap)</div>

                          <div>  pros: simple</div>

                          <div>  cons: wastes IP addresses,</div>

                          <div><br>

                          </div>

                          <div>- Active/Standby model (might have run

                            VRRP or hearbeats to dictate who is active)</div>

                          <div>  pros: load evenly shared</div>

                          <div>  cons: needs shared knowledge of address

                            assignments, </div>

                          <div>            need hearbeats or VRRP to

                            keep track of failovers</div>

                        </blockquote>

                        another one is the IP address waste. we need one

                        VIP, and 2+ more address for VRRP servers. ( we

                        can use dhcp server's ip if we don't want to do

                        load balancing behind the VRRP servers)<br>

                        another one is it will make system complicated.<br>

                        <blockquote type="cite">

                          <div><br>

                          </div>

                          <div>- LB method (use load balancer to fan out

                            to multiple dhcp servers)</div>

                          <div>  pros: scales very well </div>

                          <div>  cons: the lb becomes the single point

                            of failure,</div>

                          <div>           the lease assignments needs to

                            be shared between the dhcp servers</div>

                          <div><br>

                          </div>

                        </blockquote>

                        LB method will also wast ip address. First we at

                        lease need a VIP address. then we will need more

                        dhcp servers running for one network.<br>

                        If we need to VRRP the VIP, we will need 2+ more

                        addresses.<br>

                        another one is it will make system complicated.<br>

                        <blockquote type="cite">

                          <div>I see that the DHCP agent and the quantum

                            server communicate using RPC. Is the plan to

                            leave it alone or migrate it towards

                            something like AMQP based server in the

                            future when the "scheduler" stuff is

                            implemented. <br>

                          </div>

                        </blockquote>

                        I am not very clear your point. But current RPC

                        is on AMQP.<br>

                        <blockquote type="cite">

                          <div><br>

                          </div>

                          <div>Vinay</div>

                          <div><br>

                          </div>

                          <div><br>

                          </div>

                          <div class="gmail_quote">On Wed, Nov 28, 2012

                            at 8:03 AM, Mark McClain <span dir="ltr"><<a

                                moz-do-not-send="true"

                                href="mailto:mark.mcclain@dreamhost.com"

                                target="_blank">mark.mcclain@dreamhost.com</a>></span>

                            wrote:<br>

                            <blockquote class="gmail_quote"

                              style="margin:0 0 0 .8ex;border-left:1px

                              #ccc solid;padding-left:1ex">

                              <div><br>

                                On Nov 28, 2012, at 8:03 AM, gong yong

                                sheng <<a moz-do-not-send="true"

                                  href="mailto:gongysh@linux.vnet.ibm.com"

                                  target="_blank">gongysh@linux.vnet.ibm.com</a>>

                                wrote:<br>

                                <br>

                                > On 11/28/2012 08:11 AM, Mark

                                McClain wrote:<br>

                                >> On Nov 27, 2012, at 6:33 PM,

                                gong yong sheng <<a

                                  moz-do-not-send="true"

                                  href="mailto:gongysh@linux.vnet.ibm.com"

                                  target="_blank">gongysh@linux.vnet.ibm.com</a>>

                                wrote:<br>

                                >><br>

                                >> Just wanted to clarify two

                                items:<br>

                                >><br>

                                >>>> At the moment all of

                                the dhcp agents receive all of the

                                updates. I do not see why we need the

                                quantum service to indicate which agent

                                runs where. This will change the manner

                                in which the dhcp agents work.<br>

                                >>> No. currently, we can run

                                only one dhcp agent  since we are using

                                a topic queue for notification.<br>

                                >> You are correct.  There is a

                                bug in the underlying Oslo RPC

                                implementation that sets the topic and

                                queue names to be same value.  I didn't

                                get a clear explanation of this problem

                                until today and will have to figure out

                                a fix to oslo.<br>

                                >><br>

                                >>> And one problem with

                                multiple agents serving the same ip is:<br>

                                >>> we will have more than one

                                agents want to update the ip's leasetime

                                now and than.<br>

                                >> This is not a problem.  The

                                DHCP protocol was designed for multiple

                                servers on a network.  When a client

                                accepts a lease, the server that offered

                                the accepted lease will be the only

                                process attempting to update the lease

                                for that port.  The other DHCP instances

                                will not do anything, so there won't be

                                any chance for a conflict.  Also, when a

                                client renews it sends a unicast message

                                to that previous DHCP server and so

                                there will only be one writer in this

                                scenario too.  Additionally, we don't

                                have to worry about conflicting

                                assignments because the dhcp agents use

                                the same static allocations from the

                                Quantum database.<br>

                                > I mean dhcp agent is trying to

                                update leasetime to quantum server. If

                                we have more than one dhcp agents, this

                                will cause confusion.<br>

                                >    def update_lease(self,

                                network_id, ip_address, time_remaining):<br>

                                >        try:<br>

                                >          

                                 self.plugin_rpc.update_lease_expiration(network_id,

                                ip_address,<br>

                                >                                    

                                               time_remaining)<br>

                                >        except:<br>

                                >            self.needs_resync = True<br>

                                >            LOG.exception(_('Unable

                                to update lease'))<br>

                                > I think it is our dhcp agent's

                                defect. Why does our dhcp agent need the

                                lease time? all the IPs are managed in

                                our quantum server, there is not need

                                for dynamic ip management in dhcp server

                                managed by dhcp agent.<br>

                                <br>

                              </div>

                              There cannot be confusion.  The dhcp

                              client selects only one server to accept a

                              lease, so only one agent will update this

                              field at a time. (See RFC2131 section

                              4.3.2 for protocol specifics).  The

                              dnsmasq allocation database is static in

                              Quantum's setup, so the lease renewal

                              needs to propagate to the Quantum Server.

                               The Quantum server then uses the lease

                              time to avoid allocating IP addresses

                              before the lease has expired.  In Quantum,

                              we add an additional restriction that

                              expired allocations are not reclaimed

                              until the associated port has been deleted

                              as well.<br>

                              <div>

                                <div><br>

                                  mark<br>

                                  <br>

                                  <br>

_______________________________________________<br>

                                  OpenStack-dev mailing list<br>

                                  <a moz-do-not-send="true"

                                    href="mailto:OpenStack-dev@lists.openstack.org"

                                    target="_blank">OpenStack-dev@lists.openstack.org</a><br>

                                  <a moz-do-not-send="true"

                                    href="http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev"

                                    target="_blank">http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev</a><br>

                                </div>

                              </div>

                            </blockquote>

                          </div>

                          <br>

                          <br clear="all">

                          <div><br>

                          </div>

                          -- <br>

                          Vinay Bannai<br>

                          Email: <a moz-do-not-send="true"

                            href="mailto:vbannai@gmail.com"

                            target="_blank">vbannai@gmail.com</a><br>

                          Google Voice: <a moz-do-not-send="true"

                            href="tel:415%20938%207576"

                            value="+14159387576" target="_blank">415 938

                            7576</a><br>

                          <br>

                          <br>

                          <fieldset></fieldset>

                          <br>

                          <pre>_______________________________________________

OpenStack-dev mailing list

<a moz-do-not-send="true" href="mailto:OpenStack-dev@lists.openstack.org" target="_blank">OpenStack-dev@lists.openstack.org</a>

<a moz-do-not-send="true" href="http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev" target="_blank">http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev</a>

</pre>

                        </blockquote>

                        <br>

                        <br>

                        <fieldset></fieldset>

                        <br>

                        <pre>_______________________________________________

OpenStack-dev mailing list

<a moz-do-not-send="true" href="mailto:OpenStack-dev@lists.openstack.org" target="_blank">OpenStack-dev@lists.openstack.org</a>

<a moz-do-not-send="true" href="http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev" target="_blank">http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev</a>

</pre>

                      </blockquote>

                      <br>

                    </div>

                  </div>

                </div>

                <br>

                _______________________________________________<br>

                OpenStack-dev mailing list<br>

                <a moz-do-not-send="true"

                  href="mailto:OpenStack-dev@lists.openstack.org">OpenStack-dev@lists.openstack.org</a><br>

                <a moz-do-not-send="true"

                  href="http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev"

                  target="_blank">http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev</a><br>

                <br>

              </blockquote>

            </div>

            <br>

            <br clear="all">

            <div><br>

            </div>

            -- <br>

            Vinay Bannai<br>

            Email: <a moz-do-not-send="true"

              href="mailto:vbannai@gmail.com">vbannai@gmail.com</a><br>

            Google Voice: 415 938 7576<br>

            <br>

          </div>

          <br>

          <fieldset class="mimeAttachmentHeader"></fieldset>

          <br>

          <pre wrap="">_______________________________________________

OpenStack-dev mailing list

<a moz-do-not-send="true" class="moz-txt-link-abbreviated" href="mailto:OpenStack-dev@lists.openstack.org">OpenStack-dev@lists.openstack.org</a>

<a moz-do-not-send="true" class="moz-txt-link-freetext" href="http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev">http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev</a>

</pre>

        </blockquote>

        <br>

      </blockquote>

      <br>

    </blockquote>

    <br>

  </body>

</html>