On Thu, 2024-08-15 at 01:49 +0530, engineer2024 wrote:
> Thanks to both of you for the response . I try to insert pdb breakpoint at
> some point in the code files and start the service. Then I issue a nova
> command to check how the flow works. From then on I use the pdb commands to
> check the function calls. This to me gives a very convenient way of
> learning the code connections, since I don't have an idea of how these
> functions and modules r related. U people as developers have a design to
> start with and work on from there. So u may know the purpose of each
> function, class, etc.
>
> Since nova api doesn't support pdb becos of eventlet design, I learned to
> print those objects and find their information.
>
> But with pdb, it's a smooth flow to inspect the class attributes, their
> functions, arguments, types etc. I can even test those objects the python
> way since pdb supports python prompt as well. Of course I do all this on
> dev systems though...
this is going to be a long responce and its a bit rambely but im goign to provide
you with the context and poitn er try and figure out how to do this for your self.
first thing to note is while its possibel it not trivial and the payoff is proably not
worth it in the long run. we do not have up to day docs for how to do this because
this is not how peole learn, contibute to ro develop nova in genral.
nova does have a remote debug facilitate that is sepreate for the eventlet backdor to enable
you to use an ide to debug remotely.
that was added around 2012 and has worked on an off to vering degrees since then.
the first thing to understand about nova and several other project is that they are a distibuted system
i.e. the applciaiton that compise a given service like nova run in multiple service which are interconnected
vai a message bus with in a service and rest apis are exposed for inter service comunication.
when you enable a debugger and stop on a breakpoitn only one of the process is stop and the other contiue
to exsculte which means that RPC calls, heathbeats and other asynconos tasks can and will timeout and fail.
you cannot debug a distibuted system in teh same way as you woudl a simple applction where all state
is executed in a signle main loop.
the act of observing the internal behavior of the nova api or nova-comptue agent will change its behavior.
nova has suppoort for remote debugging that was added about 12 years ago
https://wiki.openstack.org/wiki/Nova/RemoteDebugging
https://github.com/openstack/nova/blob/master/nova/debugger.py
with that supprot you can use an idea like pycharm to connect to the remote debugger and attempt to
debug a process.
im currently propsoing removing that as part of the removal of eventlet partly because it shoudl not be needed
when we dont have enventlet and mainly because it didnt work very well when i last used it.
some ides have supprot fo gevent
gevent is a fork of eventlet which was forked form some of hte libiares in stackless python
eventlet converts a single thread python interperater in a cooperative concurent multi userland process.
what does that mean , each python function is not operating in a seperate os stackframe, it has heap allcoated
stack frames that allow context swtichs between userland thread at any tiem a funnction woudl do blocking io
the reason single step debugging does not work well with gdb is that when you single step the eventlet engine
is often going to end up context switching into the midel of another function.
to work around this you just set breakpoint at the line you want to go to next and use continue instead fo single step
one of the rules when using eventlet is that if your going to monkey patch you need to monkey patch everyting early
in addtion too the remote debuging facilites we can disabl mokey patching entirly.
you can do that by settign OS_NOVA_DISABLE_EVENTLET_PATCHING=1
https://github.com/openstack/nova/blob/master/nova/monkey_patch.py#L87
this will entirly break nova-compute and several other nova process as they have
effectivly infinity loops to pool for rpc messags, monitor hyperviors ectra.
if the remote debugger is used it disables patching the thread lib
https://github.com/openstack/nova/blob/master/nova/monkey_patch.py#L44-L46
that is prartly why usign the remote debugger is kind fo buggy in itsself.
the code is not inteded to work that way.
these fucntionlatiy mainly exist to allow debuging tests not the actulaly running code.
if i need to debug test i normaly jsut comment out
https://github.com/openstack/nova/blob/master/nova/monkey_patch.py#L83-L89
and run the test in my ides(pycharm,vscodes) test debbuger.
i normally code in nano/emacs but is do use ides if i really cant figure out what happening
however i genreally start with just adding log lines.
when i first started workign on opentack in 2013 i very much liked using an ide and debugger
and i woudl often spend time trying to get nova to work in a debugger.
over tiem i generally found that that was not worth the effort.
keepign a devstack deployment working that way was often more work then doing print staement
or better writing functional or unit test to repoduce the problem.
you previous mails ask about debuging nova's api.
if you want to do that its one of the easiest thigns to debug.
first do _not_ use the wsgi script
adding apache of modwsgi is just going to make your life harder.
just disable eventlet monkey patching, by commetiing or usign the env varible
and run the nova-api console script directly under pdb or your debugger of choice.
that will allow you to basiclly single step debug without any issues as it will not be using eventlet
anymore and it has not infinitly loops or perodic tasks so it mostly just works when not using eventlet.
that becase it didnt use eventlet at all in the api until about 2016 and even today it only really uses it in one place
in my eventlet removal serise i split nova in to the part that use eventlet directly and the parts that dont in the
second path
https://review.opendev.org/c/openstack/nova/+/904424
anything under nova/cmd/eventlet/ needs eventlet today
everything under nova/cmd/standalone/ does not.
the approch of doign an api call and following it in a debugger is not a good way in my expericne to learn how somethign
like nova really works. it can hellp to a limtied degree but you cant eaislly debug across multipel processes.
a minium nova deployment has 4 process typically more.
you can try but each openstack service (nova, neutorn, cinder, ...) is typically compised of several micocservices
some like keystone and palcement are just rest apis in front of a db and dont use eventlet at all so are trivial to
debug as a result. the rest have non tirival runtime interactions that cant be as eaislly decompsed.
becuase of that nova and other project invest in our functional tests to allwo ues to emulate that multi process env
and better reason about the code.
the debugging functionality is documetned in the contibutor guide
https://github.com/openstack/nova/blob/master/doc/source/contributor/development-environment.rst#using-a-remote-debugger
but we dont really adverstise that because none of the core maintainers use that any more and it has not been maintained
for the better part of a decade. since we cant use this type of debugging with our ci system
if we cant debug why a failure happens form the ci logs we add more logs because thyat will be useful for future us
and operators in production. so our focus is on logging/monitoring for production rather then for developer eases
simply because of the large techdebt that eventlet creates in this aready.
i know nuetorn has had some better look with using debuggers then nova has,
keystone/placmnet are eventlest free adn are just python web apps/rest apis so should "just work"
in any python debugger.
nova is prehaps the hardest to debug because of the legacy of the proejct so its not the best palce to start looking
how this works. too be clear i have ran nova api, nova-conductor, nova scheduler and nova-compute all in 4 ide windows
with 4 debuggers at once
it was proably 3 year of working on openstack before i understood how to hack that ot work and since i learned
how to properly debug without a debuggger, logs, code inspection, unit/fucntioal testing
i have nver really need to do that again.
gaining the ablity to run openstack in a debugger simply again, in a maintainable way is deffienly one of the reason im
looking forward to removing eventlet but its still a lot of work.
im not sure how helpful this was but this is proably as clsoe to a blog post as you will get.
>
> On Thu, 15 Aug 2024, 01:04 Jeremy Stanley, <fungi@yuggoth.org> wrote:
>
> > On 2024-08-14 22:38:43 +0530 (+0530), engineer2024 wrote:
> > > How many more releases are you planning to include eventlet in
> > > them? Seems they are removing any support for further development
> > > according to their official website. So, is the community thinking
> > > of any alternatives?
> > [...]
> >
> > Just to expand on the first part of Sean's reply, most of what you
> > want to know is probably answered here:
> >
> > https://governance.openstack.org/tc/goals/proposed/remove-eventlet.html
> >
> > --
> > Jeremy Stanley
> >