<div dir="ltr"><br><div class="gmail_extra">Hi Roey, </div><div class="gmail_extra"><br></div><div class="gmail_extra">I made this change and have been running this fix on 4 different servers. I believe this fix works.  Things are working very smoothly. </div>
<div class="gmail_extra"><br></div><div class="gmail_extra" style>I think we need to incorporate this change into devstack scripts or capture it in the documentation so that it saves some grief to the next person. </div><div class="gmail_extra" style>
<br></div><div class="gmail_extra" style>Thanks</div><div class="gmail_extra" style>-Sukhdev</div><div class="gmail_extra" style><br></div><div class="gmail_extra"><br></div><div class="gmail_extra"><br><br><div class="gmail_quote">
On Tue, Mar 11, 2014 at 3:06 AM, Roey Chen <span dir="ltr"><<a href="mailto:roeyc@mellanox.com" target="_blank">roeyc@mellanox.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">






<div lang="EN-US" link="blue" vlink="purple">
<div>
<p class="MsoNormal"><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif"">Forwarding the answer to the relevant mailing lists:<u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif""><u></u> <u></u></span></p>
<p class="MsoNormal"><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif"">---<u></u><u></u></span></p><div class="">
<p class="MsoNormal"><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif""><u></u> <u></u></span></p>
<p class="MsoNormal"><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif"">Hi,<u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif""><u></u> <u></u></span></p>
<p class="MsoNormal"><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif"">Hope this could help,<u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif""><u></u> <u></u></span></p>
<p class="MsoNormal"><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif"">I've encountered this issue myself not to long ago on Ubuntu 12.04 host,<u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif"">it didn't happen again after messing with the Kernel Semaphore Limits parameters [1]:<u></u><u></u></span></p>

<p class="MsoNormal"><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif""><u></u> <u></u></span></p>
<p class="MsoNormal"><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif"">Adding this [2] line to `/etc/sysctl.conf` seems to do the trick.<u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif""><u></u> <u></u></span></p>
<p class="MsoNormal"><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif""><u></u> <u></u></span></p>
<p class="MsoNormal"><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif"">- Roey<u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif""><u></u> <u></u></span></p>
<p class="MsoNormal"><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif""><u></u> <u></u></span></p>
<p class="MsoNormal"><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif"">[1] <a href="http://paste.openstack.org/show/73086/" target="_blank">http://paste.openstack.org/show/73086/</a><u></u><u></u></span></p>

<p class="MsoNormal"><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif"">[2] <a href="http://paste.openstack.org/show/73082/" target="_blank">http://paste.openstack.org/show/73082/</a><u></u><u></u></span></p>

<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d"><u></u> <u></u></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d"><u></u> <u></u></span></p>
</div><p class="MsoNormal"><b><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif"">From:</span></b><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif""> Sukhdev Kapur [mailto:<a href="mailto:sukhdevkapur@gmail.com" target="_blank">sukhdevkapur@gmail.com</a>]
<br>
<b>Sent:</b> Monday, March 10, 2014 5:56 PM<br>
<b>To:</b> Dane Leblanc (leblancd)<br>
<b>Cc:</b> OpenStack Development Mailing List (not for usage questions); <a href="mailto:openstack-infra@lists.openstack.org" target="_blank">openstack-infra@lists.openstack.org</a>; <a href="mailto:openstack-qa@lists.openstack.org" target="_blank">openstack-qa@lists.openstack.org</a></span></p>
<div class=""><br>
<b>Subject:</b> Re: [OpenStack-Infra] tgt restart fails in Cinder startup "start: job failed to start"<u></u><u></u></div><p></p>
<p class="MsoNormal"><u></u> <u></u></p>
<div>
<p class="MsoNormal">I see the same issue. This issue has crept in during the latest flurry of check-ins. I started noticing this issue a day or two before the Icehouse Feature Freeze deadline.<u></u><u></u></p><div><div class="h5">

<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">I tried restarting tgt as well, but, it does not help. <u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">However, rebooting the VM helps clear it up.<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">Has anybody else seen it as well? Does anybody have a solution for it? <u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">Thanks<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">-Sukhdev<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
</div></div></div><div><div class="h5">
<div>
<p class="MsoNormal" style="margin-bottom:12.0pt"><u></u> <u></u></p>
<div>
<p class="MsoNormal">On Mon, Mar 10, 2014 at 8:37 AM, Dane Leblanc (leblancd) <<a href="mailto:leblancd@cisco.com" target="_blank">leblancd@cisco.com</a>> wrote:<u></u><u></u></p>
<p class="MsoNormal">I don't know if anyone can give me some troubleshooting advice with this issue.<br>
<br>
I'm seeing an occasional problem whereby after several DevStack <a href="http://unstack.sh/stack.sh" target="_blank">
unstack.sh/stack.sh</a> cycles, the tgt daemon (tgtd) fails to start during Cinder startup.  Here's a snippet from the stack.sh log:<br>
<br>
2014-03-10 07:09:45.214 | Starting Cinder<br>
2014-03-10 07:09:45.215 | + return 0<br>
2014-03-10 07:09:45.216 | + sudo rm -f /etc/tgt/conf.d/stack.conf<br>
2014-03-10 07:09:45.217 | + _configure_tgt_for_config_d<br>
2014-03-10 07:09:45.218 | + [[ ! -d /etc/tgt/stack.d/ ]]<br>
2014-03-10 07:09:45.219 | + is_ubuntu<br>
2014-03-10 07:09:45.220 | + [[ -z deb ]]<br>
2014-03-10 07:09:45.221 | + '[' deb = deb ']'<br>
2014-03-10 07:09:45.222 | + sudo service tgt restart<br>
2014-03-10 07:09:45.223 | stop: Unknown instance:<br>
2014-03-10 07:09:45.619 | start: Job failed to start<br>
jenkins@neutronpluginsci:~/devstack$ 2014-03-10 07:09:45.621 | + exit_trap<br>
2014-03-10 07:09:45.622 | + local r=1<br>
2014-03-10 07:09:45.623 | ++ jobs -p<br>
2014-03-10 07:09:45.624 | + jobs=<br>
2014-03-10 07:09:45.625 | + [[ -n '' ]]<br>
2014-03-10 07:09:45.626 | + exit 1<br>
<br>
If I try to restart tgt manually without success:<br>
<br>
jenkins@neutronpluginsci:~$ sudo service tgt restart<br>
stop: Unknown instance:<br>
start: Job failed to start<br>
jenkins@neutronpluginsci:~$ sudo tgtd<br>
librdmacm: couldn't read ABI version.<br>
librdmacm: assuming: 4<br>
CMA: unable to get RDMA device list<br>
(null): iser_ib_init(3263) Failed to initialize RDMA; load kernel modules?<br>
(null): fcoe_init(214) (null)<br>
(null): fcoe_create_interface(171) no interface specified.<br>
jenkins@neutronpluginsci:~$<br>
<br>
The config in /etc/tgt is:<br>
<br>
jenkins@neutronpluginsci:/etc/tgt$ ls -l<br>
total 8<br>
drwxr-xr-x 2 root root 4096 Mar 10 07:03 conf.d<br>
lrwxrwxrwx 1 root root   30 Mar 10 06:50 stack.d -> /opt/stack/data/cinder/volumes<br>
-rw-r--r-- 1 root root   58 Mar 10 07:07 targets.conf<br>
jenkins@neutronpluginsci:/etc/tgt$ cat targets.conf<br>
include /etc/tgt/conf.d/*.conf<br>
include /etc/tgt/stack.d/*<br>
jenkins@neutronpluginsci:/etc/tgt$ ls conf.d<br>
jenkins@neutronpluginsci:/etc/tgt$ ls /opt/stack/data/cinder/volumes<br>
jenkins@neutronpluginsci:/etc/tgt$<br>
<br>
I don't know if there's any missing Cinder config in my DevStack localrc files. Here's one that I'm using:<br>
<br>
MYSQL_PASSWORD=nova<br>
RABBIT_PASSWORD=nova<br>
SERVICE_TOKEN=nova<br>
SERVICE_PASSWORD=nova<br>
ADMIN_PASSWORD=nova<br>
ENABLED_SERVICES=g-api,g-reg,key,n-api,n-crt,n-obj,n-cpu,n-cond,cinder,c-sch,c-api,c-vol,n-sch,n-novnc,n-xvnc,n-cauth,horizon,rabbit<br>
enable_service mysql<br>
disable_service n-net<br>
enable_service q-svc<br>
enable_service q-agt<br>
enable_service q-l3<br>
enable_service q-dhcp<br>
enable_service q-meta<br>
enable_service q-lbaas<br>
enable_service neutron<br>
enable_service tempest<br>
VOLUME_BACKING_FILE_SIZE=2052M<br>
Q_PLUGIN=cisco<br>
declare -a Q_CISCO_PLUGIN_SUBPLUGINS=(openvswitch nexus)<br>
declare -A Q_CISCO_PLUGIN_SWITCH_INFO=([10.0.100.243]=admin:Cisco12345:22:neutronpluginsci:1/9)<br>
NCCLIENT_REPO=git://<a href="http://github.com/CiscoSystems/ncclient.git" target="_blank">github.com/CiscoSystems/ncclient.git</a><br>
PHYSICAL_NETWORK=physnet1<br>
OVS_PHYSICAL_BRIDGE=br-eth1<br>
TENANT_VLAN_RANGE=810:819<br>
ENABLE_TENANT_VLANS=True<br>
API_RATE_LIMIT=False<br>
VERBOSE=True<br>
DEBUG=True<br>
LOGFILE=/opt/stack/logs/stack.sh.log<br>
USE_SCREEN=True<br>
SCREEN_LOGDIR=/opt/stack/logs<br>
<br>
Here are links to a log showing another localrc file that I use, and the corresponding stack.sh log:<br>
<br>
<a href="http://128.107.233.28:8080/job/neutron/1390/artifact/vpnaas_console_log.txt" target="_blank">http://128.107.233.28:8080/job/neutron/1390/artifact/vpnaas_console_log.txt</a><br>
<a href="http://128.107.233.28:8080/job/neutron/1390/artifact/vpnaas_stack_sh_log.txt" target="_blank">http://128.107.233.28:8080/job/neutron/1390/artifact/vpnaas_stack_sh_log.txt</a><br>
<br>
Does anyone have any advice on how to debug this, or recover from this (beyond rebooting the node)? Or am I missing any Cinder config?<br>
<br>
Thanks in advance for any help on this!!!<br>
Dane<br>
<br>
<br>
<br>
_______________________________________________<br>
OpenStack-Infra mailing list<br>
<a href="mailto:OpenStack-Infra@lists.openstack.org" target="_blank">OpenStack-Infra@lists.openstack.org</a><br>
<a href="http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-infra" target="_blank">http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-infra</a><u></u><u></u></p>
</div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
</div></div></div>
</div>

<br>_______________________________________________<br>
OpenStack-dev mailing list<br>
<a href="mailto:OpenStack-dev@lists.openstack.org">OpenStack-dev@lists.openstack.org</a><br>
<a href="http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev" target="_blank">http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev</a><br>
<br></blockquote></div><br></div></div>