<div dir="ltr">As a followup, when performing a distcp from HDFS to Swift, segments ARE being created in the swift container with a .distcp- prefix. Each temporary file appears to be related to the attempt of the map/reduce job.<div><br></div><div>Just as the last temporary segment appears in the remote container, the job aborts, and of the .distcp- temporary objects are deleted and Hadoop commences to the next "attempt".</div><div><br></div><div>For example, for the currently running test case, the swift container listing shows the following:</div><div><br></div><div>
<p class=""><span class=""><font face="monospace, monospace">zantac:~ lillie$ swift list --lh backups</font></span></p>
<p class=""><span class=""><font face="monospace, monospace">2.4G 2015-05-05 20:13:16 .distcp.tmp.attempt_1430771817173_0010_m_000000_0/000001</font></span></p>
<p class=""><span class=""><font face="monospace, monospace">2.4G 2015-05-05 20:14:45 .distcp.tmp.attempt_1430771817173_0010_m_000000_0/000002</font></span></p>
<p class=""><span class=""><font face="monospace, monospace">2.4G 2015-05-05 20:16:15 .distcp.tmp.attempt_1430771817173_0010_m_000000_0/000003</font></span></p>
<p class=""><span class=""><font face="monospace, monospace">7.2G</font></span></p>
<p class="">Once the entire file is "copied", the operation reports Error 413, and all of the above files are deleted. It's as though the Swift file system isn't able to close the file.</p><p class=""><br></p><p class="">/ross</p></div></div><div class="gmail_extra"><br><div class="gmail_quote">On Tue, May 5, 2015 at 2:54 PM, Ross Lillie <span dir="ltr"><<a href="mailto:ross.lillie@motorolasolutions.com" target="_blank">ross.lillie@motorolasolutions.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr">We're currently running Openstack Juno and are experiencing errors when performing large object copies between Hadoop HDFS and our Swift object store. While not using the Savana service directly, we are relying upon the Swift file system extension for Hadoop created as part of the Savana project.<div><br></div><div>In each case, the large object copy (using Hadoop's distcp) results in Swift reporting an Error 413 - Request entity too large.</div><div><br></div><div>As a test case, I created a 5.5 GB file of random data and tried to upload the file to Swift using Swift's CLI command. Once again Swift returned Error 413. If, however, I explicitly set a segment size on the Swift command line of 1G, then the file uploads correctly. </div><div><br></div><div>When using Hadoop's distcp to move data from HDFS to Swift, the job always exists with Swift reporting Error 413. Explicitly setting the fs.swift.service.x.partsize does not appear to make any difference.</div><div><br></div><div>My understanding is that Swift should automagically split files greater that 5G into multiple segments grouped under a metafile but this appears to not be working. This was working under the Havana release (Ubuntu) using the Swift File System jar file downloaded from the Marantis web site. All current testing is based up the Juno release and when performing a distcp using the openstack-hadoop jar file shipped as part of the latest hadoop distros.</div><div><br></div><div>Has anyone else seen this behavior?</div><div><br></div><div>Thanks,</div><div>/ross<span class="HOEnZb"><font color="#888888"><br clear="all"><div><br></div>-- <br><div><div dir="ltr"><div><div dir="ltr">Ross Lillie</div><div dir="ltr">Application Software & Architecture Group</div><div dir="ltr"><br><div><br></div></div></div></div></div>
</font></span></div></div>
</blockquote></div><br><br clear="all"><div><br></div>-- <br><div class="gmail_signature"><div dir="ltr"><div><div dir="ltr">Ross Lillie</div><div dir="ltr">Application Software & Architecture Group</div><div dir="ltr"><br><div><a href="https://www.google.com/calendar/embed?src=ross.lillie%40motorolasolutions.com&ctz=America/Chicago" target="_blank">View my calendar</a><br></div></div></div></div></div>
</div>