<html><body><div style="font-family: times new roman, new york, times, serif; font-size: 12pt; color: #000000"><div>Hi Mirantis guys,<br><br>I have set up two Cassandra backups:<br>The first backup procedure was similar to the one you want to achieve.<br>The second backup used SAN features (EMC VNX snapshots) so it was very specific to the environment.</div><div><br>Backup an entire cluster (therefore all replicas) is challenging when dealing with big data and not really needed. If your replicas are spread accross several data centers then you could backup just one data center. In that case you backup only one replica.<br>Depending on your needs you may want to backup twice (I mean "backup the backup" using a tape library for example) and then store it in an external location for disaster recovery, requirements specification, norms, etc.<br><br>The snapshot command issues a flush before to effectively take the snapshot. So the flush command is not necessary.<br>https://github.com/apache/cassandra/blob/c7ebc01bbc6aa602b91e105b935d6779245c87d1/src/java/org/apache/cassandra/db/ColumnFamilyStore.java#L2213<br>(snapshotWithoutFlush() is used by the scrub command)<br><br>Just out of curiosity, have you tried the leveled compaction strategy? It seems that you use STCS.<br>Does your use case imply many updates? What is your read/write ratio?<br><br>Best,<br><br>Romain</div><div><br></div><hr id="zwchr"><div style="color:#000;font-weight:normal;font-style:normal;text-decoration:none;font-family:Helvetica,Arial,sans-serif;font-size:12pt;"><b>From: </b>"Denis Makogon" <dmakogon@mirantis.com><br><b>To: </b>"OpenStack Development Mailing List (not for usage questions)" <openstack-dev@lists.openstack.org><br><b>Sent: </b>Friday, August 29, 2014 4:33:59 PM<br><b>Subject: </b>Re: [openstack-dev] [magnetodb] Backup procedure for Cassandra backend<br><div><br></div><div dir="ltr"><br><div class="gmail_extra"><br><div><br></div><div class="gmail_quote">On Fri, Aug 29, 2014 at 4:29 PM, Dmitriy Ukhlov <span dir="ltr"><<a href="mailto:dukhlov@mirantis.com" target="_blank">dukhlov@mirantis.com</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div><div>Hello Denis,<br></div>Thank you for very useful knowledge sharing.<br><div><br></div></div>
But I have one more question. As far as I understood if we have replication factor 3 it means that our backup may contain three copies of the same data. Also it may contain some not compacted sstables set. Do we have any ability to compact collected backup data before moving it to backup storage?<br>
</div></blockquote><div><br></div><div>Thanks for fast response, Dmitriy.<br></div><div><br></div><div>With replication factor 3 - yes, this looks like a feature that allows to backup only one node instead of 3 of them. In other cases, we would need to iterate over each node, as you know.<br>
</div><div>Correct, it is possible to have not compacted SSTables. To accomplish compaction we might need to use compaction mechanism provided by the nodetool, see <a href="http://www.datastax.com/documentation/cassandra/2.0/cassandra/tools/toolsCompact.html" target="_blank">http://www.datastax.com/documentation/cassandra/2.0/cassandra/tools/toolsCompact.html</a>, we just need take into account that it's possible that sstable was already compacted and force compaction wouldn't give valuable benefits.<br>
</div><div> <br><div><br></div></div><div>Best regards,<br>Denis Makogon<br><div><br></div></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr">
</div><div class="gmail_extra"><br><div><br></div><div class="gmail_quote"><div><div class="h5">On Fri, Aug 29, 2014 at 2:01 PM, Denis Makogon <span dir="ltr"><<a href="mailto:dmakogon@mirantis.com" target="_blank">dmakogon@mirantis.com</a>></span> wrote:<br>
</div></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div><div class="h5"><div dir="ltr"><span><p style="margin-top:0pt;margin-bottom:0pt;text-indent:36pt;text-align:justify">
<span style="color: #000000; font-family: Times New Roman;" data-mce-style="color: #000000; font-family: Times New Roman;" color="#000000" face="Times New Roman"><span style="font-size:19px;line-height:21.85px;white-space:pre-wrap">Hello, stackers. I'd like to start thread related to backuping procedure for MagnetoDB, to be precise, for Cassandra backend.</span></span></p>
<p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-indent:36pt;text-align:justify"><span style="font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;background-color:transparent">In order to accomplish backuping procedure for Cassandra we need to understand how does backuping work.</span></p>
<br><p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-indent:36pt;text-align:justify"><span style="font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;background-color:transparent">To perform backuping:</span></p>
<ol style="margin-top:0pt;margin-bottom:0pt"><li dir="ltr" style="list-style-type:decimal;font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;background-color:transparent"><p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-align:justify">
<span style="vertical-align:baseline;white-space:pre-wrap;background-color:transparent">We need to SSH into each node</span></p></li><li dir="ltr" style="list-style-type:decimal;font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;background-color:transparent">
<p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-align:justify"><span style="vertical-align:baseline;white-space:pre-wrap;background-color:transparent">Call ‘nodetool snapshot’ with appropriate parameters</span></p>
</li><li dir="ltr" style="list-style-type:decimal;font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;background-color:transparent"><p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-align:justify">
<span style="vertical-align:baseline;white-space:pre-wrap;background-color:transparent">Collect backup.</span></p></li><li dir="ltr" style="list-style-type:decimal;font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;background-color:transparent">
<p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-align:justify"><span style="vertical-align:baseline;white-space:pre-wrap;background-color:transparent">Send backup to remote storage.</span></p></li>
<li dir="ltr" style="list-style-type:decimal;font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;background-color:transparent"><p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-align:justify">
<span style="vertical-align:baseline;white-space:pre-wrap;background-color:transparent">Remove initial snapshot</span></p></li></ol><br><p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-align:justify">
<span style="font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;background-color:transparent"><span style="white-space:pre-wrap"> </span></span><span style="font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;background-color:transparent">Lets take a look how does ‘</span><span style="font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);font-style:italic;vertical-align:baseline;white-space:pre-wrap;background-color:transparent">nodetool snapshot</span><span style="font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;background-color:transparent">’ works. Cassandra backs up data by taking a snapshot of all on-disk data files (SSTable files) stored in the data directory. Each time an SSTable gets flushed and snapshotted it becomes a hard link against initial SSTable pinned to specific timestamp.</span></p>
<br><p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-indent:36pt;text-align:justify"><span style="font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;background-color:transparent">Snapshots are taken per keyspace or per-CF and while the system is online. However, nodes must be taken offline in order to restore a snapshot.</span></p>
<br><p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-indent:36pt;text-align:justify"><span style="font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;background-color:transparent">Using a parallel ssh tool (such as pssh), you can flush and then snapshot an entire cluster. This provides an </span><span style="font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);font-style:italic;vertical-align:baseline;white-space:pre-wrap;background-color:transparent">eventually consistent</span><span style="font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;background-color:transparent"> backup. Although no one node is guaranteed to be consistent with its replica nodes at the time a snapshot is taken, a restored snapshot can resume consistency using Cassandra's built-in consistency mechanisms.</span></p>
<br><p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-indent:36pt;text-align:justify"><span style="font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;background-color:transparent">After a system-wide snapshot has been taken, you can enable incremental backups on each node (disabled by default) to backup data that has changed since the last snapshot was taken. Each time an SSTable is flushed, a hard link is copied into a /backups subdirectory of the data directory.</span></p>
<br><p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-align:justify"><span style="font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;background-color:transparent"><span style="white-space:pre-wrap"> </span></span><span style="font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;background-color:transparent">Now lets see how can we deal with snapshot once its taken. Below you can see a list of command that needs to be executed to prepare a snapshot:</span></p>
<br><p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-align:justify"><span style="font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;background-color:transparent"> </span><span style="font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;background-color:transparent"><span style="white-space:pre-wrap"> </span></span><span style="font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;background-color:transparent">Flushing SSTables for consistency</span></p>
<p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-align:justify"><span style="font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;background-color:transparent"> </span><span style="font-size:16px;font-family:'Courier New';color:rgb(0,0,0);font-style:italic;vertical-align:baseline;white-space:pre-wrap;background-color:transparent"><span style="white-space:pre-wrap"> </span></span><span style="font-size:16px;font-family:'Courier New';color:rgb(0,0,0);font-style:italic;vertical-align:baseline;white-space:pre-wrap;background-color:transparent">'nodetool flush'</span></p>
<br><p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-align:justify"><span style="font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;background-color:transparent"> </span><span style="font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;background-color:transparent"><span style="white-space:pre-wrap"> </span></span><span style="font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;background-color:transparent">Creating snapshots (for example of all keyspaces)</span></p>
<p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-align:justify"><span style="font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;background-color:transparent"> </span><span style="font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;background-color:transparent"><span style="white-space:pre-wrap"> </span></span><span style="font-size:16px;font-family:'Courier New';color:rgb(0,0,0);font-style:italic;vertical-align:baseline;white-space:pre-wrap;background-color:transparent">"nodetool snapshot -t %(backup_name)s 1>/dev/null",</span></p>
<br><p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-indent:36pt;text-align:justify"><span style="font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;background-color:transparent">where</span></p>
<ul style="margin-top:0pt;margin-bottom:0pt"><li dir="ltr" style="list-style-type:disc;font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;background-color:transparent"><p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-align:justify">
<span style="vertical-align:baseline;white-space:pre-wrap;background-color:transparent">backup_name - is a name of snapshot</span></p></li></ul><br><p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-indent:36pt;text-align:justify">
<span style="font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;background-color:transparent">Once it’s done we would need to collect all hard links into a common directory (with keeping initial file hierarchy):</span></p>
<br><p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-align:justify"><span style="font-size:16px;font-family:'Courier New';color:rgb(0,0,0);font-style:italic;vertical-align:baseline;white-space:pre-wrap;background-color:transparent">sudo tar cpzfP /tmp/all_ks.tar.gz\</span></p>
<p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-align:justify"><span style="font-size:16px;font-family:'Courier New';color:rgb(0,0,0);font-style:italic;vertical-align:baseline;white-space:pre-wrap;background-color:transparent"> $(sudo find %(datadir)s -type d -name %(backup_name)s)"</span></p>
<br><p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;margin-left:36pt;text-align:justify"><span style="font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;background-color:transparent">where</span></p>
<ul style="margin-top:0pt;margin-bottom:0pt"><li dir="ltr" style="list-style-type:disc;font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;background-color:transparent"><p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-align:justify">
<span style="vertical-align:baseline;white-space:pre-wrap;background-color:transparent">backup_name - is a name of snapshot,</span></p></li><li dir="ltr" style="list-style-type:disc;font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;background-color:transparent">
<p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-align:justify"><span style="vertical-align:baseline;white-space:pre-wrap;background-color:transparent">datadir - storage location (/var/lib/cassandra/data, by the default)</span></p>
</li></ul><br><p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-align:justify"><span style="font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;background-color:transparent"><span style="white-space:pre-wrap"> </span></span></p>
<p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-indent:36pt;text-align:justify"><span style="font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;background-color:transparent">Note that this operation can be extended:</span></p>
<ul style="margin-top:0pt;margin-bottom:0pt"><li dir="ltr" style="list-style-type:disc;font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;background-color:transparent"><p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-align:justify">
<span style="vertical-align:baseline;white-space:pre-wrap;background-color:transparent">if cassandra was launched with more than one data directory (see </span><a href="http://www.datastax.com/documentation/cassandra/2.0/cassandra/configuration/configCassandra_yaml_r.html" style="text-decoration:none" target="_blank"><span style="text-decoration:underline;vertical-align:baseline;white-space:pre-wrap;background-color:transparent">cassandra.yaml</span></a><span style="vertical-align:baseline;white-space:pre-wrap;background-color:transparent">)</span></p>
</li><li dir="ltr" style="list-style-type:disc;font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;background-color:transparent"><p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-align:justify">
<span style="vertical-align:baseline;white-space:pre-wrap;background-color:transparent">if we want to backup only:</span></p></li><ul style="margin-top:0pt;margin-bottom:0pt"><li dir="ltr" style="list-style-type:circle;font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;background-color:transparent">
<p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-align:justify"><span style="vertical-align:baseline;white-space:pre-wrap;background-color:transparent">certain keyspaces at the same time</span></p>
</li><li dir="ltr" style="list-style-type:circle;font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;background-color:transparent"><p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-align:justify">
<span style="vertical-align:baseline;white-space:pre-wrap;background-color:transparent">one keyspace</span></p></li><li dir="ltr" style="list-style-type:circle;font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);vertical-align:baseline;background-color:transparent">
<p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-align:justify"><span style="vertical-align:baseline;white-space:pre-wrap;background-color:transparent">a list of CF’s for given keyspace</span></p>
</li></ul></ul><br><p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-align:justify"><span style="font-size:19px;font-family:'Times New Roman';color:rgb(0,0,0);text-decoration:underline;vertical-align:baseline;white-space:pre-wrap;background-color:transparent">Useful links</span></p>
<p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt;text-align:justify"><a href="http://www.datastax.com/documentation/cassandra/2.0/cassandra/tools/toolsNodetool_r.html" style="text-decoration:none" target="_blank"><span style="font-size:13px;font-family:'Times New Roman';text-decoration:underline;vertical-align:baseline;white-space:pre-wrap;background-color:transparent">http://www.datastax.com/documentation/cassandra/2.0/cassandra/tools/toolsNodetool_r.html</span></a><br data-mce-bogus="1"></p>
<br>Best regards,</span><div><span>Denis Makogon</span></div></div>
<br></div></div>_______________________________________________<br>
OpenStack-dev mailing list<br>
<a href="mailto:OpenStack-dev@lists.openstack.org" target="_blank">OpenStack-dev@lists.openstack.org</a><br>
<a href="http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev" target="_blank">http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev</a><br>
<br></blockquote></div><span class=""><span style="color: #888888;" data-mce-style="color: #888888;" color="#888888"><br><br clear="all"><br>-- <br><div dir="ltr"><div><div>Best regards,<br></div>Dmitriy Ukhlov<br></div>Mirantis Inc.<br></div>
</span></span></div>
<br>_______________________________________________<br>
OpenStack-dev mailing list<br>
<a href="mailto:OpenStack-dev@lists.openstack.org" target="_blank">OpenStack-dev@lists.openstack.org</a><br>
<a href="http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev" target="_blank">http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev</a><br>
<br></blockquote></div><br></div></div>
<br>_______________________________________________<br>OpenStack-dev mailing list<br>OpenStack-dev@lists.openstack.org<br>http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev<br></div><div><br></div></div></body></html>