Cassandra Backup with EBS

Currently I am looking how the backup/restore be done in Cassandra. We've setup a three node cluster in AWS. I understand that using nodetool snapshot tool we can take a snapshot but it's bit cumbersome process.

My idea is : Make use of EBS snapshot because they're more durable and easy to setup but one problem which I see with EBS is inconsistency backup. Hence, my plan is run a script prior to taking EBS snapshot which would just run flush command to flush out all the memtable data and copies it on to the disk(SSTable) and then prepares the hard link with flushed SSTable. Once that's done, initiate the EBS snapshot, this was we can address the inconsistency issue which we might face if we only use EBS snapshost.

Please let me know if you see any issue with this approach or share your suggestions.
Who is Participating?

[Product update] Infrastructure Analysis Tool is now available with Business Accounts.Learn More

I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

I haven't worked that much with Cassandra, but assuming the consistency issue is similar to other databases, I would probably steer clear of anything that has a high risk of having inconsistency issues (at least issues that Cassandra otherwise does not expect to handle via its own consistency-handling tools). In my experience, those sorts of issues always make a bigger mess of things during restores, and when the time comes to use them, you're usually in a situation where you need to provide guarantees about the data that was restored.

From what I've read, your best bet is to use pssh to create the snapshots of the entire cluster, and let Cassandra do its thing when it comes time to restore those snapshots, even if it's note quite AS simple as an EBS snapshot. That said, you could always pop the question over to the Amazon folks and ask if they have anyone with a similar configuration that has been through this process with Cassandra and what the outcome was.

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
cloudtechnicianAuthor Commented:
Did not get the exact answer what I was looking for. But it's still useful info.
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today
Storage Software

From novice to tech pro — start learning today.