Over the past couple of days, I’ve started playing around with Cassandra in an effort to satisfy my curiosity and to see what it’s all about. In terms of databases, SQL Server is my main skill - I first started using it nearly 10 years ago, and will always remember the first book I read cover to cover on it: Inside Microsoft SQL Server 6.5 by Ron Soukup. So Cassandra is a step outside of my comfort zone; something new and a little alien to me….a challenge. The aim of this post, is mainly to dump my early thoughts, key points and useful links - basically an overflow area for my brain; things that I find are important/helpful resources.
Along with SQL Server my other core technology is C#. So what I’m trying to do is get up and running with Cassandra and knock up some C# to talk to it. Just something basic will do - a “Hello Cassandra” app. It’s not about the pros/cons of NoSQL, SQL Server vs Cassandra vs anything else, and it’s definitely not a guide on best practices when it comes to this subject (I’m hardly an expert after 2 days). End of disclaimer.
First things first, what is Cassandra?
You can install it on any OS for which there is a Java runtime. Windows is what I’m comfortable with, so that’s the host OS I used (XP to be exact - yes, my laptop is old-school, cue violins). Future steps will be to get more comfortable with Linux and have it running on a number of VMs.
Data model - key terminology
Cluster, ColumnFamily, Keyspace, SuperColumn - sounds strange. A read through of the DataModel WIKI page is a must.
From the WIKI:
Thrift is a software framework for scalable cross-language services development. Thrift allows you to define data types and service interfaces in a simple definition file. Taking that file as input, the compiler generates code to be used to easily build RPC clients and servers that communicate seamlessly across programming languages.
You can write your code directly using the Thrift API to Cassandra, but for beginners (like me) it’s definitely worth downloading a high-level client. Plenty available for a wide variety of languages including C# / .NET, Python, Ruby, Java and more.
For .NET I originally went to try Fluent Cassandra (Github) which is being developed by Nick Berardi - after all, his site gave me the guide to get up and running. Plus he has a number of useful Cassandra articles from a .NET perspective. However, I currently don’t have .NET 4.0/VS2010 so to get something going quickly, I looked at Aquiles (Codeplex) which requires .NET 3.5 or above.
That got me to the point where I am now - able to connect to Cassandra from .NET. Nothing shiny, nothing flash. But the main point is the links I’ve collated in this post are the ones I’ve used to get going with the basics.