Cassandra: Scalable and high availability database without compromising performance

When it comes to databases, often it’s a balancing act between performance on one hand, and high availability and scalability on the other hand. Often you’ll need to compromise. But with Cassandra, you get all these benefits together. What’s more, it’s open source. You can download it and use it as is, configure it for your own needs, or integrate it with other projects like Hadoop.

Cassandra is used by more than 1,500 companies with large and active data sets, and these include Apple, eBay, Reddit, Hulu, and GoDaddy. It’s quite robust, as it can handle high activity. It’s used in Netflix, where it handles 420 TB of data and over a trillion requests per day. It’s also used by the Chinese search engine Easou, with 300 TB and more than 800 million requests a day.

KEY FEATURES

  • P2P architecture: It doesn’t use the master/slave architecture, so there are no single points of failure. You won’t have network bottlenecks as any server can entertain a request form any client.
  • Fault tolerant: It offers high availability with its data replication across multiple nodes and multiple data centers. Failed nodes can be easily replaced without any downtime.
  • Elastic scalability: Nodes can be deleted or added without causing a disturbance. When you scale, the read and write throughput both increase together with zero downtime or any hitch in the application.
  • High performance: Because of its features and architectural design, it regularly outperforms other popular NoSQL options in real applications and in benchmark tests.

Related Posts