Materialized Views in Cassandra

Tilmann Rabl and Hans-Arno Jacobsen.

In Proceedings of CASCON, 2014.

Abstract

Many web companies deal with enormous data sizes and request rates beyond the capabilities of traditional database systems. This has led to the development of modern Big Data Platforms (BDPs). BDPs handle large amounts of data and activity through massively distributed infrastructures. To achieve performance and availability at Internet scale, BDPs restrict querying capability, and provide weaker consistency guarantees than traditional ACID transactions. The reduced functionality as found in key-value stores is sufficient for many web applications.

An important requirement of many big data systems is an online view of the current status of the data and activity. Typical big data systems such as key-value stores only allow a key-based access. In order to enable more complex querying mechanisms, while satisfying necessary latencies materialized views are employed. The efficiency of the maintenance of these views is a key factor of the usability of the system. Expensive operations such as full table scans are impractical for small, frequent modifications on Internet-scale data sets. In this paper, we present an efficient implementation of materialized views in key-value stores that enables complex query processing and is tailored for efficient maintenance.

Download



Tags: cassandra, key-value stores, materialized views


Readers who enjoyed the above work, may also like the following:


  • CaSSanDra: An SSD Boosted Key-Value Store.
    Prashanth Menon, Tilmann Rabl, Mohammad Sadoghi, and Hans-Arno Jacobsen.
    In 30th IEEE International Conference on Data Engineering, pages 1162-1167, 2014.
    Tags: cassandra, big data, key-value store, nosql
  • Optimizing Key-Value Stores for Hybrid Storage Architectures.
    Prashanth Menon, Tilmann Rabl, Mohammad Sadoghi, and Hans-Arno Jacobsen.
    In Proceedings of CASCON, 2014.
    Tags: key-value stores, leveldb
  • Solving Big Data Challenges for Enterprise Application Performance Management.
    Tilmann Rabl, Mohammad Sadoghi, Hans-Arno Jacobsen, Sergio Gomez-Villamor, Victor Muntes-Mulero, and Serge Mankovskii.
    In Proceedings of the 38th Conference on Very Large Databases (VLDB), 2012.
    Tags: big data, apm, application performance management, key-value stores, benchmarking