Apache Cassandra - Data Model

Data Model

Cassandra is essentially a hybrid between a key-value and a row-oriented (or tabular) database.

A column family resembles a table in an RDBMS. Column families contain rows and columns. Each row is uniquely identified by a row key. Each row has multiple columns, each of which has a name, value, and a timestamp. Unlike a table in an RDBMS, different rows in the same column family do not have to share the same set of columns, and a column may be added to one or multiple rows at any time.

Each key in Cassandra corresponds to a value which is an object. Each key has values as columns, and columns are grouped together into sets called column families. Also, each column family can be grouped in super column families.

Thus, each key identifies a row of a variable number of elements. These column families could be considered then as tables. A table in Cassandra is a distributed multi dimensional map indexed by a key.

Furthermore, applications can specify the sort order of columns within a Super Column or Simple Column family.

Read more about this topic:  Apache Cassandra

Famous quotes containing the words data and/or model:

    To write it, it took three months; to conceive it three minutes; to collect the data in it—all my life.
    F. Scott Fitzgerald (1896–1940)

    She represents the unavowed aspiration of the male human being, his potential infidelity—and infidelity of a very special kind, which would lead him to the opposite of his wife, to the “woman of wax” whom he could model at will, make and unmake in any way he wished, even unto death.
    Marguerite Duras (b. 1914)