Apache Cassandra is NoSQL database distributed management system which is free and open-source. Cassandra is originally developed at Facebook and it was released as open-source in google code, later apache taken as incubator project for further development. Without compromising on performance, Cassandra can provide great linear scalability and high availability.
Who can Learn ?
Software professionals who have basic RDBMS knowledge and interested to learn this NoSQL database technology can be benefitted through this tutorial with many basic operations which can be performed.
Basic RDBMS knowledge is required to start with this technology. This database is being used along with big data technologies and where the high scalability and performance is needed.
So, Let’s have a quick view of cassandra tutorial…
NoSQL databases differ from RDBMS technology in four main areas such as Data model, Data structure, Scaling, Usage cost.NoSQL or ‘Not only SQL’ is a non-relational database. It supports a very simple query language with no fixed schema. Learn more..
Cassandra is a Ring based model designed for Bigdata applications, where data is distributed across all nodes in the cluster evenly using consistent hashing algorithm with no single point of failure.In Cassandra, multiple nodes that forms a cluster in a datacentre which communicates with all nodes in other datacenters using gossip protocol. Learn more..
It is a NoSQL format which contains columns of key-value pair, where the key is mapped to value. Generally, In relational databases we will work with tables in which every key-value pair is a row.In NoSQL each column is a tuple (Triplet) consists of column name , value and timestamp. whereas in RDBMS this data will be in the form of table. Learn more..
There are many such as, Node is a server/system where data is stored.Data Center:A Collection of nodes are called as Data center.Data center may be physical or virtual. Datacenters should never span physical locations.Cluster: It contains one or more data centers which can span physical locations. Learn more..
Before Installation of Cassandra , we need to go through some prerequisites. Below are the steps provided to install Cassandra on Linux environments.Ensure that you have latest version of Java. Java 1.8 and Python 2.7 Learn more..
Cassandra data model was clearly explained with detailed pictorial representations as below.Column is a Key-value pair which also contains a time-stamp along with it. A Super ColumnColumn that contains one or more columns (Array of columns) can be called as Super Column. Learn more..
In this tutorial, we will learn about the Data Types in Cassandra CQL language. DataTypes generally define the type of data a column can hold along with their own significance. CQL language supports the below list Data types:Native Types, Collection Types, User-defined Types, tuple types. Learn more..
CQLSH – This is the Command Line Utility used to execute the commands to communicate with Cassandra database.To start the utility we need to give the command cqlsh either in linux terminal or windows command prompt. The default listen port for cqlsh is 9042. Learn more..
In this tutorial, we will learn about the cqlsh commands. In Cassandra, cqlsh commands can be used through CQL interactive terminal. These commands will work only within the Cqlsh shell. Learn more..
KeySpace in NoSQL database is just like a schema in regular RDBMS concept, Anyhow it does not have any concrete structure. In NoSQL database, there will be one keyspace per application.A Keyspace contains column families or super columns. Each super column contains one or more column family, each column family contains at least one column. Learn more..
Alter Keyspace command allows you to alter the replicaton factor, strategy name and durable writes attributes. Below the syntax for Alter KeySpace command.Using Alter command Keyspace Name cannot be changed in cassandra. Learn more..
In this tutorial, we will learn about the DROP Keyspace command which allows you to drop the keyspace which includes all the data, column families,UTD,indexes from cassandra. Learn more..
In this tutorial, we will learn about the CREATE Table command in Cassandra which allows us to create new table using cqlsh by providing the valid column names, data types.Compounded PrimaryKey consists more than one column as primary key. Cassandra treats the first column as partition key. Learn more..
Using Alter Table command you can change the data type of column, can add or remove the columns to existing table and change table properties too. Learn more..
Drop table can be used to drop the table.Before giving this command , you should check that if you are in proper keyspace and you are dropping the exact table what you want.Validate,
If the table get deleted , then you should not be able to see the table name after giving the below command in the respective KeySpace. Learn more..
In this tutorial, we will learn about TRUNCATE Table command in Cassandra which will allow to truncate all the data from a table. The Date removed is irreversible.Truncating a table triggers an automatic snapshot which backs up the data only, not the schema. Learn more..
Insert command allows us to creat or insert the data records into the columns. Here it is not required to define all columns and all those missing columns will get no space on disk.So if columns Exists, it is updated. Learn more..
Update Command allows us to update the one or more columns values in cassandra table. To update the multiple columns , we need to seperate the name-value pairs using commas. Learn more..
In this tutorial, we will learn about the DELETE command in Cassandra which allows to remove one or more columns data in a table and also it removes entire row if no column is specified. Learn more..
In this tutorial, we will learn about the SELECT command in Cassandra which is used to retrieve the data from the cassandra table. We can perform various projections using the SELECT statement.Select Expression will be given as input to the select statement, where the output depends on the select expression. Learn more..
Create Index command allows to create new index on the specified column for a table. Cassandra indexes the data during the execution of command and also the new data that is being inserted once after the creation of index. Learn more..
A DROP INDEX command allows us to drop the existing index. If the index was not given a name during creation, the index name is <table_name>_<column_name>_idx. Learn more..
Collections provide the simplest way to handle multiple tasks.The maximum size of an item in collection is 64K. Also it is advisable to keep collections small to avoid delay during query execution.Even though if you insert more than 64K data that will result into dataloss, because only 64K is queryable. Learn more..
In tutorial, we will learn about the Batch command in Cassandra which allows us to write multiple DML statements in a single shot. DML statements include the Insert, Update, Delete commands. Learn more..