Cassandra Developer

Learn about Cassandra Architecture, Data Modeling, CQL, & More

Overview

Cassandra Developer Training

In today's times, businesses need to process a very large volume and variety of data. This generates a requirement for a highly available, low latency database. Cassandra solves this challenge by allowing high-speed reads and writes across a large and distributed system.

Cassandra Developer Certification Objective

The Cassandra Developer Training courses educates the learner in –
– Architect and engineer NoSQL Cassandra databases
– Model data in Cassandra
– Accessing Cassandra using CQL and Java
– Ensuring high read/write speeds while maintaining data consistency
– Integrating Cassandra with Hadoop, Pig and Hive
– Implementing commonly used Cassandra design patterns

Duration: 32 Hours

Prerequisites for Cassandra Developer Online Course

Basic knowledge of Linux

Curriculum


Cassandra Fundamentals and Features

  • Distributed and Decentralized
  • Elastic Scalability
  • High Availability and Fault Tolerance
  • Tune-able Consistency
  • Row-Oriented
  • Schema-Free
  • High Performance

The Cassandra Data Model

  • The Relational Data Model
  • A Simple Introduction
  • Clusters
  • Keyspaces
  • Hands-on Session

Installation and Setup of Cassandra

  • Single Node Setup
  • Multi-Node Cluster Setup
  • Key Configurations for Cassandra
  • CLI and Hands-On with Cassandra

Cassandra Modeling

  • Cassandra (Column Family NoSQL DB)
  • Key Concepts
    • Key Space
    • Column Family
    • Column Family Options
    • Wide Rows
    • Skinny Row
    • Column Sorting
    • Super Columns
    • Counter Column Family
    • Composite Keys and Columns
    • Time To Live
  • Secondary Indexes in Cassandra
  • A comparison between Custom Indexes and Secondary Indexes
  • A comparison between Relational Modeling and Cassandra Modeling
  • Key Points while modeling a Cassandra Database
  • Patterns and Anti-Patterns in Cassandra Modeling

Cassandra Architecture & Intro to CQL

  • Anatomy of Reading operation in Cassandra
  • Anatomy of the Write operation in Cassandra
  • How is Deletes handled in Cassandra
  • System Keyspace
  • Peer to Peer Model Logical Data Model: Keyspace, Column Family/Table, Rows, Columns
  • Traditional Ring design vs. VNodes
  • Partitioners: Murmer3, Random (md5) and ByteOrdered
  • Gossip and Failure Detection
  • Anti-Entropy and Read Repair
  • Memtables, SSTables and Commit Log
  • Compaction fundamentals to reduce SSTable data files
  • Hinted Handoff
  • Compaction
  • Bloom Filters, Tombstones
  • Managers and Services
  • VNodes
  • Indexes and Caches
  • Coordinator node
  • Seed nodes
  • Write/Read consistency levels: Any, One, Two, Three, Quorum
  • Snitches: Dynamic snitching, Simple Snitch, Rack Inferring Snitch, Property File Snitch, Gossiping Property File Snitch
  • Routing Client requests
  • Nodetool commands: gossipinfo, cfstats, describing
  • YAML file fundamentals
  • Operations management web GUI
  • Stress testing Cassandra
  • CQL command fundamentals

Cassandra API

  • Key concepts for Reading and Write in Cassandra
  • Tunable Consistency
  • Simple Get, Multi-get Slice
  • Range and Slice
  • Slice Predicate
  • Delete
  • Hands-on CLI commands

Cassandra CQSHL

  • SQL over Cassandra
  • Composite Keys
  • Hands-on examples on CQL 3.0

Cassandra Clients

  • How to establish Client Connections
  • Thrift Client
  • Connection Pooling
  • Auto-discovery and Failover in Hector
  • Client with CQL

Cassandra Monitoring and Administration

  • Backup and Recovery methods
  • Tuning Cassandra
  • Balancing
  • Bootstrapping
  • Node Tools Commands
  • Upgrades
  • Monitoring critical metrics
  • Bulk Loading Data to Cassandra
  • Bulk Export of Data from Cassandra
  • Hands-on Examples for each of them

Cassandra Analytics Cluster

  • Cassandra Hadoop Integration

Cassandra Search Cluster

  • Integration of Solr with Cassandra
  • Search Query on Cassandra