Stepleaf’s Apache Cassandra Certification Training is designed by professionals as per the industry requirements and demands. This Cassandra Certification Training helps you to master the concepts of Apache Cassandra including Cassandra Architecture, its features, Cassandra Data Model, and its Administration. Throughout the Cassandra course, you will learn to install, configure, and monitor Cassandra, along with its integration with other Apache frameworks like Hadoop, Spark, and Kafka.
Jul 25 | base64:type251:e1NhdCxTdW59 (12 Weeks) Weekend Batch | Filling Fast 08:30 PM 10:30 PM |
Can't find a batch you were looking for?
Cassandra is a distributed database from Apache that is highly scalable and designed to manage huge amount of unstructured data. Apache Cassandra Certification Training covers Database Operations, Table Operations, Node Operations in a Cluster, Managing & Monitoring the Cluster, Backup/Restore, Performance Tuning, and Hosting Cassandra Database on Cloud. You will also learn to integrate Cassandra with other Apache frameworks like Hadoop, Spark, and Kafka.
Although we use SQL Server to interrogate data throughout this course, SQL is a common language easily adapted to other RDBMS (relational database management systems) such as MySQL, IBM DB2, PostgreSQL, Oracle, Ingres, Sybase and Microsoft Access.
Course Objectives
This Apache Cassandra Training is designed by industry experts to help you master Apache Cassandra. The Cassandra Course offers:
During the Cassandra Training, you will be guided and trained by our expert instructors to:
Who should go for this training?
Our Apache Cassandra Training helps you to grab this opportunity and accelerate your career. It is best suited for:
Pre-requisites
bigdata, etc, etc, Cassandra, Data Model, Keyspace, Cassandra Architecture, Cassandra Database, Node Operations, Backup & Restore, Performance Tuning, Hosting Cassandra Database on Cloud
Goal: In this module you will get a brief introduction of Big Data and how it creates problems for traditional Database Management Systems like RDBMS. You will also learn how Cassandra solves these problems and understand Cassandra’s features.
Skills:
Basic concepts of Cassandra
Objectives:
At the end of this module, you will be able to
Topics:
Hands On:
Stepleaf VM tour
Structure your learning and get a certificate to prove it.
What are the system requirements for this Cassandra Training?
The following are the requirements for the system to smoothly run the programs:
How will I execute the practicals in this Cassandra Training?
For this Cassandra training, we will help you to setup StepLeaf's Virtual Machine in your System with local access. The detailed installation guides are provided in the LMS for setting up the environment. For any doubt, the 24*7 support team will promptly assist you. StepLeaf Virtual Machine can be installed on Mac or Windows machine.
Participant’s machines must support a 64-bit VirtualBox guest image
Which projects are included in StepLeaf's Online Apache Cassandra Training Course?
Case Study 1: Product Liking Functionality [Ecommerce]
Scenario -
David is CEO of www.purhaseitnow.com. Currently, he is selling 300k products per day across multiple categories. There are thousands of sellers having millions of products, registered on the portal.
Soon David realizes that his sale is decreasing monthly due to the poor quality of products sold by some of the sellers. He then decided to categorize the products so that the site can recommend good products to his customers. He asked his CTO John, to develop the same functionality.
John has suggested him that If they allow customers to give feedback about the product they purchased in the form of like & dislike, then they can recommend those products over other similar products.
John and Product Manager have gathered some requirements and decided to develop using Agile methodology.
Requirements:
1. Get User Details by User Id
2. Get Product Details by Product Id
3. Get all products liked by User
4. Get Product liked by Multiple Users
John is aware of RDBMS only and has suggested database schema as follows:
Tables
1. User
a. User Id
b. User Name
c. Address
2. Product
a. Product Id
b. Product Name
c. Product Description
3. User Product Likes
a. User Id (FK user table)
b. Product Id (FK product table)
c. Timestamp
Soon after, huge data got accumulated in the last table, resulting in system imbalance. They tried to apply all optimization techniques but failed to overcome the issue.
After some digging, they realized that last 2 queries were not performing good due to.
1. Tables will be huge due to large catalogue
2. Retrieval products/users will take more time
To solve this, they hired you because you have some experience in NoSQL databases. You must come up with proper database selection and schema design.
Once you have finalized design you have to:
1. Provide information about database type which you are opting RDBMS/NoSQL/GRAPH?
2. Provide information about database why you selected?
3. Provide schema details along with Primary/Partition/Composite/Clustering keys?
Extension to above problem:
4. Get all products liked by a user should also return product names
Get all user names who have liked any products
Case Study 2: DOMAIN: BANK
Problem Statement:
Our consulting firm has been retained by a major bank to help improve the scalability of their current infrastructure. There are lots of transaction logs generated by various systems. Current database MySQL is not able to handle all the logs. The Firm also wants to run some aggregation jobs.
Key issues:
You must revamp existing code and migration of existing data.
Tasks
You have given end points or log files path where data is being produced.
You have different pages on the website which can be search page, promotional page, deal of day page etc. You must use this log and design schema such that it can get daily request counts per day.
1. Number of clicks on deal of the day page with Android device on 11 May 2017
2. Number of clicks on deal of the page with IOS device on 11 May 2017
3. Number of clicks on home page with Chrome browser on 11 May 2017
4. Number of clicks on home page of Firefox browser on 11 May 2017
Case Study 3: Customer Help Desk Application
Problem Statement:
Model a Customer Help Desk application where customer complaints are logged and captured in a Cassandra column family. The Cassandra table HelpDesk shown in the following screenshot captures these details.
The columns CustomerId, TicketId, ActionTime constitute the Primary key. The column CustomerId becomes the Partition key. The records are stored in the descending order of TicketId, ActionTime. This is to make sure that the recent action details are accessible first.
Task:
1. Create a table HelpDesk as per the above requirement
2. Insert data into HelpDesk. For every record inserted, ActionTime should get the current timestamp.
3. Use the CQL command to display all the data in the specified format.
4. Write range query to retrieve data from to specific date and time. For example, between time-period 2017-11-12 19:14:00 and 2017-11-13 19:20:00
Case Study 4: Hotel Booking Application
Problem Statement:
Design a hotel room reservation application data model. Access available_rooms.csv file provided. The available_rooms.csv file contains a month’s worth of inventory for two small hotels with five rooms each.
Tasks:
1. Create a table available_rooms_by_hotel_date as per the requirement with hotel_id as the partition key, while date and room_number are clustering columns.
2. Bulk load to table available_rooms_by_hotel_data FROM available_rooms.csv
3.Display all the records in available_rooms_by_hotel_date for a particular hotel_id (ex: AZ123) and room_number (ex: 101). Remember both hotel_id and room_number are part of composite primary key.
4. Display all records for a particular hotel between two specific date range in descending order of date.
5. Write an UDF is available which return 1 if a room is available else return 0 Make a call to the UDF to display the results for table available_rooms_by_hotel_date.
6. Create UDF/UDFs to return the total available rooms.
All the instructors at StepLeaf are practitioners from the Industry with minimum 10-12 yrs of relevant IT experience. They are subject matter experts and are trained by StepLeaf for providing an awesome learning experience to the participants.