Extensive hands on workshop on Big Data using Hadoop, Spark, NoSQL/Cassandra at Santa Clara
Attend all three days, any two days or any one day
Workshop Description
Learner's Place Professional Academy is offering 3 days extensive workshop on Big Data. Big Data workshop is targeted towards technical people who want to get a jumpstart on Big Data, with a specific focus on Hadoop, Spark, NoSQL and Cassandra.
The workshop is targeted for a small focused group of attendees with more than 70% hands on lab. You will not only learn the technology but also get familiarized with industry specific applications. Workshop instructor Sujee Maniyam is industry practitioner and veteran of Big Data. For more information please visit http://www.learnersplace.com
Instructor:
Sujee has been developing software for 15 years. In the last few years he has been consulting and teaching Hadoop, NOSQL and Cloud technologies. Sujee stays active in Hadoop / Open Source community. He runs a developer focused meetup and Hadoop hackathons called ‘Big Data Gurus’. He has presented at variety of meetups. Sujee contributes to Hadoop project and other open source projects. He writes about Hadoop and other technologies on his website.
Agenda:
Day 1: NoSQL with Cassandra
Learn NoSQL data modeling with the popular Cassandra data base
9:00AM – 10:00AM - NoSQL landscape
10:00AM – 12:00PM - Cassandra architecture and concepts
12:00PM - 1:00PM - Lunch Break
1:00PM - 2:00PM - CQL
2:00PM - 3:00PM - Data modeling in Cassandra using CQL
3:00PM - 3:15PM - Break
3:15PM - 4:15PM - queries
4:15PM - 5:15PM - indexes
5:15PM - 5:45PM - composite keys
5:45PM - 6:00PM - Wrap –up Day 1
Day 2: Hadoop
Learn to use Hadoop - the Big Data platform
9:00AM - 11:30AM - Hadoop intro
11:30 AM - 12:00PM - HDFS
12:00 PM -- 1:00PM - Lunch Break
1:00 PM - 3:00PM - Map Reduce primer
3:00PM - 3:15PM - Break
3:15PM - 4:45PM - Hive
4:45PM - 5:45PM - Querying data in Hadoop
5:45PM - 6:00PM - Wrap –up Day 2
Day 3: Spark
Continue learning Big Data analytics with emerging technology - Apache Spark
9:00 AM - 10:00 AM - Scala primer (quick introduction)
10:00 AM – 12:00 PM - Spark architecture / design
12:00 PM - 1:00PM - Lunch Break
1:00PM - 2:00PM - Spark Shell
2:00PM - 3:00PM - RDDs
3:00PM - 3:15PM - Break
3:15PM - 5.00PM - Spark SQL / Dataframes
5:00PM - 5:45PM - Spark streaming
5:45PM - 6:00PM - Wrap-up and Q&A
NOTE: Agenda subject to change without notice
Lab Requirements
A reasonably modern laptop (Need to be able to connect to clusters running on cloud services… corporate laptops with overly restrictive firewalls are not recommended)
-
SSH client (For Windows use Putty / SecureCRT ; Mac and Linux come with ssh clients)
-
Chrome browser with Markdown Preview Plus plugin
-
Nice to have : a programmer’s editor
-
Windows : Sublime, NotePad++, Programmer’s NotePad, TextPad
-
Mac : Sublime, TextWrangler
-
Linux : Sublime, GEdit, vim, Emacs
Who should attend?
This course is appropriate for any Big Data enthusiast including Software Programmer, Project Manager, Product Manager, Architect, DBA or Quality Analyst. Prior experience with programming is not necessary.
Location
Dates
to 10th July 2016 - 06:00 PM