Learn By Example: Hadoop, MapReduce for Big Data problems
Zoom-in, Zoom-Out: This course is each broad and deep. It covers the individual elements of Hadoop in nice detail, and additionally provides you the next level image of however they act with one another.
Hands-on sweat involving Hadoop, MapReduce : This course can get you active with Hadoop terribly early. you may learn the way to line up your own cluster exploitationeach VMs and therefore the Cloud. All the main options of MapReduce area unit lined – as well as advanced topics like Total type and Secondary type.
The art of thinking parallel:MapReduce fully modified the means individuals considered process massive knowledge. Breaking down associatey downside into parallelizable units is an art. The examples during this course can train you to “think parallel”.
What’s Covered:
Lot’s of cool stuff ..
Using MapReduce to
Recommend friends in an exceedingly Social Networking site: Generate prime ten friend recommendations employing a cooperative filtering algorithmic program.
Build associate Inverted Index for Search Engines: Use MapReduce to put the large task of building associate inverted index for an exploration engine.
Generate Bigrams from text: Generate bigrams and calculate their distribution in an exceedingly corpus of text.
Build your Hadoop cluster:
Install Hadoop in Standalone, Pseudo-Distributed and absolutely Distributed modes
Set up a hadoop cluster exploitation UNIX system VMs.
Set up a cloud Hadoop cluster on AWS with Cloudera Manager.
Understand HDFS, MapReduce and YARN and their interaction
Customize your MapReduce Jobs:
Chain multiple Mr jobs along
Write your own tailored Partitioner
Total type : Globally type an oversized quantity of information by sampling input files
Secondary sorting
Unit tests with Mr Unit
Integrate with Python exploitation the Hadoop Streaming API
.. and after all the basics:
MapReduce : clerk, Reducer, Sort/Merge, Partitioning, Shuffle and kind
HDFS & YARN: Namenode, Datanode, Resource manager, Node manager, the anatomy of a MapReduce application, YARN programing, Configuring HDFS and YARN to performance tune your cluster.
Who is that the target audience?
Yep! Analysts UN agency need to leverage the facility of HDFS wherever ancient databases do not cut it any longer
Yep! Engineers UN agency need to develop advanced distributed computing applications to method lot’s of information
Yep! knowledge Scientists UN agency need to feature MapReduce to their bag of tricks for process knowledge
Some exceptional benefits associated with this course enrollment are:
- Quality course material
- Instant & free course updates
- Access to all Questions & Answers initiated by other students as well
- Personalized support from the instructor’s end on any issue related to the course
- Few free lectures for a quick overview
Curriculum
- 15 Sections
- 72 Lessons
- 52 Weeks
- Introduction1
- Why is Big Data a Big Deal6
- Installing Hadoop in a Local Environment3
- The MapReduce "Hello World"7
- Run a MapReduce Job2
- Juicing your MapReduce - Combiners, Shuffle and Sort and The Streaming API6
- 7.1Parallelize the reduce phase – use the Combiner15 Minutes
- 7.1Not all Reducers are Combiners14 Minutes
- 7.1How many mappers and reducers does your MapReduce have?8 Minutes
- 7.1Parallelizing reduce using Shuffle And Sort15 Minutes
- 7.1MapReduce is not limited to the Java language – Introducing the Streaming API5 Minutes
- 7.1Python for MapReduce12 Minutes
- HDFS and Yarn7
- 8.1HDFS – Protecting against data loss using replication16 Minutes
- 8.1HDFS – Name nodes and why they’re critical7 Minutes
- 8.1HDFS – Checkpointing to backup name node information11 Minutes
- 8.1Yarn – Basic components9 Minutes
- 8.1Yarn – Submitting a job to Yarn13 Minutes
- 8.1Yarn – Plug in scheduling policies14 Minutes
- 8.1Yarn – Configure the scheduler13 Minutes
- MapReduce Customizations For Finer Grained Control4
- The Inverted Index, Custom Data Types for Keys, Bigram Counts and Unit Tests!6
- 10.1The heart of search engines – The Inverted Index15 Minutes
- 10.1Generating the inverted index using MapReduce11 Minutes
- 10.1Custom data types for keys – The Writable Interface10 Minutes
- 10.1Represent a Bigram using a WritableComparable13 Minutes
- 10.1MapReduce to count the Bigrams in input text9 Minutes
- 10.1Test your MapReduce job using MRUnit30 Minutes
- Input and Output Formats and Customized Partitioning7
- 11.1Introducing the File Input Format14 Minutes
- 11.1Text And Sequence File Formats13 Minutes
- 11.1Data partitioning using a custom partitioner10 Minutes
- 11.1Make the custom partitioner real in code7 Minutes
- 11.1Total Order Partitioning10 Minutes
- 11.1Input Sampling, Distribution, Partitioning and configuring these10 Minutes
- 11.1Secondary Sort9 Minutes
- Recommendation Systems using Collaborative Filtering4
- Hadoop as a Database7
- 13.1Structured data in Hadoop14 Minutes
- 13.1Running an SQL Select with MapReduce14 Minutes
- 13.1Running an SQL Group By with MapReduce15 Minutes
- 13.1A MapReduce Join – The Map Side14 Minutes
- 13.1A MapReduce Join – The Reduce Side14 Minutes
- 13.1A MapReduce Join – Sorting and Partitioning13 Minutes
- 13.1A MapReduce Join – Putting it all together9 Minutes
- K-Means Clustering7
- 14.1What is K-Means Clustering?14 Minutes
- 14.1A MapReduce job for K-Means Clustering14 Minutes
- 14.1K-Means Clustering – Measuring the distance between points17 Minutes
- 14.1K-Means Clustering – Custom Writables for Input/Output14 Minutes
- 14.1K-Means Clustering – Configuring the Job8 Minutes
- 14.1K-Means Clustering – The Mapper and Reducer11 Minutes
- 14.1K-Means Clustering : The Iterative MapReduce Job11 Minutes
- Setting up a Hadoop Cluster3
- Appendix2
An ex-Google, Stanford and Flipkart team
Loonycorn is a team by Janani Ravi and Vitthal Srinivasan, product of Stanford University and IIM Ahmedabad.
We hold several years of working experience in the field of technology in Bay Area, New York, Singapore and Bangalore.
Janani Ravi: 7 Years of work experience (Google, Flipkart and Microsoft)
Vitthal Srinivasan: Worked at Google, Flipkart, Credit Suisse and INSEAD
We have come together to teach and educate on various technological courses in the most easiest and entertaining manner. Also, our courses will be based on practical elaborations & illustrations.
Courses you might be interested in
-
15 Lessons
-
10 Lessons
-
13 Lessons
-
39 Lessons