- What is Big Data
- Data Facts
- Aspects / Principle of Big Data
- Difference b/w Big Data & Traditional BI
- Examples: Where to use Big Data
- Big Data Business Opportunities
- Distributed File System computation with Facebook Example
- Understanding File System working
- Basic commands of LINUX
- Shell scripting
- Use Cases – Assignments
- Introduction to OOP’s concept
- Understanding on Data types
- Functions
- Methods
- Setup of Eclipse
- Coding examples
- Use Cases : Assignments
- Concepts of Python
- Data Types in Python
- Exception Handling in Python
- File Handling in Python
- Introduction to Flume
- Setup of Flume Components
- Source
- Sink
- Channel
- Agents
- Use Cases – Assignments
- Introduction to SPARK
- Understanding of RDD, Contexts
- Developing Application in SPARK
- Use Cases – Assignments
- Understanding Hadoop Architect
- Basic Understanding of Hadoop core components
- In depth understanding of HDFS
- Understanding HDFS services – NameNode & DataNode
- Understanding on File System Read & Write
- Real-time Cluster setup based on requirement
- Understanding YARN Architect
- Architect Difference b/w Hadoop 1.x & Hadoop 2.x
- Understanding File System Read & Write
- Hadoop 1.x
- Environment Settings
- Pseudo Mode Installation
- Distributed Mode Installation
- Basic configuration of Hadoop properties
- Understanding in-built scripts
- Running Basic Map Reduce code
- Hadoop 2.x
- Environment Settings
- Distributed Mode Installation
- Configuration of Hadoop properties
- Running Basic Map Reduce Code
- Hadoop File system commands
- Introduction to Sqoop
- Setup of Sqoop
- Sqoop Import commands
- Sqoop Export commands
- Formats in Sqoop
- Use Cases – Assignments
- Introduction to PIG Latin
- Setup of Pig
- Independent Mode
- Map Reduce Mode
- Basic commands in Pig
- Functions in Pig
- Developing UDF’s in Java
- Use Cases – Assignments
- Understanding of Map Reduce services – JobTracker &
TaskTracker - Map Reduce Flow Chart
- Map Reduce Phases
- Mapper
- Reducer
- Splitting
- Sorting
- Shuffling
- Combiner
- Partitioning
- Developing Map Reduce applications – JAVA Code
- Developing Map Reduce applications – Python Code
- Discussion on Input File Formats
- Difference b/w Old MR API & New MR API
- Use Cases – Assignments
- Introduction to HIVE Architecture
- Setup of Hive
- Basic queries in HIVE
- Advance Features of HIVE
- Partitioning
- Bucketing
- Serialize & De-serialize
- Query optimization in Hive
- Use Cases – Assignments