Map Reduce Implementation with 4 jobs.
(1) Parsing Of DataSet
(2) Computing the overall dataset statistics of given data
(3) Computing per-category stats of given data
(4) Computing category overall stats of given data
Installing Anaconda,Hadoop,Hive,Scala,Spark in windows environment and execution of basic Python MapReduce Code.
Skills:Python, PySpark