Hadoop with Python


Title: Hadoop with Python

Author: Donald Miner, Zachary Radka

License: Available for free by O’Reilly

Why This Book?

Hadoop is one of the most popular open-source distributed processing framework that store big data and manage data processing. Hadoop is mostly written in Java but there are scope of other programming languages too, such as Python. 

Python can be used in Hadoop in distribute file system and it is what this book teaches you. You will also  MapReduce, the Apache Pig platform and Pig Latin script, and the Apache Spark cluster-computing framework in Hadoop with Python.

Two authors tried their best to clear every concept excellently through the use of various examples.

Some other interesting books:

Hadoop and Kerberos: The Madness Beyond the Gate

Disruptive Possibilities: How Big Data Changes Everything

Hadoop with Python
4 (80%) 1 vote