Sadržaj:
  • Introducing Hadoop and seeing what it's good for
  • Common use cases for big data in Hadoop
  • Setting up your Hadoop environment
  • Storing data in Hadoop : the Hadoop distributed file system
  • Reading and writing data
  • MapReduce programming
  • Frameworks for processing data in Hadoop : YARN and MapReduce
  • Pig : Hadoop programming made easier
  • Statistical analysis in Hadoop
  • Developing and scheduling application workflows with Oozie
  • Hadoop and the data warehouse : friends or foes?
  • Extremely big tables : storing data in HBase
  • Applying structure to Hadoop data with Hive
  • Integrating Hadoop with relational databases using Sqoop
  • The holy grail : native SQL access to Hadoop data
  • Deploying Hadoop
  • Administering your Hadoop cluster
  • Ten Hadoop resources worthy of a bookmark
  • Ten reasons to adopt Hadoop.