Hadoop: The Definitive Guide is a book about Apache Hadoop by Tom White, published by O'Reilly Media.
You can buy it in electronic and paper forms from O'Reilly (including via Safari Books Online), or in paper form from Amazon (US, UK), and many other sources. A Japanese translation is also available.
Browse the table of contents.
The book's example code (also on GitHub) comes with a subset of the NCDC weather dataset that is used throughout the book.
I have made the full dataset available as a public Amazon EBS snapshot (snap-6cb6d205). To use this snapshot with a Hadoop cluster running on EC2, you need to: i) start the cluster (in the "us-east-1a" zone), ii) create an EBS volume from the snapshot (75GB or larger), iii) attach it to the master instance in the cluster, iv) use hadoop fs put to transfer the data to HDFS.
The O'Reilly site allows you to view and submit errata and reviews. You can also email comments to tom@hadoopbook.com.
I occasionally write about the book on my blog, and on Twitter.
Now you have the opportunity to learn about Hadoop from a master—not only of the technology, but also of common sense and plain talk.
Hadoop: The Definitive Guide just landed on my desk. Looks awesome. A must-have skillset for 21st c developers...
Tom White's book, Hadoop: The Definitive Guide, is the best technical book I've ever read. Hands down.
"Hadoop: The Definitive Guide" by Tom White is both information dense and highly readable, I'm learning a huge amount from it #bookreview
Deep Diving into #hadoop. Being very tippety-toed. OReilly Hadoop book by Tom White is the shining beacon of hope in these dark waters