E-books in Russian and English



<<< >>>

Welcome
(Seattle, Washington, United States)

 


Enter · Register · Search

 
 
   
 
 
 
« Ноябрь 2012
Пн Вт Ср Чт Пт Сб Вс
 1234
567891011
12131415161718
19202122232425
2627282930 
 
One Week Top10:
 2  3   4

Hadoop in Practice

date: 27 ноября 2012 / author: izograv / категория: Java/Javacript / views: 996 / comments: 0

Hadoop in Practice by Alex Holmes



Hadoop in Practice collects 85 Hadoop examples and presents them in a problem/solution format. Each technique addresses a specific task you'll face, like querying big data using Pig or writing a log file loader. You'll explore each problem step by step, learning both how to build and deploy that specific solution along with the thinking that went into its design. As you work through the tasks, you'll find yourself growing more comfortable with Hadoop and at home in the world of big data.
About the Technology

Hadoop is an open source MapReduce platform designed to query and analyze data distributed across large clusters. Especially effective for big data systems, Hadoop powers mission-critical software at Apple, eBay, LinkedIn, Yahoo, and Facebook. It offers developers handy ways to store, manage, and analyze data.
About the Book

Hadoop in Practice collects 85 battle-tested examples and presents them in a problem/solution format. It balances conceptual foundations with practical recipes for key problem areas like data ingress and egress, serialization, and LZO compression. You'll explore each technique step by step, learning how to build a specific solution along with the thinking that went into it. As a bonus, the book's examples create a well-structured and understandable codebase you can tweak to meet your own needs.

This book assumes the reader knows the basics of Hadoop.

Purchase of the print book comes with an offer of a free PDF, ePub, and Kindle eBook from Manning. Also available is all code from the book.
What's Inside

Conceptual overview of Hadoop and MapReduce
85 practical, tested techniques
Real problems, real solutions
How to integrate MapReduce and R

Table of Contents

PART 1 BACKGROUND AND FUNDAMENTALS
Hadoop in a heartbeat
PART 2 DATA LOGISTICS
Moving data in and out of Hadoop
Data serialization?working with text and beyond
PART 3 BIG DATA PATTERNS

Applying MapReduce patterns to big data
Streamlining HDFS for big data

Diagnosing and tuning performance problems
PART 4 DATA SCIENCE
Utilizing data structures and algorithms
Integrating R and Hadoop for statistics and more
Predictive analytics with Mahout
PART 5 TAMING THE ELEPHANT
Hacking with Hive
Programming pipelines with Pig

Crunch and other technologies
Testing and debugging




 

Comments: 0

 
 
Year Top:
2011
2010
2009
2008
2007
2006
 
  

 


 

Design/Web/Support/Anti-Leech by izograv @ yandex.ru
Optimized for Firefox | Anti-Leech tested on IE, Firefox, Reget