Apache Hadoop* Community Spotlight: Hadoop MapReduce* PDF

Veuillez nous excuser, ce PDF peut uniquement être téléchargé

Deveraj Das describes Apache MapReduce*, a powerful model for parallel processing large data sets—and the heart of the Apache Hadoop* system. This overview by an expert from the Apache Hadoop open-source community explains MapReduce master-slave architecture, how MapReduce works, running jobs in isolation in multitenant environments, MapReduce limitations, the significance of Apache YARN for overcoming scalability issues, and where the software is headed. Part of the Intel IT Center’s Hadoop* Community Spotlight series. Also listen to the podcast of the interview.