《HadoopMapReduceV2参考手册(第2版影印版英文版)》开篇介绍了HadoopYARN、MapReduce、HDFs以及其他Hadoop生态系统组件的安装。在《HadoopMapReduceV2参考手册(第2版影印版英文版)》的指引下,你很快就会学习到很多激动人心的主题,例如MapReduce模式,使用Hadoop处理分析、归类、在线销售、推荐、数据索引及搜索。你还会学习到如何使用包括Hive、HBase、Pig、Mahout、Nutch~BGiraph在内的Hadoop生态系统项目以及如何在云环境下进行部署。 PrefaceChapter1:GettingStartedwithHadooov2IntrOductiOnSettingupHadoopv2onyourlocalmachineWritingaWordCountMapReduceapplication,bundlingitandrunningitusingtheHadooplocalmodeAddingacombinersteptotheWordCountMapReduceprogramSettingupHDFSSettingupHadoopYARNinadistributedclusterenvironmentusingHadoopv2SettingupHadoopecosysteminadistributedclusterenvironmentusingaHadoopdistributionHDFScommand-linefileoperationsRunningtheWordCountprograminadistributedclusterenvironmentBenchmarkingHDFSusingDFSIOBenchmarkingHadoopMapReduceusingTeraSortChapter2:CloudDeployments—UsingHadoopYARNonCloudEnvironmentsIntroductionRunningHadoopMapReducev2computationsusingAmazonElasticMapReduceSavingmoneyusingAmazonEC2SpotInstancestoexecuteEMRjobflowsExecutingaPigscriptusingEMRExecutingaHivescriptusingEMRCreatinganAmazonEMRjobflowusingtheAWSCommandLineInterfaceDeployinganApacheHBaseclusteronAmazonEC2usingEMRUsingEMRbootstrapactionstoconfigureVMsfortheAmazonEMRjobsUsingApacheWhirrtodeployanApacheHadoopclusterinacloudenvironmentChapter3:HadoopEssentials—C0nfigurations,UnitTests,andOtherAPIsIntroductionOptimizingHadoopYARNandMapReducecOnfiguratiOnsforclusterdeploymentsShareduserHadoopclusters—-usingFairandCapacityschedulersSettingclasspathprecedencetouser-providedJARsSpeculativeexecutionofstragglingtasksUnittestingHadoopMapReduceapplicationsusingMRUnitIntegrationtestingHadoopMapReduceapplicationsusingMiniYarnClusterAddinganewDataNodeDecommissioningDataNodesUsingmultipledisks/volumesandlimitingHDFSdiskusageSettingtheHDFSblocksizeSettingthefilereplicationfactorUsingtheHDFsJavaAPIChapter4:Develooin~ComDlexHadoooMaoReduceAoolicationsIntrOductiOnChoosingappropriateHadoopdatatypesImplementingacustomHadoopWritabledatatypeImplementingacustomHadoopkeytypeEmittingdataofdifferentvaluetypesfromaMapperChoosingasuitableHadoopInputFormatforyourinputdataformatAddingsupportfornewinputdataformats——implementingacustomInputFormatFormattingtheresultsofMapReducecomputations——usingHadoopOutputFormatsWritingmultipleoutputsfromaMapReducecomputationHadoopintermediatedatapartitioningSecondarysorting——sortingReduceinputvaluesBrOadcastinganddistributingsharedresourcestotasksinaMapReducejob—HadoopDistributedCacheUsingHadoopwithlegacyapplications—-HadoopstreamingAddingdependenciesbetweenMapReducejobsHadoopcounterstoreportcustommetricsChapter5:AnalvticsIntroductionSimpleanalyticsusingMapReducePerformingGROUPBYusingMapReduceCalculatingfrequencydistributionsandsortingusingMapReducePlottingtheHadoopMapReduceresultsusinggnuplotCalculatinghistogramsusingMapReduceCalculatingScatterplotsusingMapReduceParsingacomplexdatasetwithHadoopJoiningtwodatasetsusingMapReduceChapter6:HadoooEcosystem—Apach
阅读更多