Hadoop::Streaming - Contains Mapper, Combiner and Reducer roles to simplify writing Hadoop Streaming jobs River stage zero No dependents

Hadoop::Streaming::* provides a simple perl interface to the Streaming interface of Hadoop. Hadoop is a system "reliable, scalable, distributed computing." Hadoop was developed at Yahoo! and is now maintained by the Apache Software Foundation. Hadoop...

SPAZM/Hadoop-Streaming-0.143060 - 02 Nov 2014 03:51:16 GMT - Search in distribution
  • Hadoop::Streaming::Mapper - Simplify writing Hadoop Streaming Mapper jobs. Write a map() function and let this role handle the Stream interface.
  • Hadoop::Streaming::Reducer - Simplify writing Hadoop Streaming jobs. Write a reduce() function and let this role handle the Stream interface. This Reducer roll provides an iterator over the multiple values for a given key.
  • Hadoop::Streaming::Combiner - Simplify writing Hadoop Streaming jobs. Combiner follows the same interface as Reducer. Requires a combine() function which will be called for each line of combiner data. Combiners are run on the same machine as the mapper as a pre-reduce reduction step.
  • 5 more results from Hadoop-Streaming ยป

Fsdb - a flat-text database for shell scripting River stage zero No dependents

JOHNH/Fsdb-2.68 - 19 Sep 2019 15:23:50 GMT - Search in distribution

Net::Amazon::HadoopEC2::Cluster - Representation of Hadoop-EC2 cluster River stage zero No dependents

A class Representing Hadoop-EC2 cluster...

DANJOU/Net-Amazon-HadoopEC2-0.02 - 22 Oct 2008 05:33:13 GMT - Search in distribution

3 results (0.035 seconds)