hadoop
< streaming
< protocol
< messaging
< web2.0
< ajax
< javascript
< web
< programming
< timwee
Here are some of my initial thoughts.
RFC: Efficient file caching (on Hadoop Task nodes, for benefit of MapReduce Tasks) ------------------------------------------------------ We will start implementing this soon. Please provide feedback and improvements to this plan.
In this tutorial I will describe the required steps for setting up a pseudo-distributed, single-node Hadoop cluster backed by the Hadoop Distributed File System, running on Ubuntu Linux.