Running Hadoop On Ubuntu Linux (Single-Node Cluster) @ Michael G. Noll

In this tutorial I will describe the required steps for setting up a pseudo-distributed, single-node Hadoop cluster backed by the Hadoop Distributed File System, running on Ubuntu Linux. Hadoop is a framework written in Java for running applications on large clusters of commodity hardware and incorporates features similar to those of the Google File System (GFS) and of the MapReduce computing paradigm. Hadoop’s HDFS is a highly fault-tolerant distributed file system and, like Hadoop in general, designed to be deployed on low-cost hardware. It provides high throughput access to application data and is suitable for applications that have large data sets. The main goal of this tutorial is to get a simple Hadoop installation up and running so that you can play around with the software and learn more about it. This tutorial has been tested with the following software versions: Ubuntu Linux 10.04 LTS (deprecated: 8.10 LTS, 8.04, 7.10, 7.04) Hadoop 1.0.3, released May 2012 Sun Java 6 Disabling IPv6
http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-single-node-cluster/
Installing Ubuntu inside Windows using VirtualBox
This work is licensed under a Creative Commons Attribution-ShareAlike 3.0 Unported License. The screenshots in this tutorial use Ubuntu 12.04, but the same principles apply also to Ubuntu 12.10, 11.10, 10.04, and any future version of Ubuntu. Actually, you can install pretty much any Linux distribution this way.
Installing Ubuntu
This work is licensed under a Creative Commons Attribution-ShareAlike 3.0 Unported License. Future versions of this will be posted to my blog. NotesInstalling Ubuntu Notes
Login as Root in Ubuntu 11.04 (Natty Narwhal)
For new users who just starting with Ubuntu and wanting to know about the root account, here’s a brief post that gives you a little insight about the root account and how / why to use it. You see, every Ubuntu edition comes with a root account. The root account is also known as the administrator account.
» VirtualBox: access Windows-host shared folders from Ubuntu-guest
This is the scenario that you run Windows as your host operating system and Ubuntu in a VirtualBox, and that you want to access a specific Windows folder from Ubuntu. First you have to make sure that have install Guest Additions. From the VirtualBox’s menu go to Devices → Install Guest Additions… This will mount a virtual CD on your /media/cdrom.
Running Apache Ant
Command Line If you've installed Apache Ant as described in the Installing Ant section, running Ant from the command-line is simple: just type ant. When no arguments are specified, Ant looks for a build.xml file in the current directory and, if found, uses that file as the build file and runs the target specified in the default attribute of the <project> tag.
Apache ANT Configuration in Ubuntu « D.Lak Says
I am using Ubuntu Linux 9.04. JDK is located in “/usr/lib/jvm/java-6-sun” and ANT is located in “/usr/share/ant” in my machine. Following instruction shows instruction for configuration. If you don’t know JAVA_HOME, you can retrieve with command “sudo update-alternatives –config java” as following.
Pig Setup
Overview Requirements Unix and Windows users need the following: Hadoop 20 - Java 1.6 - Set JAVA_HOME to the root of your Java installation.
Pig Tutorial
Overview The Pig tutorial shows you how to run two Pig scripts in local mode and mapreduce mode. Local Mode: To run the scripts in local mode, no Hadoop or HDFS installation is required. All files are installed and run from your local host and file system.
How to Install MySQL in Ubuntu 10.04
Database management system is necessary every where whether it’s a web application or other business purpose software;in ubuntu 10.04 you may like to use MySQL,the fastest database management system for web applications. MySQL is one of the most widely used database management system and it is light,free and open source so it is also in resonance with ubuntu philosophy.The aim of this post is to help you in installing mysql on your ubuntu 10.04 to help you in getting started with mysql. Install MySQL on Ubuntu 10.04 1. First,Open SPM(Synaptic Package Manager) from System->Administration->Synaptic Package Manager.Enter your password(Ubuntu login account) if it ask else,proceed to next step.2.
GettingStartedWithHadoop
Note: for the 1.0.x series of Hadoop the following articles will probably be easiest to follow: The below instructions are primarily for the 0.2x series of Hadoop. Hadoop can be downloaded from one of the Apache download mirrors. You may also download a nightly build or check out the code from subversion and build it with Ant. Select a directory to install Hadoop under (let's say /foo/bar/hadoop-install) and untar the tarball in that directory. A directory corresponding to the version of Hadoop downloaded will be created under the /foo/bar/hadoop-install directory.
Welcome to Apache Pig!
Welcome to Apache™ Hadoop™!
Related:
Related: