products

TwitterFacebook
Get flash to fully experience Pearltrees
Mavuno: A Hadoop-Based Text Mining Toolkit From the webpage: Mavuno is an open source, modular, scalable text mining toolkit built upon Hadoop. http://tm.durusau.net/?p=21139

Mavuno: Hadoop-Based Text Mining Toolkit

http://www.jamesserra.com/archive/2011/08/microsoft-sql-server-parallel-data-warehouse-pdw-explained/

Parallel Data Warehousing (PDW) Explained | James Serra's Blog

Microsoft SQL Server Parallel Data Warehouse (PDW), formally called by its code name “Project Madison”, is an edition of Microsoft’s SQL Server 2008 R2 that was released in December 2010. PDW is Microsoft’s reworking of the DatAllegro Inc. massive parallel processing ( MPP ) product that Microsoft acquired in July 2008. It only works with certain hardware (two so far), the first of which is HP Enterprise Data Warehouse Appliance ( Dell Parallel Data Warehouse Appliance is the other, with a couple more to come in the near future: IBM and Bull).
http://www.sys-con.com/node/2145007

Hadoop Quickstart: Use Whirr to automate standup of your distributed cluster on Rackspace

We have previously provided a Quickstart guide to standing up Rackspace cloud servers (and have one for Amazon servers as well).