Toolboxes

TwitterFacebook
Get flash to fully experience Pearltrees
http://www.datasciencetoolkit.org/ API: /street2coordinates Street Address to Location calculates the latitude/longitude coordinates for a postal address. Currently restricted to the US and UK. API: /file2text Converts PDFs, Word Documents, Excel Spreadsheets to text. Recovers text from JPEG, PNG or TIFF images of scanned documents. Geodict

Data Science Toolkit

HiDE: Hierarchical Data Explorer

HiDE is software for visually exploring categorical data using hierarchical layouts. Different orders, sizes, colours and layouts help reveal or emphasise different aspects of the data. HiDE lets you interactively create novel hierarchical displays that share many of the characteristics of well-known and established statistical graphics such as small multiples, spine plots, cartograms, scatter plots, mosaic and treemaps. This work was produced as part of the vizTweets project, a Virtual Research Environment Rapid Innovation (VRERI) Programme funded by JISC. Although this project is complete, we continue to work on this. HiDE lets you build graphical views of categorical data where categories can be sized, ordered and coloured by various summary statistics measures , and tweet views that are interesting. http://www.gicentre.org/hide/

Homepage — Modular toolkit for Data Processing (MDP)

From the user’s perspective, MDP is a collection of supervised and unsupervised learning algorithms and other data processing units that can be combined into data processing sequences and more complex feed-forward network architectures. From the scientific developer’s perspective, MDP is a modular framework, which can easily be expanded. The implementation of new algorithms is easy and intuitive. http://mdp-toolkit.sourceforge.net/index.html

DataSift: Realtime Social Data Mining Platform

Any content posted from the Middle East, including Yemen, Oman, the UAE, Qatar, Bahrain, Saudi Arabia, Kuwait, Israel, Jordan, Turkey, Cyprus, Lebanon, Syria, Iraq and Iran. http://datasift.net/
http://www.knime.org/

KNIME | Konstanz Information Miner

KNIME (Konstanz Information Miner) is a user-friendly and comprehensive open-source data integration, processing, analysis, and exploration platform. From day one, KNIME has been developed using rigorous software engineering practices and is used by professionals in both industry and academia in over 60 countries. KNIME grows to accommodate your demand for data analytics. While KNIME Team Space suits small teams, the KNIME Server provides support for a full corporate setting including user authentication, remote execution, scheduling, SOA integration and a configurable web browser user interface.
http://openbixo.org/

Open Source Web Mining Toolkit | Bixo

Bixo is an open source web mining toolkit that runs as a series of Cascading pipes on top of Hadoop . By building a customized Cascading pipe assembly, you can quickly create specialized web mining applications that are optimized for a particular use case. Take a look at the Getting Started page, and also the list of resources (mailing list, bug database, source code, etc) Bixo is an open source project released under the Apache License, Version 2.0 . Note that Bixo relies on Cascading, which is released under the GNU General Public License, version 3 . Bixo Architecture
"Thank you so much for a great product and great support. I am very pleased with this support package so far, it has increased my productivity amazingly." http://rapid-i.com/content/view/181/190/

Rapid - I - RapidMiner