background preloader

Data Mining

Facebook Twitter

Thank you - OutWit Hub. So, what is OutWit, in a word?

Thank you - OutWit Hub

OutWit is a Web collection engine for everyone. It runs on your Windows, MacOS or Linux machine and allows you to browse through and easily grab information, images, contacts or files from the Internet, in just a few clicks. Originally conceived for researchers and data managers, the program is bringing Web scraping tools to everyone for both business and personal use. Just browse the Web for pages that include the information you seek. OutWit will scan the pages to recognize the data structure and format it into tables, allowing you to rate it and easily export it to spreadsheets for later use. How to start? Have a look at our quick start examples. Here are a few more things you might enjoy testing, to start exploring OutWit Hub's features: In your search engine, search for any topic and click on the Next in Series arrow once or twice.

In the feedback menu, you will find an access to the bug report and suggestion pages. Available KNIME Extensions. KNIME extensions provide additional functionality for KNIME ranging from Excel support, R integration, JFreeChart for advanced visualisations, 100+ nodes wrapping machine learning algorithms from Weka, the Reporting extension, and much more.

Available KNIME Extensions

Below you can find a list of features. How they are installed is described on the update manager help page. In addition, we provide extensions from KNIME Labs and KNIME Community. R Integration With this feature that integrates the statistics project R into KNIME it is possible to run snippets of R code as one step of the workflow, open R views, or even learn models within R.

Sign in or create an account. Infochimps Data Marketplace + Commons: Download Sell or Share Databases, statistics, datasets for free.

Linked Data

Some Datasets Available on the Web » Data Wrangling Blog. Some Datasets Available on the Web The Datawrangling blog was put on the back burner last May while I focused on my startup.

Some Datasets Available on the Web » Data Wrangling Blog

Now that I have some bandwidth again, I am getting back to work on several pet projects (including the Amazon EC2 Cluster). I'm giving an EC2 talk at Pycon in March, so I'm really on the hook to wrap up that series of posts now. The event which prompted this long overdue blog post was another pet project: collecting public datasets. I keep an eye on topics of interest using tag subscriptions, and yesterday my feed was flooded with links to

Over the past year, I've been tagging interesting data I find on the web in This list is semi-static, but I'll keep adding links at Update (02/10/09): I have around 400 dataset bookmarks now (more than double the count when this post first appeared), so I've updated the list below. Web Search and Data Mining. Event: WSDM. My Data Mining Weblog » Web Search Interest of Data Mining in 2010. Following to my previous post titled “Data Mining Trends (2004-2010): By Country, City and Language“, I am eager to know what are the latest worldwide interests for data mining in 2010.

My Data Mining Weblog » Web Search Interest of Data Mining in 2010

Using Google Insights for Search, we can figured out Top 10 searches and what are the rising searches for data mining terms in 2010. Well, as expected top search term for data mining is “data mining techniques” with top three web pages in Google search engine listed as below: Meanwhile, top rising searches for data mining is “IEEE papers” with 120% rising percentage. With the current popularity of Facebook social networking site, it happened that “Facebook data mining” is also gaining top three rising searches for data mining terms. By the way, if you are interested, there is one Facebook group focusing in data mining on the first page of Google search.

Other Related Blogs. Data Mining Research - If you're new here, you may want to subscribe to my RSS feed.

Data Mining Research -

Thanks for visiting! I posted an earlier version of this data mining blog list in a previously on DMR. Here is an updated version (blogs recently added to the list have the logo “new”). I will keep this version up-to-date. You can access it at any time from the DMR top bar. Ryan Rosario. An Introduction to Data Mining-online book. Tags: Analytics, Australia, bi, book, Chart, conference, consulting, data, data mining, data visualization, editor, Free, Hospital, images, interview, KNIME, learning, media, Microsoft, mining, nature, network, online, open, open source, Paper, Pie chart, Professor S.

An Introduction to Data Mining-online book

Data Mining and Predictive Analytics.