background preloader

Data Mining

Facebook Twitter

RapidMiner. RapidMiner is a software platform developed by the company of the same name that provides an integrated environment for machine learning, data mining, text mining, predictive analytics and business analytics. It is used for business and industrial applications as well as for research, education, training, rapid prototyping, and application development and supports all steps of the data mining process including results visualization, validation and optimization.[1] RapidMiner is developed on a business source model which means only the previous version of the software is available under an OSI-certified open source license on Sourceforge.[2] A Starter Edition is available for free download, a Personal Edition is offered for US$999, a Professional Edition is $2,999 and pricing for the Enterprise Edition is available from the developer.[3] History[edit] Description[edit] RapidMiner uses a client/server model with the server offered as Software as a Service or on cloud infrastructures.[7]

AlphaWorks : Data Discovery and Query Builder : Overview. Free download: 10 terabytes of patents and trademarks. Pskomoroch's dataset Bookmarks on Delicious. Directory of Certified Firms | Washington State Office of Minority and Women's Business Enterprises. Search for Ratecenter in State. Shington Courts - Search Case Records. What is this website? It is a search engine of cases filed in the municipal, district, superior, and appellate courts of the state of Washington. The search results can point you to the official or complete court record. How can I obtain the complete court record? You can contact the court in which the case was filed to view the court record or to order copies of court records. How can I contact the court? Click here for a court directory with information on how to contact every court in the state. Can I find the outcome of a case on this website? How do I verify the information contained in the search results?

Can I use the search results to find out someone’s criminal record? Where does the information come from? Do the government agencies that provide the information for this site and maintain this site: Guarantee that the information is accurate or complete? Spatial data on the web by state: Geographic Information Systems (GIS) Lab at MIT. Corporations Data Extract Download. You can download an entire extract of the corporations search database in Text or XML format by clicking the links below. The data is provided for use in custom searches, mash-ups and other purposes as desired. For quicker download, the file has been compressed in ZIP format. The data is extracted nightly to reflect the changes made in the previous 24 hours.

The date of the extract is noted in the file. Average file size is 70 Mb compressed, 750 Mb uncompressed. Corporations Data Extract Text (tab delimited) - (last generated on 08/28/2014 2:35 AM) Corporations Data Extract XML - (last generated on 08/28/2014 2:37 AM) The text version contains two tab-delimited files, one with corporations data and the other with governing persons.

The XML contains all the data in one file. Because the files are so large, it is recommended that you use software that can import large amounts of data. This data is provided for informational purposes only. Data Mining SDK. Show me the data. Posted by Jon Udell under Uncategorized[20] Comments The emerging discipline of social data analysis and visualization faces two challenges. First, obviously, you need data. Then, more interestingly, you need to figure out ways for people to create, share, and collaboratively refine interpretations of the data. There are a handful of well-known and powerful sources of data. The OECD’s data, for example, drives several of the visualizations at IBM’s Many Eyes site. Where else can you find data for these kinds of tools and services to chew on? Sources I’ve used and discussed include Washington DC’s CAPStat and the Dartmouth Atlas of Health Care. For my own purposes, I’ve decided to keep track of these kinds of public data sources at del.icio.us/judell/publicdata.

There’s not a whole lot there, yet, but here’s one gem I discovered by way of a link to Gapminder: the United Nations Common Database. UN statistics finally liberated and free of charge! Amen. Like this: Like Loading... Some Datasets Available on the Web » Data Wrangling Blog.