AI

TwitterFacebook
Get flash to fully experience Pearltrees
The process of extracting patterns from data is called data mining. It is recognized as an essential tool by modern business since it is able to convert data into business intelligence thus giving an informational edge. At present, it is widely used in profiling practices, like surveillance, marketing, scientific discovery, and fraud detection. There are four kinds of tasks that are normally involve in Data mining: * Classification - the task of generalizing familiar structure to employ to new data * Clustering - the task of finding groups and structures in the data that are in some way or another the same, without using noted structures in the data.

5 of the Best Free and Open Source Data Mining Software | TechSource

http://www.junauza.com/2010/11/free-data-mining-software.html
November 5, 2011 Bob and Alice sit in a cubicle at the end of the floor. For much of their working day they are pissed off. They are forced to do their job using a tool implemented by The Project some years ago.

:julianbrowne => @home

http://www.julianbrowne.com/
http://en.wikipedia.org/wiki/Factor_graph In probability theory and its applications, a factor graph is a particular type of graphical model , with applications in Bayesian inference , that enables efficient computation of marginal distributions through the sum-product algorithm . One of the important success stories of factor graphs and the sum-product algorithm is the decoding of capacity-approaching error-correcting codes , such as LDPC and turbo codes . A factor graph is a bipartite graph representing the factorization of a function. Given a factorization of a function

Factor graph - Wikipedia, the free encyclopedia

Sebastian Thrun's Homepage

http://robots.stanford.edu/ You can take my latest class online and for free at UDACITY . Udacity stands for "we are audacious, for you, the student". I am against education that is only available to the top 1% of all students. I am against tens of thousands of dollars of tuition expenses. I am against the imbalance that the present system brings to the world.
Playgrounds

http://aichallenge.org/

Home | AI Challenge

Congratulations the ants contest winner, Mathis Lichtenberger. He is a second year computer science student at the University of Lubeck. His winning bot ( post mortem ) is called xathis .
Because we want to give kick-ass product recommendations. I'm showing you how to find related items based on a really simple formula. If you pay attention, this technique is used all over the web (like on Amazon) to personalize the user experience and increase conversion rates. To get one question out of the way: there are already many available libraries that do this, but as you'll see there are multiple ways of skinning the cat and you won't be able to pick the right one without understanding the process, at least intuitively. To find similar items to a certain item, you've got to first define what it means for 2 items to be similar and this depends on the problem you're trying to solve: on a blog, you may want to suggest similar articles that share the same tags, or that have been viewed by the same people viewing the item you want to compare with http://bionicspirit.com/blog/2012/01/16/cosine-similarity-euclidean-distance.html

Data Mining: Finding Similar Items and Users

http://wiki.xkcd.com/irc/Bucket#Bucket_Overview From XKCD Wiki Bucket has an outer shell of metal [ citation needed ] ; within the metal is a protective layer of high density plastic [ citation needed ] , in which may or may not reside pure HOH [ citation needed ] . There can only be speculation about what else the Bucket contains. [ citation needed ] Only the ops are allowed to cull stupid or useless factoids. And they will, boy howdy they will. The only exception to this rule is if the factoid is about you -- if someone writes an abusive factoid about you and the ops have not corrected it, please feel free to edit it yourself.

Bucket - XKCD Wiki

http://ias.cs.tum.edu/home

Intelligent Autonomous Systems - Home

The kick-off meeting of the RoboHow project took place on Tuesday, March 13th, at TU München. With more than 40 participants, it was a very successful meeting and a great start into this exciting project! The meeting was complemented by a workshop about the different control engineering approaches to be used in the project.
http://www.readwriteweb.com/tag/Semantic+Web

Semantic web

The question being posed by a new generation of news readers who now depend more upon online sources than any other, is whether the editorial process for deciding the precedence of articles in a publication - for deciding what you read, when you read the publication - matters. In a world full of thousands of "sources," some of them actually legitimate, most Web readers today have adopted a pick-and-choose mentality. In many cases, they end up making those choices based on headlines and not their sources. (Just a reminder to that end, you're currently reading ReadWriteWeb.)
We design and fabricate interactive exhibits, ball machine sculptures, public art, and a variety of special projects. We are artists, designers, craftspeople, and engineers who create objects and environments that encourage creativity, support social interaction, and inspire self-confidence. We begin each project with extensive research, brainstorming and testing to understand the intended experience. We then build and test prototypes with representative users, complete the final design and fabrication ourselves, install our work, and make modifications if necessary.

Creative Machines Inc.

http://www.creativemachines.com/

Front | Cornell Creative Machines Lab

The Creative Machines Lab comprises researchers from the various disciplines of engineering, computer science, physics, math and biology. We look at self-organization and evolutionary phenomena, and their application to both engineering design automation and understanding the emergence of complexity in natural systems. We develop abstract models and applications in various domains as well as build working systems to test new ideas.

5 de los mejores software de minería de datos de Código Libre y Abierto | El rincón de JMACOE

El proceso de extracción de patrones a partir de datos se llama minería de datos. Es reconocida como una herramienta esencial de los negocios modernos, ya que es capaz de convertir los datos en inteligencia de negocios dando así una ventaja de información. Actualmente, es ampliamente utilizado en las prácticas de perfil, como vigilancia, comercialización, descubrimientos científicos, y detección de fraudes. Agrupamiento – la tarea de encontrar grupos y estructuras en los datos que son de alguna manera u otra lo mismo, sin necesidad de utilizar las estructuras observadas en los datos. Para aquellos que están buscando algunas herramientas de minería de datos, aquí están cinco de las mejores herramientas de software de código abierto para minería de datos que puedes obtener de forma gratuita:
The latest release of LingPipe is LingPipe 4.1.0, which is a feature release, as well as patching some bugs. It is fully backward compatible with LingPipe version 4.0.1. Character, Token, and Document Suffix Arrays

LingPipe Home

Open source is a great choice for many text analytics users, especially folks who have programming skills, who need custom capabilities or who are trying to get a feel for possibilities before committing themselves. Excellent options are available for all these users. Tools such as Gate, NLTK, R and RapidMiner share the low cost, power, flexibility and community that have driven adoptionof open-source software by individual users and enterprises alike. RapidMiner even combines text processing with business intelligence (BI) and visualization functions. This article will look at open source text analytics, focusing on those four tools. ( UIMA , the open source Unstructured Information Management Architecture, is a rich topic in itself, one that merits its own article.) I will suggest a number of resources that will help you get started.

Open Source Text Analytics by Seth Grimes - BeyeNETWORK

Inside Google, MapReduce is used for 80% of all the data processing needs. That includes indexing web content , running the clustering engine for Google News , generating reports for popular queries ( Google Trends ), processing satellite imagery , language model processing for statistical machine translation and even mundane tasks like data backup and restore. The other 20% is handled by a lesser known infrastructure called “Pregel” which is optimized to mine relationships from “graphs”. According to wikipedia a “graph” is a collection of vertices or ‘nodes’ and a collection of ‘edges’ that connect pair of ‘nodes’. Depending on the requirements, a ‘graph’ can be undirected which means there is no distinction between the two ‘nodes’ in the graph, or it could be directed from one ‘node’ to another.

Pregel: Google’s other data-processing infrastructure | Scalable web architectures

ROC

Books

Vizz