
AI
Get flash to fully experience Pearltrees
The process of extracting patterns from data is called data mining. It is recognized as an essential tool by modern business since it is able to convert data into business intelligence thus giving an informational edge. At present, it is widely used in profiling practices, like surveillance, marketing, scientific discovery, and fraud detection. There are four kinds of tasks that are normally involve in Data mining: * Classification - the task of generalizing familiar structure to employ to new data * Clustering - the task of finding groups and structures in the data that are in some way or another the same, without using noted structures in the data.
5 of the Best Free and Open Source Data Mining Software | TechSource
November 5, 2011 Bob and Alice sit in a cubicle at the end of the floor. For much of their working day they are pissed off. They are forced to do their job using a tool implemented by The Project some years ago.
:julianbrowne => @home
Factor graph - Wikipedia, the free encyclopedia
Sebastian Thrun's Homepage
Playgrounds
Home | AI Challenge
Congratulations the ants contest winner, Mathis Lichtenberger. He is a second year computer science student at the University of Lubeck. His winning bot ( post mortem ) is called xathis .Because we want to give kick-ass product recommendations. I'm showing you how to find related items based on a really simple formula. If you pay attention, this technique is used all over the web (like on Amazon) to personalize the user experience and increase conversion rates. To get one question out of the way: there are already many available libraries that do this, but as you'll see there are multiple ways of skinning the cat and you won't be able to pick the right one without understanding the process, at least intuitively. To find similar items to a certain item, you've got to first define what it means for 2 items to be similar and this depends on the problem you're trying to solve: on a blog, you may want to suggest similar articles that share the same tags, or that have been viewed by the same people viewing the item you want to compare with
Data Mining: Finding Similar Items and Users
Bucket - XKCD Wiki
Intelligent Autonomous Systems - Home
The kick-off meeting of the RoboHow project took place on Tuesday, March 13th, at TU München. With more than 40 participants, it was a very successful meeting and a great start into this exciting project! The meeting was complemented by a workshop about the different control engineering approaches to be used in the project.Semantic web
The question being posed by a new generation of news readers who now depend more upon online sources than any other, is whether the editorial process for deciding the precedence of articles in a publication - for deciding what you read, when you read the publication - matters. In a world full of thousands of "sources," some of them actually legitimate, most Web readers today have adopted a pick-and-choose mentality. In many cases, they end up making those choices based on headlines and not their sources. (Just a reminder to that end, you're currently reading ReadWriteWeb.)We design and fabricate interactive exhibits, ball machine sculptures, public art, and a variety of special projects. We are artists, designers, craftspeople, and engineers who create objects and environments that encourage creativity, support social interaction, and inspire self-confidence. We begin each project with extensive research, brainstorming and testing to understand the intended experience. We then build and test prototypes with representative users, complete the final design and fabrication ourselves, install our work, and make modifications if necessary.
Creative Machines Inc.
Front | Cornell Creative Machines Lab
The Creative Machines Lab comprises researchers from the various disciplines of engineering, computer science, physics, math and biology. We look at self-organization and evolutionary phenomena, and their application to both engineering design automation and understanding the emergence of complexity in natural systems. We develop abstract models and applications in various domains as well as build working systems to test new ideas.5 de los mejores software de minería de datos de Código Libre y Abierto | El rincón de JMACOE
El proceso de extracción de patrones a partir de datos se llama minería de datos. Es reconocida como una herramienta esencial de los negocios modernos, ya que es capaz de convertir los datos en inteligencia de negocios dando así una ventaja de información. Actualmente, es ampliamente utilizado en las prácticas de perfil, como vigilancia, comercialización, descubrimientos científicos, y detección de fraudes. Agrupamiento – la tarea de encontrar grupos y estructuras en los datos que son de alguna manera u otra lo mismo, sin necesidad de utilizar las estructuras observadas en los datos. Para aquellos que están buscando algunas herramientas de minería de datos, aquí están cinco de las mejores herramientas de software de código abierto para minería de datos que puedes obtener de forma gratuita:The latest release of LingPipe is LingPipe 4.1.0, which is a feature release, as well as patching some bugs. It is fully backward compatible with LingPipe version 4.0.1. Character, Token, and Document Suffix Arrays
LingPipe Home
Open source is a great choice for many text analytics users, especially folks who have programming skills, who need custom capabilities or who are trying to get a feel for possibilities before committing themselves. Excellent options are available for all these users. Tools such as Gate, NLTK, R and RapidMiner share the low cost, power, flexibility and community that have driven adoptionof open-source software by individual users and enterprises alike. RapidMiner even combines text processing with business intelligence (BI) and visualization functions. This article will look at open source text analytics, focusing on those four tools. ( UIMA , the open source Unstructured Information Management Architecture, is a rich topic in itself, one that merits its own article.) I will suggest a number of resources that will help you get started.
Open Source Text Analytics by Seth Grimes - BeyeNETWORK
Inside Google, MapReduce is used for 80% of all the data processing needs. That includes indexing web content , running the clustering engine for Google News , generating reports for popular queries ( Google Trends ), processing satellite imagery , language model processing for statistical machine translation and even mundane tasks like data backup and restore. The other 20% is handled by a lesser known infrastructure called “Pregel” which is optimized to mine relationships from “graphs”. According to wikipedia a “graph” is a collection of vertices or ‘nodes’ and a collection of ‘edges’ that connect pair of ‘nodes’. Depending on the requirements, a ‘graph’ can be undirected which means there is no distinction between the two ‘nodes’ in the graph, or it could be directed from one ‘node’ to another.
Pregel: Google’s other data-processing infrastructure | Scalable web architectures
ROC
Books
Vizz

