Natural Language Processing (NLP)

Facebook Twitter
nltk (Natural Language Toolkit)
Latent Semantics Analysis

Text extraction

Leximancer: From Words to Meaning to Insight | Hypermancer Search the ocean. Hypermancer allows you to find the stories in complex, unstructured information, in real time. It is designed for situations where a message is not a self-contained document, but just a fragment in one of many conversations, and knowing all the needed search terms at any time is impossible. Hypermancer is a latent search system for any unstructured data, in any domain, which guides the user from what they know to what they want to know. Leximancer: From Words to Meaning to Insight | Hypermancer
Leximancer: From Words to Meaning to Insight | Hypermancer
web mining module Latest Version: 2.6 Pattern is a web mining module for Python 2.4+. It bundles tools for data retrieval (Google + Twitter + Wikipedia API, web spider, HTML DOM parser), text analysis (rule-based shallow parser, WordNet interface, syntactical + semantical n-gram search algorithm, tf-idf + cosine similarity + LSA metrics) and data visualization (graph networks). Pattern 2.2 Pattern 2.2
Book - Natural Language Toolkit