DocumentCloud. Visualization to connect the dots. This is a description of some of the proof-of-concept work that led to the Overview prototype, originally posted elsewhere.
Last month, my colleague Julian Burgess and I took a shot a peering into the Iraq War Logs by visualizing them in bulk, as opposed to using keyword searches in an attempt to figure out which of the 391,832 SIGACT reports we should be reading. Other people have created visualizations of this unique document set, such as plots of the incident locations on a map of Iraq, and graphs of monthly casualties. We wanted to go a step further, by designing a visualization based on the richest part of each report: the free text summary, where a real human describes what happened, in jargon-inflected English. We’ve found at least one technique that yields interesting results, a graph visualization where each document is node, and edges between them are weighted using cosine-similarity on TF-IDF vectors.
Google unveils 'smarter search' Web giant Google has unveiled new products that it says will push search in a new direction.
Google is using so-called semantic web technology to leverage the underlying data on websites to enhance results. "The race in search is far from over and innovation and continued improvement is absolutely pivotal," said Google's Marissa Mayer. Google said it could not afford to rest on its laurels in the quest to build the perfect search engine.