
crawler/scraper
Get flash to fully experience Pearltrees
HTTrack Website Copier - Offline Browser
It allows you to download a World Wide Web site from the Internet to a local directory, building recursively all directories, getting HTML, images, and other files from the server to your computer. HTTrack arranges the original site's relative link-structure. Simply open a page of the "mirrored" website in your browser, and you can browse the site from link to link, as if you were viewing it online.The Open Graph protocol enables any web page to become a rich object in a social graph. For instance, this is used on Facebook to allow any web page to have the same functionality as any other object on Facebook. While many different technologies and schemas exist and could be combined together, there isn't a single technology which provides enough information to richly represent any web page within the social graph. The Open Graph protocol builds on these existing technologies and gives developers one thing to implement. Developer simplicity is a key goal of the Open Graph protocol which has informed many of the technical design decisions . To turn your web pages into graph objects, you need to add basic metadata to your page.
The Open Graph Protocol
MIT Computer Science and Artificial Intelligence Laboratory | CS
MIT CSAIL Project Could Transform Robotic Design and Productionnutch/mapReduce/hadloop
references
google
algorithms
applications
DanWeld

