Crawler/scraper

Facebook Twitter

HTTrack Website Copier - Offline Browser

Version 3.48-3 (04/11/2014) Engine fixes (keep-alive, redirects, new hashtables, unit tests) Installing HTTrack: Go to the download section now! HTTrack Website Copier - Offline Browser
Introduction The Open Graph protocol enables any web page to become a rich object in a social graph. For instance, this is used on Facebook to allow any web page to have the same functionality as any other object on Facebook. While many different technologies and schemas exist and could be combined together, there isn't a single technology which provides enough information to richly represent any web page within the social graph. The Open Graph protocol builds on these existing technologies and gives developers one thing to implement. Developer simplicity is a key goal of the Open Graph protocol which has informed many of the technical design decisions.

The Open Graph Protocol

The Open Graph Protocol
nutch/mapReduce/hadloop

references

google

algorithms

applications

DanWeld

Software Agent: MIT Media Lab
Nutch Latest step by Step Installation guide for dummies: Nutch 0.9 By Peter P. Wang, Zillionics LLC Try the search engine I developed for The Christian Life: Malachi Search Please support my effort by using the best free/low price web hosting: 1&1 Inc peterwang@zillionics.com Nutch