Open Source Software

Facebook Twitter

JoBo. JoBo is a simple program to download complete websites to your local computer.

JoBo

Internally it is basically a web spider. The main advantage to other download tools is that it can automatically fill out forms (e.g. for automated login) and also use cookies for session handling. Compared to other products the GUI seems to be very simple, but the internal features matters ! YaCy Distributed Web Search. Writing a Web Crawler in the Java Programming Language. Oracle Oracle Oracle Technology Network > Java JavaOne JavaOne Sessions Available JavaOne sessions are now available for replay.

Writing a Web Crawler in the Java Programming Language

Posted 10/29/14 // Tags: java, JavaOne, Java 7 // Headlines Archive Software Downloads View All Downloads Top Downloads New Downloads What's New Java in the Cloud: Rapidly develop and deploy Java business applications in the cloud. How to write a multi-threaded webcrawler in Java. Table of Contents This page Here you can... ... learn how to write a multithreaded Java application... learn how to write a webcrawler... by the way learn how to write stuff that is object-oriented and reusable... or use the provided webcrawler more or less off-the-shelf.

How to write a multi-threaded webcrawler in Java

More or less in this case means that you have to be able to make minor adjustments to the Java source code yourself and compile it. You will need the Sun Java 2 SDK for this. BotSpot 2005 ®: the spot for all bots. WebSPHINX: A Personal, Customizable Web Crawler. Contents About WebSPHINX WebSPHINX ( Website-Specific Processors for HTML INformation eXtraction) is a Java class library and interactive development environment for web crawlers.

WebSPHINX: A Personal, Customizable Web Crawler

A web crawler (also called a robot or spider) is a program that browses and processes Web pages automatically. WebSPHINX consists of two parts: the Crawler Workbench and the WebSPHINX class library. Java tip: How to get a web page. Technologies: Java 5+ The starting point for building a link checker, web spider, or web page analyzer is, of course, to get the web page from the web server.

Java tip: How to get a web page

Java's java.net package includes classes to manage URLs and to open web server connections. This tip shows how to use them to a get text, image, audio, or data file from a web server. Introduction. Capturing Screen in Java,Capture Screen Shot,How to Capture Screen Using Java Swing. HTML Parser - HTML Parser. Download a Website for offline browsing. In this article, I guide you through the steps involved in designing a utility to download a Website.

Download a Website for offline browsing

This utility downloads only text and image files, but it can easily be extended to download files of any type. At the end of the article I'll provide tips on how you can extend the utility. First, a brief introduction to URLs (Uniform Resource Locators) would not be out of place. The general form of a URL is: An absolute URL -- such as -- has all the components required to identify the resource on the Web. The utility I describe in this article uses the URL class in the java.net package. Some of the commonly used protocols are HTTP, FTP, Gopher, and News. The main idea Suppose you visit a Webpage containing links to several other pages that, in turn, have links to still other pages. This utility lets you download all the pages of a Website in a graceful manner.

The utility consists of four classes: DownloadSite, Downloader, URLlist, and ExtendedURL. DownloadSite The DownloadSite main() HTTrack Website Copier - Offline Browser. Open Source Freeware : 400+ free applications and utilities : eC. Top 40 Free Downloadable Open Source Social Networking Software. This is Vivalogo's list of best free, downloadable, open source social networking software / scripts (kinda hard to say all these words :) ).

Top 40 Free Downloadable Open Source Social Networking Software

Unlike some other lists you may find on the net, this one contains only really downloadable and functional software.Note: listed in no particular order. SocialEngine SocialEngine is social networking software powered by PHP and Zend. The script lets you easily create your own social network or online community. Includes customizable groups, photo albums, messaging, member profiles, videos, news feeds, a drag-and-drop CMS, and more. iSocial is a free social networking script platform that allows you to create your own Friendster and Orkut like sites. Screen Capture Tools: 40+ Free Tools and Techniques. Screen capture, or print screen is perhaps the most efficient way to share whatever appears on your desktop. They help tech users like us to share and communicate better with friends and peers. Major operating systems today comes with basic screen capture and print screen function, but if these can’t fulfill what you need from a screen capture then you are probably looking for a screen capturing tool.

Screen capturing tools do what the basic tool don’t. What these tools could do varies, including the ability to include sketches and text, instantly upload image online, audio capturing, dimension-specific capturing and more. Open Source Windows. Open Source Windows The promise of open source software is best quality, flexibility and reliability.

Open Source Windows

This is the updated list of the best open source software. The only way to have TRUE "Open Source Windows" is to have all equivalent native Windows programs uninstalled and removed. [Contents] [hide] 1 Most Popular 2 Databases. Open Source Crawlers in Java - Heritrix. Open Source Crawlers in Java. HTML Screen Scraping Tools Written in Java.