Open Source Software

Facebook Twitter
JoBo JoBo JoBo is a simple program to download complete websites to your local computer. Internally it is basically a web spider. The main advantage to other download tools is that it can automatically fill out forms (e.g. for automated login) and also use cookies for session handling. Compared to other products the GUI seems to be very simple, but the internal features matters !
YaCy is a free search engine that anyone can use to build a search portal for their intranet or to help search the public internet. When contributing to the world-wide peer network, the scale of YaCy is limited only by the number of users in the world and can index billions of web pages. It is fully decentralized, all users of the search engine network are equal, the network does not store user search requests and it is not possible for anyone to censor the content of the shared index. YaCy Distributed Web Search

YaCy Distributed Web Search

Oracle Oracle Oracle Technology Network > Java Challenge Win A Trip to JavaOne 2014

Writing a Web Crawler in the Java Programming Language

Writing a Web Crawler in the Java Programming Language
How to write a multi-threaded webcrawler in Java Table of Contents This page Here you can... ... learn how to write a multithreaded Java application... learn how to write a webcrawler... by the way learn how to write stuff that is object-oriented and reusable... or use the provided webcrawler more or less off-the-shelf. More or less in this case means that you have to be able to make minor adjustments to the Java source code yourself and compile it. How to write a multi-threaded webcrawler in Java
BotSpot 2005 ®: the spot for all bots
Contents About WebSPHINX WebSPHINX ( Website-Specific Processors for HTML INformation eXtraction) is a Java class library and interactive development environment for web crawlers. A web crawler (also called a robot or spider) is a program that browses and processes Web pages automatically. WebSPHINX consists of two parts: the Crawler Workbench and the WebSPHINX class library. WebSPHINX: A Personal, Customizable Web Crawler WebSPHINX: A Personal, Customizable Web Crawler
Java tip: How to get a web page Java tip: How to get a web page Technologies: Java 5+ The starting point for building a link checker, web spider, or web page analyzer is, of course, to get the web page from the web server. Java's package includes classes to manage URLs and to open web server connections. This tip shows how to use them to a get text, image, audio, or data file from a web server. Introduction
Capturing Screen in Java,Capture Screen Shot,How to Capture Screen Using Java Swing
HTML Parser - HTML Parser Welcome to the homepage of HTMLParser - a super-fast real-time parser for real-world HTML. What has attracted most developers to HTMLParser has been its simplicity in design, speed and ability to handle streaming real-world html. The two fundamental use-cases that are handled by the parser are extraction and transformation (the syntheses use-case, where HTML pages are created from scratch, is better handled by other tools closer to the source of data). While prior versions concentrated on data extraction from web pages, Version 1.4 of the HTMLParser has substantial improvements in the area of transforming web pages, with simplified tag creation and editing, and verbatim toHtml() method output. In general, to use the HTMLParser you will need to be able to write code in the Java programming language. HTML Parser - HTML Parser
In this article, I guide you through the steps involved in designing a utility to download a Website. This utility downloads only text and image files, but it can easily be extended to download files of any type. At the end of the article I'll provide tips on how you can extend the utility. First, a brief introduction to URLs (Uniform Resource Locators) would not be out of place. The general form of a URL is: Download a Website for offline browsing Download a Website for offline browsing

HTTrack Website Copier - Offline Browser

HTTrack Website Copier - Offline Browser Version 3.48-3 (04/11/2014) Engine fixes (keep-alive, redirects, new hashtables, unit tests) Installing HTTrack: Go to the download section now! For help and questions:Visit the forum, Read the documentation, Read the FAQs, Browse the sources Welcome HTTrack is a free (GPL, libre/free software) and easy-to-use offline browser utility.

Open Source Freeware : 400+ free applications and utilities : eC

Open Source Freeware : 400+ free applications and utilities : eC Open Source Freeware : 400+ free applications and utilities Extremely useful open source applications and utilities available free under various licenses. Free (but NOT open-source) is listed separately : I want a Freeware Utility to ... 450+ common problems solved. ; Please subscribe to our rss feed Also : I want Wordpress Plugin to ... 450+ solutions to blogging problems. Anti-Spyware/Anti-Virus/Anti-Rootkit Freeware Utilities : I want to ...
This is Vivalogo's list of best free, downloadable, open source social networking software / scripts (kinda hard to say all these words :) ). Unlike some other lists you may find on the net, this one contains only really downloadable and functional software.Note: listed in no particular order. SocialEngine SocialEngine is social networking software powered by PHP and Zend. The script lets you easily create your own social network or online community.

Top 40 Free Downloadable Open Source Social Networking Software

Screen Capture Tools: 40+ Free Tools and Techniques Screen capture, or print screen is perhaps the most efficient way to share whatever appears on your desktop. They help tech users like us to share and communicate better with friends and peers. Major operating systems today comes with basic screen capture and print screen function, but if these can’t fulfill what you need from a screen capture then you are probably looking for a screen capturing tool. Screen capturing tools do what the basic tool don’t. What these tools could do varies, including the ability to include sketches and text, instantly upload image online, audio capturing, dimension-specific capturing and more.
Open Source Windows Open Source Windows The promise of open source software is best quality, flexibility and reliability. This is the updated list of the best open source software. The only way to have TRUE "Open Source Windows" is to have all equivalent native Windows programs uninstalled and removed. [Contents]
Open Source Crawlers in Java - Heritrix
Open Source Crawlers in Java
HTML Screen Scraping Tools Written in Java