Displaying items 6 - 10 of 38 matching your query
[ source ], ordered by score:
-
The Web Robots Pages
Web Robots are programs that traverse the Web automatically. Some people call
them Web Wanderers, Crawlers, or Spiders.
This in the main source for information on the robots.txt Robots Exclusion
Standard and other articles about writing well-behaved Web robots.
Added: Tue Jul 02 2002; URL: http://www.robotstxt.org/;
id = 15
-
The Great Computer Language Shootout
A benchmark comparison of a (rather large) number of programming languages,
by Doug Bagley. Also interesting as a source of code samples illustrating
implementation of simple & typical programming tasks, in languages ranging
from AWK through Pike to TCL (31 in all).
*DEFUNCT*
Added: Wed Jun 26 2002; URL: http://www.bagley.org/~doug/shootout/;
id = 7
-
H2 Database Engine
Very fast database engine ;
Free, with source code ;
Written in Java ;
Supports standard SQL, JDBC API ;
Embedded and Server mode, Clustering support ;
Strong security features ;
Experimental native version (GCJ) and ODBC drivers;
Added: Mon Jun 18 2007; URL: http://www.h2database.com/;
id = 81
-
Orange - Data Mining Fruitful & Fun
Open source data visualization and analysis for novice and experts. Data mining through visual programming or Python scripting. Extensions for bioinformatics and text mining. Comprehensive, flexible and fast.
Added: Wed Dec 09 2009; URL: http://www.ailab.si/orange/;
id = 196
-
The Lucene search engine: Powerful, flexible, and free
Lucene is a Java-based open source toolkit for text indexing and searching.
Though this article is a little old, it's notable for an easily-digestible
and instructive high-level overview of Lucene's implementation.
Currently, Lucene is part of the Apache-Jakarta project.
Added: Sun Dec 01 2002; URL: http://www.javaworld.com/javaworld/jw-09-2000/jw-0915-lucene.html;
id = 35