Eine Sammlung an Links zu Projekten und Wissenschaftlichen Arbeiten rund um das Thema Suchmaschinen und Information Retrieval Frameworks.
-
-
Pepper: Peer-to-Peer-Architekturen für die föderierte Suche in komplexen digitalen Bibliotheken
Proof: A DHT-based Peer-to-Peer Search Engine
-
DSphere: A Source Centric Approach to Crawling, Indexing and Searching the World Wide Web
UbiCrawler: A Scalable Fully Distributed Web Crawler
Terrier: A High Performance and Scalable Information Retrieval Platform
Disadvantages: Terrier does not support incremental indexing
Xapian
an Open Source Search Engine Library, […] written in C++, with bindings to allow use from Perl, Python, PHP, Java, Tcl, C# and Ruby (so far!)
MG4J (Managing Gigabytes for Java)
-
Textractor is a software framework. The framework is designed to facilitate the development of software tools that process text to extract information.
Textractor uses MG4J. |
Download
Carrot2: an Open Source Search Results Clustering Engine
-
-
-
-
A Comparison of Open Source Search Engines:
ASPSeek, BBDBot, Datapark, ebhath, Eureka, ht:Dig, Indri, ISearch, IXE, Lucene, Managing Gigabytes (MG), MG4J, mnoGoSearch, MPS Information Server, Namazu, Nutch, Omega, OmniFind IBM Yahoo! Ed., OpenFTS, PLWeb, SWISH-E, SWISH++, Terrier, WAIS/ freeWAIS, WebGlimpse, XML Query Engine, XMLSearch, Zebra, and Zettair.
-
-
-