Commit Graph

72 Commits (5a00793b2f9c13e7ceb5bd267149a4115103f7d5)

Author SHA1 Message Date
karlchenofhell 41bc31d2c2 - ConfigAdvanced_p => XHTML (no invalid IDs)
18 years ago
orbiter 1d2d1854b9 added size of rwi and urls to WatchCrawler
18 years ago
orbiter 0a050bc043 enhanced ranking
18 years ago
orbiter 61798f0ae6 added option to distinguish between text crawl and media crawl
18 years ago
orbiter febe6b114a design update of crawler monitor
18 years ago
orbiter e4570bffaf -implemented a specialized snippet-fetch for media content
18 years ago
orbiter 1377c53aa3 extraction of media links from search results
18 years ago
orbiter fb9e0f0284 preparations for media snippets
18 years ago
orbiter 937ccd4e76 fix for snippet-generation
18 years ago
orbiter 9a85f5abc3 cleanup
18 years ago
orbiter 109ed0a0bb - cleaned up code; removed methods to write the old data structures
18 years ago
orbiter ceb9e3aa17 - enhanced parser: collection of audio, video, image and application links
18 years ago
orbiter b5a29e9651 - fix for snippets that are too short
18 years ago
orbiter 30888e7a2f implementation of search constraints
18 years ago
orbiter d34f10c63d some tests with reverse dns lookup
18 years ago
orbiter 497428c8ec refactoring
18 years ago
allo a75f895884 memory and traffic informations
18 years ago
allo 2ba56f70a8 XML-safe put.
18 years ago
allo a17c43779f removed wrong part of template
18 years ago
allo 27f9e0b1c6 xml interface for blacklists
18 years ago
allo 74f09a0510 some more xml-backend files.
18 years ago
allo e25172853a fixed license notice
18 years ago
allo 1d0c0edda3 first version of posts/get from the del.icio.us api
18 years ago
orbiter 5a40ea7866 refactoring of wget string list generation
18 years ago
orbiter dbc2e039bb added time-out option parameter to call hierarchy
18 years ago
orbiter b59d4576af increased version number to emphasise that the snippet fix
18 years ago
orbiter d4c239e4be - fixed problem in collection index with deletion of single url references
18 years ago
orbiter df1629b05a - code cleanup
18 years ago
orbiter 3aac5b26da - added automatic tag generation when a web page from the search results is added
18 years ago
orbiter 5015e780c2 - simplified watchCrawler code
18 years ago
orbiter c89d8142bb replaced old 'kCache' by a full-controlled cache
18 years ago
theli 92e986bb91 *) adding missing return prop (requested by allo)
18 years ago
allo f0529fe53e update for ftp urls
18 years ago
theli 413e6b9855 *) direct access to responseheaders of sbQueue.Entry removed to make it more http independent
18 years ago
theli eb9b138986 *) next step of restructuring for new crawlers
18 years ago
theli 1395aae742 *) starting restructuring which is needed to add crawlers for additional protocols
18 years ago
orbiter 7df572756a fist step+attempt so solve the snippet marking problem.
18 years ago
orbiter 3879a0ecd0 replaced java.net.URL usage by use of new class de.anomic.net.URL
19 years ago
allo 933a9e02ab fix for broken build
19 years ago
allo 360056b30c fix ajax bug (no valid xml)
19 years ago
orbiter 90d569d70f refactoring of index management:
19 years ago
allo 44d72f06c4 more Caching
19 years ago
allo 1a13c8b78e right wordCachesize after orbiters commit.
19 years ago
allo 6b056610e3 updated watchcrawler for the recent changes
19 years ago
orbiter bcd99fe83e introduced a second RAM cache for DHT transfer
19 years ago
orbiter bae3783d38 added a snippet marking
19 years ago
allo fb5d8fdc59 removed encoding attribute
19 years ago
allo f1b91b1266 xml with right encoding
19 years ago
orbiter 3703f76866 - fixed re-search bug: after a search with several words, a second search could not
19 years ago
theli dc9174c809 *) Implementing snippet fetching via ajax
19 years ago