Commit Graph

171 Commits (bb0076c3ddfbf0062d38e2c67643c8afc9f2a262)

Author SHA1 Message Date
orbiter 61798f0ae6 added option to distinguish between text crawl and media crawl
18 years ago
orbiter 1377c53aa3 extraction of media links from search results
18 years ago
orbiter 109ed0a0bb - cleaned up code; removed methods to write the old data structures
18 years ago
orbiter bb7d4b5d5e refactoring to prepare new RWI entry object
18 years ago
orbiter b79e06615d - added new LURL.Entry class for next database migration
18 years ago
orbiter a5dd0d41af - refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
18 years ago
theli a2e3095044 *) Bugfix. Add missing plasmaParserDocument.close() calls
18 years ago
allo 4922ab8920 try to fix a nullpointer on snippet generation
18 years ago
theli b6c7b91582 *) Parser now throws an ParserException instead of returning null on parsing errors (e.g. needed by snippet fetcher)
18 years ago
orbiter 3aac5b26da - added automatic tag generation when a web page from the search results is added
18 years ago
orbiter c543028dd4 fixed double/missing null check for LURLs
18 years ago
orbiter 9340dbb501 fixed all possible problems with nullpointer exception for LURLs
18 years ago
hermens ff4362b02d some more fixes for new plasmaCrawlLURL.load behavior
18 years ago
orbiter 4866868c0e added write cache for LURLs
18 years ago
orbiter 3879a0ecd0 replaced java.net.URL usage by use of new class de.anomic.net.URL
19 years ago
allo 44d72f06c4 more Caching
19 years ago
allo 918445a2f4 Bugfix for last commit.
19 years ago
allo c58789177f bookmarkCache
19 years ago
allo d7da273d7e using ArrayList instead of Vector
19 years ago
allo e3dd67bba0 bookmarks import.
19 years ago
allo e6c2f700b1 public Tagview
19 years ago