Commit Graph

45 Commits (444575e33d5b51ba2e61551b59aae6a43e3407a5)

Author SHA1 Message Date
orbiter 0edec2b760 FULL redesign of algorithms in htmlTools to encode/decode strings from/to unicode and html.
16 years ago
orbiter 9ac16f565b - fixed several bugs in database management functions
16 years ago
lotus a81cb78211 finally some putHTML on htroot/xml/
16 years ago
orbiter 536e77e8b7 modifications towards a single database operation to read/write http header and cached file at once:
16 years ago
danielr 17b7845eb5 * refactoring
17 years ago
danielr 3bb870bfcd added final where possible
17 years ago
orbiter c3d461d191 - removed superfluous copyright statement
17 years ago
orbiter 3ca98fee42 removed superfluous copyright statement
17 years ago
orbiter 474659a71f - modified and enhanced the crawl balancer: better list export, fixing of damaged crawl queue at start-up, re-sorting at start-up to enhance domain order
17 years ago
danielr 7feae906aa - organize imports
17 years ago
orbiter cfe6790498 - added option to switch between yacy networks, especially between the two default networks (freeworld and intranet),
17 years ago
orbiter 1689030ee8 refactoring: moved all crawler classes into their own package
17 years ago
orbiter d2ba1fd2ab major step forward to network switching (target is easy switch to intranet or other networks .. and back)
17 years ago
orbiter 968c775025 - preparation of parsing/indexing queue for concurrent execution
17 years ago
orbiter d6050b9ffb - separated the LURL data storage and Crawl result stack for process supervision.
17 years ago
orbiter 541b817502 refactoring of switchboard queueing
17 years ago
orbiter 433ff855f7 - fixed another concurrency problem in collection sorting
17 years ago
orbiter 9d693ee635 more generics
17 years ago
orbiter 89b9b2b02a redesigned remote crawl process:
17 years ago
orbiter 55c87b3b12 changed behavior of crawl stacker
17 years ago
orbiter a31b9097a4 preparations for mass remote crawls:
17 years ago
fuchsi 0e1738899f * Complete number localization and provide a more reasonable interface to serverObjects:
17 years ago
orbiter daf0f74361 joined anomic.net.URL, plasmaURL and url hash computation:
17 years ago
orbiter 511dcbb172 fixed encoding bug made in SVN 3993
18 years ago
orbiter 40b0547611 - documentaton changes (removed old forum links)
18 years ago
orbiter a4e8ad95ab enhancements to news and switchboard queue processing
18 years ago
karlchenofhell 601fc7d1c5 - added source to J7Zip-modifed.jar and it's license (changelog is still to come)
18 years ago
orbiter 861f41e67e redesigned NURL-handling:
18 years ago
karlchenofhell bf7a69197d - fix for possible NPE in queues_p
18 years ago
allo 0c81bd39d4 XSS-safe put as default.
18 years ago
karlchenofhell 41bc31d2c2 - ConfigAdvanced_p => XHTML (no invalid IDs)
18 years ago
orbiter 1d2d1854b9 added size of rwi and urls to WatchCrawler
18 years ago
orbiter 61798f0ae6 added option to distinguish between text crawl and media crawl
18 years ago
orbiter febe6b114a design update of crawler monitor
18 years ago
orbiter 109ed0a0bb - cleaned up code; removed methods to write the old data structures
18 years ago
orbiter df1629b05a - code cleanup
18 years ago
orbiter 5015e780c2 - simplified watchCrawler code
18 years ago
theli 413e6b9855 *) direct access to responseheaders of sbQueue.Entry removed to make it more http independent
18 years ago
theli eb9b138986 *) next step of restructuring for new crawlers
18 years ago
theli 1395aae742 *) starting restructuring which is needed to add crawlers for additional protocols
18 years ago
allo 933a9e02ab fix for broken build
19 years ago
allo 360056b30c fix ajax bug (no valid xml)
19 years ago
allo 3fd1641893 queuesizes in queues_p.xml
19 years ago
allo 26d7e8dd0d more escapes
19 years ago
allo 127396436f more queues in the xml backend
19 years ago