Commit Graph

4744 Commits (7eece0256f42156053c4ccd744919ec3cabc2d2d)

Author SHA1 Message Date
Michael Peter Christen 5b3acc12cd Pattern.quote() replaces \\Q and \\E according to publication in
13 years ago
Michael Peter Christen e7e381d110 added configuration to switch off redirection following in crawler
13 years ago
Michael Peter Christen 70505107ca enhanced crawler/balancer: better remaining waiting-time guessing
13 years ago
Michael Peter Christen f150bc218b fixed bug in solr error document
13 years ago
Roland 'Quix0r' Haeder a093ccf5eb Now used synchronization in all close() methods to make sure all objects
13 years ago
Michael Peter Christen ba6aaabc51 refactoring + parser bugfixes
13 years ago
Michael Peter Christen 659178942f - Redesigned crawler and parser to accept embedded links from the NOLOAD
13 years ago
Michael Peter Christen f5efdb21fd refactoring
13 years ago
Michael Peter Christen f8cd57c92f new indexing strategy: ALL links that appear anywhere are indexed, not
13 years ago
Michael Peter Christen a1a5b015d8 refactoring: moved document Classification to cora package
13 years ago
Michael Peter Christen a5d7da68a0 refactoring: removed dependency from switchboard in Balancer/CrawlQueues
13 years ago
Michael Peter Christen 33d1062c79 refactoring: the cache belongs to the crawler
13 years ago
Michael Peter Christen 046f3a7e8d check if httpc has decompressed the release file and rename the file
13 years ago
Michael Christen 22f05c83ff fixed default must-match filter for full domain crawls - the old filter
13 years ago
Michael Peter Christen 0cc0290978 bugfix for a must-not-match pattern check. This bug did not make the
13 years ago
Michael Peter Christen 2fc8ecee36 ConcurrentLinkedQueue has a VERY long return time on the .size() method.
13 years ago
Michael Peter Christen 8aba045ba1 if a new pop-up page is set in config portal, then this page applies
13 years ago
Michael Peter Christen c6c61be3f0 fix for http://bugs.yacy.net/view.php?id=148
13 years ago
Michael Peter Christen 0d148c3353 more logging in resource observer
13 years ago
Michael Peter Christen 2fa037ae1d enhanced crawler
13 years ago
low012 2120db289a *) Small change which should solve problem with cgitb module in Python CGI scripts.
13 years ago
Lotus ee89cf5ae5 fix must match filter for full domain crawl
13 years ago
Michael Peter Christen 9ad1d8dde2 complete redesign of crawl queue monitoring: do not look at a
13 years ago
Michael Peter Christen 4540174fe0 memory hacks
13 years ago
Michael Peter Christen 9ebcae2fbc enhanced url parser to understand urls with & instead of & in post
13 years ago
Michael Peter Christen 1f4f60654a Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
13 years ago
Michael Peter Christen e6d26a023f fix for bookmark crash with possible side-effects on crawl start after
13 years ago
Michael Peter Christen 190b77c55e added Ukrainian translation
13 years ago
Marek Otahal 72adbeae90 !Important: move from Hashtable to HashMap
13 years ago
Marek Otahal c1af123ddd just a little faster toString
13 years ago
Marek Otahal 64e4bcee82 serverSwitch get(App/Data)Path() use common helper method
13 years ago
Marek Otahal 371fbb4deb just comment + shorter code in serverSwitch
13 years ago
Marek Otahal ed253b7aff update javadoc, does not throw IOException
13 years ago
Michael Peter Christen 2ee8cbeb2c Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
13 years ago
Michael Peter Christen 992dbdf4bb added noload statistic to servlets
13 years ago
Michael Christen 354b976110 fix for concurrency problem and endless loop in /suggest.json
13 years ago
Michael Christen c21966bb43 fix
13 years ago
Michael Christen 13b05f9c08 fix
13 years ago
Michael Christen e5d878c59e Merge branch 'master' of ssh://gitorious.org/yacy/rc1
13 years ago
Michael Christen ec26b2bea4 Merge commit 'fa08ed5ae5d72bddc3cc6a662b23103579e86109' into quix0r
13 years ago
Michael Christen 216a287a85 Merge commit '6d4e08ed06c5cd28c45981b2ebe31c7f7ec6fd83' into quix0r
13 years ago
stbrumm d18095dc48 Patch fuer Issue 0000102
13 years ago
Roland 'Quix0r' Haeder 901f37d608 Also this ... :( #2
13 years ago
Roland 'Quix0r' Haeder a985717ed2 Also this ... :(
13 years ago
Roland 'Quix0r' Haeder 5f490de554 Fix for ported fix from my old days ...
13 years ago
Roland 'Quix0r' Haeder fa08ed5ae5 Fixed a lot CHMOD rights (no need for execute flag on *.java/*.html) and introduced local/remote crawl size ratio based check
13 years ago
Michael Christen 9e5894c784 Removed handling of components objects for URIMetadataRows.
13 years ago
Michael Christen c04bfaa51b refactoring
13 years ago
Michael Christen 17f962fceb translator updates:
13 years ago
Michael Christen 752b092b8a Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
13 years ago