Commit Graph

7365 Commits (8fd4e8ea98150a18b19019c0d2d9e55f090edfc6)
 

Author SHA1 Message Date
orbiter 4c013d9088 more UTF8 getBytes() performance hacks
14 years ago
apfelmaennchen 78d6d6ca06 refactoring for ymarks
14 years ago
f1ori 399d7d6878 * fix permissions of bin/-folder in debian package
14 years ago
cominch 9ac02caf00 different initialization of empty variables in alternative constructor. This leads to wrong interpretation of user credentials, resulting in unnecessary "@" in front of host, and different urlhash values.
14 years ago
orbiter a47bdc405b better logging for robinson selection according to peer tag
14 years ago
orbiter cafcb1f9ed removed the DNS resolving for web structure computation from the indexing queue and placed it in a concurrent computation queue that does not block the crawler. Makes crawling faster and less DNS-speed-dependent
14 years ago
orbiter 57ce1fb491 reverted synchronization from SVN 7641
14 years ago
orbiter 17530ca7b5 fix for bug http://bugs.yacy.net/view.php?id=10
14 years ago
orbiter 7c8e764201 removed synchronization again...
14 years ago
orbiter 96c32e87b0 fixes to crawler and new user-agent crawl-delay handling
14 years ago
orbiter b2fe4b7b1a added a handling of appearances of yacy bot entries in robots.txt if this entry addresses the yacy peer
14 years ago
f1ori 21fe5e6c6a * add bin-folder to debian package
14 years ago
low012 e25c1f2ea3 *) preventing whitespace keys in config file
14 years ago
orbiter cb6f709a16 - enhancements in surrogate reading
14 years ago
low012 1ff9947f91 *) added new user right: extended search right (allows to define users who can query more results than anonymous users)
14 years ago
orbiter 564184909a enhanced the surrogate parser: better reading of UTF-8 characters
14 years ago
orbiter 156cf02703 - added an index constraint 'has location' to the condenser
14 years ago
orbiter 41b8d7f655 fix for url normalization (no backpath resolving in post parameters)
14 years ago
orbiter 0430a94eaa the location search shows now not re-evaluated locations but only such locations that are attached as metadata to web pages
14 years ago
orbiter 8412f8787d fix for http://bugs.yacy.net/view.php?id=8
14 years ago
orbiter 9b25d07295 - added geo information parsing to html parser
14 years ago
f1ori efcf37a953 * show info in log, if robots.txt is rejected due to wrong mime-type
14 years ago
lotus cbf87fe72f write PID to yacy.running
14 years ago
lotus 06afa94f9d hups
14 years ago
lotus a9a9db98c8 better rename modified version
14 years ago
lotus e19ca27004 do not autocomplete on mouseover. this has resulted in unwanted autocomplete.
14 years ago
low012 16cd919795 *) fixed Exceptions which caused 500 error when entering invalid URL mask or invalid prefer mask, invalid masks are ignored, error message is displayed on yacysearch.html (what about yacysearch.rss and yacysearch.json?)
14 years ago
low012 1a24917cea *) fixed NPE which occured when empty String was entered as search word
14 years ago
orbiter 01b968d836 better concurrency in ViewImage icon cache and OOM protection for too large icon caches
14 years ago
orbiter b1a8d0c020 enhancements to web cache and less strict caching rules
14 years ago
orbiter f3baaca920 - enhancements to DNS IP caching and crawler speed
14 years ago
low012 e7860b1239 *) <mode="Homer">D'oh!</Homer>
14 years ago
low012 82f1580a60 *) trying to fix ConcurrentModificationException
14 years ago
f1ori df71776929 * fix bug #7
14 years ago
low012 9f0286b380 *) fixed potential "java.lang.IllegalArgumentException: Illegal group reference" which occured if special characters which are also used as metacharacters in regular expression were used inside of <pre>...</pre> (see: http://veerasundar.com/blog/2010/01/java-lang-illegalargumentexception-illegal-group-reference-in-string-replaceall/)
14 years ago
orbiter 78d4c45d09 enhancement during search process: fast fail of search in case that all index feeder have terminated.
14 years ago
orbiter ba03ca8620 added more configuration options for search:
14 years ago
f1ori e0c7d490f9 * fix bug #6
14 years ago
orbiter 18ec7fe53c added a clearall.sh script that deletes the complete index and everything else that belongs to crawling
14 years ago
orbiter d98884f1d5 added script for importmediawiki.sh in build.xml
14 years ago
orbiter a50f28e6e7 - fixed missing save operation for peer name change
14 years ago
orbiter 2b5f8585bf performance hack for Balancer and ip address parsing
14 years ago
orbiter 43e1660512 fix/enhancement in Crawler: do not generate domain match pattern if crawl depth is 0
14 years ago
orbiter b1d133b69f another anhancement to the ThreadDump function: better multiple dumps and filtering out of not interesting dump parts
14 years ago
orbiter a35d513bd8 fix for not-deleted .gap and .idx files
14 years ago
orbiter a6935e7dc8 fix for active dns resolving: do not resolve in case that the dns server is not available (offline mode)
14 years ago
orbiter 859c99886c fix for multiple thread dump
14 years ago
orbiter 61acf55da4 avoided using a synchronized(this) for the hash computation to prevent that the lock on the object is (accidently) stolen by another thread and replaced this synchronization using the protocol object. Made also the protocol object final.
14 years ago
orbiter c2a968c23f fix for bug in formatting in ThreadDump
14 years ago
low012 2861d0888a *) simplified code\n*) fixed potential NumberFormatExceptions
14 years ago