Commit Graph

5503 Commits (96c8119b500f2ffc90ebbb1da4001ea605c149dc)

Author SHA1 Message Date
Michael Peter Christen f294f2e295 bugfix to http://bugs.yacy.net/view.php?id=181 13 years ago
Michael Peter Christen acf8d521a2 fix for http://bugs.yacy.net/view.php?id=126 13 years ago
Michael Peter Christen bb88878b4d the last commit was incomplete.. 13 years ago
Michael Peter Christen d320a31ae1 bugfix for http://bugs.yacy.net/view.php?id=186 13 years ago
Michael Peter Christen fa735f4f04 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 13 years ago
Michael Peter Christen 3e1bc9477f Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 13 years ago
Michael Peter Christen 6f8a2fef1f small speed enhancement using a column factory 13 years ago
Roland 'Quix0r' Haeder d10627d591 More sync in close() methods 13 years ago
Roland 'Quix0r' Haeder b3ae2aa41f With or without 'final'? At least please try it in other methods 13 years ago
Roland 'Quix0r' Haeder fbb946f913 Made a method static (Eclipse suggested it), removed unused import, pk=null check does now output a warning in logfile 13 years ago
Michael Peter Christen 52d307c735 prevent that the snippet fectch process removes catchall entries 13 years ago
Michael Peter Christen 7eece0256f moved yacy.logging to defaults according to request in 13 years ago
Michael Peter Christen 5b3acc12cd Pattern.quote() replaces \\Q and \\E according to publication in 13 years ago
Michael Peter Christen 89142d1e8d removed (not all) warnings 13 years ago
Michael Peter Christen 5deebd02ea added serialization 13 years ago
reger b2175ea4ef Add possibility to set custom Solr field names for the YaCy default Solr attributes. 13 years ago
Michael Peter Christen 15db703808 added missing serialization to remove all warnings 13 years ago
Michael Peter Christen 1795a7325b made HandleSet serializable 13 years ago
Michael Peter Christen e7e381d110 added configuration to switch off redirection following in crawler 13 years ago
Michael Peter Christen 2717c1b749 fixed bug in solr interface 13 years ago
Michael Peter Christen 70505107ca enhanced crawler/balancer: better remaining waiting-time guessing 13 years ago
Michael Peter Christen f150bc218b fixed bug in solr error document 13 years ago
Michael Peter Christen cb54c1737b solrj connector bugfix 13 years ago
Roland 'Quix0r' Haeder a093ccf5eb Now used synchronization in all close() methods to make sure all objects 13 years ago
Michael Peter Christen 49cab2b85f Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 13 years ago
Michael Peter Christen 0d58fea210 made multiple connector default 13 years ago
Michael Peter Christen 7740c02c56 - enhanced the solr connector 13 years ago
Michael Peter Christen 0cf3d36eae more tolerance in case of corrupted file 13 years ago
Michael Peter Christen acc6db28ff added missing classes for solr interface 13 years ago
Michael Peter Christen adeb33bb36 better abstraction for solr objects 13 years ago
Michael Peter Christen 8864141872 more abstraction in solr connection classes 13 years ago
Michael Peter Christen c00efc2717 made the solr connection more generic 13 years ago
Michael Peter Christen ea2bd43b28 patch for broken configurations 13 years ago
Michael Peter Christen e5ca7f22b1 enhancement in circle drawing 13 years ago
Michael Peter Christen 34f4225d7e less 'wellformed' calls without asserts 13 years ago
Marc Nause a691023d04 *) better formatting for network QPM 13 years ago
Michael Peter Christen 77f8e9fb9b Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 13 years ago
Michael Peter Christen ba6aaabc51 refactoring + parser bugfixes 13 years ago
Michael Peter Christen 2a0434efa4 Merge commit 'c1f6b4fb5226d3d2f8b2bec9e361f6b3476e03ff' 13 years ago
Michael Peter Christen 942896fe46 removed methods not supported by new solrj connector for httpclient 4 13 years ago
Michael Peter Christen 22e1f68c0b solrj user authentication patch 13 years ago
Michael Peter Christen 09484955dc added new entry class for embed tags 13 years ago
Michael Peter Christen 62f2554a01 - fixed build problems (deprecated methods using httpclient 3.1) 13 years ago
Michael Peter Christen a6d60fc21f concurrency enhancement in ConfigurationSet 13 years ago
Michael Peter Christen 453010bd68 - solved problems with backpath normalization 13 years ago
Michael Peter Christen 5f5ed33ed8 patch for media search (audio, video apps) 13 years ago
Michael Peter Christen 7860c1df80 fix needed for new solrj library 13 years ago
Michael Peter Christen 0e13022147 - enhanced solr field documentation 13 years ago
Michael Peter Christen 19efbf1b0f - apply directDocByURL to NOLOAD Queue 13 years ago
Michael Peter Christen 659178942f - Redesigned crawler and parser to accept embedded links from the NOLOAD 13 years ago
Michael Peter Christen a3badd3205 changed search process for images: no more media snippet load process, 13 years ago
Michael Peter Christen f5efdb21fd refactoring 13 years ago
reger c1f6b4fb52 lookupByIP: prevent comparing of port parameter if called with port -1 (=unknown) 13 years ago
Michael Peter Christen f8cd57c92f new indexing strategy: ALL links that appear anywhere are indexed, not 13 years ago
Michael Peter Christen 14f67f217c refactoring of ContentDomain: now subclass of Classification 13 years ago
Michael Peter Christen 8a08c96a82 removed dependency from logging 13 years ago
Michael Peter Christen a1a5b015d8 refactoring: moved document Classification to cora package 13 years ago
Michael Peter Christen a5d7da68a0 refactoring: removed dependency from switchboard in Balancer/CrawlQueues 13 years ago
Michael Peter Christen 33d1062c79 refactoring: the cache belongs to the crawler 13 years ago
Michael Peter Christen 4d5da75814 fix for parser problem if a <a>-tag is 'within' html tags with unclosed 13 years ago
Michael Peter Christen 91a86f0b06 fixed to network graph testing 13 years ago
Michael Peter Christen 7b5b9baee0 added citation rank to ranking profile 13 years ago
Michael Peter Christen 046f3a7e8d check if httpc has decompressed the release file and rename the file 13 years ago
Michael Christen 02e4dedff2 fix to url citation collection 13 years ago
Michael Christen e32055aa15 added stub classes for 13 years ago
Michael Christen ac5d124ee0 experimental implementation of a citation ranking as post-ranking 13 years ago
Michael Christen 8fc86fe397 added storage of full anchor link structure: 13 years ago
Michael Christen 22f05c83ff fixed default must-match filter for full domain crawls - the old filter 13 years ago
Lotus 0b3f39136e allow custom ppm lower than minimum button on /Crawler_p.html 13 years ago
Michael Peter Christen 532c7cf827 added physics experiment to the graph plotter. not active by default 13 years ago
Michael Peter Christen aba9b1bfa0 better names for elements of a linked graph 13 years ago
Michael Peter Christen 0cc0290978 bugfix for a must-not-match pattern check. This bug did not make the 13 years ago
Michael Peter Christen 2fc8ecee36 ConcurrentLinkedQueue has a VERY long return time on the .size() method. 13 years ago
Michael Peter Christen 8aba045ba1 if a new pop-up page is set in config portal, then this page applies 13 years ago
Michael Peter Christen 8c06925984 animation of the web structure picture 13 years ago
Michael Peter Christen 898fa7c3f3 use tld heuristic to check if a domain is local or global 13 years ago
Michael Peter Christen 213c8d97f2 use less proccesses in process pool 13 years ago
Michael Peter Christen c639248c23 protection against strange answers from remote peers during search 13 years ago
Michael Peter Christen 36e4d82b27 changed ranking 13 years ago
Michael Peter Christen 096c17e7cd added test code 13 years ago
Michael Peter Christen 665626a51b catch OOM errors during scanning 13 years ago
Michael Peter Christen 1cd711d005 added classes for citation references (for new citation ranking) 13 years ago
Michael Peter Christen 33a405dab8 ipv6 bugfix 13 years ago
Michael Peter Christen c6c61be3f0 fix for http://bugs.yacy.net/view.php?id=148 13 years ago
Michael Peter Christen e0f1e7d904 added new citation reference data structure that shall be used for a 13 years ago
Michael Peter Christen e18a4f6b74 more tolerant merge iterator 13 years ago
Michael Peter Christen 0d148c3353 more logging in resource observer 13 years ago
Michael Peter Christen 2fa037ae1d enhanced crawler 13 years ago
Michael Peter Christen e101c2e0e2 added changes from copperdust (submitted by email): 13 years ago
low012 2120db289a *) Small change which should solve problem with cgitb module in Python CGI scripts. 13 years ago
Lotus ee89cf5ae5 fix must match filter for full domain crawl 13 years ago
Michael Peter Christen 8d63a5887c bugfixes 13 years ago
Michael Peter Christen 9ad1d8dde2 complete redesign of crawl queue monitoring: do not look at a 13 years ago
Michael Peter Christen 7e4e3fe5b6 free some memory after parsing html 13 years ago
Michael Peter Christen 4540174fe0 memory hacks 13 years ago
Michael Peter Christen b4409cc803 small redesign of blob column index and usage 13 years ago
Michael Peter Christen d5c1f2746e performance hack 13 years ago
Michael Peter Christen 803963aebd performance hack: better space grow in CharBuffer (speeds up html 13 years ago
Michael Peter Christen 8b0920b0b5 tried to fix the ipv6 problem as reported in bug 13 years ago
Michael Peter Christen e2f8f263e8 changed storage of search words: keep order 13 years ago