Commit Graph

319 Commits (6c1b14c8e1e9013ad74a1a06e16d6881cc135e2b)

Author SHA1 Message Date
orbiter eeca2ded92 fix for http://forum.yacy-websuche.de/viewtopic.php?p=18500#p18500
15 years ago
orbiter a37878b7d5 url parser regex performance hack
15 years ago
orbiter 362b7a929b added extensive memory protection logic to avoid out of memory errors that may be caused by the RowCollection memory allocation function
15 years ago
orbiter 4a5100789f replaced _all_ size() == 0 with isEmpty() and all size() > 0 with !isEmpty(). The isEmpty() method is much faster in some cases, especially when used to access badly balanced hashtables where an size() operation becomes a large iteration.
15 years ago
orbiter 4df88a4e7a - fixes for missing or bad hashCode computation
15 years ago
orbiter 4431b9767e added about 450 replacements for printStackTrace() methods to pipe such traces into the log at DATA/LOG/
15 years ago
orbiter 5e8038ac4d - refactoring of blacklists
16 years ago
orbiter b79f4f062f refactoring of yacy documents and parsers: they depend now only on the kelondro classes
16 years ago
orbiter e7f18ba24b refactoring
16 years ago
orbiter ce8dc575ca refactoring
16 years ago
orbiter bea3b99aff moved table and util classes
16 years ago
orbiter 1e4f8b56ed accumulated classes from different packages into the new rwi package
16 years ago
orbiter 4446acc8cd moved kelondro order
16 years ago
orbiter 04a548a1e3 - temporary integrated the transferURL servlet as static class instead as a class that is called using reflection to investigate the OOM problems in that class
16 years ago
orbiter 61748285c3 more refactoring of search
16 years ago
orbiter 72e5407115 refactoring of snippet cache
16 years ago
orbiter 1d8d51075c refactoring:
16 years ago
orbiter ca72ed7526 -removed superfluous crawl cache
16 years ago
orbiter 0e8647d62f refactoring of search classes
16 years ago
orbiter dafffd0153 refactoring of parsers and document processing
16 years ago
orbiter 154bbc3364 code cleanup: call of static methods directly to the class
16 years ago
orbiter be1c7ddc64 refactoring of search classes -- moved Ranking Profile to search package
16 years ago
orbiter e0b3984805 added navigation keys for site and author facets to remote search interface
16 years ago
orbiter ab06a6edd2 renamed topwords to topics and enhanced computation methods of topics
16 years ago
orbiter a5d481eab1 enhanced navigation
16 years ago
orbiter 88426912ad more refactoring to make the segment object easier to use and to be prepared to integrate author navigation
16 years ago
orbiter 99bf0b8e41 refactoring of plasmaWordIndex:
16 years ago
orbiter dfb96ecb72 more fixes
16 years ago
f1ori 0b0a46d35a * fix transferRWI as suggested by celle (thanks!)
16 years ago
orbiter c8624903c6 full redesign of index access data model:
16 years ago
orbiter 89ec3acb3e - full abstraction of index content type: the kelondro full text index may now also contain indexes about other content than text, i.e. navigation indexes or reverse linking indexes.
16 years ago
orbiter 44e01afa5b - refactoring
16 years ago
orbiter c2359f20dd refactoring: better abstraction of reference and metadata prototypes.
16 years ago
orbiter 83792d9233 more refactoring
16 years ago
orbiter 209f25f5f5 refactoring to integrate indexCell data structures
16 years ago
orbiter 7f67238f8b refactoring of plasmaWordIndex: less methods in the class, separated the index to CachedIndexCollection
16 years ago
orbiter 14a1c33823 refactoring of wordIndex class
16 years ago
orbiter aa44d9bad9 more refactoring of kelondro.text / deleted de.anomic.index
16 years ago
orbiter 6ffc6e3389 more refactoring of indexer and kelondro classes;
16 years ago
orbiter 76ef5f0f14 refactoring of index package: better names for the classes (to be continued)
16 years ago
orbiter c12bb8a6d0 - refactoring of the http client
16 years ago
orbiter 94c42691d8 - reject less transmissions as transmission receiver
16 years ago
orbiter c25c334b75 replaced old DHT transmission method with new method. Many things have changed! some of them:
16 years ago
orbiter 65a1de6c05 longer timeout for remote crawl queries
16 years ago
orbiter 94110df85a moved logging partially to kelondro
16 years ago
orbiter 83ce65707a (almost) completed partition of classes in kelondro
16 years ago
orbiter 7ee494fde5 more refactoring of kelondro:
16 years ago
orbiter bf93767ec6 refactoring of kelondro database classes
16 years ago
orbiter fc27bf8c4c refactoring of kelondro classes:
16 years ago
orbiter dedfc7df7f removed distinction between DHT-in and DHT-out. This is necessary to make room for the new cell data structure, which cannot use this this distinction in the first place, but will enable the same meaning with different mechanisms (segments, later)
16 years ago
orbiter e004da48d3 - added fast fingerprint computation for files (any). Will be used in new index dump method
16 years ago
orbiter 2802138787 - refactoring of CrawlStacker (to prepare it for new multi-Threading to remove DNS lookup bottleneck)
16 years ago
orbiter 47292e696a more performance hacks
16 years ago
f1ori 7e1fe05e3c * added utf8-encoding to many getBytes-calls
16 years ago
orbiter 3f746be5d4 - consolidation and refactoring of many DHT target - computing methods
16 years ago
orbiter d014b2728a Design-check, Extension and Refactoring of DHT target position computation:
16 years ago
orbiter 6e7d113eac fix for wrong index initialization after network switch
17 years ago
orbiter 00c1535f84 added ranking and evaluation of language type in a search
17 years ago
orbiter 05dbba4bab added logging conditions to all fine and finest log line calls
17 years ago
orbiter 536e77e8b7 modifications towards a single database operation to read/write http header and cached file at once:
17 years ago
danielr be28af50f5 - fixed "yacy2yacy no proxy"-problem
17 years ago
danielr 621b473b18 * removed some warnings of findbugs (http://findbugs.sf.net)
17 years ago
danielr 17b7845eb5 * refactoring
17 years ago
danielr 3bb870bfcd added final where possible
17 years ago
orbiter c3d461d191 - removed superfluous copyright statement
17 years ago
orbiter e81be7d4f2 added many missing user-agent declarations for yacy http client connections.
17 years ago
orbiter 474659a71f - modified and enhanced the crawl balancer: better list export, fixing of damaged crawl queue at start-up, re-sorting at start-up to enhance domain order
17 years ago
hermens 75b4a5ced4 reinstate old timeout values for transferRWI and transferURL
17 years ago
danielr 7feae906aa - organize imports
17 years ago
orbiter e0e7f86f82 some bugfixes for the peer-ping process
17 years ago
danielr cbe722c480 small code cleanUp
17 years ago
orbiter b21598bdd0 - enhanced handling of own IP address inside seed
17 years ago
orbiter 40d7f485f3 - fixed several NPE bugs
17 years ago
orbiter 2f381b8d7a - fixed at least two causes for a NPE after a use case switch.
17 years ago
orbiter 11e00a0849 - refactoring of seedURL handling
17 years ago
orbiter 2ba7914f0b fix for NPE exception while fetching remote crawl jobs
17 years ago
orbiter cfe6790498 - added option to switch between yacy networks, especially between the two default networks (freeworld and intranet),
17 years ago
orbiter 1689030ee8 refactoring: moved all crawler classes into their own package
17 years ago
orbiter d2ba1fd2ab major step forward to network switching (target is easy switch to intranet or other networks .. and back)
17 years ago
danielr d7b21bc90c re-added gzip POST for transferRWI/URL (HTTP/1.1 compliant)
17 years ago
danielr d4bce6affd refactoring (initialized static fields, removed empty if/else, serialized some fields in serializable classes)
17 years ago
orbiter 724bbdf9b2 refactoring of RSS reader
17 years ago
orbiter 1995faef8d - refactoring of Colage back-end: move to plasma package
17 years ago
orbiter 5e3ce46339 - better logging when rejecting a url because it is not in declared domain
17 years ago
orbiter 8fe39ebd74 -fixed file transmission with POST. The only usage was in ranking transmission, therefore:
17 years ago
orbiter 202a3adb3e refactoring of HttpClient Writer processes
17 years ago
orbiter c3342e1178 - removed class with only one static method
17 years ago
danielr 7a35126e91 http timeouts von alten httpc wieder gesetzt
17 years ago
danielr d96e2badc7 - fixed POST in proxy
17 years ago
danielr ab330cfdca Network.html: removed ; from location
17 years ago
danielr 5c3c1fdf41 replaced httpc with Apache Jakarta Commons HttpClient (includes some refactoring ;)
17 years ago
orbiter 7f9f639d20 - refactoring and abstraction of index reference (urls) handling: blacklisting is part of reference filtering
17 years ago
orbiter d6050b9ffb - separated the LURL data storage and Crawl result stack for process supervision.
17 years ago
orbiter 7150b463ff changed handling of default values and database paths:
17 years ago
orbiter 9c989fe5f7 fixed deadlock
17 years ago
orbiter a3dadcd89b preventing that peer which return a false search result are disconnected
17 years ago
orbiter ba622bb240 addendum to svn 4553
17 years ago
orbiter 4fdf695064 - fixed a bug in remote search that prevented that any results had been generated (!)
17 years ago
orbiter 3f321ece7d added a search history to the new search page
17 years ago
borg-0300 77ba446332 seedDB helpers update/cleanup
17 years ago