Commit Graph

340 Commits (d27a0a67ff6c6af894d7cf96473a3eac2f216924)

Author SHA1 Message Date
orbiter 123375bfba added a new yacy protocol servlet 'idx'. This returns an index to one of the data entities that is stored in YaCy.
14 years ago
orbiter deda54d684 - relaxed matching of string-search (this is now case-insensitive)
14 years ago
orbiter 3b578a28ef some patches to prevent that empty or bad IP information is broadcasted
14 years ago
orbiter 8b95a26866 better magic
14 years ago
orbiter 2700a58e5a added a magic to the peer ping that will be used in case that the contacting peer requests that it's reported IP shall be used for a back-ping. The back-ping now also returns the same magic which will make it possible that the requested peer can verify that the back-pinged peer is actually the same peer.
14 years ago
orbiter d8e934c085 better abstraction of http client identification
14 years ago
sixcooler a3e707283d not using HTTPConnector anymore
14 years ago
orbiter 19fd13d3bc Added federated index storage to solr.
14 years ago
orbiter 4c013d9088 more UTF8 getBytes() performance hacks
14 years ago
orbiter cb6f709a16 - enhancements in surrogate reading
14 years ago
orbiter 78d4c45d09 enhancement during search process: fast fail of search in case that all index feeder have terminated.
14 years ago
orbiter cb1f49d0f2 replaced all 'new String' with default encoding (missing) or UTF-8 encoding with a String generation method that uses a pre-defined Charset constant for UTF-8. This avoids a cache-lookup for the Charset object using String hashing of the String 'UTF-8'.
14 years ago
orbiter 799c534935 one more patch again OOM during secondary remote search
14 years ago
orbiter bed79402be introduction of a new remote search load control: the remote search has taken 10 results per peer with a time-out of 3 seconds so far. The attributes of number of results per peer and time-out time can now be configured.
14 years ago
orbiter 38dce547c0 better concurrency (less locking on date formatting) more logging and minor bug fixes
14 years ago
orbiter dec24244cf added convenience class to generate UTF StringBody objects with a default UTF8 charset.
14 years ago
orbiter 57e6728cb7 - removed usage of /etc/alternatives/www-browser because of problems with lynx, see:
14 years ago
orbiter 5892fff51f introduction of dht-burst modes: this can expand the number of target peers in some cases where a better heuristic is needed. The problematic cases are either when a muti-word search is made (still a hard case for our term-oriented DHT) or when a network operator wants that all robinson peers are asked. We therefore introduced two new network steering values that switch on more peers during the peer selection. Because the number of peers can now be very large, the number of maximum httpc connections was also increased.
14 years ago
orbiter 4588b5a291 - fixed document number limitation for crawls that restrict the number of documents per domain
14 years ago
orbiter 88773e4daa changed the default port from 8080 to 8090
14 years ago
orbiter 28f669bf0b - fixed/enhanced move to SD/16:9 images (network, web structure)
14 years ago
orbiter 6c1b14c8e1 - more control in access tracker: count number of returned search results (not only info how much is in the index)
14 years ago
orbiter 54e77e6255 refactoring
14 years ago
orbiter 10ae8d961b - cora package has now no dependencies to other yacy packages and becomes a 'base' package (refactoring)
14 years ago
orbiter a9f754c45f removed unused CR accumulation and distribution process
14 years ago
low012 9b3fae9496 *) cleaning up the code a little bit
14 years ago
orbiter 25a8e55bc9 more logging about bad seeds
14 years ago
orbiter 959b8c6fa0 - allow greater seed size
14 years ago
orbiter e103419a56 - removed <3 peers barrier for peer ping feedback
14 years ago
orbiter d607b30b6a performance enhancements for search and code review for database functions
15 years ago
sixcooler 61c82f3105 gzip-compresson @ transferRWI & transferURL back again
15 years ago
orbiter d2fd93135c - moved yacybot user agent string definition to MultiProtocolURI since there are basic access mechanisms where the bot string is needed
15 years ago
orbiter 10a9cb1971 simplified snippet computation process and separated the algorithm into two classes
15 years ago
orbiter 84a023cbc8 fixed several search bugs
15 years ago
orbiter 461a2a6ec7 enhanced remote crawling:
15 years ago
orbiter 0cf006865e refactoring and enhanced concurrency
15 years ago
orbiter 5870b13f3a - code cleanup / added debug line for further investigation in HTTPDemon.parseMultipart
15 years ago
orbiter 7ebef56add - redesign of a part of the remote search client to make it possible to have a test environment for remote search performance tests
15 years ago
orbiter 64860dc1bb enhanced search event logging (to be used for further improvements)
15 years ago
orbiter 570ca577c6 performance hacks
15 years ago
orbiter 9d080f387e change in handling of the all-visible home path for storage in YaCy:
15 years ago
orbiter 844f158686 - removed dependencies in header framework:
15 years ago
orbiter 5e7081cd19 refactoring towards a unified loading mechanism for MultiProtocolURIs
15 years ago
orbiter 90531f78ff refactoring of the cora package to get subpackages for http and ftp (smb to come)
15 years ago
sixcooler 661867923a ... migrating to HttpComponents-Client-4.x ...
15 years ago
sixcooler b7102eff92 ... migrating to HttpComponents-Client-4.x ...
15 years ago
orbiter 22dbbcfa56 better (and corrected) recognition of intranet and internet-addresses. This corrects the isLocal property that is used by network definitions to restrict index ranges to local and global addresses. Address locations (intranet or internet) had been partly identified by the top level domain of the host address. Since intranet addresses can also be addressed using a host name that is in a country domain it is necessary to do a dns resolving for each check. The check is supported by a local dns cache so the intranet/internet check should not affect network traffic too much. To ensure that the cache works properly the cache class was upgraded to better concurrency data structures.
15 years ago
sixcooler 0e56d29335 ... migrating to HttpComponents-Client-4.x ...
15 years ago
sixcooler e1316d12d0 ... migrating to HttpComponents-Client-4.x ...
15 years ago
sixcooler c5c67f0504 start migrating to HttpComponents-Client-4.x
15 years ago