Commit Graph

4648 Commits (4d7ae76017d6faf8f2f467bd5b2658b4786c5055)

Author SHA1 Message Date
orbiter 1b86d06d1e fix for http://bugs.yacy.net/view.php?id=62
13 years ago
orbiter 9e4875230f performance hacks
13 years ago
orbiter a9838f8b99 fix for http://bugs.yacy.net/view.php?id=59
13 years ago
orbiter a7df70221e refactoring
13 years ago
orbiter cf4fd525ee added directDocByURL attribute in crawl profile
13 years ago
orbiter c61e4cfd78 - fix for incomplete clear() in balancer
13 years ago
orbiter 813f297a95 another performance hack: re-use of known host addresses for isLocal property; avoids look-up in local hash
13 years ago
orbiter 035ebfbf3b - performance hacks (should affect the crawl balancer and reduce CPU load during crawl stack re-fill)
13 years ago
orbiter b250e6466d implemented crawl restrictions for IP pattern and country lists
13 years ago
f1ori e207c41c8e * fix urlproxy for urls containing dolar signs
13 years ago
orbiter 5ad7f9612b added crawl settings for three new filters for each crawl:
13 years ago
orbiter d2ea250d99 refactoring:
13 years ago
low012 42b5f09f68 *) this should fix a bug in snippet creation (also cleaned up a little bit)
13 years ago
orbiter 6b22865dbc - removed some warinings
13 years ago
orbiter 0c6d95e57b - more tolerance against failure of table opening
13 years ago
orbiter 4f31869c5a enhanced search result timing
13 years ago
orbiter 6b02b696b0 - add number of search results to end of rss and json output to reflect latest status of retrieval
13 years ago
f1ori 87e6abd168 * fix urls containing a port number in urlproxy
13 years ago
f1ori 97045022fa * pass cookies to Server Side Includes
13 years ago
orbiter ce2a76d603 performance hack for search process
13 years ago
orbiter 2c4a672fe2 bugfixes and performance hacks for tabe index
13 years ago
orbiter dad5b586a4 added a concurrent warmin-up of Table data structures. that should speed-up the start-up process but may also cause stronger CPU load at that time.
13 years ago
orbiter 734059d33e performance hacks
13 years ago
orbiter 23e81b28b2 synchronization enhancements
13 years ago
orbiter dd4635e323 patches
13 years ago
orbiter bb0c045036 fix for problem with relocation of network
13 years ago
orbiter 85a5487d6d YaCy can now use the solr index to compute text snippets. This makes search result preparation MUCH faster because no document fetching and parsing is necessary any more.
13 years ago
orbiter 52a2b3f110 try to fix bug http://bugs.yacy.net/view.php?id=26
13 years ago
orbiter 2cba860693 - fix for wrong entries in NOLOAD indexing queue (that caused that urls had been only indexed based on their url and not loaded)
13 years ago
orbiter cec3836e73 added reference limitation to IndexControlRWIs_p.html servlet
13 years ago
orbiter 49e5ca579f added new configuration property "crawler.embedLinksAsDocuments". If this is switched on (this is default now), the all embedded image, audio and video links from all parsed documents are added to the search index as individual document. This will increase the search index size dramatically but will also enable us to create a much faster image, audio and video search. If the flag is switched on, the index entries are also stored to a solr index, if this is also enabled.
13 years ago
f1ori 41e146116a fixes size of document in case the server doesn't give the size in the header
13 years ago
orbiter e1a3d609aa moved merger object from Segment to IndexCell to enable a correct shutdown sequence. This solves a bug where yacy cannot be shut down during an index merge that appears during the shutdown phase.
13 years ago
sixcooler 2cf61a40ce fixed a bug from 7856, where Snippet returned an error by mistake when Metadata was found
13 years ago
orbiter 610b01e1c3 - added a 'add every media object linked in a html document as a new document' to the html parser. This causes that all image, app, video or audio file that is linked in a html file is added as document. In fact that means that parsing a single html document may cause that a number of documents is inserted into the search index.
13 years ago
orbiter 3da21c4266 protection against starting of a (second) yacy peer while another one is already running on the same port
13 years ago
orbiter 3e6767d66c limitation of reference evaluation (protection against crawler pits)
13 years ago
orbiter 2c595a6a47 added new methods to count the number of objects in RWIs. lots of refactoring was necessary to introduce new Rating class and to unify naming of methods
13 years ago
orbiter 9f9f634de2 fix in search
13 years ago
sixcooler 5f8a5ca32d - not doing merge-jobs while short on Memory
13 years ago
orbiter 22d69a6368 refactoring in cora: added sorting package
13 years ago
orbiter 51cf697acd refactoring: moved all score-related classes to new ranking package
13 years ago
sixcooler 169236c6d9 almost revert changes in this class of 7880 and 7882
13 years ago
sixcooler 4fec99115b Implementation of strategies for controlling memory resources.
13 years ago
orbiter c64faf41e2 addon to svn 7880
13 years ago
sixcooler 06408a9428 since many POST-requests come as gzip they report a contentlength of -1
13 years ago
orbiter 594d8f546a #cccamp11 maintenance fix: anons may find up to 1000 items in interactive search (was: 100)
13 years ago
sixcooler eb14111200 encapsulate potential expensive objects in TextSnippet to allow GC them asap
13 years ago
orbiter e3fc1efbef performance hack and ensuring termination in serverAccessTracker. cause:
13 years ago
orbiter 44d74f8f89 performance hacks for seed generation (because thread dumps showed multiple occurrences at these code points)
13 years ago