Commit Graph

7732 Commits (d4ad01643c2c4e304e740e35f0d80681431a1a16)
 

Author SHA1 Message Date
orbiter 610b01e1c3 - added a 'add every media object linked in a html document as a new document' to the html parser. This causes that all image, app, video or audio file that is linked in a html file is added as document. In fact that means that parsing a single html document may cause that a number of documents is inserted into the search index.
13 years ago
orbiter 9a8937f8b6 be more liberal when evaluating search results. This may cause that it is possible to fraud content on fresh peers, but that is better than looong waiting times for the evaluation of every link which causes that everybody rejects YaCy as 'too slow'. But this is only because of the high standards that YaCy sets to itself. If we are able to gain more users by lowering the standard, then that is useful. The option to set that flag to verify each link is still there.
13 years ago
orbiter 3da21c4266 protection against starting of a (second) yacy peer while another one is already running on the same port
13 years ago
orbiter 2d03dc1804 removed unnecessary warning
13 years ago
orbiter cf8e3b0df8 small fix for count: overXX includes the count
13 years ago
orbiter 6db8921a0f enhanced termlist
13 years ago
orbiter b5252ef91f added new word recommendation library in DictionaryLoader_p.html
13 years ago
orbiter 1c007188ad bugfixes in html parser
13 years ago
orbiter b00e69c5df removed test output
13 years ago
orbiter 231074bf0a fixed a parsing bug by reverting SVN 7766
13 years ago
low012 ce11b7b6d2 *) Changed action to "" instead of "yacysearch.html". This should not do any harm, but helps a lot if the page is accessed not by its original name but by a different name which can be done by adding a symbolic link to the file system of the peer. (See http://www.yacy-forum.org/viewtopic.php?f=2&t=464)
13 years ago
low012 30a8a2f76b *) replacing one ugly hack with an extended ugly hack ;-)
13 years ago
low012 95379ce0b1 *) should fix some problems with RSS Importer (see http://forum.yacy-websuche.de/viewtopic.php?f=5&t=3253)
13 years ago
low012 c660f8862a *) changed links to be underlined again since lots of links were not obvious anymore
13 years ago
low012 24e76a7b69 *) Replaced occurrences of "Wikimedia" with "MediaWiki" where applicable. (Thanks to the folks of 0x20.be for pointing this out.)
13 years ago
sixcooler d40a177c05 Generation Memory Strategy fine tuning
13 years ago
sixcooler 839f407fe4 Generation Memory Strategy fine tuning:
13 years ago
orbiter 3e6767d66c limitation of reference evaluation (protection against crawler pits)
13 years ago
orbiter a5541751a8 - added memory computation to termlist_p.xml
13 years ago
orbiter 45e497a9bd fix for term iteration
13 years ago
orbiter 9bdee5c71c added a servlet that produces a list of term hashes that appear more than 10000 times
13 years ago
orbiter 5dd2efc9a2 - bugfixes in html parser
13 years ago
orbiter 2c595a6a47 added new methods to count the number of objects in RWIs. lots of refactoring was necessary to introduce new Rating class and to unify naming of methods
13 years ago
orbiter 75df87832c refactoring/better naming of methods and classes
13 years ago
orbiter 9f9f634de2 fix in search
13 years ago
sixcooler 5f8a5ca32d - not doing merge-jobs while short on Memory
13 years ago
orbiter 965fabfb87 enhanced sorting speed (affects all DB operations)
13 years ago
orbiter 41a8ee4569 added iterable implementation in KeyList
13 years ago
orbiter 22d69a6368 refactoring in cora: added sorting package
13 years ago
orbiter 51cf697acd refactoring: moved all score-related classes to new ranking package
13 years ago
orbiter a0d5e7b6e6 added new score comparator
13 years ago
sixcooler 169236c6d9 almost revert changes in this class of 7880 and 7882
13 years ago
sixcooler 4fec99115b Implementation of strategies for controlling memory resources.
13 years ago
sixcooler 63a375b801 do not look at external dtd, cause this make this reader stay forewer(?) on on faulty dtd-locations
13 years ago
orbiter c39d63e7ad by default show only domain navigator
13 years ago
orbiter 2c58af6874 - added a short memory status simulation mode
13 years ago
orbiter c64faf41e2 addon to svn 7880
13 years ago
sixcooler 7b7a196243 ignore cookies in httpclient per default
13 years ago
sixcooler 06408a9428 since many POST-requests come as gzip they report a contentlength of -1
13 years ago
sixcooler 411ed159f8 do some extra sleep while running low on memory
13 years ago
orbiter 6361f1d875 select the search window on focus so its easy to type in another query
13 years ago
sixcooler 9ab0ba41e2 using GzipDecompressingEntity from httpclient instead of our own
13 years ago
sixcooler 52b477cf6f bump to httpclient-4.1.2, httpcore-4.1.3 - bugfixrelease
13 years ago
orbiter ca09081341 better interaction
13 years ago
orbiter 3f0349e362 added a 'loading...' message
13 years ago
orbiter feac494f26 switch off real-time search if index is large
13 years ago
sixcooler 07f5954570 try better handling of corrupt blobs
13 years ago
orbiter f970670a7c - bugfix in ServerScannerList
13 years ago
orbiter 8e03b8ee8b better integration of server list in interactive search
13 years ago
orbiter 606c5a9b40 added a serlvet that shows all scanned servers inside of the yacyinteractive search page
13 years ago