Commit Graph

1670 Commits (910c2aaed6bcdd408dedc0bed196cd7fc54dda73)

Author SHA1 Message Date
orbiter 3c6e8d2015 set default ppm when network is switched
16 years ago
orbiter 3288c19c1a reduce remote crawl PPM for fresh peers in freeworld to 6 PPM
16 years ago
orbiter 77ee0765a4 - added domain statistic generation to IndexControlURLs_p.html servlet
16 years ago
orbiter 4fbee21cea - added fetch-ahead again (had been removed in last commit)
16 years ago
orbiter fc03b0437a fixed a error case where a second search after a first search with a different search word failed
16 years ago
orbiter ead39064c5 fixed problem with wrong result number calculation
16 years ago
orbiter 05dbba4bab added logging conditions to all fine and finest log line calls
16 years ago
orbiter d3d41e2ee4 - fixed problem with searching with quotes (still not complete, but not as bad as before)
16 years ago
orbiter df4ff423c4 added additional properties to query id's to distinguish search events better
16 years ago
danielr 9ff4fc11da partial fix (images,audio,video) for proxy and content-type problem http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1374
16 years ago
lotus d9d9c522a1 addendum to last commit
16 years ago
orbiter 536e77e8b7 modifications towards a single database operation to read/write http header and cached file at once:
16 years ago
danielr 3c68905540 remove redundant null checks
16 years ago
danielr 753a1ae430 - changed default browser from netscape to firefox
16 years ago
orbiter 7989335ed6 Preparations to replace the HTCache with a new storage data structure:
16 years ago
danielr be28af50f5 - fixed "yacy2yacy no proxy"-problem
16 years ago
f1ori f99c307eff * correct debian build dependencies
16 years ago
orbiter bdae051d9a - extended new performance graph (better timing)
16 years ago
danielr a087090bbb fixed starting crawl results in "No parser available to parse mimetype 'application/octet-stream'"
17 years ago
danielr 8422ee5ec4 - fixed UnsupportedEncoding (in proxy) using defaultCharset if no characterEncoding can be determined
17 years ago
hermens cff4393f0c Fix HTCache so oldest Files get deleted first
17 years ago
danielr 621b473b18 * removed some warnings of findbugs (http://findbugs.sf.net)
17 years ago
orbiter ebb40d324b enhanced memory chart: shows now also the size of the word cache as third vector.
17 years ago
danielr 17b7845eb5 * refactoring
17 years ago
danielr 3bb870bfcd added final where possible
17 years ago
lotus 0b2f67577e Index Transfer:
17 years ago
lotus 5f77f55ed7 possible fix for negative speed values
17 years ago
orbiter 50ef5c406f - refactoring of robots parser (removed opaque Objects[] result vector)
17 years ago
orbiter c3d461d191 - removed superfluous copyright statement
17 years ago
lotus 62afea0c9f some improvements for yacyTray
17 years ago
lotus fa695c2d9f tray is now only shown on Windows and doesn't block on linux
17 years ago
lotus d77ed28e2f temporary disabled tray because of flaws on only-shell-linux
17 years ago
lotus f8a1e3175e new yacyTray
17 years ago
orbiter 7b1c9e6aee discovered and removed a (possibly large) memory leak:
17 years ago
orbiter 0f5fe8cc53 refactoring of method calling for objects from kelondroMapDataMining
17 years ago
orbiter 4acf0a61cd refactoring of kelondroObjects (mainly renaming to kelondroMap)
17 years ago
orbiter 441e9c861e fix for npe in HTCache cleaning process
17 years ago
orbiter 1e6d12f146 Major update to BLOB data structures:
17 years ago
orbiter b38f467e3c better SRU compliance
17 years ago
orbiter 7052f2f61f - added copyright header of ResourceObserver
17 years ago
orbiter 1400cdc91e - refactoring of resourceObserver (moved it to crawler)
17 years ago
f1ori b6301a54fa * added class ListDirs to provoid generic listing of directories in systemdirectories and jar-files
17 years ago
lotus f2e2d09916 - fix for index transfer
17 years ago
orbiter a6719dfd2b - refactoring of robots parser
17 years ago
orbiter e81be7d4f2 added many missing user-agent declarations for yacy http client connections.
17 years ago
orbiter 474659a71f - modified and enhanced the crawl balancer: better list export, fixing of damaged crawl queue at start-up, re-sorting at start-up to enhance domain order
17 years ago
orbiter d37fd064f9 changed peer selection for search targets:
17 years ago
orbiter 69aac0d74c modified the diskUsage class regarding the following two aspects:
17 years ago
danielr 0c1dc703e4 - set staticIP at startUp
17 years ago
orbiter b928ae492a some code-cleanup and possible speed enhancements in different core methods
17 years ago
orbiter c998dc6556 - added security functions to flush url and search caches in case that memory is full
17 years ago
danielr 68c38c2d34 - WatchCrawler shows status without JavaScript
17 years ago
orbiter f5ef7f222e - fixed a bug in parser (directory paths had not been recognized)
17 years ago
orbiter 3330181aa0 refactoring:
17 years ago
danielr 7feae906aa - organize imports
17 years ago
det f597185026 Initial import of the resource observer framework
17 years ago
orbiter e0e7f86f82 some bugfixes for the peer-ping process
17 years ago
orbiter 40d7f485f3 - fixed several NPE bugs
17 years ago
orbiter 2f381b8d7a - fixed at least two causes for a NPE after a use case switch.
17 years ago
orbiter 2a604b7402 added superfast search result computation which can be obtained for local search when snippet fetching is disabled. An example search for the rss interface would be:
17 years ago
orbiter 9bef20b537 - added cleanup for unused server loggings: they are removed after the client had not been seen since one hour
17 years ago
orbiter 1a1841392c small fix
17 years ago
orbiter 25192e0d36 added a deletion button to indexControlRWIs that deletes the complete web index
17 years ago
orbiter 0c173821fd more access security regarding database access and snippet retrieval: restrict number of results for not-authorized searchers
17 years ago
orbiter faed00d75d added use cases to basic configuration
17 years ago
orbiter 4229cd275c fixed several details about network switching, default password, random password and localhost authentification
17 years ago
orbiter c1d721dd2d fix for attacks on localhost-authorized peers from web pages with links to localhost addresses:
17 years ago
orbiter 56a300f92a bugfix / local-search predicate
17 years ago
orbiter 2f29ab8779 more target server access security
17 years ago
orbiter 3bd1db776a implemented switch for admin authorization from localhost:
17 years ago
orbiter cfe6790498 - added option to switch between yacy networks, especially between the two default networks (freeworld and intranet),
17 years ago
orbiter 78087da287 - changed seed file storage to clear text
17 years ago
orbiter 5fde679acb - fixed problem in performance configuration
17 years ago
orbiter 239cc4428d - better domain graph, faster when more links exist, looks better
17 years ago
orbiter 415b92bb07 fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1148&hilit=&p=7711#p7711
17 years ago
orbiter dd75b3cabc - patch for bad profiles
17 years ago
lotus e021278bf0 unescape link display in search results
17 years ago
danielr 74b1a60043 fixed "java.lang.NoClassDefFoundError: org/a"
17 years ago
orbiter f42c8cf69c updated terminal and dynamic webstructure applet: can now change when crawl is running
17 years ago
orbiter 7ec01d444a fix for npe
17 years ago
danielr ae03a54d23 pdfParser: updated lib, fixed ClassNotFoundException: CMSError
17 years ago
orbiter 719f5defb1 updated some grafics at new terminal_p
17 years ago
lotus 9bc56a9edc xss protection
17 years ago
orbiter b32736762c enhanced rssTerminal
17 years ago
orbiter fbb712c669 refactoring:
17 years ago
orbiter 1689030ee8 refactoring: moved all crawler classes into their own package
17 years ago
orbiter d2ba1fd2ab major step forward to network switching (target is easy switch to intranet or other networks .. and back)
17 years ago
danielr d4bce6affd refactoring (initialized static fields, removed empty if/else, serialized some fields in serializable classes)
17 years ago
orbiter d0678f7ab9 refactoring as result of
17 years ago
orbiter 483e9a2066 - shifted tld recognition methods from yacyURL to serverDomains
17 years ago
orbiter a3df23659c re-implementation of charset checking
17 years ago
orbiter 32b5b057b9 - modified, simplified old kelondroHTCache object; I believe it should be replaced by something completely new
17 years ago
orbiter 88216c1f1f fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1103&hilit=&p=7362#p7362
17 years ago
orbiter d0b893523e - protection against RAM overflow caused by new peer rss news
17 years ago
orbiter 685794e7e7 fix for parser/encoding Exception
17 years ago
orbiter 9935e83c86 added new news window into the status page. At this moment it is just a test.
17 years ago
orbiter bac38cfa18 added very rudimentary peer news as rss feed. An example can be retrieved with
17 years ago
orbiter 724bbdf9b2 refactoring of RSS reader
17 years ago
orbiter b9a2a2d287 more search performance hacks
17 years ago
orbiter ff755fb858 small corrections and enhancements after search timing profiling
17 years ago
orbiter e024e3b9cf added new default profiles to distinguish snippet fetch for local and global search
17 years ago
orbiter 1995faef8d - refactoring of Colage back-end: move to plasma package
17 years ago
orbiter d7e89c2aca fixed near-deadlock situation when deleting crawl profiles
17 years ago
orbiter 5e3ce46339 - better logging when rejecting a url because it is not in declared domain
17 years ago
orbiter 512f48e7d6 - removed unused methods
17 years ago
danielr 96e39b297a reduced StackTraces (by connect timed out)
17 years ago
orbiter 8313d58ae7 - integrated the collage into the Web Visualization menu
17 years ago
orbiter 82bf9ac1c8 - added Collage servlet from datengrab and modified it:
17 years ago
danielr 959f448e5f - disabled redirects in proxy (so client sees real path)
17 years ago
orbiter 5d1fbb25e7 fix for bad deploy:
17 years ago
orbiter 202a3adb3e refactoring of HttpClient Writer processes
17 years ago
orbiter 2c2dcd12a2 - enhanced performance of Eco-Tables: less time-consuming size() - operations
17 years ago
orbiter c3342e1178 - removed class with only one static method
17 years ago
orbiter 2c1c3bb6eb - some refactoring (sorry Daniel, hab in deinem Code rumgewütet)
17 years ago
orbiter 14404d31a8 - enhanced performance graph (more info)
17 years ago
orbiter 696b8ee3f5 fix for http://forum.yacy-websuche.de/viewtopic.php?p=6806#p6806
17 years ago
danielr 081ed1d3ec HTTPLoader: reduced stackTraces
17 years ago
orbiter 225f9fd429 various fixes
17 years ago
orbiter f099061944 protection against bad dht-flush word selection
17 years ago
orbiter 117ae78001 speed enhancement for reading of eco-table indexes
17 years ago
danielr 5c3c1fdf41 replaced httpc with Apache Jakarta Commons HttpClient (includes some refactoring ;)
17 years ago
orbiter 783a4c9edb strong speed enhancements for the index cache dump and restore:
17 years ago
orbiter 442204a1c8 fix for concurrentModificationException
17 years ago
orbiter d2f4926951 - more logging for balancer to get a hint where the problem is
17 years ago
orbiter 20dadba426 - added a deadlock prevention function in cache flushing
17 years ago
orbiter 764a40e37d speed enhancements for crawler and url retrieval (affects also search speed)
17 years ago
orbiter 368593e449 enhanced the concurrency handling of indexing process (better queue size control, better data concept, better shutdown behavior)
17 years ago
orbiter be58135b3e possible fix for deadlock in search execution
17 years ago
orbiter 0241d070bc added concurrency to indexing process:
17 years ago
orbiter bca87f1e38 - refactoring of serverThreads: renaming to distinguish busy-threads and blocking-threads
17 years ago
orbiter 968c775025 - preparation of parsing/indexing queue for concurrent execution
17 years ago
orbiter 9b0e20fb06 next refactoring step in document indexing to prepare concurrency environment for document parsing
17 years ago
orbiter 7f9f639d20 - refactoring and abstraction of index reference (urls) handling: blacklisting is part of reference filtering
17 years ago
orbiter d6050b9ffb - separated the LURL data storage and Crawl result stack for process supervision.
17 years ago
orbiter 8d6a13bc07 refactoring of parsing-condensing-indexing process:
17 years ago
orbiter 5aa96dbc36 fix for shutdown configuration
17 years ago
orbiter 93633abed8 - removed some debugging code from search process - should speed up now
17 years ago
orbiter 541b817502 refactoring of switchboard queueing
17 years ago
orbiter 433ff855f7 - fixed another concurrency problem in collection sorting
17 years ago
danielr 7008a218b3 avoid ConcurrentModificationException in plasmaCrawlerQueues
17 years ago
orbiter 7150b463ff changed handling of default values and database paths:
17 years ago
danielr f51bad8ae5 FTP:
17 years ago
orbiter 9c989fe5f7 fixed deadlock
17 years ago
danielr c565906050 FTP:
17 years ago
danielr 1a7870df0d FTP: source cleanup (added finals, indention for easier diffs)
17 years ago
orbiter fa1090113d - next try to fix the networking problem:
17 years ago
orbiter b664a53553 fix for NPE during search
17 years ago
orbiter b4ed937f1e - modified zone navigation (does still not work correctly)
17 years ago
orbiter 8d0470a5c6 new method to compute search history IDs
17 years ago
orbiter 9eddc1506b - one try to fix the httpd problem
17 years ago