Commit Graph

1747 Commits (fadf311b97f883ce474ba38e288ad01ce14c5025)

Author SHA1 Message Date
orbiter c6880ce28b removed the permanent cache flush and replaced it with a periodic cache flush
16 years ago
orbiter 6c7e83909b - refactoring of data access methods to be prepared for new cell data structure
16 years ago
orbiter c4c4c223b9 fixed a problem with attribute flags on RWI entries that prevented proper selection of index-of constraint
16 years ago
orbiter 6072831235 no cr transmission for robinson peers
16 years ago
orbiter be4c458951 refactoring (implemented Iterable in kelondroRowCollection)
16 years ago
orbiter b6bba18c37 replaced the storing procedure for the index ram cache with a method that generates BLOBHeap-compatible dumps
16 years ago
f1ori 025094675f * remove empty directory
16 years ago
orbiter e004da48d3 - added fast fingerprint computation for files (any). Will be used in new index dump method
16 years ago
f1ori 963da8c3f9 * updated tm-extractors to new version 1.0
16 years ago
orbiter e34ac22fbd - added new monitoring servlet at
16 years ago
orbiter d376d81fc4 replaced busy thread control of crawl stacker by blocking threads
16 years ago
orbiter 8cb7170b75 - set status of kelondroTree, kelondroBLOBTree and kelondroFlexTable to deprecated
16 years ago
orbiter 7535fd7447 - refactoring of CrawlEntry and CrawlStacker
16 years ago
orbiter 2802138787 - refactoring of CrawlStacker (to prepare it for new multi-Threading to remove DNS lookup bottleneck)
16 years ago
orbiter 1779c3c507 - added a read cache to the RAFile interface to RandomAccessFile
16 years ago
orbiter 4a2dac659e more speed hacks:
16 years ago
orbiter 47292e696a more performance hacks
16 years ago
lotus 1951d30a62 addendum to last commit
16 years ago
lotus 325ba7bfb8 only query words with length > 2
16 years ago
f1ori 5af8923f37 * distribute forgotten jar-file in parser
16 years ago
orbiter b0f2003792 fast database initialization and fast start.up of yacy:
16 years ago
orbiter 867d0f2f56 removed some unnecessary pause delays
16 years ago
orbiter 8c96bc2ac1 do not use proxy caching rules for crawling
16 years ago
orbiter dba7ef5144 extended crawling constraints:
16 years ago
orbiter 96174b2b56 more debugging / better result status logging for parser/caching errors
16 years ago
f1ori 90e78b2cf6 * improve encoding detection of http service
16 years ago
orbiter ef66438662 - more space in error db to store larger error messages
16 years ago
orbiter 674ad2d55b different handling of error cases that occur during loading files with http or ftp:
16 years ago
f1ori 7e1fe05e3c * added utf8-encoding to many getBytes-calls
16 years ago
lotus fad044fb54 update to snippet marker:
16 years ago
orbiter 3f746be5d4 - consolidation and refactoring of many DHT target - computing methods
16 years ago
orbiter d014b2728a Design-check, Extension and Refactoring of DHT target position computation:
16 years ago
orbiter 22989d0d8a added property index.storeCommons to switch commons storage on or off
16 years ago
f1ori 340ecd919d * include non ascii characters in visible characters
16 years ago
low012 00e27e5050 *) fixed bug which made it possible to write files outside of the DATA/LIST directory when creating a new blacklist
17 years ago
orbiter b098522977 some very small advances to index utf-8 (not working yet), inserted also debugging code
17 years ago
orbiter 2f49666908 integrated the character decoding into the parser, removed old code
17 years ago
orbiter 0edec2b760 FULL redesign of algorithms in htmlTools to encode/decode strings from/to unicode and html.
17 years ago
f1ori 2e53cbc66a should compile now
17 years ago
f1ori f3bf2e379e should compile again
17 years ago
f1ori dd8441f102 fix bug: data from plasmaParser is allready converted to UTF-8
17 years ago
orbiter 6941bf42b1 performance hacks
17 years ago
orbiter 9b0c4b1063 redesign of parts of the new BLOB buffer
17 years ago
orbiter 1778fb420d - added some performance tweaks to the new BLOB buffer
17 years ago
orbiter 9663e61449 added another class to handle BLOB writings to the new HTCACHE data storage:
17 years ago
orbiter 826ca79735 refactoring and new architecture to store the files of the web cache:
17 years ago
orbiter ffed5fc415 fixed problem with lost peers in database
17 years ago
orbiter 2d65887723 - fix for bug in new profile handling
17 years ago
orbiter ff68f394dd fix for problem with balancer and lost crawl profiles:
17 years ago
lotus fb8d9850ea fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1462
17 years ago
orbiter 9ac16f565b - fixed several bugs in database management functions
17 years ago
orbiter 820a03f9d6 - removed some warnings
17 years ago
orbiter c8bdd965ec - larger update time for status page
17 years ago
orbiter ce4715e305 removed indexing of anchor links and tagging such words as part of urls (that was wrong)
17 years ago
orbiter ce57de6cb3 - fixed re-setting of DHT Send/Receive settings
17 years ago
f1ori 7afa084207 * add nativ java trayicon, using reflections
17 years ago
orbiter 6e7d113eac fix for wrong index initialization after network switch
17 years ago
orbiter 7b35d54c6c fixed some problems with network switching (was not completely 'clean')
17 years ago
orbiter f0b42e5a98 fixed NPE
17 years ago
orbiter 8e0de7f180 update to language statistic evaluation:
17 years ago
orbiter 1198eeecc7 added language selection to search query:
17 years ago
orbiter 00c1535f84 added ranking and evaluation of language type in a search
17 years ago
orbiter bfcf9b7aa3 - added language detection using metadata from documents: html and odt documents provide this information
17 years ago
orbiter e1f67262f7 - added and removed some debugging output
17 years ago
orbiter ce2a7ed116 integrated language detection classes into condenser environment
17 years ago
orbiter 2b13705839 fixed a mistake in indexing queue processing: documents had been parsed before it was checked if they should be indexed or not. parsing was not necessary for this check, so the check was moved in the queue in front of the document parsing
17 years ago
orbiter 1bbf362cef update to the crawl balancer: better organization and better crawl delay prediction
17 years ago
orbiter 0cd0fee546 fixed bug with wrong proxy result enqueueing. See:
17 years ago
orbiter 670244849d fix for http://forum.yacy-websuche.de/viewtopic.php?p=9835#p9835
17 years ago
orbiter 5fbccfd75e fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1366&p=9348#p9348
17 years ago
orbiter 1fb1665e71 increased dht interval to avoid peer selection failure
17 years ago
orbiter 1eb813bd43 shifted index deletion-on-exit rule to the class where the errors are produced
17 years ago
lotus 3ded1efe84 kelondroExceptionCounter didn't work
17 years ago
lotus 383d89481e count errors before deleting collection.index
17 years ago
lotus 0bb4fbc403 delete corrupted collecion.index on exit for rebuild on next start
17 years ago
lotus b68d06a6e8 performance settings based on network's remote crawl speed
17 years ago
orbiter bb5c898441 enhancements to localsearch behavior
17 years ago
orbiter 3c6e8d2015 set default ppm when network is switched
17 years ago
orbiter 3288c19c1a reduce remote crawl PPM for fresh peers in freeworld to 6 PPM
17 years ago
orbiter 77ee0765a4 - added domain statistic generation to IndexControlURLs_p.html servlet
17 years ago
orbiter 4fbee21cea - added fetch-ahead again (had been removed in last commit)
17 years ago
orbiter fc03b0437a fixed a error case where a second search after a first search with a different search word failed
17 years ago
orbiter ead39064c5 fixed problem with wrong result number calculation
17 years ago
orbiter 05dbba4bab added logging conditions to all fine and finest log line calls
17 years ago
orbiter d3d41e2ee4 - fixed problem with searching with quotes (still not complete, but not as bad as before)
17 years ago
orbiter df4ff423c4 added additional properties to query id's to distinguish search events better
17 years ago
danielr 9ff4fc11da partial fix (images,audio,video) for proxy and content-type problem http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1374
17 years ago
lotus d9d9c522a1 addendum to last commit
17 years ago
orbiter 536e77e8b7 modifications towards a single database operation to read/write http header and cached file at once:
17 years ago
danielr 3c68905540 remove redundant null checks
17 years ago
danielr 753a1ae430 - changed default browser from netscape to firefox
17 years ago
orbiter 7989335ed6 Preparations to replace the HTCache with a new storage data structure:
17 years ago
danielr be28af50f5 - fixed "yacy2yacy no proxy"-problem
17 years ago
f1ori f99c307eff * correct debian build dependencies
17 years ago
orbiter bdae051d9a - extended new performance graph (better timing)
17 years ago
danielr a087090bbb fixed starting crawl results in "No parser available to parse mimetype 'application/octet-stream'"
17 years ago
danielr 8422ee5ec4 - fixed UnsupportedEncoding (in proxy) using defaultCharset if no characterEncoding can be determined
17 years ago
hermens cff4393f0c Fix HTCache so oldest Files get deleted first
17 years ago
danielr 621b473b18 * removed some warnings of findbugs (http://findbugs.sf.net)
17 years ago
orbiter ebb40d324b enhanced memory chart: shows now also the size of the word cache as third vector.
17 years ago
danielr 17b7845eb5 * refactoring
17 years ago
danielr 3bb870bfcd added final where possible
17 years ago
lotus 0b2f67577e Index Transfer:
17 years ago
lotus 5f77f55ed7 possible fix for negative speed values
17 years ago
orbiter 50ef5c406f - refactoring of robots parser (removed opaque Objects[] result vector)
17 years ago
orbiter c3d461d191 - removed superfluous copyright statement
17 years ago
lotus 62afea0c9f some improvements for yacyTray
17 years ago
lotus fa695c2d9f tray is now only shown on Windows and doesn't block on linux
17 years ago
lotus d77ed28e2f temporary disabled tray because of flaws on only-shell-linux
17 years ago
lotus f8a1e3175e new yacyTray
17 years ago
orbiter 7b1c9e6aee discovered and removed a (possibly large) memory leak:
17 years ago
orbiter 0f5fe8cc53 refactoring of method calling for objects from kelondroMapDataMining
17 years ago
orbiter 4acf0a61cd refactoring of kelondroObjects (mainly renaming to kelondroMap)
17 years ago
orbiter 441e9c861e fix for npe in HTCache cleaning process
17 years ago
orbiter 1e6d12f146 Major update to BLOB data structures:
17 years ago
orbiter b38f467e3c better SRU compliance
17 years ago
orbiter 7052f2f61f - added copyright header of ResourceObserver
17 years ago
orbiter 1400cdc91e - refactoring of resourceObserver (moved it to crawler)
17 years ago
f1ori b6301a54fa * added class ListDirs to provoid generic listing of directories in systemdirectories and jar-files
17 years ago
lotus f2e2d09916 - fix for index transfer
17 years ago
orbiter a6719dfd2b - refactoring of robots parser
17 years ago
orbiter e81be7d4f2 added many missing user-agent declarations for yacy http client connections.
17 years ago
orbiter 474659a71f - modified and enhanced the crawl balancer: better list export, fixing of damaged crawl queue at start-up, re-sorting at start-up to enhance domain order
17 years ago
orbiter d37fd064f9 changed peer selection for search targets:
17 years ago
orbiter 69aac0d74c modified the diskUsage class regarding the following two aspects:
17 years ago
danielr 0c1dc703e4 - set staticIP at startUp
17 years ago
orbiter b928ae492a some code-cleanup and possible speed enhancements in different core methods
17 years ago
orbiter c998dc6556 - added security functions to flush url and search caches in case that memory is full
17 years ago
danielr 68c38c2d34 - WatchCrawler shows status without JavaScript
17 years ago
orbiter f5ef7f222e - fixed a bug in parser (directory paths had not been recognized)
17 years ago
orbiter 3330181aa0 refactoring:
17 years ago
danielr 7feae906aa - organize imports
17 years ago
det f597185026 Initial import of the resource observer framework
17 years ago
orbiter e0e7f86f82 some bugfixes for the peer-ping process
17 years ago
orbiter 40d7f485f3 - fixed several NPE bugs
17 years ago
orbiter 2f381b8d7a - fixed at least two causes for a NPE after a use case switch.
17 years ago
orbiter 2a604b7402 added superfast search result computation which can be obtained for local search when snippet fetching is disabled. An example search for the rss interface would be:
17 years ago
orbiter 9bef20b537 - added cleanup for unused server loggings: they are removed after the client had not been seen since one hour
17 years ago
orbiter 1a1841392c small fix
17 years ago
orbiter 25192e0d36 added a deletion button to indexControlRWIs that deletes the complete web index
17 years ago
orbiter 0c173821fd more access security regarding database access and snippet retrieval: restrict number of results for not-authorized searchers
17 years ago
orbiter faed00d75d added use cases to basic configuration
17 years ago
orbiter 4229cd275c fixed several details about network switching, default password, random password and localhost authentification
17 years ago
orbiter c1d721dd2d fix for attacks on localhost-authorized peers from web pages with links to localhost addresses:
17 years ago
orbiter 56a300f92a bugfix / local-search predicate
17 years ago
orbiter 2f29ab8779 more target server access security
17 years ago
orbiter 3bd1db776a implemented switch for admin authorization from localhost:
17 years ago
orbiter cfe6790498 - added option to switch between yacy networks, especially between the two default networks (freeworld and intranet),
17 years ago
orbiter 78087da287 - changed seed file storage to clear text
17 years ago
orbiter 5fde679acb - fixed problem in performance configuration
17 years ago