Commit Graph

410 Commits (fe75f326d8db7db083313a560a7464b8016249d7)

Author SHA1 Message Date
Michael Peter Christen 4ca1b76627 less search overhead when first result set is smaller than requested 12 years ago
Michael Peter Christen 4589afe056 fix NPE when solr does not deliver snippets 12 years ago
Michael Peter Christen c3d50d91f8 relaxing site operator for www prefix: 12 years ago
reger d456f69381 SeedUpload url : check to reject localhost url included in saveSeedList (same check as in / copied from Seed.isProper() ), to prevent identity change on next startup (due to rejected seeduploadurl). 12 years ago
Michael Peter Christen 433143ba40 removed protocol, tld, ext from the urlmask and created specific 12 years ago
Michael Peter Christen 01200f06cc using the author field as solr-native facet. this makes it necessary to 12 years ago
Michael Peter Christen bab573361f - using a filter query for facet restriction 12 years ago
Michael Peter Christen 34f8786508 removed dependency of vocabulary navigation from Jena and it's 12 years ago
Michael Peter Christen 9319b90d8a - fixes for host navigation 12 years ago
Michael Peter Christen cb5cbec14d distinguishing modified query string and original query string 12 years ago
Michael Peter Christen d48e9788d2 enhanced search result processing behavior 12 years ago
orbiter 5dfd6359cb redesign of the QueryParams class: introduced QueryGoal which holds the 12 years ago
Michael Peter Christen d64445c3cb because we have the inurl:<term> - searchmodifier, we don't actually 12 years ago
Michael Peter Christen 93001586a0 removed warnings, removed too-fast pausing of crawls 13 years ago
Michael Peter Christen 2371ef031c added solr faceted search support to YaCy search results 13 years ago
Michael Peter Christen 8fb370d9f8 renovated the way how search results are count. should be correct now... 13 years ago
Michael Peter Christen 6629e37685 tried to clean up the search process mess 13 years ago
Michael Peter Christen c5f67a5d6d fixed a problem with local search from solr results: now all results 13 years ago
Michael Peter Christen f8f05ecba7 - added a delete button in host browser to delete a complete subpath 13 years ago
Michael Peter Christen 0fe8be7981 enhaced data structures for balancer and latency computation which 13 years ago
Michael Peter Christen a33e2742cb - removed unnecessary synchronized and deadlock in crawler 13 years ago
sixcooler 2d972f289a rise commitWithinMs to default-value from SwitchBoard 13 years ago
Michael Peter Christen f2d0418218 because the new PngEncoder had a problem with the PixelGrabber which is 13 years ago
Michael Peter Christen d5d64019e5 - added a method for the RasterPlotter to draw arrow endings to lines 13 years ago
orbiter 276dd6452b removed warnings 13 years ago
Michael Peter Christen ae6feb5610 showing the web structure graph as animation in the crawl monitor 13 years ago
Michael Peter Christen 39317a6c66 enhanced webstructure image: introduced 13 years ago
sixcooler 57ddd63888 not hold a expensive cache of references for DHT-out,but but load them 13 years ago
Michael Peter Christen ccc3760a47 Refactoring and redesign of data architecture to make URIMetadataRow 13 years ago
Michael Peter Christen e5b3c172ff removed hack which translated Solr documents to virtual RWI entries 13 years ago
Michael Peter Christen 43f3345c90 - removed dependencies from URIMetadataRow and made direct access to 13 years ago
Michael Peter Christen 21fe8339b4 - enhanced generation of url objects 13 years ago
Michael Peter Christen 584663ae8c - redesign of solr query construction 13 years ago
Michael Peter Christen 016ffa7434 increased strength of crawling waves in network image 13 years ago
Michael Peter Christen 24d2ee3c52 - better date ranking 13 years ago
Michael Peter Christen 562183932b - removed ip_s from default profile since that needs a DNS lookup to 13 years ago
Michael Peter Christen c913b2ba77 - fix for NPEs during remote solr configuration 13 years ago
Michael Peter Christen 1533bfd63b refactoring 13 years ago
Michael Peter Christen 872f83ebe0 refactoring 13 years ago
Michael Peter Christen 5683162bd3 simplifications in DHT Distribution class and more documentation 13 years ago
Michael Peter Christen e57bf2ca39 simplified DHT classes 13 years ago
Michael Peter Christen 8219a445f3 refactoring 13 years ago
Michael Peter Christen 00c1c777fa refactoring 13 years ago
orbiter 563d584420 removed more dependencies in cora from kelondro 13 years ago
Michael Peter Christen f75b3f8a47 added more patches to work without RWI data structure 13 years ago
Michael Peter Christen e8acd542b5 - added faceted drill-down for host and geolocation to solr queries 13 years ago
Michael Peter Christen 4716546ef5 - reduced memory usage in index transmission using a transformation of 13 years ago
Michael Peter Christen 653645c1cf corrected solr query syntax 13 years ago
Michael Peter Christen f42a57cd7d gsa format update 13 years ago
Michael Peter Christen b3aad6cc35 bugfix for remote search when search is done to solr 13 years ago
Michael Peter Christen ff3eaa21b0 added remote search to solr on YaCy peers! 13 years ago
Michael Peter Christen a06123aec6 more abstraction and less parameter overhead for remote search 13 years ago
Michael Peter Christen f00733186b code simplifications 13 years ago
orbiter 404b0aab09 refactoring in remote search and stub for remote node peer selection 13 years ago
Michael Peter Christen 0cab06c47c refactoring 13 years ago
Michael Peter Christen 18f989dfb1 - refactoring (load -> getMetadata) 13 years ago
Michael Peter Christen a16206e38b more attempts to clean the index (cleaning is faster then) 13 years ago
Michael Peter Christen 703f427303 fixed some peer-ping connection details 13 years ago
Michael Peter Christen 597bb76e4f get the peer location more quickly 13 years ago
Michael Peter Christen b51df6c7e8 - added coordinate storage in solr schema 13 years ago
orbiter e816b88b55 changed behaviour of metadata storage: in case that any solr is 13 years ago
Michael Peter Christen f9c0e6e950 - Implemented and integrated the URIMetadataNode object which is a 13 years ago
orbiter 67edfd991c Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 13 years ago
orbiter d9173ba7ed added more solr fields to integrate values from URIMetadataRow. All 13 years ago
Michael Peter Christen 24d9db1613 snippet retrieval loading processes may use a smaller minimum load time 13 years ago
Michael Peter Christen 1687737771 Abstraction of HandleMap and HandleSet 13 years ago
Michael Peter Christen 6f1ddb2519 Moved solr index-add method to the same method where the YaCy index is 13 years ago
Michael Peter Christen 1f41d9c6f5 bugfix for a NPE 13 years ago
Michael Peter Christen d3f243e2e1 fixed node type calculation for principal peers 13 years ago
orbiter 69e743d9e3 - more abstraction for the RWI index as preparation for solr integration 13 years ago
orbiter c00a3cf74d less usage of generic logger to avoid logger generation overhead 13 years ago
orbiter 0cbda0b2b8 - replaced all length() == 0 and size() == 0 with isEmpty() 13 years ago
orbiter 62202e2d71 refactoring of query attribute variable names for better consistency 13 years ago
Michael Peter Christen b0c408788b made class methods static where possible 13 years ago
Michael Peter Christen 0301aba1e9 removed unused method parameters 13 years ago
Michael Peter Christen 241dd8410a removed snippet pattern filter - it was not used 13 years ago
Michael Peter Christen d3964253ae - added @SuppressWarnings to unused servlet method parameters 13 years ago
Michael Peter Christen ea10766bfd cleaned unnecessary nested code 13 years ago
Michael Peter Christen 1481037820 replaced non-generic array with collection 13 years ago
Michael Peter Christen 613b45f604 - better data structures in secondary search 13 years ago
Michael Peter Christen 1825f165b8 better integration of blacklist according to use case 13 years ago
Michael Peter Christen 96aeb127e3 generalized localhost naming. 13 years ago
Michael Peter Christen 77f795756c fixing redirects and status codes: storing of status code in 13 years ago
Michael Peter Christen b9d42fd9c8 using com.google.common.io.Files instead of homebrew methods 13 years ago
Michael Peter Christen 3f55dc7c1e - added solr core and libraries that solr needs (lucene is missing, will 13 years ago
Michael Peter Christen 82a682b31d fixed problem with seed when switching network 13 years ago
Michael Peter Christen 8e97ada7c9 IPv6 bugfix 13 years ago
Roland 'Quix0r' Haeder edaa09b9b1 Rewrote all String blacklist types to enum 'BlacklistType', closes bug 13 years ago
Michael Peter Christen 3b992e6b00 using utf8 String compression in Webstructure database 13 years ago
Michael Peter Christen a1fe65b115 performance hacks 13 years ago
Michael Peter Christen e0d8643226 - performance hacks 13 years ago
Michael Peter Christen 7c1feefb28 introduced a default 10 second time-out in rwi normalization time 13 years ago
Michael Peter Christen 65d37e6a20 only ASCII needed in seed bitflags 13 years ago
Michael Peter Christen 0f82fb3628 using double instead float for a better release ordering 13 years ago
Michael Peter Christen 71c3163f3d - fixes to node identification 13 years ago
Michael Peter Christen ad222be7f8 added node state icon in network list 13 years ago
Michael Peter Christen 3c2bec681f added a root node flag: identifies peers with short ping time 13 years ago
Michael Peter Christen f294f2e295 bugfix to http://bugs.yacy.net/view.php?id=181 13 years ago
Michael Peter Christen 89142d1e8d removed (not all) warnings 13 years ago
Michael Peter Christen 15db703808 added missing serialization to remove all warnings 13 years ago
Roland 'Quix0r' Haeder a093ccf5eb Now used synchronization in all close() methods to make sure all objects 13 years ago
Marc Nause a691023d04 *) better formatting for network QPM 13 years ago
Michael Peter Christen 77f8e9fb9b Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 13 years ago
Michael Peter Christen ba6aaabc51 refactoring + parser bugfixes 13 years ago
Michael Peter Christen 2a0434efa4 Merge commit 'c1f6b4fb5226d3d2f8b2bec9e361f6b3476e03ff' 13 years ago
reger c1f6b4fb52 lookupByIP: prevent comparing of port parameter if called with port -1 (=unknown) 13 years ago
Michael Peter Christen f8cd57c92f new indexing strategy: ALL links that appear anywhere are indexed, not 13 years ago
Michael Peter Christen 14f67f217c refactoring of ContentDomain: now subclass of Classification 13 years ago
Michael Peter Christen 33d1062c79 refactoring: the cache belongs to the crawler 13 years ago
Michael Peter Christen 046f3a7e8d check if httpc has decompressed the release file and rename the file 13 years ago
Michael Peter Christen 8c06925984 animation of the web structure picture 13 years ago
Michael Peter Christen c639248c23 protection against strange answers from remote peers during search 13 years ago
Michael Peter Christen 7e4e3fe5b6 free some memory after parsing html 13 years ago
Michael Peter Christen b4409cc803 small redesign of blob column index and usage 13 years ago
Michael Peter Christen 0b67a0a5d8 added a column index for tables in blob files. This is heavily used 13 years ago
Michael Peter Christen 7e728867e5 added a synchronization around iterations to prevent IO-deadlocking 13 years ago
Michael Peter Christen ef5192f8c9 using the generic document parser for crawl starts instead of the html 13 years ago
Marek Otahal 72adbeae90 !Important: move from Hashtable to HashMap 13 years ago
Michael Christen 216a287a85 Merge commit '6d4e08ed06c5cd28c45981b2ebe31c7f7ec6fd83' into quix0r 13 years ago
stbrumm d18095dc48 Patch fuer Issue 0000102 13 years ago
stbrumm 9f1b1b4604 Type for Robinson-Mode/Private Perr added 13 years ago
Roland 'Quix0r' Haeder fa08ed5ae5 Fixed a lot CHMOD rights (no need for execute flag on *.java/*.html) and introduced local/remote crawl size ratio based check 13 years ago
Michael Christen 85bd4cc8bc better lookup for peer names 13 years ago
Michael Christen 20e3084bd4 redesign of fining of peers by ip: more leightweight method to read the 13 years ago
Michael Christen 0797b0de99 new handling of remote search processes: looking for seeds will now not 13 years ago
Michael Christen 9e5894c784 Removed handling of components objects for URIMetadataRows. 13 years ago
Michael Christen c04bfaa51b refactoring 13 years ago
Michael Christen 1f4afb4dc0 performance hacks 13 years ago
Michael Christen 675d557e88 removed debug logging 13 years ago
Michael Christen e9dc99fe15 added rules to set specific RWIs as private RWIs which are not 13 years ago
Michael Christen 044f83feed added some pauses into the search process which shall produce 13 years ago
Michael Christen f14faf503b better ranking because we wait a very little time during the search 13 years ago
Michael Christen e7e429705a - less automatic indexing after a search (needs to reset the default 13 years ago
admin 484c4ad339 Merge branch 'master' of git://github.com/f1ori/yacy 13 years ago
orbiter 402e9d71ef changed ording on release files: main criteria is not the svn any more; releases are now ordered by 13 years ago
admin 56ce8488e4 Merge branch 'master' of git://github.com/f1ori/yacy 13 years ago
orbiter 4b8ff84705 - search bugfixes (page counter and number of results per page; recognition of new search) 13 years ago
sixcooler aeeae75b8a the timeout of httpclient is not absolut, but till a connection is 13 years ago
hermens 2ac272cfbf Fix for PeerSelection.seedsByAge() for big networks (>1000 Peers) 13 years ago
orbiter 0796b54601 - some speed hacks for network image 13 years ago
orbiter f9216e388c - faster ping to clean up old peers faster 13 years ago
orbiter 35a9e8f307 - fixed network graphic 13 years ago
orbiter 550c881d80 remove more news (all older than one day) because they can be a performance problem if we have too many peers sending news 13 years ago
orbiter ebd840ebf6 - enhanced description on search front page 13 years ago
orbiter 5a55397f99 some last-minute performance hacks 13 years ago
orbiter c9216d5adf fixed secondary remote search (the process that finds distributed join situations) 13 years ago
orbiter c9a0dbd25a added a security check 13 years ago
orbiter 1120f0c93c update to network graphics: slightly less crawling activity, slightly stronger color for query activity 13 years ago
orbiter 8e0b2c5832 fixed cluster search 13 years ago
orbiter e58438c01c - added a new retry connector for solr (for cases where solr responses are slow) 14 years ago
orbiter c31564ef08 stability bugfixes 14 years ago
orbiter 279482a76d fix for npe 14 years ago
hermens d3df03838a make sure myself-target is always inserted at its appropriate position 14 years ago
hermens c3e7efa846 added sender side prevention of rwi flooding as mentioned in SVN 7993 14 years ago
orbiter a7df70221e refactoring 14 years ago
orbiter 813f297a95 another performance hack: re-use of known host addresses for isLocal property; avoids look-up in local hash 14 years ago
orbiter 035ebfbf3b - performance hacks (should affect the crawl balancer and reduce CPU load during crawl stack re-fill) 14 years ago
orbiter 57d5529a01 performance hacks 14 years ago
orbiter 2c3161b4ac refactoring: 14 years ago
orbiter d2ea250d99 refactoring: 14 years ago