Commit Graph

920 Commits (6042dd99c6094eea2d319fd87d95856bf1df759f)

Author SHA1 Message Date
Michael Peter Christen 32aa1d4569 removed unused option for queries
12 years ago
Michael Peter Christen 0c1a018bbd removed 'later' tactic because it used too much RAM, reduced number of
12 years ago
orbiter da621e827e prevent NPE in case RWI is disabled
12 years ago
reger c03f75ebc3 fix DHT url receive see http://bugs.yacy.net/view.php?id=242
12 years ago
Michael Peter Christen 8dbc80da70 redesign of index.exist-test: this shall now not be done using a single
12 years ago
Michael Peter Christen f7f3e28c5e prevent that the size of the index is computed too many times.
12 years ago
reger 566a3b0294 fix: Index Administration > Reverse Word Index (IndexControlRWIs_p) corrected use of word search to word-hash search
12 years ago
Michael Peter Christen 9406a2e438 fixed NPE during index abstract computation
12 years ago
Michael Peter Christen d725782440 turned severe message to warning message about network failure events
12 years ago
Michael Peter Christen 2d472a39f4 DHT-transferred metadata and crawl receipts now also use the delayed
12 years ago
Michael Peter Christen c95a84103a complete redesign of search process:
12 years ago
Michael Peter Christen 089dee1770 - generalized SchemaConfiguration into super-class Configuration and
12 years ago
Michael Peter Christen 788288eb9e added the generation of 50 (!!) new solr field in the core 'webgraph'.
12 years ago
Michael Peter Christen c34af7fe94 extended JSON Response Writer and Opensearch Response Writer for the
12 years ago
Michael Peter Christen 4faa07c214 added a timeout for topic computation (solr is here much slower than the
12 years ago
Michael Peter Christen d2d5be032d added a 'inlink' search option according to the suggestion in the YaCy
12 years ago
reger f143804382 fix configuration for search page navigators
12 years ago
orbiter fe50702eb0 added a filterscannerfail attribute to QueryParams which causes that a
12 years ago
Michael Peter Christen 433143ba40 removed protocol, tld, ext from the urlmask and created specific
12 years ago
Michael Peter Christen 01200f06cc using the author field as solr-native facet. this makes it necessary to
12 years ago
Michael Peter Christen 9319b90d8a - fixes for host navigation
12 years ago
Michael Peter Christen d6b82840f8 added a feature to find similarities in documents.
12 years ago
orbiter 5dfd6359cb redesign of the QueryParams class: introduced QueryGoal which holds the
12 years ago
Michael Peter Christen 2371ef031c added solr faceted search support to YaCy search results
12 years ago
Michael Peter Christen 8fb370d9f8 renovated the way how search results are count. should be correct now...
12 years ago
Michael Peter Christen 1168d09de8 more refactoring - integrated the code of SnippetProcess into
12 years ago
Michael Peter Christen 6629e37685 tried to clean up the search process mess
12 years ago
Michael Peter Christen c5f67a5d6d fixed a problem with local search from solr results: now all results
12 years ago
Michael Peter Christen ccc3760a47 Refactoring and redesign of data architecture to make URIMetadataRow
12 years ago
Michael Peter Christen e5b3c172ff removed hack which translated Solr documents to virtual RWI entries
12 years ago
Michael Peter Christen 43f3345c90 - removed dependencies from URIMetadataRow and made direct access to
12 years ago
Michael Peter Christen 5f0ab25382 removed the option to prevent removal of & parts inside of the
12 years ago
Michael Peter Christen a06930662c replaced some more .getBytes() with UTF8/ASCII.getBytes()
12 years ago
Michael Peter Christen 2f536cb54d code cleanup: removed unised methods and made more methods and objects
12 years ago
Michael Peter Christen 1533bfd63b refactoring
12 years ago
Michael Peter Christen e49359cc95 removed tenant query attribute since it is not used any more and is
12 years ago
Michael Peter Christen e57bf2ca39 simplified DHT classes
12 years ago
Michael Peter Christen 8219a445f3 refactoring
12 years ago
Michael Peter Christen 00c1c777fa refactoring
12 years ago
Michael Peter Christen f75b3f8a47 added more patches to work without RWI data structure
12 years ago
Michael Peter Christen 31d4d38804 - extended the solr interface by a references-by-word-count method
12 years ago
Michael Peter Christen a06123aec6 more abstraction and less parameter overhead for remote search
12 years ago
orbiter 6f01542aaa explicit double-check in transferURL
12 years ago
Michael Peter Christen 0cab06c47c refactoring
12 years ago
Michael Peter Christen 18f989dfb1 - refactoring (load -> getMetadata)
12 years ago
Michael Peter Christen 6197caf698 added clear-text search words in query params
12 years ago
Michael Peter Christen 597bb76e4f get the peer location more quickly
12 years ago
orbiter 9b88433f45 patch from hint in
12 years ago
orbiter e816b88b55 changed behaviour of metadata storage: in case that any solr is
12 years ago
Michael Peter Christen f9c0e6e950 - Implemented and integrated the URIMetadataNode object which is a
12 years ago
orbiter 67edfd991c Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter d9173ba7ed added more solr fields to integrate values from URIMetadataRow. All
12 years ago
Michael Peter Christen 1687737771 Abstraction of HandleMap and HandleSet
12 years ago
orbiter 69e743d9e3 - more abstraction for the RWI index as preparation for solr integration
12 years ago
orbiter 0cbda0b2b8 - replaced all length() == 0 and size() == 0 with isEmpty()
13 years ago
orbiter 62202e2d71 refactoring of query attribute variable names for better consistency
13 years ago
Michael Peter Christen 0301aba1e9 removed unused method parameters
13 years ago
Michael Peter Christen 241dd8410a removed snippet pattern filter - it was not used
13 years ago
Michael Peter Christen d3964253ae - added @SuppressWarnings to unused servlet method parameters
13 years ago
Michael Peter Christen 03280fb161 removed segments-concept and the Segments class:
13 years ago
Michael Peter Christen b9d42fd9c8 using com.google.common.io.Files instead of homebrew methods
13 years ago
Michael Peter Christen 8b53771db2 changed behavior of navigation processing:
13 years ago
Roland 'Quix0r' Haeder edaa09b9b1 Rewrote all String blacklist types to enum 'BlacklistType', closes bug
13 years ago
Michael Peter Christen 2fe207f813 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
13 years ago
Michael Peter Christen 5aee19daa4 added show from cache in search results (not yet finished)
13 years ago
Michael Peter Christen e0d8643226 - performance hacks
13 years ago
Michael Peter Christen 9b4c699526 ehanced location search:
13 years ago
Michael Peter Christen 71c3163f3d - fixes to node identification
13 years ago
Michael Peter Christen 7bf421b9dd - fixed image search page navigation
13 years ago
Michael Peter Christen ba6aaabc51 refactoring + parser bugfixes
13 years ago
Michael Peter Christen f8cd57c92f new indexing strategy: ALL links that appear anywhere are indexed, not
13 years ago
Michael Peter Christen 14f67f217c refactoring of ContentDomain: now subclass of Classification
13 years ago
Michael Peter Christen a5d7da68a0 refactoring: removed dependency from switchboard in Balancer/CrawlQueues
13 years ago
Michael Peter Christen a9b4d49b75 removed debug output
13 years ago
Michael Peter Christen 9ad1d8dde2 complete redesign of crawl queue monitoring: do not look at a
13 years ago
Michael Peter Christen b4bc1e2875 remote search does not do snippet generation
13 years ago
Michael Peter Christen 83009d86f7 added the vocabulary navigator. It can be very simply tested by
13 years ago
Michael Christen 20e3084bd4 redesign of fining of peers by ip: more leightweight method to read the
13 years ago
Michael Christen 9e5894c784 Removed handling of components objects for URIMetadataRows.
13 years ago
Michael Christen c04bfaa51b refactoring
13 years ago
Michael Peter Christen 0bcef2d156 added feature as requested in
13 years ago
orbiter ebd840ebf6 - enhanced description on search front page
13 years ago
orbiter 5a55397f99 some last-minute performance hacks
13 years ago
orbiter c9216d5adf fixed secondary remote search (the process that finds distributed join situations)
13 years ago
orbiter 507c9d478d much better timing when search globally; less blocking; more results earlier!
13 years ago
orbiter 8e0b2c5832 fixed cluster search
13 years ago
orbiter 368b51ed5b argh.. fixed bad SVN 8080
13 years ago
orbiter eb4436defb removed limitation to cluster peers if peer is asked remotely. This enables single-linked clusters which naturaly is there first if a new cluster is created
13 years ago
apfelmaennchen 5581be12fb YMarks:
13 years ago
orbiter 84c3fc9d97 local/global fixes in search, better abstraction
13 years ago
orbiter 5a7cec59f3 moved ynetSearch to get all files out of htroot/api/util/
13 years ago
orbiter f8b8c82421 - refactoring of getpageinfo_p.xml (moved out of util)
13 years ago
apfelmaennchen 4d7ae76017 - update to jquery 1.7 (does not apply to all jquery code, old version is additionally kept for compatibility)
13 years ago
orbiter 9e4875230f performance hacks
13 years ago
orbiter 204e98db3a added a protection against rwi flooding
13 years ago
orbiter a7df70221e refactoring
13 years ago
orbiter 813f297a95 another performance hack: re-use of known host addresses for isLocal property; avoids look-up in local hash
13 years ago
orbiter d2ea250d99 refactoring:
13 years ago
orbiter 734059d33e performance hacks
13 years ago
orbiter 22d69a6368 refactoring in cora: added sorting package
13 years ago
orbiter 51cf697acd refactoring: moved all score-related classes to new ranking package
13 years ago
orbiter 11dc653de3 added a visualization of peer pings to the performance graphic
14 years ago
orbiter 115abc8917 - more attributes for search progress bar
14 years ago
orbiter 4bea3f9714 hack to reduce resource contention caused by massive UTF8 decodings which use java.nio resources:
14 years ago
orbiter e28bd0d038 fix for some possible causes of memory leaks
14 years ago
orbiter 10e2f588f8 - enhanced ybr ranking computation
14 years ago
orbiter b45701d20f this is a re-implementation of the YaCy Block Rank feature
14 years ago
orbiter 123375bfba added a new yacy protocol servlet 'idx'. This returns an index to one of the data entities that is stored in YaCy.
14 years ago
orbiter 5b579e21a3 code cleanup
14 years ago
orbiter 0621a15f89 fix for wrong search result counter: added a counter for all filtered out entities
14 years ago
orbiter deda54d684 - relaxed matching of string-search (this is now case-insensitive)
14 years ago
orbiter 6e42d4de88 - added full-String search function: find things that match exactly what is quoted in the query
14 years ago
apfelmaennchen 8b8db2aaba YMarks: some small changes/fixes
14 years ago
orbiter 6fa439c82b - refactoring of robots
14 years ago
apfelmaennchen e7c2ea193b YMark:
14 years ago
orbiter 3b578a28ef some patches to prevent that empty or bad IP information is broadcasted
14 years ago
orbiter 8b95a26866 better magic
14 years ago
orbiter 2700a58e5a added a magic to the peer ping that will be used in case that the contacting peer requests that it's reported IP shall be used for a back-ping. The back-ping now also returns the same magic which will make it possible that the requested peer can verify that the back-pinged peer is actually the same peer.
14 years ago
apfelmaennchen b2281f0b7d YMark: intermediate work towards flexigrid support
14 years ago
lotus 06afa94f9d hups
14 years ago
lotus a9a9db98c8 better rename modified version
14 years ago
lotus e19ca27004 do not autocomplete on mouseover. this has resulted in unwanted autocomplete.
14 years ago
low012 2861d0888a *) simplified code\n*) fixed potential NumberFormatExceptions
14 years ago
orbiter 8f11d3a5bb redesigned the ScoreMap classes:
14 years ago
orbiter 694fa3a2a5 - replaced more direct string-based UTF-8 conversions by predefined UTF-8 conversion
14 years ago
orbiter cb1f49d0f2 replaced all 'new String' with default encoding (missing) or UTF-8 encoding with a String generation method that uses a pre-defined Charset constant for UTF-8. This avoids a cache-lookup for the Charset object using String hashing of the String 'UTF-8'.
14 years ago
orbiter bed79402be introduction of a new remote search load control: the remote search has taken 10 results per peer with a time-out of 3 seconds so far. The attributes of number of results per peer and time-out time can now be configured.
14 years ago
orbiter 5e186e0122 continuing the fight against deadlocks during time formatting: better caching.
14 years ago
orbiter 4473cf8c61 replaced utf-8 with UTF-8
14 years ago
orbiter 5892fff51f introduction of dht-burst modes: this can expand the number of target peers in some cases where a better heuristic is needed. The problematic cases are either when a muti-word search is made (still a hard case for our term-oriented DHT) or when a network operator wants that all robinson peers are asked. We therefore introduced two new network steering values that switch on more peers during the peer selection. Because the number of peers can now be very large, the number of maximum httpc connections was also increased.
14 years ago
orbiter 4588b5a291 - fixed document number limitation for crawls that restrict the number of documents per domain
14 years ago
orbiter 88773e4daa changed the default port from 8080 to 8090
14 years ago
orbiter 28f669bf0b - fixed/enhanced move to SD/16:9 images (network, web structure)
14 years ago
orbiter efb4ca8fa8 modified auto-delete of search failure-words:
14 years ago
orbiter 6c1b14c8e1 - more control in access tracker: count number of returned search results (not only info how much is in the index)
14 years ago
orbiter 10ae8d961b - cora package has now no dependencies to other yacy packages and becomes a 'base' package (refactoring)
14 years ago
orbiter a4c9d27287 - moved some variables from Stwitchboard to new class AccessTracker
14 years ago
orbiter a563b05b60 enhanced crawler:
14 years ago
orbiter db99db4be9 some redesign of the search-fail-response mechanism:
14 years ago
orbiter 18d33b5c6d fixed several search result navigation bugs
14 years ago
orbiter 49b5a206cd - better caclculation of search result size
14 years ago
orbiter a9f754c45f removed unused CR accumulation and distribution process
14 years ago
low012 e7552bd719 *) cleaning up the code a little bit
14 years ago
low012 38fdf43587 *) renamed classes according to standard Java coding conventions
14 years ago
apfelmaennchen 808edffaf6 ymarks
14 years ago
apfelmaennchen 43586a2ace a update to ymarks (please test if you wish):
14 years ago
orbiter 25a8e55bc9 more logging about bad seeds
14 years ago
orbiter 959b8c6fa0 - allow greater seed size
14 years ago
apfelmaennchen 7adfe4a1c1 fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=35#p21092
14 years ago
orbiter 863065abc4 added user agent logging to access tracker
14 years ago