Commit Graph

695 Commits (6f9ed439d3c34cc61dd4b7b2b46015018e353acf)

Author SHA1 Message Date
Michael Peter Christen 5e31bad711 - the webgraph shall store all links which appear on a web page and not
11 years ago
Michael Peter Christen e8e558a9b7 fix for content domain classification in URIMetadataNode
11 years ago
Michael Peter Christen cb85b22725 redesign of the image search process (with much better results,
11 years ago
orbiter f106345eef link strings should not be tokenized
11 years ago
Michael Peter Christen 0f3d8890db removed an assert which causes a shortcut call circuit
11 years ago
Michael Peter Christen 47b1c81d08 - refactoring
11 years ago
Michael Peter Christen 58fe986cca Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen cf12835f20 replaced the single-text description solr field with a multi-value
11 years ago
Roland Haeder 13433d41a1 Log this exception better
11 years ago
orbiter 056b42f5aa - added information about segment count to status_p.xml
11 years ago
Michael Peter Christen 336f86394c replaced StringBuffer with StringBuilder
11 years ago
Michael Peter Christen aeac2fb763 replaced more containsKey() -> get() usages by a simple get(), followed
11 years ago
Michael Peter Christen 735a66eff3 enhancements to crawler
11 years ago
Roland Haeder 841a28ae76 Added 'final' for all exception blocks as this helps the Java compiler
11 years ago
Michael Peter Christen 5c6946dd5f replaced usage of log4j by ConcurrentLog where possible
12 years ago
Michael Peter Christen 5878c1d599 - refactoring of log to ConcurrentLog:
12 years ago
reger a6bf44212e bugfix: location (lat/lon) meta data retrival (Double.NaN check)
12 years ago
Michael Peter Christen 14186e815e npe fix
12 years ago
Michael Peter Christen f7e77a21bf Added a citation reference computation for intra-domain link structures.
12 years ago
Michael Peter Christen e20450e798 patch in HTCache and CitationIndex loading in case that a file is
12 years ago
reger 7480e87386 - fix stopword handling for RWI see example http://bugs.yacy.net/view.php?id=247
12 years ago
Michael Peter Christen a1644ca0fd new workflow processor in Segment to enqueue indexing documents to solr
12 years ago
Michael Peter Christen 5344a1c5f7 getting the trash out
12 years ago
orbiter 888a985dc6 set a higher limit for table copy usage
12 years ago
Michael Peter Christen 8dbc80da70 redesign of index.exist-test: this shall now not be done using a single
12 years ago
Michael Peter Christen 44e363f37f refactoring of WorkflowProcessor, added process counter, update of
12 years ago
orbiter aeff31cd44 fix for workflow processor (cause: latest redesign for less threads)
12 years ago
orbiter a1c989002b fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=4652
12 years ago
orbiter 7de5b9cfa0 fix for http://bugs.yacy.net/view.php?id=233
12 years ago
Michael Peter Christen bb4bf3d8fd infinity timeout bug protection patch
12 years ago
orbiter e1bfe9d07a - reduction of the concurrently running processes to make YaCy more
12 years ago
Michael Peter Christen c1a2175fbc added transparency to gif image animation and the integration to the
12 years ago
Michael Peter Christen ada3f27de7 added three new field for a better ranking: references_internal_i,
12 years ago
Michael Peter Christen 342ba1049b - callback fix
12 years ago
orbiter 47114910d5 fix for possible memory leaks
12 years ago
Michael Peter Christen addba047e2 changes in ranking computation
12 years ago
Michael Peter Christen 2b6c79d347 in method exists() also use the new caching-stacks for
12 years ago
Michael Peter Christen 3b1d9dc884 made index storage from DHT search result concurrently. This prevents
12 years ago
orbiter d74472f562 corrected result counter
12 years ago
Michael Peter Christen c95a84103a complete redesign of search process:
12 years ago
Michael Peter Christen 35fa718b77 testing to use solr for portalsearch caused some bugfixing but no full
12 years ago
Michael Peter Christen 089dee1770 - generalized SchemaConfiguration into super-class Configuration and
12 years ago
Michael Peter Christen 788288eb9e added the generation of 50 (!!) new solr field in the core 'webgraph'.
12 years ago
Michael Peter Christen 91a0401d59 introduced a second core named 'webgraph'. This core will hold the link
12 years ago
Marc Nause 75f9568472 *) only install files from the RELEASE directory
12 years ago
Marc Nause 3bc5ee6e3d *) added protection against CSRF in update download page
12 years ago
reger 3897bb4409 added (manual) urldb migration (link on: Index Administraton -> Federated Solr Index)
12 years ago
Michael Peter Christen 38d3feae65 added separate delete commands for the local+remote solr index, the old
12 years ago
Michael Peter Christen 0f5b6f38c1 enhanced root-url detection
12 years ago
Michael Peter Christen 5c0c56cfe1 Preparations to produce a click depth attribute in the search index.
12 years ago