Commit Graph

749 Commits (504327b15c142a88b12016fb8ee75144b822a1f3)

Author SHA1 Message Date
Michael Peter Christen 5afa6e3aee Automatically flush the log cache if a short memory status is reached.
11 years ago
Michael Peter Christen 6aabc4e5c8 reduced logging line memory, 10000 lines had filled up 450MB! grrr.
11 years ago
Michael Peter Christen 1b4fa2947d - fixed a problem which ocurred when a document was not recognized with
11 years ago
Michael Peter Christen 7b69c438f7 more methods for the table class
11 years ago
Michael Peter Christen 5e31bad711 - the webgraph shall store all links which appear on a web page and not
11 years ago
Michael Peter Christen e8e558a9b7 fix for content domain classification in URIMetadataNode
11 years ago
Michael Peter Christen cb85b22725 redesign of the image search process (with much better results,
11 years ago
orbiter f106345eef link strings should not be tokenized
11 years ago
Michael Peter Christen 0f3d8890db removed an assert which causes a shortcut call circuit
11 years ago
Michael Peter Christen 47b1c81d08 - refactoring
11 years ago
Michael Peter Christen 58fe986cca Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen cf12835f20 replaced the single-text description solr field with a multi-value
11 years ago
Roland Haeder 13433d41a1 Log this exception better
11 years ago
orbiter 056b42f5aa - added information about segment count to status_p.xml
11 years ago
Michael Peter Christen 336f86394c replaced StringBuffer with StringBuilder
11 years ago
Michael Peter Christen aeac2fb763 replaced more containsKey() -> get() usages by a simple get(), followed
11 years ago
Michael Peter Christen 735a66eff3 enhancements to crawler
11 years ago
Roland Haeder 841a28ae76 Added 'final' for all exception blocks as this helps the Java compiler
11 years ago
Michael Peter Christen 5c6946dd5f replaced usage of log4j by ConcurrentLog where possible
12 years ago
Michael Peter Christen 5878c1d599 - refactoring of log to ConcurrentLog:
12 years ago
reger a6bf44212e bugfix: location (lat/lon) meta data retrival (Double.NaN check)
12 years ago
Michael Peter Christen 14186e815e npe fix
12 years ago
Michael Peter Christen f7e77a21bf Added a citation reference computation for intra-domain link structures.
12 years ago
Michael Peter Christen e20450e798 patch in HTCache and CitationIndex loading in case that a file is
12 years ago
reger 7480e87386 - fix stopword handling for RWI see example http://bugs.yacy.net/view.php?id=247
12 years ago
Michael Peter Christen a1644ca0fd new workflow processor in Segment to enqueue indexing documents to solr
12 years ago
Michael Peter Christen 5344a1c5f7 getting the trash out
12 years ago
orbiter 888a985dc6 set a higher limit for table copy usage
12 years ago
Michael Peter Christen 8dbc80da70 redesign of index.exist-test: this shall now not be done using a single
12 years ago
Michael Peter Christen 44e363f37f refactoring of WorkflowProcessor, added process counter, update of
12 years ago
orbiter aeff31cd44 fix for workflow processor (cause: latest redesign for less threads)
12 years ago
orbiter a1c989002b fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=4652
12 years ago
orbiter 7de5b9cfa0 fix for http://bugs.yacy.net/view.php?id=233
12 years ago
Michael Peter Christen bb4bf3d8fd infinity timeout bug protection patch
12 years ago
orbiter e1bfe9d07a - reduction of the concurrently running processes to make YaCy more
12 years ago
Michael Peter Christen c1a2175fbc added transparency to gif image animation and the integration to the
12 years ago
Michael Peter Christen ada3f27de7 added three new field for a better ranking: references_internal_i,
12 years ago
Michael Peter Christen 342ba1049b - callback fix
12 years ago
orbiter 47114910d5 fix for possible memory leaks
12 years ago
Michael Peter Christen addba047e2 changes in ranking computation
12 years ago
Michael Peter Christen 2b6c79d347 in method exists() also use the new caching-stacks for
12 years ago
Michael Peter Christen 3b1d9dc884 made index storage from DHT search result concurrently. This prevents
12 years ago
orbiter d74472f562 corrected result counter
12 years ago
Michael Peter Christen c95a84103a complete redesign of search process:
12 years ago
Michael Peter Christen 35fa718b77 testing to use solr for portalsearch caused some bugfixing but no full
12 years ago
Michael Peter Christen 089dee1770 - generalized SchemaConfiguration into super-class Configuration and
12 years ago
Michael Peter Christen 788288eb9e added the generation of 50 (!!) new solr field in the core 'webgraph'.
12 years ago
Michael Peter Christen 91a0401d59 introduced a second core named 'webgraph'. This core will hold the link
12 years ago
Marc Nause 75f9568472 *) only install files from the RELEASE directory
12 years ago
Marc Nause 3bc5ee6e3d *) added protection against CSRF in update download page
12 years ago
reger 3897bb4409 added (manual) urldb migration (link on: Index Administraton -> Federated Solr Index)
12 years ago
Michael Peter Christen 38d3feae65 added separate delete commands for the local+remote solr index, the old
12 years ago
Michael Peter Christen 0f5b6f38c1 enhanced root-url detection
12 years ago
Michael Peter Christen 5c0c56cfe1 Preparations to produce a click depth attribute in the search index.
12 years ago
reger 276e63401e small sanitary fixes
12 years ago
Michael Peter Christen 24c9bb35f7 extended the Scheduler: introduced scheduled events
12 years ago
reger ad71747525 fix: set defaul language to "en"
12 years ago
orbiter 712cc37c40 if maxFileSize < 0 then the file size limit is without limit.
12 years ago
Michael Peter Christen 8fc3679c66 using more pre-compile pattern for split methods
12 years ago
Michael Peter Christen 5e182a566f - added another enumeration method in kelondro data structure to get a
12 years ago
Michael Peter Christen d6b82840f8 added a feature to find similarities in documents.
12 years ago
Michael Peter Christen f5ca5cea44 - added field options to all solr queries. This can be used to restrict
12 years ago
Michael Peter Christen 832eead998 Merge remote-tracking branch 'regerdev/master'
12 years ago
Michael Peter Christen 570e42c4e3 fix for filetype naviagtor
12 years ago
reger 633fbe9188 Fix Metadata handling
12 years ago
Michael Peter Christen c5f67a5d6d fixed a problem with local search from solr results: now all results
12 years ago
Michael Peter Christen f8f05ecba7 - added a delete button in host browser to delete a complete subpath
12 years ago
Michael Peter Christen a33e2742cb - removed unnecessary synchronized and deadlock in crawler
12 years ago
orbiter 354f0d9acd moved static method from ClusteredScoreMap to MapDataMining because it
12 years ago
Michael Peter Christen 1baf498d59 - show more lines in online log
12 years ago
Michael Peter Christen f2d0418218 because the new PngEncoder had a problem with the PixelGrabber which is
12 years ago
orbiter 276dd6452b removed warnings
12 years ago
Michael Peter Christen ce0e5b1e17 - more refactoring / private methods
12 years ago
Michael Peter Christen ccc3760a47 Refactoring and redesign of data architecture to make URIMetadataRow
12 years ago
Michael Peter Christen b400fc7b4d fix for file parser problem
12 years ago
Michael Peter Christen e5b3c172ff removed hack which translated Solr documents to virtual RWI entries
12 years ago
Michael Peter Christen 6017691522 added an exception catch
12 years ago
Michael Peter Christen 43f3345c90 - removed dependencies from URIMetadataRow and made direct access to
12 years ago
Michael Peter Christen 21fe8339b4 - enhanced generation of url objects
12 years ago
Michael Peter Christen 613cf7da7f enhancement to post argument parsing - possible fix to zero-filled
12 years ago
Michael Peter Christen 5f0ab25382 removed the option to prevent removal of &amp; parts inside of the
12 years ago
Michael Peter Christen a06930662c replaced some more .getBytes() with UTF8/ASCII.getBytes()
12 years ago
Michael Peter Christen 2f536cb54d code cleanup: removed unised methods and made more methods and objects
12 years ago
Michael Peter Christen 584663ae8c - redesign of solr query construction
12 years ago
Michael Peter Christen a8167e6e5b clean-up: removed unused methods in kelondro
12 years ago
Michael Peter Christen 24d2ee3c52 - better date ranking
12 years ago
Michael Peter Christen ca313e404f - if a "/date" modifier is used, the solr remote query applies an
12 years ago
Michael Peter Christen 24f4ca4d85 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
apfelmaennchen 116f429e35 fix for java.lang.RuntimeException: TableColumnIndex not available...
12 years ago
Michael Peter Christen 1533bfd63b refactoring
12 years ago
Michael Peter Christen 872f83ebe0 refactoring
12 years ago
Michael Peter Christen 8219a445f3 refactoring
12 years ago
Michael Peter Christen 00c1c777fa refactoring
12 years ago
orbiter 563d584420 removed more dependencies in cora from kelondro
12 years ago
Michael Peter Christen e072632a54 no complaints about memory if the database is empty
12 years ago
Michael Peter Christen e65cecc419 - updated lucene libraries to 3.6.1
12 years ago
Michael Peter Christen 4d29f59a27 removed warnings
12 years ago
Michael Peter Christen 8c099d2106 Merge remote-tracking branch 'origin/master'
12 years ago
apfelmaennchen d31a632951 - added dmoz RDF dump importer
12 years ago
Michael Peter Christen d8425e6809 added collections to crawl monitor
12 years ago