Commit Graph

802 Commits (5445f38070af55ed56a5c826e20838925cfc2519)

Author SHA1 Message Date
reger ea6c9e9b07 reduce mem buffer overhead for gap files during r/w
10 years ago
orbiter 487021fb0a snippet computation update
10 years ago
Michael Peter Christen 0ceeceb35e more logic on Solr queries; usage of the query terms in posprocessing,
10 years ago
reger 2b8cc5832c fix seek error for 0 file size records file
11 years ago
reger 2ba394333f fix Crawler HostQueue release of stackfile
11 years ago
Michael Peter Christen 501d55cd35 removed superfluous assert
11 years ago
Michael Peter Christen f0db501630 better handling of ranking parameters and new default values for date
11 years ago
Michael Peter Christen 6634b5b737 debug code for index distribution testing
11 years ago
orbiter 97983ba89f fixed generics warnings for generic array instantiation that appeared
11 years ago
orbiter 88f4af90da removed warnings
11 years ago
orbiter 89f76da24b Merge branch 'master' of git@gitorious.org:yacy/rc1.git
11 years ago
sixcooler b8cee9b7d8 remove tables from tabletracker on close to avoid lots of dead entrys in
11 years ago
orbiter f15c832587 Merge branch 'master' of git@gitorious.org:yacy/rc1.git
11 years ago
reger ffc5b75c73 optimize and fix lat / lon assignment
11 years ago
reger 9313447de2 reimplement tighter lat/lon calc in URIMetadataNode
11 years ago
orbiter a3542f29b4 npe fix
11 years ago
orbiter c48d2a2a02 npe fix
11 years ago
orbiter 12ba890205 removed warnings
11 years ago
reger 727dfb5875 refactore URIMetadataNode to further unify interaction with index
11 years ago
Michael Peter Christen 1aea01fe5b fix for Table in case that requested file does not exist and paths also
11 years ago
Michael Peter Christen da86f150ab - added a new Crawler Balancer: HostBalancer and HostQueues:
11 years ago
Michael Peter Christen 17e0956312 refactoring of SystemLoad calls (only one backend tool)
11 years ago
reger 227c42bc96 eleminate obsolete URIMetaDataRow class
11 years ago
Michael Peter Christen 62a36fa584 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
reger c9f92abddc fix: application link count
11 years ago
Michael Peter Christen 5b83887da8 npe fix
11 years ago
Michael Peter Christen 56710ecb26 prevent opening of new files as that could be a cause for the latest
11 years ago
Michael Peter Christen 8b44fcf0f4 added missing @Override annotation
11 years ago
Michael Peter Christen 1a764135be one more Thread Dump fix for new bootstrap css style
11 years ago
Michael Peter Christen bb21d825f9 fix for thread dump line spacing
11 years ago
Michael Peter Christen 5f4a6892c1 enhanced RowSet re-sort limit for small sets
11 years ago
Michael Peter Christen 6ed9c0164e attaching names to all Threads to get a better view in profiling tools
11 years ago
Michael Peter Christen fdaeac374a - enhanced postprocessing speed and memory footprint (by using HashMaps
11 years ago
Michael Peter Christen 9eb668e951 enhanced the resource observer
11 years ago
Michael Peter Christen fbee98c06f fixed shortcut self-reference bug
11 years ago
Michael Peter Christen acc8d7faa7 fixed setting of shortMemoryStatus in MemoryControl
11 years ago
Michael Peter Christen 94245ce0a8 fixed "Size in KBytes" calculation in PerformanceQueues_p.html,
11 years ago
Michael Peter Christen ebfaf753b7 - faster initialization of index files
11 years ago
reger a3e2cca8e9 improve isOlder check to not overwrite node index with metadata on equal load date
11 years ago
orbiter c351e47a84 fix for bad-formatted lonlat
11 years ago
Michael Peter Christen c87cdfca2e do not set a load prerequisite that prevents the start of one-time-jobs
11 years ago
Michael Peter Christen 6ada0daae9 making latency_factor and maximum number of same hosts in loader queue
11 years ago
sixcooler 40a4030b55 configurable max-load values for YaCy-Threads:
11 years ago
Michael Peter Christen 1ea17bd9f3 - removed old metadata database and all migration code
11 years ago
Michael Peter Christen 25a6c05008 experimental removal of synchronization. This should work for all cases
11 years ago
Michael Peter Christen 5695280edd removed superfluous synchronization
11 years ago
Michael Peter Christen a1977b7a75 removed debug code
11 years ago
Michael Peter Christen ec10ed45bd better logging in logger
11 years ago
Michael Peter Christen c3dcbdc8d5 try to recover from an OOM during citation index reading and fail-over
11 years ago
Michael Peter Christen 2c39b65409 fixes for searches containing stopwords. The fix was done using a
11 years ago
Michael Peter Christen 191fd3d7e7 added an optimization option to HandleSet mass data storage structure
11 years ago
Michael Peter Christen 1a4a69c226 set more logger to 'final static'
11 years ago
orbiter 3c3cb78555 - removed a lot of garbage and bloated code from GuiHandler.
11 years ago
Michael Peter Christen 5afa6e3aee Automatically flush the log cache if a short memory status is reached.
11 years ago
Michael Peter Christen 6aabc4e5c8 reduced logging line memory, 10000 lines had filled up 450MB! grrr.
11 years ago
Michael Peter Christen 1b4fa2947d - fixed a problem which ocurred when a document was not recognized with
11 years ago
Michael Peter Christen 7b69c438f7 more methods for the table class
11 years ago
Michael Peter Christen 5e31bad711 - the webgraph shall store all links which appear on a web page and not
11 years ago
Michael Peter Christen e8e558a9b7 fix for content domain classification in URIMetadataNode
11 years ago
Michael Peter Christen cb85b22725 redesign of the image search process (with much better results,
11 years ago
orbiter f106345eef link strings should not be tokenized
11 years ago
Michael Peter Christen 0f3d8890db removed an assert which causes a shortcut call circuit
11 years ago
Michael Peter Christen 47b1c81d08 - refactoring
11 years ago
Michael Peter Christen 58fe986cca Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen cf12835f20 replaced the single-text description solr field with a multi-value
11 years ago
Roland Haeder 13433d41a1 Log this exception better
11 years ago
orbiter 056b42f5aa - added information about segment count to status_p.xml
11 years ago
Michael Peter Christen 336f86394c replaced StringBuffer with StringBuilder
11 years ago
Michael Peter Christen aeac2fb763 replaced more containsKey() -> get() usages by a simple get(), followed
11 years ago
Michael Peter Christen 735a66eff3 enhancements to crawler
11 years ago
Roland Haeder 841a28ae76 Added 'final' for all exception blocks as this helps the Java compiler
11 years ago
Michael Peter Christen 5c6946dd5f replaced usage of log4j by ConcurrentLog where possible
12 years ago
Michael Peter Christen 5878c1d599 - refactoring of log to ConcurrentLog:
12 years ago
reger a6bf44212e bugfix: location (lat/lon) meta data retrival (Double.NaN check)
12 years ago
Michael Peter Christen 14186e815e npe fix
12 years ago
Michael Peter Christen f7e77a21bf Added a citation reference computation for intra-domain link structures.
12 years ago
Michael Peter Christen e20450e798 patch in HTCache and CitationIndex loading in case that a file is
12 years ago
reger 7480e87386 - fix stopword handling for RWI see example http://bugs.yacy.net/view.php?id=247
12 years ago
Michael Peter Christen a1644ca0fd new workflow processor in Segment to enqueue indexing documents to solr
12 years ago
Michael Peter Christen 5344a1c5f7 getting the trash out
12 years ago
orbiter 888a985dc6 set a higher limit for table copy usage
12 years ago
Michael Peter Christen 8dbc80da70 redesign of index.exist-test: this shall now not be done using a single
12 years ago
Michael Peter Christen 44e363f37f refactoring of WorkflowProcessor, added process counter, update of
12 years ago
orbiter aeff31cd44 fix for workflow processor (cause: latest redesign for less threads)
12 years ago
orbiter a1c989002b fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=4652
12 years ago
orbiter 7de5b9cfa0 fix for http://bugs.yacy.net/view.php?id=233
12 years ago
Michael Peter Christen bb4bf3d8fd infinity timeout bug protection patch
12 years ago
orbiter e1bfe9d07a - reduction of the concurrently running processes to make YaCy more
12 years ago
Michael Peter Christen c1a2175fbc added transparency to gif image animation and the integration to the
12 years ago
Michael Peter Christen ada3f27de7 added three new field for a better ranking: references_internal_i,
12 years ago
Michael Peter Christen 342ba1049b - callback fix
12 years ago
orbiter 47114910d5 fix for possible memory leaks
12 years ago
Michael Peter Christen addba047e2 changes in ranking computation
12 years ago
Michael Peter Christen 2b6c79d347 in method exists() also use the new caching-stacks for
12 years ago
Michael Peter Christen 3b1d9dc884 made index storage from DHT search result concurrently. This prevents
12 years ago
orbiter d74472f562 corrected result counter
12 years ago
Michael Peter Christen c95a84103a complete redesign of search process:
12 years ago
Michael Peter Christen 35fa718b77 testing to use solr for portalsearch caused some bugfixing but no full
12 years ago
Michael Peter Christen 089dee1770 - generalized SchemaConfiguration into super-class Configuration and
12 years ago
Michael Peter Christen 788288eb9e added the generation of 50 (!!) new solr field in the core 'webgraph'.
12 years ago