Commit Graph

881 Commits (6b45cd579922574059e5385153b84be3ca07533b)

Author SHA1 Message Date
reger 706f75ddc2 try to fix hang on index blob merge on shutdown
10 years ago
Michael Peter Christen fd4e2c809a Show dates in the content of a document in the search result:
10 years ago
reger df83fcc4fc disable optimistic GC assumption in StandardMemoryStrategy
10 years ago
Michael Peter Christen ac19690d30 refactoring with CommonPattern.COMMA
10 years ago
Michael Peter Christen 3d717b749a fix for urlmaskfilter
10 years ago
reger 24f68a4eb7 refactor opensearch heuristic
10 years ago
reger 8e751d754a - add javadoc to busythread with hint about the init parameter useage
10 years ago
Michael Peter Christen 3cd7deb3b8 do not flush non-errors to stdout because this is a concurrency issue.
10 years ago
reger 198102304b refactor size() -> filesize() of URIMetadataNode
10 years ago
reger c6f634a4f2 remove redundant caching of urlhash in URIMetadataNode
10 years ago
Michael Peter Christen 413eeefed4 added character set detection library from
10 years ago
Michael Peter Christen a304058840 added Image Events as another option to generate images with a mac if no
10 years ago
Michael Peter Christen 321840fde3 Replaced all fixed thread pools with cached thread pools. The cached
10 years ago
Michael Peter Christen 7bfab5eb9d set Busy- and Blocking-Threads to daemon mode (they will now not prevent
10 years ago
Michael Peter Christen ad0da5f246 added new web page snapshot infrastructure which will lead to the
10 years ago
Michael Peter Christen 4920ab7b76 optimize usage of size() cache
10 years ago
Michael Peter Christen 2beb6abeb6 disabled crazy sleep loop
10 years ago
Michael Peter Christen 8aee7f940e added missing class for latest changes
10 years ago
Michael Peter Christen 97039049e4 fix in key enumeration methods for cases where the enumeration is done
10 years ago
Michael Peter Christen 421ee64f33 another fix to ordering of table indexes; fixes also network stats
10 years ago
Michael Peter Christen 1db476c67e fix for bad table iteration
10 years ago
orbiter 0fcd8097a3 removed unused options from BusyThreads
10 years ago
sixcooler 72561926aa do not overwrite yacy.conf in case of an exception
10 years ago
Michael Peter Christen bc275dca07 added network history graph image /NetworkHistory.png which can show
10 years ago
Michael Peter Christen ee27be3399 misc bugfixes (concurrency, memory protection)
10 years ago
Michael Peter Christen 7817fc50c9 added a high cpu cycle monitor to PerformanceQueues
10 years ago
orbiter 3ac31614a3 added option to reverse-sort YaCy tables (internal API change only)
10 years ago
Michael Peter Christen ec6082c872 very bad language detection hack fix hack
10 years ago
Michael Peter Christen a7dd89c4de changed method to write the citation index: do not catch up references
10 years ago
reger ea6c9e9b07 reduce mem buffer overhead for gap files during r/w
10 years ago
orbiter 487021fb0a snippet computation update
10 years ago
Michael Peter Christen 0ceeceb35e more logic on Solr queries; usage of the query terms in posprocessing,
10 years ago
reger 2b8cc5832c fix seek error for 0 file size records file
11 years ago
reger 2ba394333f fix Crawler HostQueue release of stackfile
11 years ago
Michael Peter Christen 501d55cd35 removed superfluous assert
11 years ago
Michael Peter Christen f0db501630 better handling of ranking parameters and new default values for date
11 years ago
Michael Peter Christen 6634b5b737 debug code for index distribution testing
11 years ago
orbiter 97983ba89f fixed generics warnings for generic array instantiation that appeared
11 years ago
orbiter 88f4af90da removed warnings
11 years ago
orbiter 89f76da24b Merge branch 'master' of git@gitorious.org:yacy/rc1.git
11 years ago
sixcooler b8cee9b7d8 remove tables from tabletracker on close to avoid lots of dead entrys in
11 years ago
orbiter f15c832587 Merge branch 'master' of git@gitorious.org:yacy/rc1.git
11 years ago
reger ffc5b75c73 optimize and fix lat / lon assignment
11 years ago
reger 9313447de2 reimplement tighter lat/lon calc in URIMetadataNode
11 years ago
orbiter a3542f29b4 npe fix
11 years ago
orbiter c48d2a2a02 npe fix
11 years ago
orbiter 12ba890205 removed warnings
11 years ago
reger 727dfb5875 refactore URIMetadataNode to further unify interaction with index
11 years ago
Michael Peter Christen 1aea01fe5b fix for Table in case that requested file does not exist and paths also
11 years ago
Michael Peter Christen da86f150ab - added a new Crawler Balancer: HostBalancer and HostQueues:
11 years ago
Michael Peter Christen 17e0956312 refactoring of SystemLoad calls (only one backend tool)
11 years ago
reger 227c42bc96 eleminate obsolete URIMetaDataRow class
11 years ago
Michael Peter Christen 62a36fa584 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
reger c9f92abddc fix: application link count
11 years ago
Michael Peter Christen 5b83887da8 npe fix
11 years ago
Michael Peter Christen 56710ecb26 prevent opening of new files as that could be a cause for the latest
11 years ago
Michael Peter Christen 8b44fcf0f4 added missing @Override annotation
11 years ago
Michael Peter Christen 1a764135be one more Thread Dump fix for new bootstrap css style
11 years ago
Michael Peter Christen bb21d825f9 fix for thread dump line spacing
11 years ago
Michael Peter Christen 5f4a6892c1 enhanced RowSet re-sort limit for small sets
11 years ago
Michael Peter Christen 6ed9c0164e attaching names to all Threads to get a better view in profiling tools
11 years ago
Michael Peter Christen fdaeac374a - enhanced postprocessing speed and memory footprint (by using HashMaps
11 years ago
Michael Peter Christen 9eb668e951 enhanced the resource observer
11 years ago
Michael Peter Christen fbee98c06f fixed shortcut self-reference bug
11 years ago
Michael Peter Christen acc8d7faa7 fixed setting of shortMemoryStatus in MemoryControl
11 years ago
Michael Peter Christen 94245ce0a8 fixed "Size in KBytes" calculation in PerformanceQueues_p.html,
11 years ago
Michael Peter Christen ebfaf753b7 - faster initialization of index files
11 years ago
reger a3e2cca8e9 improve isOlder check to not overwrite node index with metadata on equal load date
11 years ago
orbiter c351e47a84 fix for bad-formatted lonlat
11 years ago
Michael Peter Christen c87cdfca2e do not set a load prerequisite that prevents the start of one-time-jobs
11 years ago
Michael Peter Christen 6ada0daae9 making latency_factor and maximum number of same hosts in loader queue
11 years ago
sixcooler 40a4030b55 configurable max-load values for YaCy-Threads:
11 years ago
Michael Peter Christen 1ea17bd9f3 - removed old metadata database and all migration code
11 years ago
Michael Peter Christen 25a6c05008 experimental removal of synchronization. This should work for all cases
11 years ago
Michael Peter Christen 5695280edd removed superfluous synchronization
11 years ago
Michael Peter Christen a1977b7a75 removed debug code
11 years ago
Michael Peter Christen ec10ed45bd better logging in logger
11 years ago
Michael Peter Christen c3dcbdc8d5 try to recover from an OOM during citation index reading and fail-over
11 years ago
Michael Peter Christen 2c39b65409 fixes for searches containing stopwords. The fix was done using a
11 years ago
Michael Peter Christen 191fd3d7e7 added an optimization option to HandleSet mass data storage structure
11 years ago
Michael Peter Christen 1a4a69c226 set more logger to 'final static'
11 years ago
orbiter 3c3cb78555 - removed a lot of garbage and bloated code from GuiHandler.
11 years ago
Michael Peter Christen 5afa6e3aee Automatically flush the log cache if a short memory status is reached.
11 years ago
Michael Peter Christen 6aabc4e5c8 reduced logging line memory, 10000 lines had filled up 450MB! grrr.
11 years ago
Michael Peter Christen 1b4fa2947d - fixed a problem which ocurred when a document was not recognized with
11 years ago
Michael Peter Christen 7b69c438f7 more methods for the table class
11 years ago
Michael Peter Christen 5e31bad711 - the webgraph shall store all links which appear on a web page and not
11 years ago
Michael Peter Christen e8e558a9b7 fix for content domain classification in URIMetadataNode
11 years ago
Michael Peter Christen cb85b22725 redesign of the image search process (with much better results,
11 years ago
orbiter f106345eef link strings should not be tokenized
11 years ago
Michael Peter Christen 0f3d8890db removed an assert which causes a shortcut call circuit
11 years ago
Michael Peter Christen 47b1c81d08 - refactoring
11 years ago
Michael Peter Christen 58fe986cca Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen cf12835f20 replaced the single-text description solr field with a multi-value
11 years ago
Roland Haeder 13433d41a1 Log this exception better
11 years ago
orbiter 056b42f5aa - added information about segment count to status_p.xml
11 years ago
Michael Peter Christen 336f86394c replaced StringBuffer with StringBuilder
11 years ago
Michael Peter Christen aeac2fb763 replaced more containsKey() -> get() usages by a simple get(), followed
11 years ago
Michael Peter Christen 735a66eff3 enhancements to crawler
11 years ago
Roland Haeder 841a28ae76 Added 'final' for all exception blocks as this helps the Java compiler
11 years ago
Michael Peter Christen 5c6946dd5f replaced usage of log4j by ConcurrentLog where possible
12 years ago
Michael Peter Christen 5878c1d599 - refactoring of log to ConcurrentLog:
12 years ago
reger a6bf44212e bugfix: location (lat/lon) meta data retrival (Double.NaN check)
12 years ago
Michael Peter Christen 14186e815e npe fix
12 years ago
Michael Peter Christen f7e77a21bf Added a citation reference computation for intra-domain link structures.
12 years ago
Michael Peter Christen e20450e798 patch in HTCache and CitationIndex loading in case that a file is
12 years ago
reger 7480e87386 - fix stopword handling for RWI see example http://bugs.yacy.net/view.php?id=247
12 years ago
Michael Peter Christen a1644ca0fd new workflow processor in Segment to enqueue indexing documents to solr
12 years ago
Michael Peter Christen 5344a1c5f7 getting the trash out
12 years ago
orbiter 888a985dc6 set a higher limit for table copy usage
12 years ago
Michael Peter Christen 8dbc80da70 redesign of index.exist-test: this shall now not be done using a single
12 years ago
Michael Peter Christen 44e363f37f refactoring of WorkflowProcessor, added process counter, update of
12 years ago
orbiter aeff31cd44 fix for workflow processor (cause: latest redesign for less threads)
12 years ago
orbiter a1c989002b fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=4652
12 years ago
orbiter 7de5b9cfa0 fix for http://bugs.yacy.net/view.php?id=233
12 years ago
Michael Peter Christen bb4bf3d8fd infinity timeout bug protection patch
12 years ago
orbiter e1bfe9d07a - reduction of the concurrently running processes to make YaCy more
12 years ago
Michael Peter Christen c1a2175fbc added transparency to gif image animation and the integration to the
12 years ago
Michael Peter Christen ada3f27de7 added three new field for a better ranking: references_internal_i,
12 years ago
Michael Peter Christen 342ba1049b - callback fix
12 years ago
orbiter 47114910d5 fix for possible memory leaks
12 years ago
Michael Peter Christen addba047e2 changes in ranking computation
12 years ago
Michael Peter Christen 2b6c79d347 in method exists() also use the new caching-stacks for
12 years ago
Michael Peter Christen 3b1d9dc884 made index storage from DHT search result concurrently. This prevents
12 years ago
orbiter d74472f562 corrected result counter
12 years ago
Michael Peter Christen c95a84103a complete redesign of search process:
12 years ago
Michael Peter Christen 35fa718b77 testing to use solr for portalsearch caused some bugfixing but no full
12 years ago
Michael Peter Christen 089dee1770 - generalized SchemaConfiguration into super-class Configuration and
12 years ago
Michael Peter Christen 788288eb9e added the generation of 50 (!!) new solr field in the core 'webgraph'.
12 years ago
Michael Peter Christen 91a0401d59 introduced a second core named 'webgraph'. This core will hold the link
12 years ago
Marc Nause 75f9568472 *) only install files from the RELEASE directory
12 years ago
Marc Nause 3bc5ee6e3d *) added protection against CSRF in update download page
12 years ago
reger 3897bb4409 added (manual) urldb migration (link on: Index Administraton -> Federated Solr Index)
12 years ago
Michael Peter Christen 38d3feae65 added separate delete commands for the local+remote solr index, the old
12 years ago
Michael Peter Christen 0f5b6f38c1 enhanced root-url detection
12 years ago
Michael Peter Christen 5c0c56cfe1 Preparations to produce a click depth attribute in the search index.
12 years ago
reger 276e63401e small sanitary fixes
12 years ago
Michael Peter Christen 24c9bb35f7 extended the Scheduler: introduced scheduled events
12 years ago
reger ad71747525 fix: set defaul language to "en"
12 years ago
orbiter 712cc37c40 if maxFileSize < 0 then the file size limit is without limit.
12 years ago
Michael Peter Christen 8fc3679c66 using more pre-compile pattern for split methods
12 years ago
Michael Peter Christen 5e182a566f - added another enumeration method in kelondro data structure to get a
12 years ago
Michael Peter Christen d6b82840f8 added a feature to find similarities in documents.
12 years ago
Michael Peter Christen f5ca5cea44 - added field options to all solr queries. This can be used to restrict
12 years ago
Michael Peter Christen 832eead998 Merge remote-tracking branch 'regerdev/master'
12 years ago
Michael Peter Christen 570e42c4e3 fix for filetype naviagtor
12 years ago
reger 633fbe9188 Fix Metadata handling
12 years ago
Michael Peter Christen c5f67a5d6d fixed a problem with local search from solr results: now all results
12 years ago
Michael Peter Christen f8f05ecba7 - added a delete button in host browser to delete a complete subpath
12 years ago
Michael Peter Christen a33e2742cb - removed unnecessary synchronized and deadlock in crawler
12 years ago