Commit Graph

5942 Commits (342543a6c4c4a691b90f7bd15f2d41c64fe0a6f3)

Author SHA1 Message Date
cominch b5a8fb5fd8 Catch malformed URL when submitted in encoded style 13 years ago
cominch df47f31235 interaction: add special table interaction 13 years ago
cominch e14f2881ae interaction: add special table interaction 13 years ago
cominch d7326079a8 interaction: add global variable store 13 years ago
cominch 4e4e7a99f8 interaction: add global variable store 13 years ago
cominch 8e80894812 create virtual web folder /currentyacypeer/ which always points to local 13 years ago
cominch bde07ed7a8 Add tagging overlay element 13 years ago
cominch b0bc0b4572 Add new demonstration module for client-side key-value store (backend: 13 years ago
cominch c9dc6cda02 Demonstration: include value from interaction in search results 13 years ago
cominch ae8adb0e58 Small changes 13 years ago
cominch bcbd8eee33 Add several parsers, for RDFa and rdf files. 13 years ago
cominch 9ef5a80f4e add interaction for triples and selector for augmented browsing 13 years ago
cominch 282c1620d6 Allow TripleStore to be persistent after reboot 13 years ago
cominch 5d20cd324a Add Triplestore and RDF query interface 13 years ago
cominch bc9a618e0a augmented browsing: ignore js and css, integrate more user interaction 13 years ago
cominch b21048892b augmentedParser add features and integrate external html parser to 13 years ago
cominch 9cbfc1a1c0 augmentedProxy, which forwards every proxy request to a 13 years ago
Michael Peter Christen 3b992e6b00 using utf8 String compression in Webstructure database 13 years ago
Michael Peter Christen 26301a538d bugfix in Domains - dns-lookup 13 years ago
Michael Peter Christen cde20911bb saved a bit more ram using UTF8 String compression for OpenGeoDB and 13 years ago
Michael Peter Christen 225ee42879 made the GeoLocation into an interface with the current 13 years ago
Michael Peter Christen 2280a7b276 - changed initialization order to prefer allocation of memory for table 13 years ago
Michael Peter Christen 0746308bc2 only the metadata tables shall be able to use the tail cache 13 years ago
Michael Peter Christen 7ec9bef0c3 fix for OOM 13 years ago
Michael Peter Christen 41c02cb10e - less restrictions for usage of Table RAM copy 13 years ago
Michael Peter Christen b8f56a9803 npe bugfix 13 years ago
Michael Peter Christen dd14b19c26 lazy initialization of block rank table ... only normal web search uses 13 years ago
Michael Peter Christen ba10caf89a lazy initialization of database tables 13 years ago
Michael Peter Christen 701b9a28a0 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 13 years ago
Michael Peter Christen ab7107b34b fixed RWIProcess queue limits: now discovering hidden results for mass 13 years ago
Michael Peter Christen 10c9c17d51 fixed handlemap spread factor and null iterator handling 13 years ago
Michael Peter Christen b0095c8d3c flush the compressor cache when a cleanup is done 13 years ago
Michael Peter Christen a61f44f9e4 lazy initialization of block rank table. 13 years ago
Michael Peter Christen 96e9d77270 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 13 years ago
Michael Peter Christen 00f2df1120 a variety of possible memory leak fixes 13 years ago
Michael Peter Christen 3dd8376825 added automatic cleaning of cache if metadata and file database size is 13 years ago
Michael Peter Christen d0ec8018f5 fixes for bad long computation 13 years ago
Michael Peter Christen 6bb07afcc3 accept also files with other file prefix; used to read 'foreign' cache 13 years ago
Michael Peter Christen 96c8119b50 added GeoLocation / GeoPoint classes which uses less memory than 13 years ago
Michael Peter Christen 461a0ce052 removed warnings 13 years ago
Michael Peter Christen 62ae9bbfda allow more POIs, get more at once 13 years ago
Michael Peter Christen 407fdf6968 more bug fixes and performance hacks for search process 13 years ago
Michael Peter Christen a1fe65b115 performance hacks 13 years ago
Michael Peter Christen 2fe207f813 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 13 years ago
Michael Peter Christen 5aee19daa4 added show from cache in search results (not yet finished) 13 years ago
Michael Peter Christen 5e562dcdb7 adopted vocabulary usage within anotation/naviagtion feature of search 13 years ago
Michael Peter Christen 514700291a moved Vocabulary to cora package (added in git 13 years ago
Michael Peter Christen 0284a4d88f more fixes for double precision of coordinates 13 years ago
Michael Peter Christen 964406ad17 added concurrency enhancement to xml parser 13 years ago
Michael Peter Christen 240045cf7c fix for bad distance computation 13 years ago
Michael Peter Christen e0d8643226 - performance hacks 13 years ago
Michael Peter Christen 7a329465b3 using pre-compile pattern in blacklist; should enhance search speed 13 years ago
Michael Peter Christen 6e83b02b83 - bugfix for surrogate file reader 13 years ago
Michael Peter Christen 9b4c699526 ehanced location search: 13 years ago
Michael Peter Christen 834dc6b263 store more data from interface access 13 years ago
Michael Peter Christen 1f48d1528b performance hacks 13 years ago
Michael Peter Christen c70aaccdc9 better location to generate a guid for rss messages 13 years ago
Michael Peter Christen 10da7335ea performance hack: use a hash cache for all hashes that are computed by a 13 years ago
Michael Peter Christen f8a0cf6d7c RSSMessages do not need a concurrent hash map -> removed overhead 13 years ago
Michael Peter Christen 07ca7e4dd1 enhanced RSS parsing by ensuring that it is parsed with a buffered input 13 years ago
Michael Peter Christen 7c1feefb28 introduced a default 10 second time-out in rwi normalization time 13 years ago
Michael Peter Christen 8d997d55b6 better logging 13 years ago
Michael Peter Christen 65d37e6a20 only ASCII needed in seed bitflags 13 years ago
Michael Peter Christen 0f82fb3628 using double instead float for a better release ordering 13 years ago
Michael Peter Christen 43c2c6e588 better logging 13 years ago
sixcooler 56087c1f23 bump to httpclient- httpcore-, httpmime- 4.2 13 years ago
Michael Peter Christen 20e0cc0822 fix for bad location evaluation 13 years ago
Michael Peter Christen 71c3163f3d - fixes to node identification 13 years ago
Michael Peter Christen 4d3cc02168 replaced old bzip2 library against better documented commons-compress 13 years ago
Michael Peter Christen ad222be7f8 added node state icon in network list 13 years ago
Michael Peter Christen eff7667554 fix for http://bugs.yacy.net/view.php?id=188 13 years ago
Michael Peter Christen 3c2bec681f added a root node flag: identifies peers with short ping time 13 years ago
Michael Peter Christen c846e9ca14 redesign of the crawler monitor page: show crawled pages instead of 13 years ago
Michael Peter Christen 8b974905ee changed log-in text for all servlets with authentication: 13 years ago
Michael Peter Christen 16b21f7a5b Added more steering in Crawler_p.html interface 13 years ago
Michael Peter Christen acc19e190d hack against 100% cpu during crawl delete 13 years ago
Michael Peter Christen c15fcde1c8 add-on to latest commit 13 years ago
Michael Peter Christen cf47d94888 performance hack to parse numbers inside of substrings without actually 13 years ago
Michael Peter Christen 7e0ddbd275 added a "fromCache" flag in Response object to omit one cache.has() 13 years ago
Michael Peter Christen 81737dcb18 removed stack trace from swf parser since we cant do anything there 13 years ago
Michael Peter Christen 7bf421b9dd - fixed image search page navigation 13 years ago
Michael Peter Christen 125d47b3c1 added more interruptions in DidYouMean because that was the cause for 13 years ago
Michael Peter Christen c6a09eab0b synchronization needed 13 years ago
Michael Peter Christen fb94b47b1a changed queue sizes to have less memory occupied during indexing 13 years ago
Michael Peter Christen 76157dc2c3 bugfix for http://bugs.yacy.net/view.php?id=173 13 years ago
reger 6696cb1313 bugfix: lookup of peernames no result for active peer in page IndexControlRWIs_p.html -> Transfer RWI to other Peer 13 years ago
Michael Peter Christen c6558cba08 more classification bugs 13 years ago
Michael Peter Christen 082831b9d6 search contentdom was checked in wrong way - fixed 13 years ago
reger ee553d971e correct typo in scripts_txt comment 13 years ago
Michael Peter Christen f294f2e295 bugfix to http://bugs.yacy.net/view.php?id=181 13 years ago
Michael Peter Christen acf8d521a2 fix for http://bugs.yacy.net/view.php?id=126 13 years ago
Michael Peter Christen bb88878b4d the last commit was incomplete.. 13 years ago
Michael Peter Christen d320a31ae1 bugfix for http://bugs.yacy.net/view.php?id=186 13 years ago
Michael Peter Christen fa735f4f04 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 13 years ago
Michael Peter Christen 3e1bc9477f Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 13 years ago
Michael Peter Christen 6f8a2fef1f small speed enhancement using a column factory 13 years ago
Roland 'Quix0r' Haeder d10627d591 More sync in close() methods 13 years ago
Roland 'Quix0r' Haeder b3ae2aa41f With or without 'final'? At least please try it in other methods 13 years ago
Roland 'Quix0r' Haeder fbb946f913 Made a method static (Eclipse suggested it), removed unused import, pk=null check does now output a warning in logfile 13 years ago
Michael Peter Christen 52d307c735 prevent that the snippet fectch process removes catchall entries 13 years ago
Michael Peter Christen 7eece0256f moved yacy.logging to defaults according to request in 13 years ago
Michael Peter Christen 5b3acc12cd Pattern.quote() replaces \\Q and \\E according to publication in 13 years ago
Michael Peter Christen 89142d1e8d removed (not all) warnings 13 years ago
Michael Peter Christen 5deebd02ea added serialization 13 years ago
reger b2175ea4ef Add possibility to set custom Solr field names for the YaCy default Solr attributes. 13 years ago
Michael Peter Christen 15db703808 added missing serialization to remove all warnings 13 years ago
Michael Peter Christen 1795a7325b made HandleSet serializable 13 years ago
Michael Peter Christen e7e381d110 added configuration to switch off redirection following in crawler 13 years ago
Michael Peter Christen 2717c1b749 fixed bug in solr interface 13 years ago
Michael Peter Christen 70505107ca enhanced crawler/balancer: better remaining waiting-time guessing 13 years ago
Michael Peter Christen f150bc218b fixed bug in solr error document 13 years ago
Michael Peter Christen cb54c1737b solrj connector bugfix 13 years ago
Roland 'Quix0r' Haeder a093ccf5eb Now used synchronization in all close() methods to make sure all objects 13 years ago
Michael Peter Christen 49cab2b85f Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 13 years ago
Michael Peter Christen 0d58fea210 made multiple connector default 13 years ago
Michael Peter Christen 7740c02c56 - enhanced the solr connector 13 years ago
Michael Peter Christen 0cf3d36eae more tolerance in case of corrupted file 13 years ago
Michael Peter Christen acc6db28ff added missing classes for solr interface 13 years ago
Michael Peter Christen adeb33bb36 better abstraction for solr objects 13 years ago
Michael Peter Christen 8864141872 more abstraction in solr connection classes 13 years ago
Michael Peter Christen c00efc2717 made the solr connection more generic 13 years ago
Michael Peter Christen ea2bd43b28 patch for broken configurations 13 years ago
Michael Peter Christen e5ca7f22b1 enhancement in circle drawing 13 years ago
Michael Peter Christen 34f4225d7e less 'wellformed' calls without asserts 13 years ago
Marc Nause a691023d04 *) better formatting for network QPM 13 years ago
Michael Peter Christen 77f8e9fb9b Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 13 years ago
Michael Peter Christen ba6aaabc51 refactoring + parser bugfixes 13 years ago
Michael Peter Christen 2a0434efa4 Merge commit 'c1f6b4fb5226d3d2f8b2bec9e361f6b3476e03ff' 13 years ago
Michael Peter Christen 942896fe46 removed methods not supported by new solrj connector for httpclient 4 13 years ago
Michael Peter Christen 22e1f68c0b solrj user authentication patch 13 years ago
Michael Peter Christen 09484955dc added new entry class for embed tags 13 years ago
Michael Peter Christen 62f2554a01 - fixed build problems (deprecated methods using httpclient 3.1) 13 years ago
Michael Peter Christen a6d60fc21f concurrency enhancement in ConfigurationSet 13 years ago
Michael Peter Christen 453010bd68 - solved problems with backpath normalization 13 years ago
Michael Peter Christen 5f5ed33ed8 patch for media search (audio, video apps) 13 years ago
Michael Peter Christen 7860c1df80 fix needed for new solrj library 13 years ago
Michael Peter Christen 0e13022147 - enhanced solr field documentation 13 years ago
Michael Peter Christen 19efbf1b0f - apply directDocByURL to NOLOAD Queue 13 years ago
Michael Peter Christen 659178942f - Redesigned crawler and parser to accept embedded links from the NOLOAD 13 years ago
Michael Peter Christen a3badd3205 changed search process for images: no more media snippet load process, 13 years ago
Michael Peter Christen f5efdb21fd refactoring 13 years ago
reger c1f6b4fb52 lookupByIP: prevent comparing of port parameter if called with port -1 (=unknown) 13 years ago
Michael Peter Christen f8cd57c92f new indexing strategy: ALL links that appear anywhere are indexed, not 13 years ago
Michael Peter Christen 14f67f217c refactoring of ContentDomain: now subclass of Classification 13 years ago
Michael Peter Christen 8a08c96a82 removed dependency from logging 13 years ago
Michael Peter Christen a1a5b015d8 refactoring: moved document Classification to cora package 13 years ago
Michael Peter Christen a5d7da68a0 refactoring: removed dependency from switchboard in Balancer/CrawlQueues 13 years ago
Michael Peter Christen 33d1062c79 refactoring: the cache belongs to the crawler 13 years ago
Michael Peter Christen 4d5da75814 fix for parser problem if a <a>-tag is 'within' html tags with unclosed 13 years ago
Michael Peter Christen 91a86f0b06 fixed to network graph testing 13 years ago
Michael Peter Christen 7b5b9baee0 added citation rank to ranking profile 13 years ago
Michael Peter Christen 046f3a7e8d check if httpc has decompressed the release file and rename the file 13 years ago
Michael Christen 02e4dedff2 fix to url citation collection 13 years ago
Michael Christen e32055aa15 added stub classes for 13 years ago
Michael Christen ac5d124ee0 experimental implementation of a citation ranking as post-ranking 13 years ago
Michael Christen 8fc86fe397 added storage of full anchor link structure: 13 years ago
Michael Christen 22f05c83ff fixed default must-match filter for full domain crawls - the old filter 13 years ago
Lotus 0b3f39136e allow custom ppm lower than minimum button on /Crawler_p.html 13 years ago
Michael Peter Christen 532c7cf827 added physics experiment to the graph plotter. not active by default 13 years ago
Michael Peter Christen aba9b1bfa0 better names for elements of a linked graph 13 years ago
Michael Peter Christen 0cc0290978 bugfix for a must-not-match pattern check. This bug did not make the 13 years ago
Michael Peter Christen 2fc8ecee36 ConcurrentLinkedQueue has a VERY long return time on the .size() method. 13 years ago
Michael Peter Christen 8aba045ba1 if a new pop-up page is set in config portal, then this page applies 13 years ago
Michael Peter Christen 8c06925984 animation of the web structure picture 13 years ago
Michael Peter Christen 898fa7c3f3 use tld heuristic to check if a domain is local or global 13 years ago
Michael Peter Christen 213c8d97f2 use less proccesses in process pool 13 years ago
Michael Peter Christen c639248c23 protection against strange answers from remote peers during search 13 years ago
Michael Peter Christen 36e4d82b27 changed ranking 13 years ago
Michael Peter Christen 096c17e7cd added test code 13 years ago
Michael Peter Christen 665626a51b catch OOM errors during scanning 13 years ago
Michael Peter Christen 1cd711d005 added classes for citation references (for new citation ranking) 13 years ago
Michael Peter Christen 33a405dab8 ipv6 bugfix 13 years ago
Michael Peter Christen c6c61be3f0 fix for http://bugs.yacy.net/view.php?id=148 13 years ago
Michael Peter Christen e0f1e7d904 added new citation reference data structure that shall be used for a 13 years ago
Michael Peter Christen e18a4f6b74 more tolerant merge iterator 13 years ago
Michael Peter Christen 0d148c3353 more logging in resource observer 13 years ago
Michael Peter Christen 2fa037ae1d enhanced crawler 13 years ago
Michael Peter Christen e101c2e0e2 added changes from copperdust (submitted by email): 13 years ago
low012 2120db289a *) Small change which should solve problem with cgitb module in Python CGI scripts. 13 years ago
Lotus ee89cf5ae5 fix must match filter for full domain crawl 13 years ago
Michael Peter Christen 8d63a5887c bugfixes 13 years ago
Michael Peter Christen 9ad1d8dde2 complete redesign of crawl queue monitoring: do not look at a 13 years ago
Michael Peter Christen 7e4e3fe5b6 free some memory after parsing html 13 years ago
Michael Peter Christen 4540174fe0 memory hacks 13 years ago
Michael Peter Christen b4409cc803 small redesign of blob column index and usage 13 years ago
Michael Peter Christen d5c1f2746e performance hack 13 years ago
Michael Peter Christen 803963aebd performance hack: better space grow in CharBuffer (speeds up html 13 years ago
Michael Peter Christen 8b0920b0b5 tried to fix the ipv6 problem as reported in bug 13 years ago
Michael Peter Christen e2f8f263e8 changed storage of search words: keep order 13 years ago
Michael Peter Christen ed39ef2890 changed generation of protocol information 13 years ago
Michael Peter Christen 0b67a0a5d8 added a column index for tables in blob files. This is heavily used 13 years ago
Michael Peter Christen 2e5cd6a1b2 fixed parser extension deny list generation and usage 13 years ago
Michael Peter Christen 8bee1472c9 there is no noindex, only nofollow in links 13 years ago
Michael Peter Christen 3cd6dcd352 do not add new solr fields as activated fields 13 years ago
Michael Peter Christen e3bb73c3d6 serialized some database access methods 13 years ago
Michael Peter Christen 7e728867e5 added a synchronization around iterations to prevent IO-deadlocking 13 years ago
Michael Peter Christen 355ecf330f reduced target file site to 64mb 13 years ago
Michael Peter Christen 10ae6d94a1 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 13 years ago
Michael Peter Christen 2ea585d616 fix for host navigator 13 years ago
Michael Peter Christen 2f6dde92e2 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 13 years ago
Michael Peter Christen c560a582ac fix for single-word vocabulary lines 13 years ago
Michael Peter Christen 4c5edab1ec added option to have exception search result windows 13 years ago
Michael Peter Christen 046d7de95b Merge remote branch 'reger/master' 13 years ago
reger a95f645a61 Bugfix class repository.Loaddispatcher fixed download file limit of 10000 13 years ago
Michael Peter Christen ef78f22ee1 performance hack 13 years ago
Michael Peter Christen 41536eb4a2 performance hack 13 years ago
Michael Peter Christen f91487fc50 added delete-button for host navigation 13 years ago
Michael Peter Christen e8d24fd802 author navigator can be switched off 13 years ago
Michael Peter Christen 558ab7bd4e made the protocol navigator reversible 13 years ago
Michael Peter Christen 96cb75f1d4 made the filetype navigator be able to deselect the search constraint 13 years ago
Michael Peter Christen 9ebcae2fbc enhanced url parser to understand urls with &amp; instead of & in post 13 years ago
Michael Peter Christen 1f4f60654a Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 13 years ago
reger 32104360ce PDFParser - return at least first 3 pages of PDF 13 years ago
Michael Peter Christen ef5192f8c9 using the generic document parser for crawl starts instead of the html 13 years ago
Michael Peter Christen a02fdf8625 better error messages 13 years ago
Michael Peter Christen eadb58dd87 small enhancements in pdf parser 13 years ago
Michael Peter Christen c6ba44468e timeout = 5000 instead 3000 13 years ago
reger b616de5973 PDFParser - return at least first 3 pages of PDF 13 years ago
Michael Peter Christen e6d26a023f fix for bookmark crash with possible side-effects on crawl start after 13 years ago
Lotus c73af39e54 refactoring of tray icon class, 13 years ago
Michael Peter Christen 4eff0e26f1 npe bugfix 13 years ago
low012 8776b84c10 *) small fix to make password change function of reconfigureYACY.sh work 13 years ago
Michael Peter Christen 190b77c55e added Ukrainian translation 13 years ago
Michael Peter Christen 1a0b6b3913 get more navigation details to search results 13 years ago
Michael Peter Christen 7f9b6b7a0c added switches to ConfigParser to accept/deny documents by their 13 years ago
Michael Peter Christen 4901cee3cc suppress auto-tagged subject entries when sending out or receiving 13 years ago
Michael Peter Christen 83009d86f7 added the vocabulary navigator. It can be very simply tested by 13 years ago
sixcooler 985b78cf89 correct 'avaiable()' to use max of young / eden 13 years ago
sixcooler 4da8746275 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 13 years ago
sixcooler c9aaa9e00a respect non-reserved Memory in GenerationMemoryStrategy 13 years ago
Michael Peter Christen 37f2d1b3e9 replaced Thread initialization with ExecutorService pool for delete 13 years ago
Michael Peter Christen a58dc4a91f added autotagging to document condenser: 13 years ago
Michael Peter Christen 0d6176804b emergency disabling of GenerationMemoryStrategy because of non-working 13 years ago
Lotus 411aab02e3 Windows installer now detects reliably whether YaCy runs. A file lock on 13 years ago
Michael Peter Christen 87f0210480 enriched log output to find NPE in HeapReader 13 years ago
Michael Peter Christen 987b412491 updated solr scheme: generic declaration of solr schemes 13 years ago
Michael Peter Christen 254adea51c small fixes 13 years ago
Michael Peter Christen 49be60a7c8 WorkflowProcess is forced to make small pauses if shortMemoryStatus is 13 years ago
Michael Peter Christen b7bb84c0bb set a limit to CharBuffer object size to fight against bad/too large 13 years ago
Michael Peter Christen c602eaaf46 enhanced search process 13 years ago
Michael Peter Christen 087f97d4c0 less noise if a browser cannot be opened 13 years ago
Michael Christen eff966f396 fix for search process (it was aborted too early during remote search) 13 years ago
Michael Christen e6d51363ee Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 13 years ago
Marek Otahal a231d0eeb9 Run from Java the whole app YACY 13 years ago
Marek Otahal 72adbeae90 !Important: move from Hashtable to HashMap 13 years ago
Marek Otahal c1af123ddd just a little faster toString 13 years ago
Marek Otahal 64e4bcee82 serverSwitch get(App/Data)Path() use common helper method 13 years ago
Marek Otahal 371fbb4deb just comment + shorter code in serverSwitch 13 years ago
Marek Otahal ed253b7aff update javadoc, does not throw IOException 13 years ago
Marek Otahal f40efb39af Blacklist loadList() remove duplicates by using Set 13 years ago
Marek Otahal f75b5e40e0 little fix in copy() 13 years ago
Marek Otahal 1dc5d9f0f3 make ConnectionInfo comparable and sort list of connections in Connections_p 13 years ago
Michael Christen fa8da7f89d vocabularies are now also used as source for a did-you-mean computation 13 years ago
Michael Christen eaec14ecc4 Dictionaries from words caches can now be used as autotagging vocabulary 13 years ago
Michael Peter Christen 91940fdf56 redesign of WordCache to be prepared to hold multiple 13 years ago
Michael Christen bd40a10230 added autotaggig stub .. only reading and parsing of vocabularies at 13 years ago
Michael Peter Christen 2ee8cbeb2c Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 13 years ago
Michael Peter Christen 992dbdf4bb added noload statistic to servlets 13 years ago
Michael Christen 354b976110 fix for concurrency problem and endless loop in /suggest.json 13 years ago
Michael Christen c21966bb43 fix 13 years ago
Michael Christen 13b05f9c08 fix 13 years ago
Michael Christen e5d878c59e Merge branch 'master' of ssh://gitorious.org/yacy/rc1 13 years ago
Michael Christen ec26b2bea4 Merge commit 'fa08ed5ae5d72bddc3cc6a662b23103579e86109' into quix0r 13 years ago
Michael Christen eebc02f5c1 fix 13 years ago
Michael Christen 216a287a85 Merge commit '6d4e08ed06c5cd28c45981b2ebe31c7f7ec6fd83' into quix0r 13 years ago
stbrumm d18095dc48 Patch fuer Issue 0000102 13 years ago
stbrumm 9f1b1b4604 Type for Robinson-Mode/Private Perr added 13 years ago
Michael Christen 20962a4ed7 added metadata node stub for metadata from blobs 13 years ago
Michael Christen 575dbbaa93 enhancements in Blob retrieval: try to use less CPU resources by testing 13 years ago
Michael Christen 585a8f3c44 fixed a bug in search sequence (caused emtpy results) 13 years ago
Michael Christen 361146dd7a better error handling for file loader 13 years ago
Roland 'Quix0r' Haeder 6d4e08ed06 Rewrote filesize() to (hopefully) avoid a NPE, rewrote Blacklist class to concurrent classes to avoid a CME 13 years ago
Roland 'Quix0r' Haeder 901f37d608 Also this ... :( 13 years ago
Roland 'Quix0r' Haeder a985717ed2 Also this ... :( 13 years ago
Roland 'Quix0r' Haeder 5f490de554 Fix for ported fix from my old days ... 13 years ago
Roland 'Quix0r' Haeder fa08ed5ae5 Fixed a lot CHMOD rights (no need for execute flag on *.java/*.html) and introduced local/remote crawl size ratio based check 13 years ago
Roland Haeder 319fd1f4aa A concurrent access can happen on the blacklist (with latest introduced blacklist check in media snippet computation) 13 years ago
Roland 'Quix0r' Haeder a3083d13bf Blacklist checks are now always turned on, in media searches (e.g. image search) images matching blacklist entries are no longer shown to the user 13 years ago
Michael Christen 52184a1170 fix for search process 13 years ago
Michael Christen 85bd4cc8bc better lookup for peer names 13 years ago
Michael Christen 20e3084bd4 redesign of fining of peers by ip: more leightweight method to read the 13 years ago
Michael Christen 0797b0de99 new handling of remote search processes: looking for seeds will now not 13 years ago
Michael Christen ee9aae5cc0 more about CreativeCommons license vocabulary 13 years ago
Michael Christen ecd74fe34f less dramatic upnp failures 13 years ago
Michael Christen c75e1a3125 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 13 years ago
Michael Christen 13f5b5f80d the component part in the YaCy Metadata is filled using the Dubling Core 13 years ago
Michael Peter Christen 8d2cbfb685 more vocabularies and more semantics for lod data structures 13 years ago
Michael Christen 9cd36b4c44 added vocabulary for geolocalization as used in georss 13 years ago
Michael Christen 9e5894c784 Removed handling of components objects for URIMetadataRows. 13 years ago
Michael Christen 66ab51f89d added rdf vocabulary 13 years ago
Michael Christen c04bfaa51b refactoring 13 years ago
Michael Peter Christen 136b514f52 added a Triple Store based on Nodes that fit to the new storage classes. 13 years ago
Michael Peter Christen 613ab6a69d added BEncodedHeapBag and BEncodedHeapShard which are storage container 13 years ago
Michael Christen 6fecd0db88 one more performance hack to prevent costly md5 computation 13 years ago
Michael Christen e13441b069 better digest pool size (smaller by default but unlimited) 13 years ago
Michael Christen 1f4afb4dc0 performance hacks 13 years ago
Michael Christen 675d557e88 removed debug logging 13 years ago
Michael Christen e9dc99fe15 added rules to set specific RWIs as private RWIs which are not 13 years ago
Michael Peter Christen 4243ace863 added phonetic classes 13 years ago
Michael Peter Christen 0bcef2d156 added feature as requested in 13 years ago
Michael Christen 204c29f010 small bugfixes for search result display and cache display 13 years ago
Michael Christen 17f962fceb translator updates: 13 years ago
Michael Christen 752b092b8a Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 13 years ago
Michael Christen 078fcde0dd bad initialization 13 years ago
admin 391fc9bd57 Merge branch 'master' of gitorious.org:yacy/rc1 13 years ago
admin 23afee58fe Merge branch 'master' of git://github.com/f1ori/yacy 13 years ago
Michael Christen 14e45e90fd patch for a bug that I don't understand by now. 13 years ago
Michael Christen 3eccdca63c protection against too long running snippet fetch processes 13 years ago
Michael Christen 86b3385847 fixed a deadlock during secondary remote search 13 years ago
apfelmaennchen ff19fcdb28 bugfix for YMarks XBEL import and export; thanks to Dominic 13 years ago
Michael Christen c715d19c09 fixes for dependency on svn 13 years ago
Michael Christen 404758698a less io operations 13 years ago
Michael Christen 0bc5d76bee ups 13 years ago
Michael Christen 044f83feed added some pauses into the search process which shall produce 13 years ago
Michael Christen 943b670738 less terrible warning if uPnP fails 13 years ago
sixcooler 448656087a probably fix for http://bugs.yacy.net/view.php?id=94 13 years ago
Michael Christen f14faf503b better ranking because we wait a very little time during the search 13 years ago
Michael Christen 762e0ecfb6 fixed localization dictionaries, see 13 years ago
Michael Christen 6e66c9d7f1 fix for http://bugs.yacy.net/view.php?id=87 13 years ago
Michael Christen d35bdc2df6 removed npe 13 years ago
Michael Christen e7e429705a - less automatic indexing after a search (needs to reset the default 13 years ago
admin a4ac051029 Merge branch 'master' of git://github.com/f1ori/yacy 13 years ago
low012 7cfdc2c092 Improved CGI capabilities: 13 years ago
Michael Christen 9cd469e6d6 added pull request from als plus an NPE fix 13 years ago
admin 484c4ad339 Merge branch 'master' of git://github.com/f1ori/yacy 13 years ago
orbiter 402e9d71ef changed ording on release files: main criteria is not the svn any more; releases are now ordered by 13 years ago
admin 29f07fea33 Merge branch 'master' of git://github.com/f1ori/yacy 13 years ago
orbiter 11729061f2 added an option in the bookmark import process to put everything into the crawler 13 years ago
admin b9c5ce6cae Merge branch 'master' of git://github.com/f1ori/yacy 13 years ago
apfelmaennchen 70bcfc150a - small bug fix to ymarks html importer 13 years ago
admin 56ce8488e4 Merge branch 'master' of git://github.com/f1ori/yacy 13 years ago
orbiter 4b8ff84705 - search bugfixes (page counter and number of results per page; recognition of new search) 13 years ago
sixcooler aeeae75b8a the timeout of httpclient is not absolut, but till a connection is 13 years ago
hermens 2ac272cfbf Fix for PeerSelection.seedsByAge() for big networks (>1000 Peers) 13 years ago
apfelmaennchen b5d9f631e3 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@8128 6c8d7289-2bf4-0310-a012-ef5d649a1542 13 years ago
orbiter 83335c3b09 fix for http://bugs.yacy.net/view.php?id=78 13 years ago
Al Sutton 39898cb94a Added try/finally protection to ensure streams are closed. Added initial size guess for the CharBuffer 13 years ago
Al Sutton 4c67a964a1 Added try/finally protection to ensure streams are closed. Added initial size guess for the CharBuffer 13 years ago
Al Sutton 3f9b9f953f Added close() to ensure buffer close actions are invoked 13 years ago
Al Sutton d73c84f9a0 Allow initial buffer size definition in TransformWriter, and use available() method to set it in htmlParser. In this situation a ByteArrayInputStream is used so the available() method gives a good size estimation and avoid the buffer needing to be continually grown 13 years ago
Al Sutton f02ea27b31 Added missing closure of ByteArrayInputSteam 13 years ago
orbiter 0796b54601 - some speed hacks for network image 13 years ago
orbiter f9216e388c - faster ping to clean up old peers faster 13 years ago
orbiter 35a9e8f307 - fixed network graphic 13 years ago
Al Sutton 8993cac4d8 Initial performance improvements 13 years ago
orbiter d9c066227a fix for npe 13 years ago
orbiter 8895d8c1cd removed unnecessary log entries 13 years ago
orbiter 550c881d80 remove more news (all older than one day) because they can be a performance problem if we have too many peers sending news 13 years ago
orbiter ebd840ebf6 - enhanced description on search front page 13 years ago
apfelmaennchen 77a080ced9 smaller fixes for YMarks 13 years ago
orbiter e22f8497c9 - tested the ARC methods 13 years ago
orbiter bc5df0eef5 updated ranking tables (fresh computation) 13 years ago
orbiter 5a55397f99 some last-minute performance hacks 13 years ago
apfelmaennchen dd1482aaf5 further update to YMarks 13 years ago
orbiter c9216d5adf fixed secondary remote search (the process that finds distributed join situations) 13 years ago
orbiter 64fd20b857 new default ranking profile 13 years ago
orbiter 0cf9ebc3b0 speed enhancements when parsing RWI rows (makes search slightly faster) 13 years ago
orbiter c9a0dbd25a added a security check 13 years ago
orbiter ee8b1d4de1 fixed unresolved pattern and unwanted local/global switch when using votes on search results 13 years ago
orbiter c584db991f creating a bookmark from the search results now works again .. with new YMarks 13 years ago
orbiter 1120f0c93c update to network graphics: slightly less crawling activity, slightly stronger color for query activity 13 years ago
orbiter 6cd27473f5 - better default values for caching and cache usage 13 years ago
orbiter 709013385a fix for language fix 13 years ago
orbiter 1019c36dad bug fixes and speed enhancements for search 13 years ago
orbiter 507c9d478d much better timing when search globally; less blocking; more results earlier! 13 years ago
orbiter 8e0b2c5832 fixed cluster search 13 years ago
orbiter c0c6e9e7a5 fix for bad language encoding 13 years ago
apfelmaennchen 564374d1fe - included YMarks in addition to old bookmarks in yacysearchitem.html; don't get confused by the old bookmark dialog, the ymark is automatically added silently beforehand. 13 years ago
orbiter 05f34a3fa7 added a full, complete, database insert, update and delete API for the tables. 13 years ago
lotus 3cc93325f0 temporary remove compare search from tray 13 years ago
orbiter c93f10417a add a bookmark automatically each time a new crawl is started 13 years ago
orbiter e4a82ddd8b produce a bookmark entry from every crawl start. these bookmarks are always private. 13 years ago
apfelmaennchen 6287c2b4a9 YMarks: 13 years ago
cominch 2236e01137 Minor correction to prevent useless comma at beginning of string, created from list 13 years ago
apfelmaennchen 5581be12fb YMarks: 13 years ago
orbiter 804e48888b smaller bug fixes for search behavior; should produce less unnecessary removals and an exact number of results as shown in counter 13 years ago
apfelmaennchen a3eebfdcba YMarks: 13 years ago
orbiter c50f8f9a06 code cleanup 13 years ago
orbiter 84c3fc9d97 local/global fixes in search, better abstraction 13 years ago
apfelmaennchen 4f95f72124 YMarks: 13 years ago
orbiter aa322bc6d0 fix 13 years ago
orbiter 97d1347adb added also a default accept field to robots.txt downloads 13 years ago
orbiter f183d3822c added a default accept header in http requests since some http fraud detection functions check that this header field exist 13 years ago
orbiter 06352b8d6b more logging 13 years ago
orbiter a99934226e more logging for debugging of robots.txt 13 years ago
orbiter 7a5841e061 fix for robot parser 13 years ago
orbiter 458c20ff72 fix for robot parser 13 years ago
orbiter 017a01714d - enhanced logging in robots.txt parser for remote debugging 13 years ago
apfelmaennchen a8dfe787ed - updated to jquery flexigrid 1.1 13 years ago
orbiter eb1c7c041d write info about robots.txt evaluation into getpageinfo_p.xml 14 years ago
apfelmaennchen abba31f02e - bugfix for correctly sorting ymarks 14 years ago
orbiter 3a15e58e28 - increased stability when opening the robots table 14 years ago
orbiter 775b44017e refactoring 14 years ago
orbiter e914a30099 fix for npe 14 years ago
apfelmaennchen 5f7dbe1c42 - some refactoring (ymarks) 14 years ago
orbiter 78ce3b13be typo 14 years ago
orbiter 85d6bf4ac4 fixed urls to media content during indexing 14 years ago
orbiter 0d858d48ec replaced String with StringBuilder in suggestion process 14 years ago
orbiter 3a807e10cf - added a cache for active crawl profiles to the crawl switchboard 14 years ago
orbiter 37e35f2741 normalization of url using urlencoding/decoding 14 years ago
orbiter e58438c01c - added a new retry connector for solr (for cases where solr responses are slow) 14 years ago
orbiter d8d9735b4f stability bugfix 14 years ago
orbiter c31564ef08 stability bugfixes 14 years ago
orbiter f121f4bb45 fix for link in Supporter and Suftipps page 14 years ago
orbiter 94eab08794 - updated opensearchdescription text and icon 14 years ago
orbiter 279482a76d fix for npe 14 years ago
orbiter 1b86d06d1e fix for http://bugs.yacy.net/view.php?id=62 14 years ago
orbiter 9e4875230f performance hacks 14 years ago
orbiter eb9c9edb01 enhanced table method (used by almost all yacy api interfaces) 14 years ago
orbiter 4ad9fc2bff new snippet strategy for search hits in metadata: show beginning of text instead of hit position 14 years ago
orbiter a9838f8b99 fix for http://bugs.yacy.net/view.php?id=59 14 years ago
hermens d3df03838a make sure myself-target is always inserted at its appropriate position 14 years ago
hermens c3e7efa846 added sender side prevention of rwi flooding as mentioned in SVN 7993 14 years ago
orbiter 5af9598bd1 enhanced exported row parsing during row import 14 years ago
orbiter 7598a9e26b fix for thread dump 14 years ago
orbiter 8eef8722d1 update to ThreadDump analysis: freerunner and thread state recognition 14 years ago
orbiter 1df43b137d another performance hack 14 years ago
orbiter 7df0643f0e performance hacks 14 years ago
orbiter a7df70221e refactoring 14 years ago
orbiter 1b45e33f04 added robots tag parser to solr scheme 14 years ago
orbiter cf4fd525ee added directDocByURL attribute in crawl profile 14 years ago
orbiter c61e4cfd78 - fix for incomplete clear() in balancer 14 years ago
orbiter 813f297a95 another performance hack: re-use of known host addresses for isLocal property; avoids look-up in local hash 14 years ago
orbiter 035ebfbf3b - performance hacks (should affect the crawl balancer and reduce CPU load during crawl stack re-fill) 14 years ago
orbiter b250e6466d implemented crawl restrictions for IP pattern and country lists 14 years ago
f1ori e207c41c8e * fix urlproxy for urls containing dolar signs 14 years ago
orbiter 57d5529a01 performance hacks 14 years ago
orbiter 5ad7f9612b added crawl settings for three new filters for each crawl: 14 years ago
orbiter 47a8c69745 added a new feature to MultiProtocolURIs to get the locale for each url: 14 years ago
orbiter 2c3161b4ac refactoring: 14 years ago
orbiter d2ea250d99 refactoring: 14 years ago
low012 42b5f09f68 *) this should fix a bug in snippet creation (also cleaned up a little bit) 14 years ago
low012 277b454a62 *) added comments 14 years ago
orbiter 6b22865dbc - removed some warinings 14 years ago
orbiter 0c6d95e57b - more tolerance against failure of table opening 14 years ago
orbiter 4f31869c5a enhanced search result timing 14 years ago
orbiter 6b02b696b0 - add number of search results to end of rss and json output to reflect latest status of retrieval 14 years ago
f1ori 87e6abd168 * fix urls containing a port number in urlproxy 14 years ago
f1ori 97045022fa * pass cookies to Server Side Includes 14 years ago
orbiter ce2a76d603 performance hack for search process 14 years ago
orbiter aaf7a0feaa yet another cache strategy 14 years ago
orbiter 8a428d3e77 ensure termination of pdf parser to avoid deadlocking of other processes during search result preparation 14 years ago
orbiter 2c4a672fe2 bugfixes and performance hacks for tabe index 14 years ago
orbiter dad5b586a4 added a concurrent warmin-up of Table data structures. that should speed-up the start-up process but may also cause stronger CPU load at that time. 14 years ago
orbiter 734059d33e performance hacks 14 years ago
orbiter 23e81b28b2 synchronization enhancements 14 years ago
orbiter dd4635e323 patches 14 years ago
orbiter bb0c045036 fix for problem with relocation of network 14 years ago
orbiter 85a5487d6d YaCy can now use the solr index to compute text snippets. This makes search result preparation MUCH faster because no document fetching and parsing is necessary any more. 14 years ago
orbiter 0819e1d397 protection against OOM cases in image parser. See also bugs.yacy.net/view.php?id=54 14 years ago