Commit Graph

8593 Commits (ad09b786bf85d7fd1962bc4a9616afd62b99a185)
 

Author SHA1 Message Date
Michael Peter Christen ad09b786bf clean up parser data 13 years ago
Michael Peter Christen 276a66a793 Adding a limit of 1000 links that a parser shall store during indexing. 13 years ago
Michael Peter Christen 613b45f604 - better data structures in secondary search 13 years ago
Michael Peter Christen de903a53a0 parser refactoring & hacks 13 years ago
Michael Peter Christen 8a82609360 - smaller caches to save memory 13 years ago
Michael Peter Christen 7249d9c9de bugfix for concurrent seed loader 13 years ago
Michael Peter Christen c72d3b12cd concurrently initialize the seed list during p2p network bootstrap 13 years ago
Michael Peter Christen 1825f165b8 better integration of blacklist according to use case 13 years ago
Michael Peter Christen c18fa9fa75 Merge branch 'master' of git://gitorious.org/~reger/yacy/bbyacy-rc1 13 years ago
Michael Peter Christen ce8d4b87d9 fixes for new eclipse 'Juno' warning 'Resource leak'. 13 years ago
Michael Peter Christen 0c345d1559 giving threads name so its easier to see whats happening during 13 years ago
reger 067728bccc add search result heuristic. adding a crawl job with depth-1 for every displayed search result (crawling every external linked page of displayed search result pages) 13 years ago
Michael Peter Christen 03280fb161 removed segments-concept and the Segments class: 13 years ago
Michael Peter Christen 508a81b86c added solr field 'refresh_s' which stores the refresh url contained in 13 years ago
Michael Peter Christen f3167def64 do not fill the keywords with title content if keywords do not exist. 13 years ago
Michael Peter Christen 9116013c64 - allow lazy initialization of solr value (if using 'lazy', then no 13 years ago
sixcooler 97f60010d8 fix crawl start from file 13 years ago
Michael Peter Christen c03d306afa shorter autocommit time (now: 1 second) to prevent that user cannot see 13 years ago
Michael Peter Christen 0294a53459 - add canonical field only if requested by solr schema 13 years ago
Michael Peter Christen 3fd4a01286 added option to record urls that are forwarded to the solr index 13 years ago
Michael Peter Christen d763e4d94b fixed bad referer computation in SSIs which causes a NPE during host 13 years ago
Michael Peter Christen 358b04885e more IPv6 hacks 13 years ago
Michael Peter Christen 96aeb127e3 generalized localhost naming. 13 years ago
Michael Peter Christen 77f795756c fixing redirects and status codes: storing of status code in 13 years ago
Michael Peter Christen 8dd469b9dd added option to configure the autocommit delay time of solr on-the-fly 13 years ago
Michael Peter Christen 5d9bd4ddc2 Merge remote-tracking branch 'origin/master' 13 years ago
Michael Peter Christen b9dfca4b0a - fixed IndexFederated Servlet / a embedded Solr can now be selected 13 years ago
reger 2931726386 adjusted NetBeans classpath for new and updated libraries in lib 13 years ago
Michael Peter Christen cc1b6762bb root, not yacy 13 years ago
Michael Peter Christen 2589158f44 changed recommended line in /etc/crontab for high-availability 13 years ago
reger 4156d4e12b Merge branch 'master' of git://gitorious.org/yacy/rc1.git 13 years ago
Michael Peter Christen 7b53be141f upgraded to pdfbox 1.7.0 13 years ago
Michael Peter Christen fad3b14813 added jetty libraries, needed for future use as web server and as 13 years ago
Michael Peter Christen a38b0a2c46 extended embedded solr tests to ensure that it will be usable within a 13 years ago
Michael Peter Christen b9d42fd9c8 using com.google.common.io.Files instead of homebrew methods 13 years ago
Michael Peter Christen a5eb91fa60 refactoring 13 years ago
Michael Peter Christen 1be0025a9c - added test for EmbeddedSolrConnector 13 years ago
Michael Peter Christen dbdd697f4d moved RDFaParser.xsl configuration file to defaults 13 years ago
Michael Peter Christen 90b82ce994 using guava for host resolution (non-blocking for ips) and time-out 13 years ago
Michael Peter Christen f094936b89 added new class libraries to mac app 13 years ago
Michael Peter Christen e12bb254b4 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 13 years ago
Michael Peter Christen 3f55dc7c1e - added solr core and libraries that solr needs (lucene is missing, will 13 years ago
reger 2fccc4e883 Merge branch 'master' of git://gitorious.org/yacy/rc1.git 13 years ago
Michael Peter Christen c337190a00 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 13 years ago
cominch c63c3a4495 Show additional interaction elements in footer section on each page, if 13 years ago
Michael Peter Christen 786be7d175 better integration of RDFaParser 13 years ago
cominch fa98657bb3 Augmented Browsing: changed the settings page 13 years ago
cominch 751eeade0d Merge remote-tracking branch 'original yacy/master' 13 years ago
cominch 84a11ec48c Corrected loading of default page settings on ConfigPortal.html 13 years ago
sixcooler bea002dc15 correct table in new look of Crawler_p 13 years ago