Commit Graph

8593 Commits (ad09b786bf85d7fd1962bc4a9616afd62b99a185)
 

Author SHA1 Message Date
Michael Peter Christen ad09b786bf clean up parser data
13 years ago
Michael Peter Christen 276a66a793 Adding a limit of 1000 links that a parser shall store during indexing.
13 years ago
Michael Peter Christen 613b45f604 - better data structures in secondary search
13 years ago
Michael Peter Christen de903a53a0 parser refactoring & hacks
13 years ago
Michael Peter Christen 8a82609360 - smaller caches to save memory
13 years ago
Michael Peter Christen 7249d9c9de bugfix for concurrent seed loader
13 years ago
Michael Peter Christen c72d3b12cd concurrently initialize the seed list during p2p network bootstrap
13 years ago
Michael Peter Christen 1825f165b8 better integration of blacklist according to use case
13 years ago
Michael Peter Christen c18fa9fa75 Merge branch 'master' of git://gitorious.org/~reger/yacy/bbyacy-rc1
13 years ago
Michael Peter Christen ce8d4b87d9 fixes for new eclipse 'Juno' warning 'Resource leak'.
13 years ago
Michael Peter Christen 0c345d1559 giving threads name so its easier to see whats happening during
13 years ago
reger 067728bccc add search result heuristic. adding a crawl job with depth-1 for every displayed search result (crawling every external linked page of displayed search result pages)
13 years ago
Michael Peter Christen 03280fb161 removed segments-concept and the Segments class:
13 years ago
Michael Peter Christen 508a81b86c added solr field 'refresh_s' which stores the refresh url contained in
13 years ago
Michael Peter Christen f3167def64 do not fill the keywords with title content if keywords do not exist.
13 years ago
Michael Peter Christen 9116013c64 - allow lazy initialization of solr value (if using 'lazy', then no
13 years ago
sixcooler 97f60010d8 fix crawl start from file
13 years ago
Michael Peter Christen c03d306afa shorter autocommit time (now: 1 second) to prevent that user cannot see
13 years ago
Michael Peter Christen 0294a53459 - add canonical field only if requested by solr schema
13 years ago
Michael Peter Christen 3fd4a01286 added option to record urls that are forwarded to the solr index
13 years ago
Michael Peter Christen d763e4d94b fixed bad referer computation in SSIs which causes a NPE during host
13 years ago
Michael Peter Christen 358b04885e more IPv6 hacks
13 years ago
Michael Peter Christen 96aeb127e3 generalized localhost naming.
13 years ago
Michael Peter Christen 77f795756c fixing redirects and status codes: storing of status code in
13 years ago
Michael Peter Christen 8dd469b9dd added option to configure the autocommit delay time of solr on-the-fly
13 years ago
Michael Peter Christen 5d9bd4ddc2 Merge remote-tracking branch 'origin/master'
13 years ago
Michael Peter Christen b9dfca4b0a - fixed IndexFederated Servlet / a embedded Solr can now be selected
13 years ago
reger 2931726386 adjusted NetBeans classpath for new and updated libraries in lib
13 years ago
Michael Peter Christen cc1b6762bb root, not yacy
13 years ago
Michael Peter Christen 2589158f44 changed recommended line in /etc/crontab for high-availability
13 years ago
reger 4156d4e12b Merge branch 'master' of git://gitorious.org/yacy/rc1.git
13 years ago
Michael Peter Christen 7b53be141f upgraded to pdfbox 1.7.0
13 years ago
Michael Peter Christen fad3b14813 added jetty libraries, needed for future use as web server and as
13 years ago
Michael Peter Christen a38b0a2c46 extended embedded solr tests to ensure that it will be usable within a
13 years ago
Michael Peter Christen b9d42fd9c8 using com.google.common.io.Files instead of homebrew methods
13 years ago
Michael Peter Christen a5eb91fa60 refactoring
13 years ago
Michael Peter Christen 1be0025a9c - added test for EmbeddedSolrConnector
13 years ago
Michael Peter Christen dbdd697f4d moved RDFaParser.xsl configuration file to defaults
13 years ago
Michael Peter Christen 90b82ce994 using guava for host resolution (non-blocking for ips) and time-out
13 years ago
Michael Peter Christen f094936b89 added new class libraries to mac app
13 years ago
Michael Peter Christen e12bb254b4 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
13 years ago
Michael Peter Christen 3f55dc7c1e - added solr core and libraries that solr needs (lucene is missing, will
13 years ago
reger 2fccc4e883 Merge branch 'master' of git://gitorious.org/yacy/rc1.git
13 years ago
Michael Peter Christen c337190a00 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
13 years ago
cominch c63c3a4495 Show additional interaction elements in footer section on each page, if
13 years ago
Michael Peter Christen 786be7d175 better integration of RDFaParser
13 years ago
cominch fa98657bb3 Augmented Browsing: changed the settings page
13 years ago
cominch 751eeade0d Merge remote-tracking branch 'original yacy/master'
13 years ago
cominch 84a11ec48c Corrected loading of default page settings on ConfigPortal.html
13 years ago
sixcooler bea002dc15 correct table in new look of Crawler_p
13 years ago