Commit Graph

9921 Commits (561ea135af864574f1b462868779dc3a59678682)
 

Author SHA1 Message Date
Michael Peter Christen 07261fe274 Merge remote-tracking branch 'nutomics/blacklist_structure'
12 years ago
Michael Peter Christen dea71851d2 - better concurrency for network scanner
12 years ago
Michael Peter Christen a34e137e27 fix for citation index generation in case that entry.referrerhash() is
12 years ago
Michael Peter Christen a2c8116a8f accept (but ignore) a '+' sign in front of search words
12 years ago
orbiter 9f0cc9b401 enhanced network scanner
12 years ago
orbiter d8354a389c Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Lotus 6e120e90fe do not cut text on submit buttons
12 years ago
orbiter f8c28efd66 fix for rssTerminal coloring
12 years ago
sixcooler 308d73f855 do not use remote proxy if not switched on - regardless of the proto
12 years ago
sixcooler 69906b1d2e Revert "do not use remote proxy if not switched on - regardless of the proto"
12 years ago
sixcooler 20f452d228 do not use remote proxy if not switched on - regardless of the proto
12 years ago
sixcooler 9551720d5c re-enable saved setting for proxy-crawl-profile
12 years ago
sixcooler d5d8936f9d For indexes that are changing rapidly in NRT situations, fcs (stands for
12 years ago
Felix Ableitner 44f8fcf62e Changed class structure of Blacklist.
12 years ago
Michael Peter Christen 3054a6d4b9 added a patch from Sebastian M.B., submitted by email for coloring of
12 years ago
Michael Peter Christen 78af998f8f Merge commit 'fd90fcc4e08f80acbfd1c9a7ec62ce04cd309594'
12 years ago
Michael Peter Christen 57ffdfad4c added a crawl option to obey html-meta-robots-noindex. This is on by
12 years ago
Felix Ableitner fd90fcc4e0 Fixes #196.
12 years ago
Michael Peter Christen 5a5d411ec0 new robots_i attribute fields
12 years ago
Michael Peter Christen fa08bd9d5a hack to prevent long waiting times in crawler
12 years ago
Michael Peter Christen f1c5338210 prepartion for greedy crawl profiles and refactoring
12 years ago
Michael Peter Christen e6f361f474 adding the canonical tag to crawl queues
12 years ago
orbiter 40c5ee47c1 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter ae23a0badb updated copyright message; included LGPL for 'cora' and a warranty
12 years ago
reger a6bf44212e bugfix: location (lat/lon) meta data retrival (Double.NaN check)
12 years ago
Michael Peter Christen 203921006a redesign of citation index storage
12 years ago
orbiter 7c6ccc426c set crawlingQ to true by default because most webpages are dynamic and
12 years ago
Lotus 5de4267a9d windows installer: update to latest jre
12 years ago
reger 83763ee4a4 jpeg parser: extract GPS location from meta data
12 years ago
Michael Peter Christen e92b9275ce Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen 56cdcfa2fa fixed greedy learning mode - global is not a search attribute in
12 years ago
Michael Peter Christen 32aa1d4569 removed unused option for queries
12 years ago
Michael Peter Christen 0c5bed7e2c added configuration option for greedy learning function to ConfigPortal
12 years ago
sixcooler 5d1f619f07 possible helpful closing of solr-requests
12 years ago
Michael Peter Christen 9d291764d1 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
sixcooler e5abccdfe4 added optimize-option
12 years ago
Michael Peter Christen 8ea6ddf636 removed attributes from ConfigPortal.html which are redundant to
12 years ago
Michael Peter Christen 64140f35cd fix for solr requests if no query part is given (prevent npe)
12 years ago
Michael Peter Christen 8caaf6203a fixed false multiple-generation of remote facet search which
12 years ago
Michael Peter Christen 23fb458963 - fix to gsa searchresult answer in case that no query part is given
12 years ago
Michael Peter Christen 823ae4d6a7 added url_protocol_s to error documents
12 years ago
Michael Peter Christen 660a196989 refactoring
12 years ago
Michael Peter Christen c4538d8d91 added metadata-extractor-2.6.2.jar to eclipse classpath, removed old lib
12 years ago
reger 3760e2616b bump up lib/metadata-extractor-2.6.2.jar (used for image parser) with needed code adjustments
12 years ago
Michael Peter Christen 9a6fcdf597 npe fix
12 years ago
Michael Peter Christen 54024958ac added url_file_name_s in qeury for live-search of urls
12 years ago
Michael Peter Christen 16d1d744fa added url_file_name_s in default collection schema for the file name
12 years ago
reger 8d1c4c423d make imageparser fileextension detection case insensitive (extensions are often upper case)
12 years ago
Michael Peter Christen f542cf7d9c fix for daterange: the to-date is inclusive
12 years ago
Michael Peter Christen f9d859f5dc now writing image alt texts and (camelcase-)parsed urls into a text
12 years ago