Commit Graph

30 Commits (5dbe071b4e5f5c5ed1b185a63c2f6cb76eda6c77)

Author SHA1 Message Date
luccioman 1ca9cb6bd9 Fixed a NullPointerException case, reported in issue #209
6 years ago
luccioman 8da3174867 Ensure lower case conversion consistency with any default locale.
7 years ago
luccioman 6e89d125f2 Added robots.txt support for heuristics federated search.
8 years ago
Michael Peter Christen b5ac29c9a5 added a html field scraper which reads text from html entities of a
10 years ago
reger 24f68a4eb7 refactor opensearch heuristic
10 years ago
reger 6c3f36def1 - fix path to default heuristic.cfg
10 years ago
reger 1d3ea35d69 prevent NPE on host link for to short HeuristicCfg.OpenSearchURL
10 years ago
Michael Peter Christen 453bfd0f17 removed unused variables and warnings
11 years ago
reger 97e84439fb adjusted ConfigHeuristic and changed QueryGoal.getOriginalQueryString to .getQueryString
11 years ago
Michael Peter Christen 5e31bad711 - the webgraph shall store all links which appear on a web page and not
11 years ago
Roland Haeder 841a28ae76 Added 'final' for all exception blocks as this helps the Java compiler
11 years ago
Michael Peter Christen 5878c1d599 - refactoring of log to ConcurrentLog:
12 years ago
Michael Peter Christen 6115bef335 added a 'greedy learning' mechanismn which will cause that a 'fresh'
12 years ago
Michael Peter Christen 2080fc7406 removed unused tag fields
12 years ago
reger 7804c12976 fix error msg in ConfigHeuristics_p
12 years ago
reger 230a12bfe2 adjust Opensearch discover function to new webgraph Solr schema
12 years ago
Michael Peter Christen 089dee1770 - generalized SchemaConfiguration into super-class Configuration and
12 years ago
Michael Peter Christen 788288eb9e added the generation of 50 (!!) new solr field in the core 'webgraph'.
12 years ago
Michael Peter Christen 91a0401d59 introduced a second core named 'webgraph'. This core will hold the link
12 years ago
Michael Peter Christen b6de1f42dc Full redesign of solr connection architecture. This was done to support
12 years ago
reger 168b1d130d Adding heuristic to get search results from configured systems which support opensearch specification
12 years ago
Michael Peter Christen 00c1c777fa refactoring
12 years ago
orbiter a55e77a115 added twitter search heuristic
12 years ago
Michael Peter Christen d3964253ae - added @SuppressWarnings to unused servlet method parameters
13 years ago
reger 067728bccc add search result heuristic. adding a crawl job with depth-1 for every displayed search result (crawling every external linked page of displayed search result pages)
13 years ago
Roland 'Quix0r' Haeder af5a597e47 Scroogle is not comming back, remove dead code
13 years ago
orbiter d2ea250d99 refactoring:
13 years ago
orbiter cc6499bf8d - added http://blekko.com as search heuristic (like scroogle). This was easy since they deliver their search results also as rss feed
14 years ago
orbiter 3197ca42ed preparations to move the HTCache into cora:
14 years ago
orbiter 11b7853940 added a configuration page for search heuristics. currently you can switch on there:
15 years ago