sixcooler
61f627eb85
fix for ssl-connections from proxy-usage staying in close-wait-state
...
+ some extra 'close' in HttpClient
11 years ago
Michael Peter Christen
91fa99e9bb
added new icon/image for latest commit
11 years ago
Michael Peter Christen
9fac9249bc
- replaced 'edit' link with a clone symbol in Table_API_p since that is
...
what it does: it clones the crawl, it does not change the crawl.
- moved the appearance of this clone link to the type column since this
makes it visible also if the URL column is not visible.
11 years ago
Michael Peter Christen
0f6db6ad5b
Merge remote-tracking branch 'jensbees/crawlexpert-post'
11 years ago
bhoerdzn
3fcf7a94c5
rolling back wrong merge
11 years ago
Jens Bertram
3252c1ec39
Merge upstream/master into crawlexpert-post
11 years ago
Michael Peter Christen
d328cc4a83
fix for didyoumean, added also more asian alphabets
11 years ago
Michael Peter Christen
90c8577840
enhanced ranking; patches to replace old ranking
11 years ago
Jens Bertram
9f6b98d374
Merge master into crawlexpert-post
11 years ago
bhoerdzn
6e33be4ce6
reverting local changes to project.xml
11 years ago
bhoerdzn
a3824dfbaa
check URL on inital load, if set
11 years ago
bhoerdzn
52f49d475b
add a hidden field for "crawlingstart" since jQuery omits the submit button value
11 years ago
bhoerdzn
b0c0ec2dec
link recorded crawl starts back to "CrawlStartExpert_p" in "Process Scheduler"
11 years ago
bhoerdzn
d64d45361c
use integer types for boolean values
11 years ago
bhoerdzn
eda123d6fd
remove debugging code intercepting post requests
11 years ago
bhoerdzn
5057f27bbd
fix typo in parsing "cachePolicy" parameter
11 years ago
bhoerdzn
98f5c9018d
Fixed template vars for "deleteold". Fixed parsing "deleteold" parameter. Stop "setState" overwriting "deletold" state on load.
11 years ago
bhoerdzn
a6a62986d4
correct state handling for country code restriction
11 years ago
bhoerdzn
4066b85155
correctly set initial state for load filters
11 years ago
bhoerdzn
8c91c3e7cd
set form boolean values to 0 & 1 instead of false & true
11 years ago
bhoerdzn
c27fabc88e
fixed wrong parameter check
11 years ago
bhoerdzn
2214bf5396
Remove some post parameters, if they are set to default values, as their values are already set by YaCy. Added some documentation.
11 years ago
Michael Peter Christen
1b61bd40ed
- Added new solr field url_file_name_tokens_t which stores the file name
...
tokens. This can be used to enhance the ranking.
- Added also a rating_i field as basis for later usage.
- enhanced the tokenization process.
11 years ago
orbiter
6efa7532d2
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
orbiter
5f5a97bafc
added the anchor text within web pages to the searcheable entities of a
...
web page. This can be of benefit for the ranking if these fields are
used for boosts.
11 years ago
orbiter
705b3338ee
list more fields available for search and for ranking boosts
11 years ago
sixcooler
d536092fe4
fix false fill NAME_CACHE_MISS-DNS-Cache in case of a timeout
...
for eg. caused by massive requests when crawl from file
11 years ago
bhoerdzn
405878182f
Use list template for all other option lists. Fixed some template expressions.
11 years ago
bhoerdzn
8e74098cd4
Use list template for "reloadIfOlderNumber".
11 years ago
bhoerdzn
52bad7b908
Dynamic toggling of form fields, based on passed in and selected values. This will also cut down the post string by disabling not needed fields.
11 years ago
Michael Peter Christen
78e7aadb26
removed unused initialization method
11 years ago
Michael Peter Christen
e56aa4fe93
fixed search navigation
11 years ago
Michael Peter Christen
4fbc4740df
removed warnings
11 years ago
Lotus
202a9fbdad
adding synonyms from German OpenThesaurus ready for use in YaCy
11 years ago
Michael Peter Christen
21aa6a0321
migration to Solr 4.5.0
11 years ago
bhoerdzn
45cf553bc3
try to guess default crawling mode, if none set
11 years ago
bhoerdzn
b4f0c822f2
assign strings before checking contents
11 years ago
Michael Peter Christen
ef31d0f279
fix for rss reader, see http://bugs.yacy.net/view.php?id=294
11 years ago
bhoerdzn
499abe8f91
set default values for string parameters
11 years ago
Jens Bertram
85316b3ac6
Merge branch 'master' into crawlexpert-post
11 years ago
bhoerdzn
42ea56eaad
made crawStartExpert_p aware of post variables; extended template where needed
11 years ago
Michael Peter Christen
101a6e6e14
Patch the citation index for links with canonical tags.
...
This shall fulfill the following requirement:
If a document A links to B and B contains a 'canonical C', then the
citation rank computation shall consider that A links to C and B does
not link to C.
To do so, we first must collect all canonical links, find all references
to them, get the anchor list of the documents and patch the citation
reference of these links.
11 years ago
orbiter
ba3c173077
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
reger
fd119deb00
fix NPE on modified since check ( Response.requestHeader allowed to be null)
11 years ago
orbiter
a3b5d84c81
Merge remote-tracking branch 'origin/master'
...
Conflicts:
.classpath
11 years ago
orbiter
adfae074cf
added classpath for debugging
11 years ago
Michael Peter Christen
b28d43decc
added two more fields source_cr_host_norm_i,target_cr_host_norm_i in
...
webgraph and an addition to postprocessing to copy all cr ranking
attributes to the link edges associated to the postprocessing documents
11 years ago
Michael Peter Christen
a52f3a597e
fix for canonical-from-http-header feature
11 years ago
Michael Peter Christen
2dd7c5be44
added parsing of http-canonical tags (untested, could not find an
...
example page)
11 years ago
Michael Peter Christen
4476dea5ba
do not fail if a wrong boost key is used; instead, print only a warning
...
See also: http://bugs.yacy.net/view.php?id=293
11 years ago