reger
d5077b94e4
update Netbeans IDE classpath
10 years ago
reger
516c8c02b1
update to commons-compress-1.8.1
...
(bugfix release)
10 years ago
reger
ea6c9e9b07
reduce mem buffer overhead for gap files during r/w
...
(they are typically small compared to idx allowing to use smaller buffersize -> set to 16k records)
10 years ago
reger
8d1b764458
update to Jetty 9.2.2
10 years ago
reger
e88537522d
allow single quote " ' " in query
...
see http://mantis.tokeek.de/view.php?id=379
-add QueryGoal test case for this
10 years ago
reger
f76d81f5c9
fix: hanging text in input fields of WatchWebStructure_p.html
...
in IE11
10 years ago
orbiter
487021fb0a
snippet computation update
10 years ago
orbiter
cf9e7fdbb8
reverted template from latest cherry-picked commit
10 years ago
Alex
f6c7467a90
updated some french translations
10 years ago
orbiter
1c2f1f233a
Merge branch 'master' of git@gitorious.org:yacy/rc1.git
10 years ago
reger
5a4995ded3
fill solr rss writer dc:subject tag with keyword content
10 years ago
orbiter
927aaa95a6
concurrency bugfix
10 years ago
Alexandre Vouilloz
564f1cf6d3
updated fr traduction of ConfigBasic.html and header.template + fixed some accents in fr.lng
10 years ago
orbiter
c9e593cf78
removed warnings
10 years ago
reger
19e35a9126
add type attribute to atom feed <link> tag
...
(for /yacysearch.atom)
10 years ago
orbiter
51bb21c6b4
reverted .classpath from cherry-picked
...
ce1fd1e901
10 years ago
Alexandre Vouilloz
2dd8be3c66
updated the traduction of Status_p.inc in the french locale
10 years ago
reger
e50b2b4d04
fix test case MultiProtocolURL.toString()
...
(only allowed on AnchorURL)
10 years ago
reger
0a2f4a0e2f
eliminate lat/lon type conversion in osm
...
(define as double)
10 years ago
reger
7584352e7b
use more predefined Solr query parameter constants
...
- use CommonParams and DisMaxParams constants
- fix typo in get sort parameter
- getDocumentCountByParams redundant implementation and risk of not optimized call (row parameter unspecified) -> as only used from getCountByQuery removed from interface
10 years ago
reger
f9db5dd6c5
reduce doublecontent check document (prevent out of memory)
...
see http://mantis.tokeek.de/view.php?id=437
test result (concurrency=7)
2000 docs = eom always
1000 docs = eom always
100 docs = eom never
chosen -> 200 docs (eom not encountered during test with 1GB mem setting)
10 years ago
reger
e9eae45b55
simplify rssreader and improve atom feed link extraction
...
- type detection (rss/atom)
- init type parameter overwritten during parse, parameter obsolete
- detection by endtag changed to simpler first-tag evaluation
- channel image not used, removed related extra parser handling
- remove unused code (set/getImage) in rssfeed
- atom link extraction to account for possible multipe link tags
- spec limits link to one with rel="alternate" or one without rel attribute
not accounting for the follwing type & hreflang exception yet:
o atom:entry elements MUST NOT contain more than one atom:link
element with a rel attribute value of "alternate" that has the
same combination of type and hreflang attribute values.
10 years ago
reger
a8508417d1
catch NPE during crawl (OAI import)
...
- condenseDocument mime=null (allowed)
- collectionconfiguration responseheader = null (allowed)
10 years ago
reger
3dde94422f
center searchevent lines on network graph
...
(PerformanceSearch_p.html)
10 years ago
Michael Peter Christen
3860711aef
fix for possible interruption of concurrent queries
10 years ago
Michael Peter Christen
01bbb20666
increased default logging line count to max
10 years ago
Michael Peter Christen
6344718f8b
reducing the concurrent query stack size and reduced concurrency of
...
postprocessing to avoid OOM situations
10 years ago
Michael Peter Christen
eca9380e3d
bugfix for crawler double-check: if an url is redirected, the
...
redirect-target was not double-checked. This is now done by replacing
the redirect-URL on the crawl queue again (where it is double-checked)
10 years ago
Michael Peter Christen
9ac0c93f17
fix for subpath crawl filter
10 years ago
Michael Peter Christen
9bc3e457dd
fix for termination of all crawls
10 years ago
Michael Peter Christen
66106bdaf0
fix for crawler attribute maxdompages
10 years ago
Michael Peter Christen
49d91b94c3
npe fix in crawler
10 years ago
Michael Peter Christen
8d650ca225
added hint to port forwarding videos
10 years ago
Michael Peter Christen
b7183a7321
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
reger
ea2e627662
fix ConfigAccounts del user with uppercase letter in name
...
(usernames are case sensitive, userdb.delete used toLower)
10 years ago
Michael Peter Christen
c465b791af
typo
10 years ago
Michael Peter Christen
191ec8c82a
added concurrency to postprocess rewrite process
10 years ago
Michael Peter Christen
a1e8bdd5e9
log ppm instead of docs/second
10 years ago
Michael Peter Christen
cc0ded7abd
set process type of web graph according to fields as defined in the
...
schema
10 years ago
Michael Peter Christen
12fb9d7cd1
log postprocessing constraints in case that postprocessing is not
...
performed
10 years ago
Michael Peter Christen
3c23b89823
less logging
10 years ago
Michael Peter Christen
a0c53174c5
better solr query logging to detect unnecessary sort requests for more
...
performance profiling
10 years ago
Michael Peter Christen
338f574bdc
no sorting if http/www unique fields are not demanded (makes query
...
faster) and some code restrucuring
10 years ago
Michael Peter Christen
1609763be5
toString fix
10 years ago
Michael Peter Christen
b983e68254
more retries, less sleep
10 years ago
Michael Peter Christen
1503ba7794
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
reger
8f77719091
fix "Ljava.lang.String" in crawl queue anchor name
...
(e.g. IndexCreateQueues_p.html?stack=LOCAL with images in queue)
10 years ago
Michael Peter Christen
0ceeceb35e
more logic on Solr queries; usage of the query terms in posprocessing,
...
saving one query for double document detection now per document
10 years ago
reger
3963bca3b6
catch IndexControlRWIs_p error if RWI not connected
10 years ago
orbiter
38864ae004
Merge branch 'master' of git@gitorious.org:yacy/rc1.git
10 years ago