Commit Graph

13777 Commits (92e10d7d1c5ef6aece60b85cc9f0b5ed019a9777)
 

Author SHA1 Message Date
luccioman 9412881230 Added basic support for autotagging microdata annotated item types.
7 years ago
luccioman 5a14d34a7d Refactoring : documented and extracted autotagging processing functions.
7 years ago
luccioman 58b9834729 Added HTML microdata typed items parsing capability.
7 years ago
luccioman 80fb1026d0 Create recrawl requests with the relevant crawl profile.
7 years ago
luccioman 539925a275 Added an utility to generate/update XLIFF master file from lng files.
7 years ago
luccioman 41a6b052d9 Updated master and French translation for the IndexReIndexMonitor_p page
7 years ago
luccioman fa6d030b0b Moved dbtest to the test source folder.
7 years ago
luccioman 6cd3847d0a Fixed NullPointerException case on Table init with relative file path.
7 years ago
luccioman 28883d8a71 Shutdown daemon threads at the end of dbtest
7 years ago
luccioman 929e0d6eae Replaced improper ByteBuffer.equals() implementation by Arrays.equals()
7 years ago
luccioman 098ee63911 Added a manual performance test for the HostBalancer.
7 years ago
luccioman fefe2d1b6e Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
7 years ago
reger 5aa4fb1144 upd to metadata-extractor-2.11.0.jar
7 years ago
luccioman 46b5249c20 Removed time condition on HostBalancer initialization in JUnit test.
7 years ago
luccioman 8b572b7337 Commit Solr index before simulating or starting recrawl job.
7 years ago
luccioman 5b943c07ab
Merge pull request #155 from JeremyRand/readme-typo-fixes
7 years ago
JeremyRand dea856c854
Fix some typos in the README.
7 years ago
luccioman 733cacdbb8 Revised the RDFaParser main launcher for minimal proper operation.
7 years ago
luccioman 7baa99f26f Fixed stored URL in web cache when redirection(s) occurs.
7 years ago
luccioman 5e2812c060 Automatically refresh running recrawl report when JavaScript is enabled.
7 years ago
luccioman 19903a984f
Merge pull request #154 from tangdou1/master
7 years ago
tangdou1 49d103ad16
Merge pull request #1 from tangdou1/tangdou1-patch-1
7 years ago
tangdou1 dd4f93f049
Update zh.lng
7 years ago
tangdou1 e585b4f597
Update zh.lng
7 years ago
luccioman 0fce264ba4 Set reindex page to html5 and removed presentational only html tables.
7 years ago
luccioman 83df922afc Removed unused duplicated HTML id on header hidden field
7 years ago
luccioman 9ddf92d143 Removed unncessary reflection usage for workflow tasks.
7 years ago
luccioman 897d3d30cc Added new recrawl job profile to the list of default crawl profiles
7 years ago
luccioman 9624516bf8 Refresh recrawl job profile threshold date like other default profiles
7 years ago
luccioman b712a0671e Added a specific default crawl profile for the recrawl job.
7 years ago
luccioman adf3fa493d Added comments about crawl profiles recrawl cycles
7 years ago
luccioman 3638e16c2e More comprehensive log on rejected recrawls caused by date constraint
7 years ago
luccioman d47afe6fab Use a constant for crawler reject reason prefix with specific processing
7 years ago
luccioman 4e03335625 Added more details to the recrawl job report
7 years ago
luccioman d95d393a0d Add a query link to local Solr to browse selected recrawl candidates
7 years ago
luccioman 59f7763af6 Display recrawl job report also when job is actively running
7 years ago
luccioman 6425963cee Fixed internal tables exact value match iterator
7 years ago
luccioman 0c9e0b3566 Record recrawl calls to make them schedulable
7 years ago
luccioman 433e241e4f Added a report info box about eventual last terminated recrawl job
7 years ago
luccioman b2af25b14f Added a stop condition to the Recrawl busy thread
7 years ago
luccioman 421728d25a Made possible to customize selection query before launching a recrawl
7 years ago
luccioman fab6e54fec Enforced controls (HTTP method, token) on ReIndex and ReCrawl operations
7 years ago
luccioman 36e9b1c5b3 Fixed SegmentTest test case time dependant occasional failures
7 years ago
luccioman 8a4ea1c11e Added UI switch to control content domain constraint per search request
7 years ago
luccioman 36a45b3905 Added UI setting for strictness of content-type checking on media search
7 years ago
reger cedb53be4e upd to commons-io-2.6
7 years ago
reger f8071ac8ae Make TokenizedStringNavigator (used for keyword search facet) active
7 years ago
reger 270b77074e upd to httpclient-4.5.4 and httpmime-4.5.4
7 years ago
reger 6db7f5525b upd to icu4j-60.2
7 years ago
luccioman e6907fdab3 Added optional search parameter/setting to control content domain filter
7 years ago