Commit Graph

8604 Commits (7496df93c38ee63c032bf6791c65623faf4e76f8)

Author SHA1 Message Date
luccioman fb3032c530 Added a crawl filtering possibility on documents Media Type (MIME)
7 years ago
luccioman e45afedee4 Added support for enclosures (media links) to the RSS loader
7 years ago
luccioman aaefd5219c Reduce log verbosity of RSS loader on feed items with no link
7 years ago
luccioman cf62b571bd Added RSS reader support for `enclosure` feed item sub element.
7 years ago
luccioman e5f5de0fc7 Added some JavaDoc to the RSSMessage class.
7 years ago
luccioman 0d7625ecfb Handle Solr fields restrict and alias in YaCy html and exml writers
7 years ago
luccioman 3da2739bbd Parse and index more common audio metadata text tag fields.
7 years ago
luccioman 846aba00fa Added parsing of URLs eventually present in audio metadata tags
7 years ago
Michael Peter Christen 187075b878 added nav filter
7 years ago
luccioman bcbd0ae1a4 Enabled partial parsing of audio resources.
7 years ago
luccioman fda0189613 Updated audio file extensions with ones recently added to audioTagParser
7 years ago
luccioman 978e2be95b Let a chance for other parsers on audioTagParser error
7 years ago
luccioman 9e5846a26e Small fix on svg parser error message
7 years ago
luccioman 11611dbdcf Reuse existing File copy function to handle audio parser tmp files
7 years ago
luccioman f77f8f40f9 Factored audio parser tag processing
7 years ago
luccioman 9a7a353d0e Removed some unnecessary intermediate list creation on array copy.
7 years ago
luccioman fb6457f5bc Fixed NPE case when on audio resource parsed with null tag
7 years ago
luccioman c3ff50c17a Updated the list of audio file formats supported by the audioTagParser
7 years ago
luccioman 1b90479a76 Added missing vocabulary navigator increment on results from RWI
7 years ago
luccioman 46c9da6428 Allow creation of vocabularies from remote CSV file URLs.
7 years ago
luccioman 17c7a85f18 Make StreamResponse usable in Java try-with-resources statements
7 years ago
luccioman b67742336e Provide user interface messages on vocabulary creation read/write errors
7 years ago
luccioman 3e8dd90211 Use https rather than http in links and queries to openstreetmap.org
7 years ago
luccioman 3a973dbb23 Removed unused import
7 years ago
luccioman e9527cd0e5 Reuse the same Pattern instance when matching multiple key/values
7 years ago
luccioman dbf4c1cd76 Improved blacklist entries editing operations :
7 years ago
reger 87077b8fb6 Adjust and move Language Navigator to be member of the navigatior plugin
7 years ago
luccioman eb20589e29 Fixed issue #158 : completed div CSS class ignore in crawl
7 years ago
luccioman 0cdee4e26a Fixed loss of "meanCount" search param when using facets or page buttons
7 years ago
luccioman 117a859879 Do not clear all search modifiers when unselecting one modifier.
7 years ago
luccioman 33593c22e9 Fixed loss of other modifiers on keywords/tags search navigation links
7 years ago
luccioman a9dc0874c0 Remove old query terms from search results suggestions links.
7 years ago
luccioman 9412881230 Added basic support for autotagging microdata annotated item types.
7 years ago
luccioman 5a14d34a7d Refactoring : documented and extracted autotagging processing functions.
7 years ago
luccioman 58b9834729 Added HTML microdata typed items parsing capability.
7 years ago
luccioman 80fb1026d0 Create recrawl requests with the relevant crawl profile.
7 years ago
luccioman 539925a275 Added an utility to generate/update XLIFF master file from lng files.
7 years ago
luccioman fa6d030b0b Moved dbtest to the test source folder.
7 years ago
luccioman 6cd3847d0a Fixed NullPointerException case on Table init with relative file path.
7 years ago
luccioman 28883d8a71 Shutdown daemon threads at the end of dbtest
7 years ago
luccioman 929e0d6eae Replaced improper ByteBuffer.equals() implementation by Arrays.equals()
7 years ago
luccioman 46b5249c20 Removed time condition on HostBalancer initialization in JUnit test.
7 years ago
luccioman 8b572b7337 Commit Solr index before simulating or starting recrawl job.
7 years ago
luccioman 733cacdbb8 Revised the RDFaParser main launcher for minimal proper operation.
7 years ago
luccioman 7baa99f26f Fixed stored URL in web cache when redirection(s) occurs.
7 years ago
luccioman 9ddf92d143 Removed unncessary reflection usage for workflow tasks.
7 years ago
luccioman 897d3d30cc Added new recrawl job profile to the list of default crawl profiles
7 years ago
luccioman 9624516bf8 Refresh recrawl job profile threshold date like other default profiles
7 years ago
luccioman b712a0671e Added a specific default crawl profile for the recrawl job.
7 years ago
luccioman adf3fa493d Added comments about crawl profiles recrawl cycles
7 years ago