Commit Graph

14 Commits (29967102a20c31f173ff89e73322ba46b35b61e9)

Author SHA1 Message Date
Roland Haeder 841a28ae76 Added 'final' for all exception blocks as this helps the Java compiler
11 years ago
reger 97ab5b90e8 - odt & ooxml (office document) parser correction to add content to fulltext index
12 years ago
reger 160ce568b3 move testing SolrServlet.main to test, making include of jetty*.jar in distribution and classpath obsolete
12 years ago
orbiter 49e5ca579f added new configuration property "crawler.embedLinksAsDocuments". If this is switched on (this is default now), the all embedded image, audio and video links from all parsed documents are added to the search index as individual document. This will increase the search index size dramatically but will also enable us to create a much faster image, audio and video search. If the flag is switched on, the index entries are also stored to a solr index, if this is also enabled.
13 years ago
orbiter b6fb239e74 redesign of parser interface:
15 years ago
orbiter 3528b970d6 - refactoring
15 years ago
orbiter b79f4f062f refactoring of yacy documents and parsers: they depend now only on the kelondro classes
15 years ago
f1ori 34c71b22e8 fix and enable parser unit tests (tested with eclipse)
15 years ago
orbiter ce8dc575ca refactoring
15 years ago
orbiter ce7924d712 better concurrency for rwi entry parsing during search processing
15 years ago
orbiter 72ac5bd80f refactoring of search process.
15 years ago
f1ori d515bc11e2 added ooxmlparser
15 years ago
f1ori 67da20647f * add new odf parser based on sax-xml-parser
16 years ago
f1ori 06557485f5 * added parser unittest!
16 years ago