yacy_search_server

History

orbiter 8a428d3e77 ensure termination of pdf parser to avoid deadlocking of other processes during search result preparation git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7958 6c8d7289-2bf4-0310-a012-ef5d649a1542		14 years ago
..
html	bugfixes in html parser	14 years ago
images	protection against OOM cases in image parser. See also bugs.yacy.net/view.php?id=54	14 years ago
xml	more UTF8 getBytes() performance hacks	14 years ago
bzipParser.java	added new configuration property "crawler.embedLinksAsDocuments". If this is switched on (this is default now), the all embedded image, audio and video links from all parsed documents are added to the search index as individual document. This will increase the search index size dramatically but will also enable us to create a much faster image, audio and video search. If the flag is switched on, the index entries are also stored to a solr index, if this is also enabled.	14 years ago
csvParser.java	- enhanced html parser: recognized much more details in the content	14 years ago
docParser.java	- enhanced html parser: recognized much more details in the content	14 years ago
genericParser.java	- enhanced html parser: recognized much more details in the content	14 years ago
gzipParser.java	added new configuration property "crawler.embedLinksAsDocuments". If this is switched on (this is default now), the all embedded image, audio and video links from all parsed documents are added to the search index as individual document. This will increase the search index size dramatically but will also enable us to create a much faster image, audio and video search. If the flag is switched on, the index entries are also stored to a solr index, if this is also enabled.	14 years ago
htmlParser.java	added new configuration property "crawler.embedLinksAsDocuments". If this is switched on (this is default now), the all embedded image, audio and video links from all parsed documents are added to the search index as individual document. This will increase the search index size dramatically but will also enable us to create a much faster image, audio and video search. If the flag is switched on, the index entries are also stored to a solr index, if this is also enabled.	14 years ago
mmParser.java	- enhanced html parser: recognized much more details in the content	14 years ago
odtParser.java	- enhanced html parser: recognized much more details in the content	14 years ago
ooxmlParser.java	- enhanced html parser: recognized much more details in the content	14 years ago
pdfParser.java	ensure termination of pdf parser to avoid deadlocking of other processes during search result preparation	14 years ago
pptParser.java	- enhanced html parser: recognized much more details in the content	14 years ago
psParser.java	- enhanced html parser: recognized much more details in the content	14 years ago
rssParser.java	- enhanced html parser: recognized much more details in the content	14 years ago
rtfParser.java	- enhanced html parser: recognized much more details in the content	14 years ago
sevenzipParser.java	added new configuration property "crawler.embedLinksAsDocuments". If this is switched on (this is default now), the all embedded image, audio and video links from all parsed documents are added to the search index as individual document. This will increase the search index size dramatically but will also enable us to create a much faster image, audio and video search. If the flag is switched on, the index entries are also stored to a solr index, if this is also enabled.	14 years ago
sidAudioParser.java	- enhanced html parser: recognized much more details in the content	14 years ago
sitemapParser.java	better abstraction of http client identification	14 years ago
swfParser.java	- enhanced html parser: recognized much more details in the content	14 years ago
tarParser.java	added new configuration property "crawler.embedLinksAsDocuments". If this is switched on (this is default now), the all embedded image, audio and video links from all parsed documents are added to the search index as individual document. This will increase the search index size dramatically but will also enable us to create a much faster image, audio and video search. If the flag is switched on, the index entries are also stored to a solr index, if this is also enabled.	14 years ago
torrentParser.java	- enhanced html parser: recognized much more details in the content	14 years ago
vcfParser.java	- enhanced html parser: recognized much more details in the content	14 years ago
vsdParser.java	- enhanced html parser: recognized much more details in the content	14 years ago
xlsParser.java	- enhanced html parser: recognized much more details in the content	14 years ago
zipParser.java	added new configuration property "crawler.embedLinksAsDocuments". If this is switched on (this is default now), the all embedded image, audio and video links from all parsed documents are added to the search index as individual document. This will increase the search index size dramatically but will also enable us to create a much faster image, audio and video search. If the flag is switched on, the index entries are also stored to a solr index, if this is also enabled.	14 years ago