You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
yacy_search_server/source/net/yacy/document
orbiter 8a428d3e77
ensure termination of pdf parser to avoid deadlocking of other processes during search result preparation
13 years ago
..
content abstraction of surrogate main element (xmlns:geo was missing for wiki extracts) 14 years ago
geolocalization
importer added new configuration property "crawler.embedLinksAsDocuments". If this is switched on (this is default now), the all embedded image, audio and video links from all parsed documents are added to the search index as individual document. This will increase the search index size dramatically but will also enable us to create a much faster image, audio and video search. If the flag is switched on, the index entries are also stored to a solr index, if this is also enabled. 13 years ago
language enhanced identificator: using AtomicInteger for counter 14 years ago
parser ensure termination of pdf parser to avoid deadlocking of other processes during search result preparation 13 years ago
AbstractParser.java added new configuration property "crawler.embedLinksAsDocuments". If this is switched on (this is default now), the all embedded image, audio and video links from all parsed documents are added to the search index as individual document. This will increase the search index size dramatically but will also enable us to create a much faster image, audio and video search. If the flag is switched on, the index entries are also stored to a solr index, if this is also enabled. 13 years ago
Classification.java - added a 'add every media object linked in a html document as a new document' to the html parser. This causes that all image, app, video or audio file that is linked in a html file is added as document. In fact that means that parsing a single html document may cause that a number of documents is inserted into the search index. 13 years ago
Condenser.java hack to reduce resource contention caused by massive UTF8 decodings which use java.nio resources: 14 years ago
Document.java YaCy can now use the solr index to compute text snippets. This makes search result preparation MUCH faster because no document fetching and parsing is necessary any more. 13 years ago
ImageParser.java reduce teh effect of 'Bildersuche findet generierte HTML-Seiten als Bilder' 14 years ago
LargeNumberCache.java
LibraryProvider.java added new word recommendation library in DictionaryLoader_p.html 13 years ago
Parser.java
Phrase.java
SentenceReader.java
SnippetExtractor.java finishing up my commits (7855-7858) which could be helpful for 13 years ago
TextParser.java added new configuration property "crawler.embedLinksAsDocuments". If this is switched on (this is default now), the all embedded image, audio and video links from all parsed documents are added to the search index as individual document. This will increase the search index size dramatically but will also enable us to create a much faster image, audio and video search. If the flag is switched on, the index entries are also stored to a solr index, if this is also enabled. 13 years ago
WordCache.java added new word recommendation library in DictionaryLoader_p.html 13 years ago
WordTokenizer.java encapsulate potential expensive objects in TextSnippet to allow GC them asap 13 years ago