You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
yacy_search_server/source/net/yacy/document
Michael Peter Christen 2e5cd6a1b2
fixed parser extension deny list generation and usage
13 years ago
..
content abstraction of surrogate main element (xmlns:geo was missing for wiki extracts) 14 years ago
geolocalization added autotaggig stub .. only reading and parsing of vocabularies at 13 years ago
importer !Important: move from Hashtable to HashMap 13 years ago
language enhanced identificator: using AtomicInteger for counter 14 years ago
parser Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 13 years ago
AbstractParser.java added new configuration property "crawler.embedLinksAsDocuments". If this is switched on (this is default now), the all embedded image, audio and video links from all parsed documents are added to the search index as individual document. This will increase the search index size dramatically but will also enable us to create a much faster image, audio and video search. If the flag is switched on, the index entries are also stored to a solr index, if this is also enabled. 13 years ago
Autotagging.java fix for single-word vocabulary lines 13 years ago
Classification.java - added a 'add every media object linked in a html document as a new document' to the html parser. This causes that all image, app, video or audio file that is linked in a html file is added as document. In fact that means that parsing a single html document may cause that a number of documents is inserted into the search index. 13 years ago
Condenser.java added autotagging to document condenser: 13 years ago
Document.java there is no noindex, only nofollow in links 13 years ago
ImageParser.java - enhanced description on search front page 13 years ago
LargeNumberCache.java
LibraryProvider.java added autotagging to document condenser: 13 years ago
Parser.java *) added SID file (Commodore 64) sound file parser 14 years ago
Phrase.java
SentenceReader.java Initial performance improvements 13 years ago
SnippetExtractor.java performance hack 13 years ago
StringBuilderComparator.java replaced String with StringBuilder in suggestion process 13 years ago
TextParser.java fixed parser extension deny list generation and usage 13 years ago
WordCache.java vocabularies are now also used as source for a did-you-mean computation 13 years ago
WordTokenizer.java performance hack 13 years ago