You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Marek Otahal
72adbeae90
!Important: move from Hashtable to HashMap
...
Hashtable is an obsolete collection v1, now since v2 offers HashMap with same or better
functionality. Please review, almost all code was already moved, so only a few changes. That is not the issue,
but I found notices that some (ugly big) helper classes had to be created in past
to compensate missing Hashtable's functionality. I'd like input if we can remove some of them.
look for //FIX: if these commits
Signed-off-by: Marek Otahal <markotahal@gmail.com>
13 years ago
..
content
abstraction of surrogate main element (xmlns:geo was missing for wiki extracts)
14 years ago
geolocalization
added autotaggig stub .. only reading and parsing of vocabularies at
13 years ago
importer
!Important: move from Hashtable to HashMap
13 years ago
language
enhanced identificator: using AtomicInteger for counter
14 years ago
parser
refactoring
13 years ago
AbstractParser.java
added new configuration property "crawler.embedLinksAsDocuments". If this is switched on (this is default now), the all embedded image, audio and video links from all parsed documents are added to the search index as individual document. This will increase the search index size dramatically but will also enable us to create a much faster image, audio and video search. If the flag is switched on, the index entries are also stored to a solr index, if this is also enabled.
14 years ago
Autotagging.java
added autotaggig stub .. only reading and parsing of vocabularies at
13 years ago
Classification.java
- added a 'add every media object linked in a html document as a new document' to the html parser. This causes that all image, app, video or audio file that is linked in a html file is added as document. In fact that means that parsing a single html document may cause that a number of documents is inserted into the search index.
14 years ago
Condenser.java
Initial performance improvements
13 years ago
Document.java
some last-minute performance hacks
13 years ago
ImageParser.java
- enhanced description on search front page
13 years ago
LargeNumberCache.java
more performance hacks
15 years ago
LibraryProvider.java
added autotaggig stub .. only reading and parsing of vocabularies at
13 years ago
Parser.java
*) added SID file (Commodore 64) sound file parser
14 years ago
Phrase.java
more performance hacks
15 years ago
SentenceReader.java
Initial performance improvements
13 years ago
SnippetExtractor.java
finishing up my commits (7855-7858) which could be helpful for
14 years ago
StringBuilderComparator.java
replaced String with StringBuilder in suggestion process
14 years ago
TextParser.java
Added missing closure of ByteArrayInputSteam
13 years ago
WordCache.java
redesign of WordCache to be prepared to hold multiple
13 years ago
WordTokenizer.java
Initial performance improvements
13 years ago