.. |
cache
|
- code cleanup
|
18 years ago |
crawler
|
*) CrawlWorker.java: only keep content in memory if size is equal or less than 5MB
|
18 years ago |
dbImport
|
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
|
18 years ago |
parser
|
First version of the MS Powerpoint parser based on Apache POI
|
18 years ago |
urlPattern
|
*) adding missing classes
|
19 years ago |
plasmaCondenser.java
|
lines inside tags without punctuation are extended by a single dot.
|
18 years ago |
plasmaCrawlBalancer.java
|
- code cleanup
|
18 years ago |
plasmaCrawlEURL.java
|
*) Better errorhandling for charset encoding problem during content parsing
|
18 years ago |
plasmaCrawlLURL.java
|
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
|
18 years ago |
plasmaCrawlLURLEntry.java
|
- some bugfixing and code cleanup
|
18 years ago |
plasmaCrawlLURLOldEntry.java
|
- some bugfixing and code cleanup
|
18 years ago |
plasmaCrawlLoader.java
|
*) plasmaHTCache:
|
18 years ago |
plasmaCrawlLoaderMessage.java
|
added snippet-url re-indexing
|
18 years ago |
plasmaCrawlNURL.java
|
- code cleanup
|
18 years ago |
plasmaCrawlProfile.java
|
* simplified initialization of database objects
|
18 years ago |
plasmaCrawlRobotsTxt.java
|
- code cleanup
|
18 years ago |
plasmaCrawlStacker.java
|
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
|
18 years ago |
plasmaDHTChunk.java
|
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
|
18 years ago |
plasmaDHTFlush.java
|
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
|
18 years ago |
plasmaDHTTransfer.java
|
- Cache known URLs during indexReceive to avoid getting blocked during loadedURL.exists() whenever possible
|
19 years ago |
plasmaGrafics.java
|
- code cleanup
|
18 years ago |
plasmaHTCache.java
|
reverse SVN 2744, it is not needed
|
18 years ago |
plasmaParser.java
|
*) Trying to be more tolerant against wrong charset names
|
18 years ago |
plasmaParserConfig.java
|
- code cleanup
|
18 years ago |
plasmaParserDocument.java
|
removed lowercase of snippets (and other things):
|
18 years ago |
plasmaRankingCRProcess.java
|
- code cleanup
|
18 years ago |
plasmaRankingDistribution.java
|
- code cleanup
|
18 years ago |
plasmaRankingRCIEvaluation.java
|
- code cleanup
|
18 years ago |
plasmaSearchEvent.java
|
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
|
18 years ago |
plasmaSearchImages.java
|
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
|
18 years ago |
plasmaSearchPreOrder.java
|
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
|
18 years ago |
plasmaSearchQuery.java
|
- code cleanup
|
18 years ago |
plasmaSearchRankingProfile.java
|
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
|
18 years ago |
plasmaSearchResult.java
|
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
|
18 years ago |
plasmaSearchTimingProfile.java
|
- code cleanup
|
18 years ago |
plasmaSnippetCache.java
|
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
|
18 years ago |
plasmaStore.java
|
code cleanup
|
19 years ago |
plasmaSwitchboard.java
|
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
|
18 years ago |
plasmaSwitchboardQueue.java
|
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
|
18 years ago |
plasmaURLPool.java
|
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
|
18 years ago |
plasmaWordConnotation.java
|
* simplified initialization of database objects
|
18 years ago |
plasmaWordIndex.java
|
null pointer bugfix
|
18 years ago |
plasmaWordIndexAssortment.java
|
- code cleanup
|
18 years ago |
plasmaWordIndexAssortmentCluster.java
|
- code cleanup
|
18 years ago |
plasmaWordIndexFile.java
|
- code cleanup
|
18 years ago |
plasmaWordIndexFileCluster.java
|
bugfix for old WORDS storage method
|
18 years ago |