You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
orbiter
764a40e37d
speed enhancements for crawler and url retrieval (affects also search speed)
...
- concurrency for LURL-fetching: this can be done using a concurrent lookup into the separated url databases. Concurrency is possible because there is no IO during lookup. The more LURL-Tables are present, the better is the speedup. More CPUs will increase speed
- because a large number of LURL-lookups are made during crawling (for double-check), the LURL-Lookup speed enhancements enhances also crawling speed
- search speed also profits from LURL-lookup enhancement
- changed some flushing parameters in word index caching which should make better use of large word index caches and should speed up indexing
- removed flush chunksize parameter, because this was only useful for IO path enhancement feature which was removed some weeks ago to prevent blocking and deadlocks during search requests
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4628 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
..
cache
more generics
17 years ago
crawler
added concurrency to indexing process:
17 years ago
dbImport
- refactoring and abstraction of index reference (urls) handling: blacklisting is part of reference filtering
17 years ago
parser
- next try to fix the networking problem:
17 years ago
plasmaCondenser.java
- refactoring and abstraction of index reference (urls) handling: blacklisting is part of reference filtering
17 years ago
plasmaCrawlBalancer.java
fixed deadlock
17 years ago
plasmaCrawlEURL.java
fixed problem with redirection: redirected URLs had not been tested with the double-check
18 years ago
plasmaCrawlEntry.java
fixed image search and favicon loading
18 years ago
plasmaCrawlLoaderMessage.java
joined anomic.net.URL, plasmaURL and url hash computation:
18 years ago
plasmaCrawlNURL.java
speed enhancements for crawler and url retrieval (affects also search speed)
17 years ago
plasmaCrawlProfile.java
refactoring
17 years ago
plasmaCrawlResults.java
- refactoring and abstraction of index reference (urls) handling: blacklisting is part of reference filtering
17 years ago
plasmaCrawlRobotsTxt.java
- changed order of database initialisation (index first)
17 years ago
plasmaCrawlStacker.java
- refactoring and abstraction of index reference (urls) handling: blacklisting is part of reference filtering
17 years ago
plasmaCrawlZURL.java
added concurrency to indexing process:
17 years ago
plasmaDHTChunk.java
- refactoring and abstraction of index reference (urls) handling: blacklisting is part of reference filtering
17 years ago
plasmaDHTFlush.java
some bugfixes:
18 years ago
plasmaDHTTransfer.java
- added ViewProfile as rdf in foaf syntax
17 years ago
plasmaGrafics.java
more generics
17 years ago
plasmaHTCache.java
- refactoring of serverThreads: renaming to distinguish busy-threads and blocking-threads
17 years ago
plasmaParser.java
- enhanced recognition, parsing, management and double-occurrence-handling of image tags
17 years ago
plasmaParserConfig.java
- added parsing of Dublin Core - compliant metadata (see RFC 5013 and ISO 15836) to html parser
17 years ago
plasmaParserDocument.java
next refactoring step in document indexing to prepare concurrency environment for document parsing
17 years ago
plasmaProfiling.java
- preparation of parsing/indexing queue for concurrent execution
17 years ago
plasmaRankingCRProcess.java
consistent use of de.anomic.server.serverMemory to get information about memory statistics
17 years ago
plasmaRankingDistribution.java
small cosmetic change for static fields in serverCore (special protocol ASCII entities) to improve readability
18 years ago
plasmaRankingRCIEvaluation.java
more generics
17 years ago
plasmaSearchAPI.java
- refactoring and abstraction of index reference (urls) handling: blacklisting is part of reference filtering
17 years ago
plasmaSearchEvent.java
enhanced the concurrency handling of indexing process (better queue size control, better data concept, better shutdown behavior)
17 years ago
plasmaSearchImages.java
- enhanced recognition, parsing, management and double-occurrence-handling of image tags
17 years ago
plasmaSearchQuery.java
- refactoring and abstraction of index reference (urls) handling: blacklisting is part of reference filtering
17 years ago
plasmaSearchRankingProcess.java
- refactoring and abstraction of index reference (urls) handling: blacklisting is part of reference filtering
17 years ago
plasmaSearchRankingProfile.java
- probably slightly better default ranking
17 years ago
plasmaSnippetCache.java
- refactoring and abstraction of index reference (urls) handling: blacklisting is part of reference filtering
17 years ago
plasmaStore.java
code cleanup
20 years ago
plasmaSwitchboard.java
speed enhancements for crawler and url retrieval (affects also search speed)
17 years ago
plasmaSwitchboardQueue.java
added concurrency to indexing process:
17 years ago
plasmaWebStructure.java
next refactoring step in document indexing to prepare concurrency environment for document parsing
17 years ago
plasmaWordConnotation.java
- changed order of database initialisation (index first)
17 years ago
plasmaWordIndex.java
speed enhancements for crawler and url retrieval (affects also search speed)
17 years ago