Commit Graph

77 Commits (191ec8c82abe99f13506bf8d148aab563f9b76d5)

Author SHA1 Message Date
Michael Peter Christen c115f3869c enhanced snippet computation and test method in ViewFile
11 years ago
Michael Peter Christen 2de159719b added an option to set 'obey nofollow' for links with rel="nofollow"
11 years ago
reger 431a5f9c4e added test case for TextSnippet,
11 years ago
Michael Peter Christen f0db501630 better handling of ranking parameters and new default values for date
11 years ago
reger 727dfb5875 refactore URIMetadataNode to further unify interaction with index
11 years ago
Michael Peter Christen 10cf8215bd added crawl depth for failed documents
11 years ago
Marc Nause 809b4e1fd9 Team added support for URLs with unicode characters in host part to
11 years ago
Michael Peter Christen fdaeac374a - enhanced postprocessing speed and memory footprint (by using HashMaps
11 years ago
Michael Peter Christen 0db8e34625 enhanced webgraph processing
11 years ago
reger e4f49fb175 for searchresults with empty title use filename as title
11 years ago
Michael Peter Christen 9bb7eab389 hacks to prevent storage of data longer than necessary during search and
11 years ago
Michael Peter Christen 2602be8d1e - removed ZURL data structure; removed also the ZURL data file
12 years ago
Michael Peter Christen 61c5e40687 - replaced the properties object in AnchorURL with distinct variables
12 years ago
Michael Peter Christen 5e31bad711 - the webgraph shall store all links which appear on a web page and not
12 years ago
Michael Peter Christen cb85b22725 redesign of the image search process (with much better results,
12 years ago
reger 29967102a2 optimized QueryGoal (reducing mem and computation by removing all_hashes)
12 years ago
Michael Peter Christen 765943a4b7 Redesign of crawler identification and robots steering. A non-p2p user
12 years ago
Michael Peter Christen 47b1c81d08 - refactoring
12 years ago
Roland Haeder 841a28ae76 Added 'final' for all exception blocks as this helps the Java compiler
12 years ago
Michael Peter Christen 89c0aa0e74 added collection_sxt to error documents
12 years ago
Michael Peter Christen 5878c1d599 - refactoring of log to ConcurrentLog:
12 years ago
Michael Peter Christen 2a8b99ea82 remove text_t in search result after snippet has been computed to save
12 years ago
orbiter da621e827e prevent NPE in case RWI is disabled
12 years ago
Michael Peter Christen 8f2d3ce2f9 reduced locking situation in crawler: shifted synchronized location and
12 years ago
orbiter e1bfe9d07a - reduction of the concurrently running processes to make YaCy more
12 years ago
Michael Peter Christen addba047e2 changes in ranking computation
12 years ago
Michael Peter Christen 788288eb9e added the generation of 50 (!!) new solr field in the core 'webgraph'.
12 years ago
reger f301336adf fix: no results with configuration citation reference index switched off
12 years ago
Michael Peter Christen d6b82840f8 added a feature to find similarities in documents.
12 years ago
orbiter 5dfd6359cb redesign of the QueryParams class: introduced QueryGoal which holds the
12 years ago
Michael Peter Christen 2371ef031c added solr faceted search support to YaCy search results
12 years ago
Michael Peter Christen ccc3760a47 Refactoring and redesign of data architecture to make URIMetadataRow
12 years ago
Michael Peter Christen e5b3c172ff removed hack which translated Solr documents to virtual RWI entries
12 years ago
Michael Peter Christen 5d16c23a1f specified more URIMetadata as URIMetadataNode
12 years ago
Michael Peter Christen 21fe8339b4 - enhanced generation of url objects
12 years ago
Michael Peter Christen 5f0ab25382 removed the option to prevent removal of & parts inside of the
12 years ago
Michael Peter Christen a06930662c replaced some more .getBytes() with UTF8/ASCII.getBytes()
12 years ago
Michael Peter Christen 0cec7e761a enhanced snippet extractor to find snippets also inside of tokens of an
13 years ago
Michael Peter Christen 1533bfd63b refactoring
13 years ago
Michael Peter Christen 8219a445f3 refactoring
13 years ago
Michael Peter Christen 00c1c777fa refactoring
13 years ago
Michael Peter Christen e54ac38095 - some corrections in usage of getFile() and getFileName()
13 years ago
Michael Peter Christen 0cab06c47c refactoring
13 years ago
Michael Peter Christen 9bece5ac5f enhanced snippet fetch - removed a bug that caused documents to be
13 years ago
Michael Peter Christen f9c0e6e950 - Implemented and integrated the URIMetadataNode object which is a
13 years ago
Michael Peter Christen 24d9db1613 snippet retrieval loading processes may use a smaller minimum load time
13 years ago
Michael Peter Christen 1687737771 Abstraction of HandleMap and HandleSet
13 years ago
orbiter 69e743d9e3 - more abstraction for the RWI index as preparation for solr integration
13 years ago
orbiter 0cbda0b2b8 - replaced all length() == 0 and size() == 0 with isEmpty()
13 years ago
Michael Peter Christen 7c1ba99755 removed more unused method parameters
13 years ago