Commit Graph

8245 Commits (08dcf3e5d1662bfe2de1e908cdd8309c9a814bce)
 

Author SHA1 Message Date
Michael Peter Christen 08dcf3e5d1 hack to get all results if the actual number is between 10 and 64
13 years ago
Michael Peter Christen 19efbf1b0f - apply directDocByURL to NOLOAD Queue
13 years ago
Michael Peter Christen 5c66880be2 fix for search result selection in case that contentdom is not set
13 years ago
Michael Peter Christen 659178942f - Redesigned crawler and parser to accept embedded links from the NOLOAD
13 years ago
Michael Peter Christen 3bea25c513 increased image preview size
13 years ago
Michael Peter Christen a3badd3205 changed search process for images: no more media snippet load process,
13 years ago
Michael Peter Christen f5efdb21fd refactoring
13 years ago
Michael Peter Christen 4aa0eedead one more scroogle...
13 years ago
Michael Peter Christen 347612ddd4 removed scroogle parser
13 years ago
Michael Peter Christen f8cd57c92f new indexing strategy: ALL links that appear anywhere are indexed, not
13 years ago
Michael Peter Christen 14f67f217c refactoring of ContentDomain: now subclass of Classification
13 years ago
Michael Peter Christen 8a08c96a82 removed dependency from logging
13 years ago
Michael Peter Christen a1a5b015d8 refactoring: moved document Classification to cora package
13 years ago
Michael Peter Christen a5d7da68a0 refactoring: removed dependency from switchboard in Balancer/CrawlQueues
13 years ago
Michael Peter Christen 33d1062c79 refactoring: the cache belongs to the crawler
13 years ago
Michael Peter Christen 8429967ea7 no more SVN
13 years ago
Michael Peter Christen 0466bb0ddf no more SVN..
13 years ago
Michael Peter Christen 4844e124b1 one more warning in case that crawling is paused because of low disk
13 years ago
Michael Peter Christen 0ec2713af8 'download'
13 years ago
Michael Peter Christen 2be327b5ab update location update
13 years ago
Michael Peter Christen f30c577fdb add hint to speed up search results
13 years ago
Michael Peter Christen 6b133de3e9 add hint for consulting support
13 years ago
Michael Peter Christen 4d5da75814 fix for parser problem if a <a>-tag is 'within' html tags with unclosed
13 years ago
Michael Peter Christen eb2c8ffa62 display is not used any more
13 years ago
Michael Peter Christen 91a86f0b06 fixed to network graph testing
13 years ago
Michael Peter Christen f31ad84d98 automatic generation of blacklist pattern, see
13 years ago
Michael Peter Christen 7b5b9baee0 added citation rank to ranking profile
13 years ago
Michael Peter Christen 046f3a7e8d check if httpc has decompressed the release file and rename the file
13 years ago
reger 06951ef751 remove heuristic scroogle from search option help text in index.html
13 years ago
Michael Peter Christen e377092198 fix to xml output format
13 years ago
Michael Christen 41be98dc9d extended webstructure api to show together with incoming links also
13 years ago
Michael Christen 02e4dedff2 fix to url citation collection
13 years ago
Michael Christen e32055aa15 added stub classes for
13 years ago
Michael Christen ac5d124ee0 experimental implementation of a citation ranking as post-ranking
13 years ago
Michael Christen 8f89c8ef07 added information about inbound, outbound and citation links into
13 years ago
Michael Christen 71649a1296 added an api to retrieve the new citation.index with the
13 years ago
Michael Christen 8fc86fe397 added storage of full anchor link structure:
13 years ago
Michael Christen 22f05c83ff fixed default must-match filter for full domain crawls - the old filter
13 years ago
Lotus 3e61287326 some better feedback on properties change
13 years ago
Lotus 96ac95cff9 added hint how to change integration options
13 years ago
Thomas 4f61b8fd82 Fixes for compare-search
13 years ago
Thomas e0680de7b3 Remove Scroogle from compare-search, Scroogle is dead
13 years ago
Lotus 78f0d8f046 no focus on preview frames for search integration
13 years ago
Lotus 0b3f39136e allow custom ppm lower than minimum button on /Crawler_p.html
13 years ago
Lotus e14eb9de82 checkalive.sh: try to fetch only once (default: 20)
13 years ago
Lotus 7792ac6406 fix links & bug #163
13 years ago
Michael Peter Christen 532c7cf827 added physics experiment to the graph plotter. not active by default
13 years ago
Michael Peter Christen aba9b1bfa0 better names for elements of a linked graph
13 years ago
Michael Peter Christen 0cc0290978 bugfix for a must-not-match pattern check. This bug did not make the
13 years ago
Michael Peter Christen 2fc8ecee36 ConcurrentLinkedQueue has a VERY long return time on the .size() method.
13 years ago