Commit Graph

4679 Commits (2512119e5f6641e176e7ec80346525dc9ceff350)

Author SHA1 Message Date
orbiter b6f09a475d - added an index profile editor in the /indexFederated_p.html servlet for solr indexes
14 years ago
f1ori a17351dcfe * navigation bar for filetype constraints
14 years ago
f1ori 96957375cc * fix url proxy for relative links and chromium
14 years ago
orbiter 9ebc75db4b fix for channel authorization
14 years ago
orbiter 6d9e5865ee faster appearance of search result page (but complete search time is the same)
14 years ago
orbiter f7ca84cfc0 enhanced template engine
14 years ago
orbiter 84c9658644 added a file type navigator
14 years ago
orbiter 31283ecd07 - added a search option to filter only specific network protocols. i.e. get only results from ftp servers. Just add '/ftp' to your search.
14 years ago
orbiter 4b425ffdd2 fix for http://bugs.yacy.net/view.php?id=41
14 years ago
orbiter 7db208c992 performance hacks: more pre-allocated StringBuilder
14 years ago
orbiter 87bd559c42 fixed warning
14 years ago
orbiter f30d36b101 enhanced template engine
14 years ago
orbiter 115abc8917 - more attributes for search progress bar
14 years ago
sixcooler 7bfa6bb4b6 prevent getting a yacySeed from zero-length-hash-string by chance
14 years ago
orbiter bce280a308 update on options for interface graphics
14 years ago
orbiter 2683162ec5 - added more options to access grid picture, web structure picture and network graphics
14 years ago
orbiter 0c1b29f3c9 - applied many small performance hacks
14 years ago
f1ori 900dacbf97 * improve link rewriting in proxy-url
14 years ago
f1ori dc855d881b * further improve proxyurl
14 years ago
orbiter a7a6b392f5 code cleanup
14 years ago
orbiter fe0c08455b more concurrency (enhancement) hacks
14 years ago
orbiter 0e9a99cb05 another resource hack
14 years ago
orbiter 535b6b953c more hacks to omit superfluous string object allocation
14 years ago
orbiter 87082f407e less String object creation during search
14 years ago
orbiter ab5a16b957 lesse memory occupation during ranking and faster host navigator
14 years ago
orbiter 1489ebeedf one more hack to free ram for search events
14 years ago
f1ori ddcc333acc * fix negative result counts
14 years ago
orbiter fa734bdf9f better memory protection in search logger
14 years ago
orbiter dbea40d536 - changed snippet fetch strategy logic: do not check if entry is in cache. This should reduce IO load on the HTCACHE which is a showstopper during large number of search requests
14 years ago
orbiter 4bea3f9714 hack to reduce resource contention caused by massive UTF8 decodings which use java.nio resources:
14 years ago
orbiter 746e3c3b06 Replaced a widely-used Property Object in the httpd with HashMap<String, Object> which is not synchronized like Properties
14 years ago
f1ori 14e1666b21 * fix replacing regexes in url proxy
14 years ago
orbiter e28bd0d038 fix for some possible causes of memory leaks
14 years ago
orbiter 09ba6814c0 - non-blocking word hash computation with dynamic digest object generation (this was important!)
14 years ago
orbiter 10e2f588f8 - enhanced ybr ranking computation
14 years ago
orbiter bd55dcee50 - commented out experimental distributed ranking loading
14 years ago
orbiter d1dbbd956a always use a template method cache even if the template cache flag is set to false. This flag is only used to make dynamic updates to the template files, to not dynamic updates to the rewrite methods (which is not possible without recompiling). low memory usage is guaranteed by the usage of soft references which are dropped before an OOM is thrown
14 years ago
orbiter 0d040ff6bb fix for bug 0000036: no crawling of https pages
14 years ago
orbiter 3ed4a09368 small features, some bug fixes and performance hacks
14 years ago
orbiter e55c254f7b enhanced logging
14 years ago
orbiter b45701d20f this is a re-implementation of the YaCy Block Rank feature
14 years ago
orbiter 021840e5ba removed (almost) deadlocks and unnecessary CPU load
14 years ago
orbiter 123375bfba added a new yacy protocol servlet 'idx'. This returns an index to one of the data entities that is stored in YaCy.
14 years ago
orbiter 1d8b0f74f4 one more fix for SVN 7713
14 years ago
orbiter 0960261769 fix for svn 7713
14 years ago
orbiter 5b579e21a3 code cleanup
14 years ago
orbiter 039126cfaf better handling of on/off switched solr indexing
14 years ago
orbiter 9248a4eef4 reduce teh effect of 'Bildersuche findet generierte HTML-Seiten als Bilder'
14 years ago
orbiter 0621a15f89 fix for wrong search result counter: added a counter for all filtered out entities
14 years ago
orbiter 9c33b2fb58 fix for String Matcher in case that no snippet is returned (NPE)
14 years ago
orbiter 76f2817e00 a fix for the snippet computation and hopefully better snippets
14 years ago
orbiter deda54d684 - relaxed matching of string-search (this is now case-insensitive)
14 years ago
orbiter 6e42d4de88 - added full-String search function: find things that match exactly what is quoted in the query
14 years ago
apfelmaennchen 8b8db2aaba YMarks: some small changes/fixes
14 years ago
apfelmaennchen 441035f1f4 YMarks: some improvements to flexigrid quick search on YMarks.html
14 years ago
orbiter 6fa439c82b - refactoring of robots
14 years ago
apfelmaennchen e7c2ea193b YMark:
14 years ago
orbiter 3b578a28ef some patches to prevent that empty or bad IP information is broadcasted
14 years ago
orbiter 361841df16 another patch according to http://bugs.yacy.net/view.php?id=26#c36
14 years ago
orbiter 37fede9d30 better logic for proper seed ip recognition and better error messages
14 years ago
orbiter 8b95a26866 better magic
14 years ago
orbiter 2700a58e5a added a magic to the peer ping that will be used in case that the contacting peer requests that it's reported IP shall be used for a back-ping. The back-ping now also returns the same magic which will make it possible that the requested peer can verify that the back-pinged peer is actually the same peer.
14 years ago
orbiter 8879cc1db2 removed System.out.println
14 years ago
orbiter f6077b3cc0 added more attributes for html parser and enhanced data structures
14 years ago
f1ori 0b02083e97 * function for simple crawl of one url
14 years ago
f1ori d671de8c17 add ranking weight to json-search-results
14 years ago
orbiter d8e934c085 better abstraction of http client identification
14 years ago
sixcooler a3e707283d not using HTTPConnector anymore
14 years ago
orbiter b77b8cac0c - enhanced html parser: recognized much more details in the content
14 years ago
low012 bc84d2bc9d *) fixed typo in stop script
14 years ago
apfelmaennchen b2281f0b7d YMark: intermediate work towards flexigrid support
14 years ago
low012 06d50fd801 *) fixed stupid bug (introduced in r7663 by myself) which caused wrong parsing of Wiki pages
14 years ago
apfelmaennchen 60412d2bb3 YMark:
14 years ago
orbiter 3d5104d357 - fixed a bug in crawl start with file name (npe in new url)
14 years ago
orbiter fd3baa9025 fix for http://bugs.yacy.net/view.php?id=24
14 years ago
low012 2e9694c9e9 *) removed recursion which hopefully prevents exception
14 years ago
apfelmaennchen a2e86daae9 YMark: more bug fixes
14 years ago
apfelmaennchen 62855f9567 YMark: code clean up and some small fixes
14 years ago
apfelmaennchen 667e912b19 YMark:
14 years ago
apfelmaennchen a0e4960a4d YMark:
14 years ago
orbiter 958ff4778e enhanced location search:
14 years ago
orbiter 19fd13d3bc Added federated index storage to solr.
14 years ago
orbiter c17d102bd8 enhanced speed for OrderedScoreMap inc method and size comparisment in concurrent environments
14 years ago
orbiter 01690eab86 fix for mediawiki importer and wikicode parser
14 years ago
orbiter c5352e6872 added new SearchResult class (to be used later)
14 years ago
orbiter 4c013d9088 more UTF8 getBytes() performance hacks
14 years ago
apfelmaennchen 78d6d6ca06 refactoring for ymarks
14 years ago
orbiter a47bdc405b better logging for robinson selection according to peer tag
14 years ago
orbiter cafcb1f9ed removed the DNS resolving for web structure computation from the indexing queue and placed it in a concurrent computation queue that does not block the crawler. Makes crawling faster and less DNS-speed-dependent
14 years ago
orbiter 17530ca7b5 fix for bug http://bugs.yacy.net/view.php?id=10
14 years ago
orbiter 96c32e87b0 fixes to crawler and new user-agent crawl-delay handling
14 years ago
orbiter b2fe4b7b1a added a handling of appearances of yacy bot entries in robots.txt if this entry addresses the yacy peer
14 years ago
orbiter cb6f709a16 - enhancements in surrogate reading
14 years ago
low012 1ff9947f91 *) added new user right: extended search right (allows to define users who can query more results than anonymous users)
14 years ago
orbiter 156cf02703 - added an index constraint 'has location' to the condenser
14 years ago
orbiter 0430a94eaa the location search shows now not re-evaluated locations but only such locations that are attached as metadata to web pages
14 years ago
orbiter 9b25d07295 - added geo information parsing to html parser
14 years ago
f1ori efcf37a953 * show info in log, if robots.txt is rejected due to wrong mime-type
14 years ago
low012 16cd919795 *) fixed Exceptions which caused 500 error when entering invalid URL mask or invalid prefer mask, invalid masks are ignored, error message is displayed on yacysearch.html (what about yacysearch.rss and yacysearch.json?)
14 years ago
low012 1a24917cea *) fixed NPE which occured when empty String was entered as search word
14 years ago
orbiter b1a8d0c020 enhancements to web cache and less strict caching rules
14 years ago
orbiter f3baaca920 - enhancements to DNS IP caching and crawler speed
14 years ago
low012 e7860b1239 *) <mode="Homer">D'oh!</Homer>
14 years ago
low012 82f1580a60 *) trying to fix ConcurrentModificationException
14 years ago
low012 9f0286b380 *) fixed potential "java.lang.IllegalArgumentException: Illegal group reference" which occured if special characters which are also used as metacharacters in regular expression were used inside of <pre>...</pre> (see: http://veerasundar.com/blog/2010/01/java-lang-illegalargumentexception-illegal-group-reference-in-string-replaceall/)
14 years ago
orbiter 78d4c45d09 enhancement during search process: fast fail of search in case that all index feeder have terminated.
14 years ago
orbiter ba03ca8620 added more configuration options for search:
14 years ago
f1ori e0c7d490f9 * fix bug #6
14 years ago
orbiter a50f28e6e7 - fixed missing save operation for peer name change
14 years ago
orbiter 2b5f8585bf performance hack for Balancer and ip address parsing
14 years ago
low012 2861d0888a *) simplified code\n*) fixed potential NumberFormatExceptions
14 years ago
orbiter 1989ebc24b removed more warnings
14 years ago
orbiter b62b79675b removed type cast warnings
14 years ago
orbiter 8f11d3a5bb redesigned the ScoreMap classes:
14 years ago
orbiter a564230c48 more enhancements against blocked threads occurred in seed age evaluation (blocks httpd in some cases)
14 years ago
orbiter dc0db3550e avoid string conversion
14 years ago
orbiter 694fa3a2a5 - replaced more direct string-based UTF-8 conversions by predefined UTF-8 conversion
14 years ago
orbiter 30aed9824a moved getBytes() to UTF8.getBytes() to use a default String encoding
14 years ago
orbiter 1214615185 fix for 'invisible entry', see http://forum.yacy-websuche.de/viewtopic.php?p=22133#p22133
14 years ago
orbiter 3820525464 more memory protection: auto-flush of caches in case of memory shortage
14 years ago
orbiter 7962d35425 - removed file upload function in crawl start and replaced it with an input field for a file path where the crawl start file is loaded. This was necessary to support the API steering for file crawl starts, for two reasons:
14 years ago
orbiter e1b6916423 always try to guess the size of a StringBuilder to prevent too many memory re-allocations
14 years ago
low012 3b40b98256 *) set SVN properties
14 years ago
orbiter 2af8e33773 better performance computing search targets with index abstracts
14 years ago
orbiter 619b561a4a enhanced secondary search: index abstracts decompression is now much faster and does not cause strong CPU load after several searches with more than one word
14 years ago
orbiter 27ecdb5444 use less peers for remote search
14 years ago
orbiter cb1f49d0f2 replaced all 'new String' with default encoding (missing) or UTF-8 encoding with a String generation method that uses a pre-defined Charset constant for UTF-8. This avoids a cache-lookup for the Charset object using String hashing of the String 'UTF-8'.
14 years ago
orbiter 7138f4036b less synchronization, better thread dump tool
14 years ago
orbiter 8d14916c74 more patches for a better out-of-memory management
14 years ago
orbiter c2c5b12882 - even less memory for circle tool
14 years ago
orbiter 799c534935 one more patch again OOM during secondary remote search
14 years ago
orbiter 77b1e921a9 this asserts prevents a network operation in case of sabotage and must be removed therefore
14 years ago
orbiter bed79402be introduction of a new remote search load control: the remote search has taken 10 results per peer with a time-out of 3 seconds so far. The attributes of number of results per peer and time-out time can now be configured.
14 years ago
orbiter 6dfaf6fef7 fix for bug in deletion of old seeds
14 years ago
orbiter 993b9bc1a8 memory/performance hacks, less synchronization, better concurrency
14 years ago
orbiter 42d90664f3 - fixed a memory leak in the httpc.post method (no finish)
14 years ago
orbiter 38dce547c0 better concurrency (less locking on date formatting) more logging and minor bug fixes
14 years ago
f1ori 59dea3a284 * implement url proxy, a proxy via the url http://peer:port/proxy.html?url=http://domain.tld/path
14 years ago
mikeworks 8b7b783c49 Tray.java: Broke the build on with wrong non UTF-8 encoded file and french umlauts (unmappable character for encoding UTF8)
14 years ago
mikeworks db65ada467 Tray.java: Added localization for french tray icon command - although this can probably also done better than with if statements. (preferably also from the locales file)
14 years ago
orbiter 89d337841c more logging for OOMs
14 years ago
orbiter b1781d7aae some more performance hacks
14 years ago
orbiter 5e186e0122 continuing the fight against deadlocks during time formatting: better caching.
14 years ago
orbiter dec24244cf added convenience class to generate UTF StringBody objects with a default UTF8 charset.
14 years ago
orbiter 1110d16af9 performance hack: replaced generic row.getColBytes() call with row.getPrimaryKeyBytes() where the column is 0
14 years ago
orbiter 19b2a50578 - enhanced date formatter cache
14 years ago
orbiter f2e8ffd768 enhancement in synchronisation
14 years ago
orbiter ad7fcb9d61 Enhanced Base64Order transformation: less overhead (transformation between StringBuilder and byte[])
14 years ago
orbiter 0ce17d823a - fixed bug in ordering
14 years ago
orbiter dec4f36700 - fix for missing favicons in search widgets
14 years ago