Commit Graph

4134 Commits (56ff9d5fd4287203d1e95a7c65961efe0524cbe1)

Author SHA1 Message Date
orbiter 599c3766c4 added authentication to automated API call
15 years ago
orbiter 727dd9b193 - fixed a bug in robots.txt parser
15 years ago
orbiter 54af9e6b49 - added parsing of robots meta-tag in html headers to detect a noindexing request
15 years ago
orbiter 46c4f8b68a better look-ahead into the crawl queue: show more on crawl monitor
15 years ago
lotus 7b546415dc added svn6695 for windows
15 years ago
orbiter f175f9a2d3 changed way how number of search requests are counted:
15 years ago
orbiter 84222e3b4f fix for auto-updater: delete old libraries before copy of new one
15 years ago
orbiter 93b7ddc27d fix for http://forum.yacy-websuche.de/viewtopic.php?p=19376#p19376
15 years ago
orbiter 8030ed3319 self-healing for lost crawl profile handles
15 years ago
orbiter e3e5e05ec2 fix for problem in ranking setting which was caused by the introduction of a toString() method in serverObjects
15 years ago
orbiter e3ccfb54aa fix for display problem in Firefox on MacOS X
15 years ago
orbiter 564927ce72 redesign of CrawlResult data structures because of OOM occurrences during URL deletion processes.
15 years ago
orbiter ef62d017e5 integrated session id filtering for crawler
15 years ago
orbiter d8d9984913 added framework for session id filtering (not ready yet)
15 years ago
orbiter 2bc36de336 - fix for bug in svn 6669
15 years ago
orbiter d378ca4604 better handling of concurrency in seed
15 years ago
orbiter 6538043d89 fix for http://forum.yacy-websuche.de/viewtopic.php?p=19189#p19189
15 years ago
lotus 945e0ba5a5 allow global search if res. observer disabled index transmission
15 years ago
lotus 8faeedd99a not a fix! for:
15 years ago
lotus 11188cd7eb resource observer now uses the Java 6 method to check for free space. thus, disk observing now needs Java 6 installed.
15 years ago
orbiter be18b5d8cd fix for 'cannot switch back to default language'-bug
15 years ago
orbiter 74e736c903 missing file for last commit
15 years ago
orbiter 308a973503 refactoring of tables data organisation
15 years ago
orbiter 8a76f38d26 Added a new steering servlet that can be used to repeat actions that had been made on the yacy interface. This can be used to:
15 years ago
orbiter 840527689b more simplification of bookmark class
15 years ago
orbiter d77782a8d5 removed bookmark tags file, tags are now stored only in RAM
15 years ago
orbiter ada0ce9de3 refactoring of bookmarks: there is a big performance problem in the bookmarks code and furthermore the bookmarks
15 years ago
orbiter a131ebbcb5 one more fix for NPE, see
15 years ago
orbiter 24060885b6 - added Tables abstraction in data.Tables.java
15 years ago
orbiter 7fdf59a77f misc NPE check
15 years ago
orbiter a512aef6ad fix for http://forum.yacy-websuche.de/viewtopic.php?p=18918#p18918
15 years ago
orbiter 3889438db6 fix for bookmarks
15 years ago
orbiter 23bcca07a3 removed directly linked servlets that had been there to test memory failures that appeared in that servlets
15 years ago
orbiter 69c29acb6e no exception thread dump if parser cannot parse becuase that mime-type/extension is in the deny-set
15 years ago
orbiter 8ce936bcdd added an api recording function: it shall be possible to record
15 years ago
orbiter e80e060ca6 - increased thread priority for server threads
15 years ago
orbiter 234f733a3d - relocation of seed db is better for network switch than re-initialization because of the embedding of the peers object in other objects
15 years ago
orbiter 473b11033d fixed network switch process - crawling did not work after a switch before this fix
15 years ago
orbiter fd7b348973 some fixes for the network switch
15 years ago
orbiter f6731c6240 more logging etc.
15 years ago
orbiter d9169cc6c3 increased proxy load time-out from 30000 to 60000 milliseconds
15 years ago
orbiter 938e806182 tried to fix date problem that may have prevented that foreign peers stay in the network
15 years ago
orbiter bd05e57d3b fix for http://forum.yacy-websuche.de/viewtopic.php?p=18563#p18563
15 years ago
orbiter 5df628a2a4 - added BEncoder class
15 years ago
orbiter 82f57f79e5 more PMD enhancements
15 years ago
orbiter 5d930c96f0 more fixes to search result page navigation
15 years ago
orbiter 8c520f128d reverted a change in ranking process committed this afternoon
15 years ago
orbiter a06f7ddb33 more PMD recommendations
15 years ago
orbiter 18172451a0 better search computation:
15 years ago
orbiter d126d6c1b5 renamed the servlet WatchCrawler_p to Crawler_p
15 years ago
orbiter 66c0a8e849 more PMD recommendations
15 years ago
orbiter 909a4f91c7 added a logging output for crawl starts that shows the URL that can be used to start the crawl again
15 years ago
orbiter bc96d74813 - clean-up of robots.txt parser
15 years ago
orbiter dd459281c8 applied code changes that are recommended by PMD
15 years ago
lotus eac2daf2e8 * reenable DHT if yet enough memory is available
15 years ago
orbiter d77a8f3b3e added some modifications recommended by PMD for better performance
15 years ago
orbiter d1973bae2a code cleanup: removed unused code and unused methods
15 years ago
orbiter a3b8b7b5c5 some redesign of the main menu structure:
15 years ago
lotus ab3cf60dbe fix for npe
15 years ago
orbiter 7f20963b41 add-on to last commit
15 years ago
orbiter eeca2ded92 fix for http://forum.yacy-websuche.de/viewtopic.php?p=18500#p18500
15 years ago
orbiter bb2e03761c - fix for deadlock with 100% CPU during search
15 years ago
orbiter dff4f95c78 some patches to get the torrent parser working
15 years ago
low012 82198acc06 *) minor changes
15 years ago
low012 b75547fc60 *) minor changes
15 years ago
orbiter 57d729e377 fix for negative numbers in network statistic
15 years ago
orbiter 4ac4fe952c patch for npe in bookmarks
15 years ago
orbiter d548bd41ad fix for a npe during search
15 years ago
orbiter 37245430c3 fix for NPE during DHT RWI selection
15 years ago
orbiter a37878b7d5 url parser regex performance hack
15 years ago
orbiter b527d2ebfa fix for media search
15 years ago
orbiter 362b7a929b added extensive memory protection logic to avoid out of memory errors that may be caused by the RowCollection memory allocation function
15 years ago
orbiter 8281e29963 - more configuration for profiling graph (number of events)
15 years ago
f1ori 5f0f6b71b4 * revert last commit, something is more broken than before
15 years ago
f1ori 8c8b642eba * fix timezone problem
15 years ago
orbiter 4782d2c438 fix for search bug that appeared when looking at page 3 of results or further
15 years ago
orbiter 29fde9ed49 better control of ranking order in sort stack
15 years ago
orbiter 66923ebc6c - modified method in RequestHeader that delivers the host name of requester: no more reverse domain lookup (may have killed interface performance in some cases)
15 years ago
orbiter e34e63a039 preset of proper HashMap dimensions: should prevent re-hashing and increase performance
15 years ago
orbiter 4a5100789f replaced _all_ size() == 0 with isEmpty() and all size() > 0 with !isEmpty(). The isEmpty() method is much faster in some cases, especially when used to access badly balanced hashtables where an size() operation becomes a large iteration.
15 years ago
orbiter f4946eaf27 - better thread dump
15 years ago
orbiter 9743b70d1c disabled keep-alive of server, not really needed for speed but a cause for much trouble and memory occupancy
15 years ago
orbiter 491ba6a1ba - some refactoring in workflow
15 years ago
orbiter 969123385b added json and rss output for image search
15 years ago
orbiter d183f8d980 refactoring (moved code from ContentTransformer to TemplateEngine)
15 years ago
orbiter 23aef43786 - better synchronization in SortStack
15 years ago
orbiter 7b1f5b0430 - better media search ranking
15 years ago
orbiter 4df88a4e7a - fixes for missing or bad hashCode computation
15 years ago
orbiter dbdf2570ba added comparator and more fixes for SortStack/SortStore
15 years ago
orbiter 1dff620181 Better implementation of SortStack and SortStore and adoptions in all using classes to implement the necessary Comparable interface and hash code computation.
15 years ago
orbiter fe41a84330 some enhancements in web caching: avoid double loading of response metadata and/or content
15 years ago
orbiter 06d0dcde20 more enhancements to image search
15 years ago
orbiter 4c6312d103 enhanced image search
15 years ago
orbiter 2d8f3ee301 some performance hacks
15 years ago
orbiter 94b2a664f3 - use a static DiskFileItemFactory (one instantiation is enough)
15 years ago
orbiter 013f337d3f - avoid unnecessary host name lookups for localhost
15 years ago
orbiter 20c5d78a5c fix for a ConcurrentModificationException
15 years ago
orbiter 7144d2df6e added crawlReceipt servlet as individual class to examine OOM problem as documented in
15 years ago
orbiter 29fe436e36 - fixed post-ranking including prefer mask
15 years ago
orbiter 5399d1e2bc refactoring (reason: get more abstraction to use the blacklist class; for integration in other servlets)
15 years ago
orbiter 1fa0ac26e9 better protection against NPEs during search/ranking
15 years ago
orbiter 4c99d4683d possible fix for lost crawl profile handles: clean-up job did wrong measurement to see if crawl is still running.
15 years ago
orbiter 18b21eaffe small fixes to search default values and server logging
15 years ago
lotus 6edc168cfe option to disable dht by memory limit:
15 years ago
orbiter 4431b9767e added about 450 replacements for printStackTrace() methods to pipe such traces into the log at DATA/LOG/
15 years ago
orbiter e3025ee691 - new icon for OAI-PMH loading action
15 years ago
orbiter f0b8db93f0 - more abstraction of serverCore thread access
15 years ago
orbiter 2889b9426e missing code for last commit
15 years ago
orbiter b6a8887ff5 better handling of running sessions without explicit hashtable
15 years ago
orbiter 1dc7ea986a added a dynamic keep-alive time-out for http server sessions:
15 years ago
orbiter b0b7a4f9a5 - added function to OAI-PMH reader that can pull all records from a server using an evaluation of the resumption token to get URL to retrieve remaining records
15 years ago
lotus 79251e6f60 configurable disk space hardlimit for dht
15 years ago
orbiter a0e891c63d - some redesign in UI menu structure to make room for new 'Content Integration' main menu containing import servlets for Wikimedia Dumps, phpbb3 forum imports and OAI-PMH imports
15 years ago
orbiter 30f108f97d added stub of oai-pmh importer (not working yet)
15 years ago
orbiter 77c99e500f added more control over memory allocation
15 years ago
orbiter 52470d0de4 - fix for xls parser
15 years ago
orbiter 5e8038ac4d - refactoring of blacklists
15 years ago
orbiter 26fafd85a5 - more refactoring
15 years ago
orbiter 3528b970d6 - refactoring
15 years ago
orbiter a8ce192f63 - shifted main classes to new package net.yacy
15 years ago
orbiter b79f4f062f refactoring of yacy documents and parsers: they depend now only on the kelondro classes
15 years ago
hermens 0fd9540866 Configuration of HTTPDProxyHandler logging
15 years ago
orbiter cee7a05ff2 - de-serialized the pdf parser
15 years ago
orbiter 9db928ce53 replaced fontbox 0.7.3 with fontbox 0.8.0
15 years ago
orbiter c2272785c7 - fix for xlsx and pptx parsing
15 years ago
orbiter c864901087 - moved httpd.mime to defaults path
15 years ago
low012 8829ec5f18 *) made sure that   is replaced with a space and not just deleted in CharacterCoding.java
15 years ago
orbiter 6c347a37eb more options for DocumentIndex
15 years ago
orbiter e7f18ba24b refactoring
15 years ago
orbiter ce8dc575ca refactoring
15 years ago
orbiter bea3b99aff moved table and util classes
15 years ago
orbiter bd876eb4b7 moved io classes
15 years ago
orbiter c0e0e1f422 moved blob classes
15 years ago
orbiter 1e4f8b56ed accumulated classes from different packages into the new rwi package
15 years ago
orbiter 194da25a2f moved kelondro index
15 years ago
orbiter 4446acc8cd moved kelondro order
15 years ago
orbiter f677d534b1 start of a really extensive refactoring which will produce a hierarchical package structure with the domain yacy.net as package root
15 years ago
orbiter ea473e32b8 refactoring
15 years ago
orbiter 735e2737e3 * added index segments
15 years ago
orbiter 09de5da74a once again a performance hack
15 years ago
orbiter 2f6d88403e
15 years ago
orbiter d2615ea5a8 increased memory for scraper buffer to enhance parsing speed
15 years ago
orbiter 4bbbb74ec4 removed not necessary synchronization
15 years ago
hermens 67e5464cc2 Fix for SVN6380: x[] Arrays are unsuitable Keys for Maps without using a proper Comparator.
15 years ago
hermens aeab8c7917 Prevent failed DHT attemps from overwriting newer peer info
15 years ago
hermens 9324b5b6c5 Enhancements to DHT
15 years ago
hermens e49e2d75fe Limit the time Transmission.Chunks stay in the transmissionCloud by using a Map that iterates entires in insertion order.
15 years ago
orbiter 92db7c5d07 increased timeout for index retrieval
15 years ago
lotus 386b9f35f6 activated resource observer for windows 7
15 years ago
orbiter 6e0dc39a7d - some fixes to prevent blocking situations
15 years ago