Commit Graph

185 Commits (c17d102bd829746091451f070e6ec4ca060eaaea)

Author SHA1 Message Date
orbiter fc5efcc05a enhanced and fixed OAI-PMH import
15 years ago
sixcooler c2098f9399 close unused connections if there to many for DHT
15 years ago
orbiter 3aad50d38e :-(
15 years ago
orbiter 9edd38fbc5 connectionCount limit too low?
15 years ago
orbiter 7a05db0fcb fixed to prevent that too many open connections exist
15 years ago
orbiter 2bc3cba6f1 - fix for 'do not write to cache' rule.
15 years ago
orbiter 8b8107b2a3 reduced IO-load and synchronization/blocking
15 years ago
orbiter 1a8a134e0c continuing String-hash - to - byte[]-hash redesign that was started in SVN 6775 and continued in SVN 6790
15 years ago
orbiter 55d8e686ea performance hacks
15 years ago
hermens 2f90f0ad56 Remove asserts blocking proxy use cases
15 years ago
orbiter 25aef069a6 continuing String-hash - to - byte[]-hash redesign that was started in SVN 6775
15 years ago
orbiter a85c5bb8a7 added support for multiple (fail-over) network definition locations when http-locations are given. multiple locations can be given with a comma-separated list of urls pointing to the network definition file
15 years ago
orbiter 1e8e79b9ef redesign of reference hash (URL-hash) parameter hand-over:
15 years ago
orbiter 748abfcffa added patches to prevent yacy-protocol DoS settings
15 years ago
orbiter e820ed061a avoiding excessive DNS lookups to determine localhost
15 years ago
orbiter 3300930fc5 - (almost) fixed FTP crawler
15 years ago
orbiter 617dfbbd06 allo 'authorization by encoded password' also if requesting client is not from localhost but from the same host as yacy is running on.
15 years ago
orbiter 727dd9b193 - fixed a bug in robots.txt parser
15 years ago
orbiter 54af9e6b49 - added parsing of robots meta-tag in html headers to detect a noindexing request
15 years ago
orbiter f175f9a2d3 changed way how number of search requests are counted:
15 years ago
orbiter 8030ed3319 self-healing for lost crawl profile handles
15 years ago
orbiter ef62d017e5 integrated session id filtering for crawler
15 years ago
orbiter d8d9984913 added framework for session id filtering (not ready yet)
15 years ago
orbiter 74e736c903 missing file for last commit
15 years ago
orbiter d77782a8d5 removed bookmark tags file, tags are now stored only in RAM
15 years ago
orbiter 24060885b6 - added Tables abstraction in data.Tables.java
15 years ago
orbiter 7fdf59a77f misc NPE check
15 years ago
orbiter 69c29acb6e no exception thread dump if parser cannot parse becuase that mime-type/extension is in the deny-set
15 years ago
orbiter 8ce936bcdd added an api recording function: it shall be possible to record
15 years ago
orbiter 234f733a3d - relocation of seed db is better for network switch than re-initialization because of the embedding of the peers object in other objects
15 years ago
orbiter 473b11033d fixed network switch process - crawling did not work after a switch before this fix
15 years ago
orbiter fd7b348973 some fixes for the network switch
15 years ago
orbiter f6731c6240 more logging etc.
15 years ago
orbiter a06f7ddb33 more PMD recommendations
15 years ago
orbiter dd459281c8 applied code changes that are recommended by PMD
15 years ago
orbiter d77a8f3b3e added some modifications recommended by PMD for better performance
15 years ago
orbiter dff4f95c78 some patches to get the torrent parser working
15 years ago
low012 82198acc06 *) minor changes
15 years ago
orbiter 57d729e377 fix for negative numbers in network statistic
15 years ago
orbiter 362b7a929b added extensive memory protection logic to avoid out of memory errors that may be caused by the RowCollection memory allocation function
15 years ago
orbiter 8281e29963 - more configuration for profiling graph (number of events)
15 years ago
orbiter e34e63a039 preset of proper HashMap dimensions: should prevent re-hashing and increase performance
15 years ago
orbiter 4a5100789f replaced _all_ size() == 0 with isEmpty() and all size() > 0 with !isEmpty(). The isEmpty() method is much faster in some cases, especially when used to access badly balanced hashtables where an size() operation becomes a large iteration.
15 years ago
orbiter 491ba6a1ba - some refactoring in workflow
15 years ago
orbiter 1dff620181 Better implementation of SortStack and SortStore and adoptions in all using classes to implement the necessary Comparable interface and hash code computation.
15 years ago
orbiter 4c6312d103 enhanced image search
15 years ago
orbiter 4c99d4683d possible fix for lost crawl profile handles: clean-up job did wrong measurement to see if crawl is still running.
15 years ago
orbiter 4431b9767e added about 450 replacements for printStackTrace() methods to pipe such traces into the log at DATA/LOG/
15 years ago
orbiter b0b7a4f9a5 - added function to OAI-PMH reader that can pull all records from a server using an evaluation of the resumption token to get URL to retrieve remaining records
15 years ago
orbiter a0e891c63d - some redesign in UI menu structure to make room for new 'Content Integration' main menu containing import servlets for Wikimedia Dumps, phpbb3 forum imports and OAI-PMH imports
15 years ago
orbiter 52470d0de4 - fix for xls parser
15 years ago
orbiter 5e8038ac4d - refactoring of blacklists
15 years ago
orbiter 26fafd85a5 - more refactoring
15 years ago
orbiter 3528b970d6 - refactoring
15 years ago
orbiter a8ce192f63 - shifted main classes to new package net.yacy
15 years ago
orbiter b79f4f062f refactoring of yacy documents and parsers: they depend now only on the kelondro classes
15 years ago
orbiter e7f18ba24b refactoring
15 years ago
orbiter ce8dc575ca refactoring
15 years ago
orbiter bea3b99aff moved table and util classes
15 years ago
orbiter 4446acc8cd moved kelondro order
15 years ago
orbiter f677d534b1 start of a really extensive refactoring which will produce a hierarchical package structure with the domain yacy.net as package root
15 years ago
orbiter ea473e32b8 refactoring
15 years ago
orbiter 735e2737e3 * added index segments
15 years ago
orbiter 09de5da74a once again a performance hack
15 years ago
orbiter 6e0dc39a7d - some fixes to prevent blocking situations
15 years ago
orbiter 04a548a1e3 - temporary integrated the transferURL servlet as static class instead as a class that is called using reflection to investigate the OOM problems in that class
15 years ago
orbiter 6aa474f529 - better logging for web cache access and fail reasons
15 years ago
orbiter 3671c37989 added experimental oai-pmh reader and integrated it with the existing dublin core parser
15 years ago
orbiter e627f75415 one more fix to badwords and stopwords
15 years ago
orbiter 721b88efbd - fixed a problem loading blacklists with new yacycore.jar
15 years ago
orbiter 68465c37af added a convenience class to add files into a YaCy index
15 years ago
orbiter 573d03c7d7 added configuration to enable ram table copy
15 years ago
orbiter 700218846c disabled or removed sleep calls
15 years ago
orbiter 67eddaec4b changed way to integrate dictionary files:
15 years ago
orbiter d656a94f55 fix for bad paths in dictionary processing
15 years ago
orbiter 3b9aaf9e9f - inserted new library tests inside DidYouMean
15 years ago
orbiter 1a9cfd8718 some performance hacks (CPU only, not IO)
15 years ago
orbiter 72e5407115 refactoring of snippet cache
15 years ago
orbiter 72ac5bd80f refactoring of search process.
15 years ago
orbiter 92edd24e70 fixed problem with switching of networks
16 years ago
orbiter c4ae2cd03f fixed bug that caused deletion of crawl profiles at every application startup
16 years ago
orbiter 161d2fd2ef redesign of access to the HTCache (now http.client.Cache):
16 years ago
f1ori ba2e6de538 fix empty version string again
16 years ago
orbiter 4da9042e8a code simplification
16 years ago
orbiter 1d8d51075c refactoring:
16 years ago