Commit Graph

116 Commits (44874f2cb9e0c324ae24c33f479f94836d2cc3df)

Author SHA1 Message Date
orbiter d2fd93135c - moved yacybot user agent string definition to MultiProtocolURI since there are basic access mechanisms where the bot string is needed
15 years ago
f1ori e670e1ef8e add charset auto-detection for htmlParser
15 years ago
f1ori ddcd5ae78c fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=2989
15 years ago
f1ori 8fe1102452 fix http://forum.yacy-websuche.de/viewtopic.php?p=20889#p18426
15 years ago
orbiter 10a9cb1971 simplified snippet computation process and separated the algorithm into two classes
15 years ago
orbiter 84a023cbc8 fixed several search bugs
15 years ago
lotus d2a3d08c44 avoid div. by zero
15 years ago
orbiter 570ca577c6 performance hacks
15 years ago
orbiter 114bdd8ba7 fixed old sitemap importer which was not able to parse urls containing post elements
15 years ago
orbiter c0b08ac59b slighlty changed way of pdf parser integration
15 years ago
orbiter 5fe828fa06 - replaced pdfbox and fontbox version 1.1.0 with 1.2.1
15 years ago
orbiter 24502fe3de performance hacks
15 years ago
orbiter 22047ffad5 enhanced computation speed of many replaceAll string operations
15 years ago
orbiter 3988a95fb5 added ability in rss reader to parse atom feeds
15 years ago
orbiter 9d080f387e change in handling of the all-visible home path for storage in YaCy:
15 years ago
orbiter 0010cd9db1 Support for indexing of RSS feeds!
15 years ago
orbiter 844f158686 - removed dependencies in header framework:
15 years ago
orbiter 5e7081cd19 refactoring towards a unified loading mechanism for MultiProtocolURIs
15 years ago
orbiter e10cd115a9 - added a new RSS reader interface. This is not finished but you can now load and look at RSS feeds. It will be used to index RSS feeds in a way that is appropriate for such kind of data.
15 years ago
orbiter 933dc1a600 removed old rss parser (will be replaced with parser from cora package)
15 years ago
orbiter 5924a0d851 - enhanced concurrency in database index access for multicore
15 years ago
orbiter 989948e1a9 fixed generic image parser
15 years ago
orbiter 27d8a8b53e removed wrong com.sun.codec class access in generic image parser
15 years ago
orbiter b6fb239e74 redesign of parser interface:
15 years ago
low012 d4851441b0 *) Added Android packages to parser in order to be able to create a decentralized search for direct downloads of Android apps.
15 years ago
orbiter 150cf42a1b migrated all my LGPL 3 -licensed files to the LGPL 2.1 because LGPL 3 is not compatible to the GPL 2
15 years ago
orbiter 5a4684f21f allow words with length >= 2 (you can't search for 'wm' with 3-letter words...)
15 years ago
orbiter 37b8827a7a - removed the UPnP library sources from sbbi and added the jar library again. The library was included to get support for fedora releases, but after this time the fact that the sbbi cannot be part of fedora should be re-discussed. If this will still not be possible, then we may integrate the sbbi UPnP package using reflection.
15 years ago
orbiter 777195e8d1 more abstraction for access of LoaderDispatcher and cache
15 years ago
orbiter 7bcfa033c9 more abstraction of the htcache when using the LoaderDispatcher:
15 years ago
orbiter 87087f12fe - scanned remote search process and enhanced some data structure and synchronizations here and there
15 years ago
orbiter de4f30bb2e UTF-8 fix
15 years ago
orbiter 3a1cebb598 bugfixes
15 years ago
orbiter 60e71876ad - more abstraction (HashMap -> Map)
15 years ago
orbiter 2eea806005 less errors in image parser
15 years ago
orbiter 11639aef35 - added new protocol loader for 'file'-type URLs
15 years ago
orbiter 98c1d65415 - show up to 10 locations (maps) after search (instead of a max of 5)
15 years ago
orbiter 9842fab6e4 - fixes to query parameter
15 years ago
orbiter 1defd580bc - added option to localization search to distinguish between a search for a location according to the search word only or for the relation between a web search results and locations found in the metadata fields
15 years ago
orbiter bd0a9df895 fix for bad location double check
15 years ago
orbiter e43e61e502 added another geolocalization data source: GeoNames
15 years ago
orbiter 2126c03a62 - removed download-limit that can be given for the crawler for non-crawler download tasks. This was necessary because the same procedure was used for other downloads like for the download of dictionary files where a limit is not useful. The limit still stays for the indexer
15 years ago
orbiter 789c6b26ce added a location search service: using the following servlet/example:
15 years ago
orbiter f23cbd2dab more bugfixes to date parser
15 years ago
orbiter cf43bdc87e This is a large bugfix and enhancement commit to support a better location detection for data
15 years ago
orbiter 6eba2cb96b fix in bmp parser
15 years ago
orbiter a7d038bb7a The oai ListFriends source list becomes configurable: just write them into defaults/oaiListFriendsSource.xml
15 years ago
orbiter 5fbf866cae - fixed resumption token generation for oai-pmh import
15 years ago
orbiter fc5efcc05a enhanced and fixed OAI-PMH import
15 years ago
orbiter 90c3e5d6f6 - cleanup, removed unused imports
15 years ago