Commit Graph

43 Commits (9b0c4b1063f3ab8aa725f00fbf93c37cf88d8a4d)

Author SHA1 Message Date
orbiter bfcf9b7aa3 - added language detection using metadata from documents: html and odt documents provide this information 17 years ago
apfelmaennchen 3768a1bd32 set encoding="UTF-8" for getpageinfo_p.xml 17 years ago
apfelmaennchen 5e8bd0f29c small fixes to getpageinfo_p.xml and htmlFilterContentScraper.java with respect to keyword extraction 17 years ago
apfelmaennchen 5b2a57bfd0 - /xml/util/getpageinfo_p.xml added <desc> and <lang> tags 17 years ago
apfelmaennchen cd1ac5bb90 - fixed security issue with /xml/util/ynetSearch.xml 17 years ago
orbiter c73cf05ddd tried to fix local search in yacy-ui 17 years ago
orbiter 536e77e8b7 modifications towards a single database operation to read/write http header and cached file at once: 17 years ago
danielr 3bb870bfcd added final where possible 17 years ago
orbiter c3d461d191 - removed superfluous copyright statement 17 years ago
orbiter 3ca98fee42 removed superfluous copyright statement 17 years ago
orbiter a6719dfd2b - refactoring of robots parser 17 years ago
orbiter e81be7d4f2 added many missing user-agent declarations for yacy http client connections. 17 years ago
danielr 7feae906aa - organize imports 17 years ago
orbiter 0bfe76d3d0 fix for compile bug 17 years ago
f1ori fd8bd5d0d1 * fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1176 (encoding issue) 17 years ago
apfelmaennchen 4f9b8a6ef8 /xml/util/getpageinfo_p.java: fixed problem with empty tags and added recognition of compound tags e.g. "DER SPIEGEL" 17 years ago
apfelmaennchen 1a3b87baaa ... 17 years ago
apfelmaennchen 4a932194a9 adjusted license text and copyright 17 years ago
apfelmaennchen 37505c0665 implemented ynetSearch.java to allow ajax cross domain search (e.g. sciencenet) for YaCy-UI... 17 years ago
orbiter 1995faef8d - refactoring of Colage back-end: move to plasma package 17 years ago
danielr 5c3c1fdf41 replaced httpc with Apache Jakarta Commons HttpClient (includes some refactoring ;) 17 years ago
orbiter 541b817502 refactoring of switchboard queueing 17 years ago
orbiter fa1090113d - next try to fix the networking problem: 17 years ago
fuchsi 0e1738899f * Complete number localization and provide a more reasonable interface to serverObjects: 18 years ago
orbiter daf0f74361 joined anomic.net.URL, plasmaURL and url hash computation: 18 years ago
orbiter 511dcbb172 fixed encoding bug made in SVN 3993 18 years ago
orbiter 36a37f758b fix for oom exception during release download 18 years ago
theli 339153d40e *) favicons that are specified in the document content via html link-tags 18 years ago
allo 5fc00871a9 getpageinfo/sitemap bugfix 18 years ago
allo e7da3d2340 fixed sitemap url in getpageinfo 18 years ago
karlchenofhell 601fc7d1c5 - added source to J7Zip-modifed.jar and it's license (changelog is still to come) 18 years ago
theli 7d9259e44d *) Bugfix for umlaut problem 18 years ago
theli 6f46245a51 *) Bookmarks: Ajax icon is displayed while loading title 18 years ago
orbiter d34f10c63d some tests with reverse dns lookup 19 years ago
orbiter 5a40ea7866 refactoring of wget string list generation 19 years ago
orbiter df1629b05a - code cleanup 19 years ago
theli 92e986bb91 *) adding missing return prop (requested by allo) 19 years ago
allo f0529fe53e update for ftp urls 19 years ago
orbiter 3879a0ecd0 replaced java.net.URL usage by use of new class de.anomic.net.URL 19 years ago
allo 62664d7252 AJAX Check for robots.txt before crawling. 19 years ago
allo ba96cefe0c packages for xml/* 19 years ago
allo 26bab876db more del.icio.us Api 19 years ago
allo 2e2fa99501 bookmarksManager: 19 years ago