Commit Graph

426 Commits (a26f1b3cd749d336963967481b70e50659ec61e7)

Author SHA1 Message Date
Michael Peter Christen cf12835f20 replaced the single-text description solr field with a multi-value
11 years ago
sixcooler 7d53ac86a3 fix for Blacklist (-Administration)
11 years ago
Roland Haeder e2ee412160 Use SwitchboardConstants.LISTS_PATH_DEFAULT instead of 'DATA/LISTS'
11 years ago
Roland Haeder 59225487ea Fix for blacklist export, also applied the filename filter here
11 years ago
Michael Peter Christen 4c242f9af9 always use a default value for boolean options to have transparency for
11 years ago
orbiter 86b514cf46 added load info to status_p.xml
11 years ago
orbiter 056b42f5aa - added information about segment count to status_p.xml
11 years ago
orbiter 232100301c removed double-ocurring value assignments
11 years ago
Roland Haeder 841a28ae76 Added 'final' for all exception blocks as this helps the Java compiler
11 years ago
Roland Haeder ebbb3bc5c1 Fixed CHMOD on many files + added missing loggers (e.g. jena) and made some noisy loggers quiet
12 years ago
Michael Peter Christen bcc623a843 refactoring of load_delay: this is a matter of client identification
12 years ago
orbiter 2be456e7fb added a postprocessing field into api/status_p.xml to show if the
12 years ago
orbiter c4efb612e2 added list of crawls to status_p.xml
12 years ago
orbiter dac88561ae minimum access time has a tight connection to ClientIdentification,
12 years ago
Michael Peter Christen 5878c1d599 - refactoring of log to ConcurrentLog:
12 years ago
orbiter c8e94ad7c7 fix for citation search in case that the citation is very fresh
12 years ago
Michael Peter Christen fd1776a3b0 added a new 'Citations' function: each search result item can now be
12 years ago
Michael Peter Christen 8f2d3ce2f9 reduced locking situation in crawler: shifted synchronized location and
12 years ago
Michael Peter Christen 038f956821 fix for sitemap detection: the sitemap url was not visible if it
12 years ago
Michael Peter Christen 008288719c fix for schema export to consider also automatically generated
12 years ago
Michael Peter Christen 58e1e6fa2b fixes to schema
12 years ago
Michael Peter Christen 788288eb9e added the generation of 50 (!!) new solr field in the core 'webgraph'.
12 years ago
Michael Peter Christen 91a0401d59 introduced a second core named 'webgraph'. This core will hold the link
12 years ago
Michael Peter Christen b6de1f42dc Full redesign of solr connection architecture. This was done to support
12 years ago
Michael Peter Christen dee8b24d3c better error handling for bookmarks
12 years ago
Michael Peter Christen 3834829b37 bugfixes and more logging for solr connector
12 years ago
Michael Peter Christen 99185d7048 one more fix for author_sxt
12 years ago
Michael Peter Christen b6ae6262f6 - add the copyField author_sxt only if author exists
12 years ago
Michael Peter Christen e23a596c1d added a copyField for author_sxt for automated schema generation
12 years ago
Michael Peter Christen 244b157299 fix for external solr schema definition
12 years ago
reger f301336adf fix: no results with configuration citation reference index switched off
12 years ago
Michael Peter Christen cb5cbec14d distinguishing modified query string and original query string
12 years ago
Michael Peter Christen 3de784c8dd replaced more split and replaceAll missing pattern pre-compilation with
12 years ago
Michael Peter Christen 8fc3679c66 using more pre-compile pattern for split methods
12 years ago
Michael Peter Christen 4eab3aae60 removed overhead by preventing generation of full search results when
12 years ago
Michael Peter Christen 952e143580 FINALLY YaCy can now search for full strings using double- or
12 years ago
orbiter 5dfd6359cb redesign of the QueryParams class: introduced QueryGoal which holds the
12 years ago
Michael Peter Christen 5fd3b93661 added deletion of hosts during crawl start if deleteold option was given
12 years ago
Michael Peter Christen d64445c3cb because we have the inurl:<term> - searchmodifier, we don't actually
12 years ago
Michael Peter Christen 2d9e577ad0 replaced the custom robots.txt loader by the standard http loader
12 years ago
Michael Peter Christen ccc3760a47 Refactoring and redesign of data architecture to make URIMetadataRow
12 years ago
Michael Peter Christen 43f3345c90 - removed dependencies from URIMetadataRow and made direct access to
12 years ago
Michael Peter Christen 21fe8339b4 - enhanced generation of url objects
12 years ago
Michael Peter Christen 5f0ab25382 removed the option to prevent removal of &amp; parts inside of the
12 years ago
Michael Peter Christen abab291162 made the index schema retrieval public and allow cross-domain retrieval
12 years ago
Michael Peter Christen 1533bfd63b refactoring
12 years ago
Michael Peter Christen 872f83ebe0 refactoring
12 years ago
Michael Peter Christen 8219a445f3 refactoring
12 years ago
Michael Peter Christen 00c1c777fa refactoring
12 years ago
orbiter 563d584420 removed more dependencies in cora from kelondro
12 years ago
orbiter 63762d8f89 removed kelondro dependencies from cora
12 years ago
Michael Peter Christen b69ed96f0b - added collections to yacydoc
12 years ago
Michael Peter Christen 4d29f59a27 removed warnings
12 years ago
Michael Peter Christen 8c099d2106 Merge remote-tracking branch 'origin/master'
12 years ago
apfelmaennchen d31a632951 - added dmoz RDF dump importer
12 years ago
Michael Peter Christen 8ca842b137 added new button design to more buttons
12 years ago
Michael Peter Christen b2b516cc3e added a collection attribute to crawls and searches:
12 years ago
Michael Peter Christen a427a68bac removed many warnings
12 years ago
Michael Peter Christen 31d4d38804 - extended the solr interface by a references-by-word-count method
12 years ago
Michael Peter Christen 528d6763fa - added new solr fields:
12 years ago
Michael Peter Christen 75d5e3475d Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen 316b5fe116 - added a solr type definition verifier
12 years ago
reger 2d2be546fe fix path to env/grafics to display api icon on meta data page
12 years ago
Michael Peter Christen 0cab06c47c refactoring
12 years ago
Michael Peter Christen 06a78eecb7 code simplification
12 years ago
Michael Peter Christen 18f989dfb1 - refactoring (load -> getMetadata)
12 years ago
Michael Peter Christen 136fcb1ad9 refactoring
12 years ago
Michael Peter Christen 24d9db1613 snippet retrieval loading processes may use a smaller minimum load time
12 years ago
Michael Peter Christen 1687737771 Abstraction of HandleMap and HandleSet
12 years ago
Michael Peter Christen 6f1ddb2519 Moved solr index-add method to the same method where the YaCy index is
12 years ago
orbiter 69e743d9e3 - more abstraction for the RWI index as preparation for solr integration
12 years ago
Michael Peter Christen f78ce93a80 collection of speed and memory saving hacks
13 years ago
orbiter 0cbda0b2b8 - replaced all length() == 0 and size() == 0 with isEmpty()
13 years ago
Michael Peter Christen b0c408788b made class methods static where possible
13 years ago
Michael Peter Christen 5bd3c90907 - removed unnecessary semicolons
13 years ago
Michael Peter Christen 241dd8410a removed snippet pattern filter - it was not used
13 years ago
Michael Peter Christen d3964253ae - added @SuppressWarnings to unused servlet method parameters
13 years ago
Michael Peter Christen ea10766bfd cleaned unnecessary nested code
13 years ago
Michael Peter Christen 1825f165b8 better integration of blacklist according to use case
13 years ago
Michael Peter Christen 03280fb161 removed segments-concept and the Segments class:
13 years ago
Michael Peter Christen 9116013c64 - allow lazy initialization of solr value (if using 'lazy', then no
13 years ago
cominch 011f8a5818 Auto Tagging: Add hyperlinks to tags (provisional)
13 years ago
Michael Peter Christen 52f5d40043 better abstraction of document model generation
13 years ago
Michael Peter Christen 8b7c4d3144 produce a rdf output containing the triplestore with yacydoc; ie:
13 years ago
cominch d8815db877 Merge remote-tracking branch 'original yacy/master'
13 years ago
cominch e4dab19045 Augmented Browsing: added template for document info bar
13 years ago
Michael Peter Christen b2d1c25ebb removed warnings/unused entities
13 years ago
Michael Peter Christen 64c0268b2b show triplestore metadata in yacydoc and viewfile
13 years ago
Roland 'Quix0r' Haeder edaa09b9b1 Rewrote all String blacklist types to enum 'BlacklistType', closes bug
13 years ago
cominch 87a3fbb3c2 interaction javascript
13 years ago
Michael Peter Christen 8b974905ee changed log-in text for all servlets with authentication:
13 years ago
reger b2175ea4ef Add possibility to set custom Solr field names for the YaCy default Solr attributes.
13 years ago
Michael Peter Christen c00efc2717 made the solr connection more generic
13 years ago
Michael Peter Christen 453010bd68 - solved problems with backpath normalization
13 years ago
Michael Peter Christen 0e13022147 - enhanced solr field documentation
13 years ago
Michael Peter Christen e377092198 fix to xml output format
13 years ago
Michael Christen 41be98dc9d extended webstructure api to show together with incoming links also
13 years ago
Michael Christen 8f89c8ef07 added information about inbound, outbound and citation links into
13 years ago
Michael Christen 71649a1296 added an api to retrieve the new citation.index with the
13 years ago
Michael Peter Christen 9ad1d8dde2 complete redesign of crawl queue monitoring: do not look at a
13 years ago
Michael Peter Christen e2f8f263e8 changed storage of search words: keep order
13 years ago
Michael Peter Christen c166eb68b6 fixes in solr schema file
13 years ago
Lotus 335a776351 xss hardening on Status.html
13 years ago
Michael Peter Christen ef5192f8c9 using the generic document parser for crawl starts instead of the html
13 years ago
Michael Peter Christen ce620be783 for for crawl start with smb url
13 years ago
Michael Peter Christen 7053f8ab46 added automatic generation of a solr schema.xml file
13 years ago
Michael Peter Christen 2ee8cbeb2c Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
13 years ago
Michael Peter Christen 992dbdf4bb added noload statistic to servlets
13 years ago
Roland 'Quix0r' Haeder fa08ed5ae5 Fixed a lot CHMOD rights (no need for execute flag on *.java/*.html) and introduced local/remote crawl size ratio based check
13 years ago
Michael Christen 9e5894c784 Removed handling of components objects for URIMetadataRows.
13 years ago
Michael Christen c04bfaa51b refactoring
13 years ago
Michael Christen e9dc99fe15 added rules to set specific RWIs as private RWIs which are not
13 years ago
Michael Christen 204c29f010 small bugfixes for search result display and cache display
13 years ago
apfelmaennchen ff19fcdb28 bugfix for YMarks XBEL import and export; thanks to Dominic
13 years ago
orbiter 11729061f2 added an option in the bookmark import process to put everything into the crawler
13 years ago
apfelmaennchen 8f30d288e9 small change to mouse over text for crawl starts within bookmarks
13 years ago
apfelmaennchen 29e97f94f2 small optical enhancements to ymarks treeview
13 years ago
apfelmaennchen 77a080ced9 smaller fixes for YMarks
13 years ago
orbiter e22f8497c9 - tested the ARC methods
13 years ago
orbiter 5a55397f99 some last-minute performance hacks
13 years ago
apfelmaennchen dd1482aaf5 further update to YMarks
13 years ago
apfelmaennchen 564374d1fe - included YMarks in addition to old bookmarks in yacysearchitem.html; don't get confused by the old bookmark dialog, the ymark is automatically added silently beforehand.
13 years ago
orbiter 05f34a3fa7 added a full, complete, database insert, update and delete API for the tables.
13 years ago
orbiter c461c1eebf fixed xml output for table retrieval
13 years ago
orbiter c93f10417a add a bookmark automatically each time a new crawl is started
13 years ago
apfelmaennchen 6287c2b4a9 YMarks:
13 years ago
apfelmaennchen 5581be12fb YMarks:
13 years ago
apfelmaennchen a3eebfdcba YMarks:
13 years ago
apfelmaennchen 4f95f72124 YMarks:
13 years ago
orbiter 017a01714d - enhanced logging in robots.txt parser for remote debugging
13 years ago
orbiter 5a7cec59f3 moved ynetSearch to get all files out of htroot/api/util/
13 years ago
apfelmaennchen a8dfe787ed - updated to jquery flexigrid 1.1
13 years ago
cominch cef8ebc41d getpageinfo: Checks if there is a OAI repository behind the URL.
13 years ago
orbiter eb1c7c041d write info about robots.txt evaluation into getpageinfo_p.xml
13 years ago
orbiter f8b8c82421 - refactoring of getpageinfo_p.xml (moved out of util)
13 years ago
apfelmaennchen abba31f02e - bugfix for correctly sorting ymarks
13 years ago
apfelmaennchen 5f7dbe1c42 - some refactoring (ymarks)
13 years ago
apfelmaennchen 4d7ae76017 - update to jquery 1.7 (does not apply to all jquery code, old version is additionally kept for compatibility)
13 years ago
orbiter a7df70221e refactoring
13 years ago
orbiter d2ea250d99 refactoring:
13 years ago
orbiter 2d03dc1804 removed unnecessary warning
13 years ago
orbiter cf8e3b0df8 small fix for count: overXX includes the count
13 years ago
orbiter 6db8921a0f enhanced termlist
13 years ago
sixcooler d40a177c05 Generation Memory Strategy fine tuning
13 years ago
orbiter a5541751a8 - added memory computation to termlist_p.xml
13 years ago
orbiter 9bdee5c71c added a servlet that produces a list of term hashes that appear more than 10000 times
13 years ago
sixcooler 916d79111e Runtime.maxMemory() DOES change @ runtime:
14 years ago
orbiter 9ebc75db4b fix for channel authorization
14 years ago
orbiter 115abc8917 - more attributes for search progress bar
14 years ago
orbiter 4bea3f9714 hack to reduce resource contention caused by massive UTF8 decodings which use java.nio resources:
14 years ago