orbiter
d0c32c6aeb
better protection against fraud peers
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3104 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
61798f0ae6
added option to distinguish between text crawl and media crawl
...
- for each crawl start, there is now a flag for text and media
- the localCrawl flag is superfluous
- added new crawl profiles
- if an image search is done, only media links are crawled for the snippets
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3100 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
be941c4475
- "javascript:"-URLs are recognized as well (as intended formerly I assume)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3097 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
a619ba3f49
- fix for String index out of range during URL parsing
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3096 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
febe6b114a
design update of crawler monitor
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3094 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
40049e0635
fixed media search snippet flow
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3092 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
borg-0300
05d0464377
only do 16 checks if "address" starts with "172.";
...
better readably;
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3087 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
rramthun
1a525710c1
*) cursor jumps now to searchbox on searchpages again
...
*) added missing private IP-ranges for APIPA/Zeroconf and 172.16.0.0–172.31.255.255
*) Changed some seed-download-errors to warnings
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3086 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
e17591acc3
- parse HTML arguments as UTF-8 strings
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3085 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
d30932c7d8
- fix for fix... sry
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3084 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
6118fb73ec
- added decode of UTF-16 escapes in url-arguments (%u0123), bugfix for http://www.yacy-forum.de/viewtopic.php?t=2762
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3083 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
allo
782db9099d
version independent name for commons-pool lib
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3082 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
7ff86d6ba6
- image search now shows thumbnails (in bad order, but it works)
...
- repaired DHT selection
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3081 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
ee3d91cb6b
print-out of links that result from contraint-filtering
...
in search result
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3078 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
borg-0300
f4737eebd6
Properties
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3076 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
borg-0300
1fc75c0f67
better logging
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3075 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
e4570bffaf
-implemented a specialized snippet-fetch for media content
...
-changed search result preparation for media search presentation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3073 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
low012
694a6e4f44
*) better text snipptes: any possible searchword (welt, linux, tag) in welt-linux-tag will be marked correctly now
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3072 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
bddc197453
reverted by-mistake removed change from low012/SVN 3068
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3070 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
1377c53aa3
extraction of media links from search results
...
these links are mixed to the snippets for testing purpose
(a final version will handle this differently)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3069 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
low012
586add4c6c
*) Better snippets: words like GNU/Linux will not prevent Linux or GNU from being marked if they are searchword (see http://www.yacy-forum.de/viewtopic.php?t=2891 )
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3068 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
borg-0300
29922d5dd2
changed writeMap
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3066 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
borg-0300
5b00a04359
changed writeMap
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3065 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
52abbd4131
- fix for wrong public IP if no hostname was set and IP was from range 192.168.*.*
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3063 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
borg-0300
8b7c543885
NullPointer fix
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3061 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
937ccd4e76
fix for snippet-generation
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3060 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
559f41a001
fix for http://www.yacy-forum.de/viewtopic.php?p=28607#28607
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3059 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
auron_x
c086c71f17
*) fixed ArrayIndexOutOfBoundsException
...
--> http://www.yacy-forum.de/viewtopic.php?t=3210
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3058 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
c93cfdc23a
fix for http://www.yacy-forum.de/viewtopic.php?p=28564#28564
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3057 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
93a5ace330
fix for http://www.yacy-forum.de/viewtopic.php?p=28544#28544
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3056 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
bf0d820659
- added correct flagging of word properties
...
- added self-healing to database in case that wrong free-pointers exist
- added presentation of media links in snippets (does not yet work correctly)
- code cleanup
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3055 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
10d888e70c
- added a media search for images, audio, video and applications
...
- new search options on search page
- new option in ViewInfo to display all links of a file
- enhanced collection data structure
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3054 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
a603c4d5e8
more code simplifications
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3052 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
9a85f5abc3
cleanup
...
- removed 'deleteComplete' flag; this was used especially for WORDS indexes
- shifted methods from plasmaSwitchboard to plasmaWordIndex
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3051 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
borg-0300
fbe1ee402b
plasmaCrawlLURL$kiter cleanup
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3050 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
773ba1e91a
- generalized object order handling
...
- controlled object order for all database tables
- migrated DHT position computation to correct base64-decoded values
this also closed the 'gaps' in the dht positions
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3049 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
borg-0300
15381cbf73
other bugfix
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3048 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
borg-0300
ad65cc9d2f
NullPointer fixes
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3047 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
borg-0300
d33745a7ea
NullPointer
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3046 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
3a4933b63c
bugfix for
...
http://www.yacy-forum.de/viewtopic.php?p=28493#28493
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3045 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
109ed0a0bb
- cleaned up code; removed methods to write the old data structures
...
- added an assortment importer. the old database structures can
be imported with
java -classpath classes yacy -migrateassortments
- modified wordmigration. The indexes from WORDS are now imported
to the collection database. The call is
java -classpath classes yacy -migratewords
(as it was)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3044 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
052f28312a
removed assortments from indexing data structures
...
removed options to switch on assortments
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3041 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
2372b4fe0c
release 0.49
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3040 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
f8efb3c948
fixed a null pointer exception problem reported in the forum.
...
I cant find the forum entry any more because my girlfriend switched
off the power while the forum window was open.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3039 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
ad1e4aa88e
added selection of audio, video, image and application resources
...
to search procedure. This function can currently not used through the
search interface, but only through remote search.
added accumulation of search attributes to enable the audio, video,
image and application selection.
fixed a problem with external URL representation generation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3036 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
fb7902aa68
fix for http://www.yacy-forum.de/viewtopic.php?p=26142#26142
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3033 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
7cc4cec9c9
bugfix for assertion bugs documented in
...
http://www.yacy-forum.de/viewtopic.php?p=28261#28261
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3030 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
97596af843
patch for wrong old RWI entries
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3028 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
hydrox
2c69cc969a
*) more special chars removed
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3027 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
hydrox
ebb42906f8
*) removed special characters
...
*) added Copyright comments
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3026 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago