This website works better with JavaScript.
Explore
Help
Sign In
boxtec
/
yacy_search_server
mirror of
https://github.com/yacy/yacy_search_server
Watch
4
Star
1
Fork
You've already forked yacy_search_server
0
Code
Issues
Projects
Releases
Wiki
Activity
You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
e039a797d2
master
Release_1.92
Release_1.90
Release_1.82
Release_1.80
Release_1.72
Release_1.7
Release_1.68
Release_1.6
Release_1.5
Release_1.4
Release_1.3
Release_1.2
Release_1.1
Release_1.04
Release_1.03
Release_1.02
Release_1.01
Release_1.0
1.0
0.99
Branches
Tags
${ item.name }
Create tag
${ searchTerm }
Create branch
${ searchTerm }
from 'e039a797d2'
${ noResults }
yacy_search_server
/
source
/
net
/
yacy
/
document
/
parser
History
luccioman
e90405b6f0
Support parsing audio URLs without file extension
...
Added also a Junit for the audio tag parser
6 years ago
..
html
Properly resolve relative URLs against document URL in html base tags
6 years ago
images
Small fix on svg parser error message
7 years ago
rdfa
Revised the RDFaParser main launcher for minimal proper operation.
7 years ago
xml
taking care of closing inputstreams, HTTPClient
6 years ago
AbstractCompressorParser.java
Added a parser for XZ compressed archives.
7 years ago
GenericXMLParser.java
Also handle text content when parsing XML within limits.
8 years ago
XZParser.java
Added a parser for XZ compressed archives.
7 years ago
apkParser.java
Properly close file output streams even on exceptions scenarios.
8 years ago
audioTagParser.java
Support parsing audio URLs without file extension
6 years ago
bzipParser.java
added a crawl filter based on <div> tag class names
7 years ago
csvParser.java
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
9 years ago
docParser.java
Cleaned up some Javadoc warnings.
8 years ago
dwgParser.java
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
9 years ago
genericParser.java
Added parsing within bounds implementation to the generic parser.
8 years ago
gzipParser.java
added a crawl filter based on <div> tag class names
7 years ago
htmlParser.java
removed transformer
7 years ago
linkScraperParser.java
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
9 years ago
mmParser.java
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
9 years ago
odtParser.java
fix delete of temp file after odt % ooxml parser
9 years ago
ooxmlParser.java
Improved parsing support for OOXML spreadsheets (.xlsx)
8 years ago
pdfParser.java
Updated pdf cache clear steps consistently with current pdfbox version
7 years ago
pptParser.java
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
9 years ago
psParser.java
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
9 years ago
rdfParser.java
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
9 years ago
rssParser.java
Added RSS reader support for `enclosure` feed item sub element.
7 years ago
rtfParser.java
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
9 years ago
sevenzipParser.java
added a crawl filter based on <div> tag class names
7 years ago
sidAudioParser.java
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
9 years ago
sitemapParser.java
taking care of closing inputstreams, HTTPClient
6 years ago
tarParser.java
added a crawl filter based on <div> tag class names
7 years ago
torrentParser.java
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
9 years ago
vcfParser.java
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
9 years ago
vsdParser.java
Added a basic JUnit test for the Visio parser (vsdParser)
7 years ago
xlsParser.java
refactor xlsParser to include Excel file attribute (like author) in parser result doc.
9 years ago
zipParser.java
added a crawl filter based on <div> tag class names
7 years ago