Seedlists
Aus YaCyWiki
Inhaltsverzeichnis
Working
These Seedlists should be available and fresh. If not, please move it to not working
hardcoded:
http://home.arcor.de/hermens/yacy/seed.txt http://low.audioattack.de/yacy/seed.txt http://www.lulabad.de/seed.txt http://www.yacy.net/seed.txt
dynamic:
http://www.anomic.de/yacy/seed.txt http://www.suma-lab.de/yacy/seed.txt http://www.pruefziffernberechnung.de/YaCy/seedFile.txt http://home.arcor.de/tim.stein/yacy/127001-2.txt https://www.ghcif.de/~yacy/seed.txt http://www.daburna.de/YaCy/seed.txt http://www.bachhausen.de/wohnseed.txt http://www.bachhausen.de/seed.txt
Tor(.eff.org) Seeds can be found on De:YaCy-Tor
Not Working
Seeds which are not fresh/available at the moment.
http://yacy-websuche.mxchange.org/seeds/seed.txt ==> redirects a user to a website http://www.mcmilk.de/seed.txt ==> 404 http://mxchange.org/seeds/free-search.txt ==> 404 http://yacy.kicks-ass.net:8080/repository/seed.txt ==> Server not found http://cowpuncher.drollette.com/seed.txt ==> 404 http://mirror.hfase.com/yacy/seed.txt ==> Server not found http://85.214.248.55:8090/www/seed.txt ==> Could not connect http://reezer.org/yacy/seed.txt ==> exists, but outdated (Jan, 4th 2010 as of Jan, 27th) http://projekte.newsfreak.de/yacy/seed.txt ==> exist, but really outdated (e-mail from Webmaster) http://www.krautter-consulting.de/seed/seed.txt ==> exist, but really outdated (e-mail from Webmaster) http://trumpkin.de/yacy/seed.txt ==> exist, but really outdated, seed.txt Date:30-Juni-2008
mod_rewrite
Some search engines list the seed files regardless of an exclusion in robots.txt. To force them not to index the list you can redirect any other client than YaCy on access to the seedfile. To do this, you need mod_rewrite on Apache and the following entries in your .htaccess file:
RewriteEngine on RewriteCond %{HTTP_USER_AGENT} !^yacy.*$ [NC] RewriteRule seed.txt http://www.yacy.net/ [R]
Please keep in mind that now nobody except YaCy itself can download the seedfile.
How to get a fresh list of seedfiles (in Linux)
You can parse the Network.html?page=1
page on your (or another) peer to get a really fresh list of seedfiles. (i.E. in scripts ..)
lynx --source "http://<peer>:<port>/Network.html?page=1"|grep "Type: Principal"|cut -d '"' -f 4
i.E. build a simple seedfiles link checker:
lynx --source "http://<peer>:<port>/Network.html?page=1"|grep "Type: Principal"|cut -d '"' -f 4|while read link ; do lynx --dump "$link" &> /dev/null && echo "[OK] $link" || echo "[ERROR] $link" done