]> git.donarmstrong.com Git - lilypond.git/commitdiff
Deny rogue crawler AhrefsBot. Fixes web load.
authorJan Nieuwenhuizen <janneke@gnu.org>
Thu, 8 Aug 2013 06:34:12 +0000 (08:34 +0200)
committerPhil Holmes <mail@philholmes.net>
Thu, 8 Aug 2013 10:07:44 +0000 (11:07 +0100)
The AhrefsBot is checking crawling files every second, including
all binaries.  This increases the load in an unacceptable way.

Documentation/web/server/lilypond.org.htaccess

index b4780a94147df06f986cb21957f7670ec7b881a0..5e7dfae3d98773b58516b45122318f187bfd7716 100644 (file)
@@ -23,6 +23,10 @@ RewriteEngine On
 RewriteCond %{HTTP_USER_AGENT} httrack [NC]
 RewriteRule ^.*/source/.*$ /please-respect-robots.txt.html [L]
 
+# Deny rogue crawler
+RewriteCond %{HTTP_USER_AGENT} ^(.*)AhrefsBot(.*) [NC]
+RewriteRule .* - [F,L]
+
 # Permanent top level entry points -- ./doc
 RedirectMatch ^/music-glossary /glossary
 RedirectMatch ^/tutorial /learning