admin wrote:So, did ADP record this URL to the database or not?
- Code: Select all
http://www.site.com/_frameworks_static/js/scriptaculous.js?load=effects
It receives a 200 OK code, because its there. It doesnt receive TEXT/HTML because it isnt one. ADP should just skip it, and go further.
Whats the problem with it?
No problem, other than unnecessary network traffic and time consumption. Which can be avoided, I should add.
For a 5 kB js-file, this is no issue. But this can become one, if ADP downloads 100 MB PDF's. Unlikely ? Not so. Many web analytics people add some ? with a code for tracking purposes.
So you really should strip anything after and including a '?' from an URL, and then apply the Non Follow Match.
The problem with these:
- Code: Select all
http://site.co.uk/x.oSrc;
is most likely ADP not extracting the links from that particular JavaScript. It a bit hard to come up with a regular expression that would match links from all the JavaScript junk people like to code.
Well, this can't be done with RE's - js is a programming language and thus quite unparseable. But again, in order to avoid unnecessary traffic, why
don't you apply a RE to check if you at least have a valid (or reasonable) URL before going out to the net. In this example, an URL which terminates in ';' could be skipped.
Btw: Sorry for posting this in the General Forum; please shift over to the Bugs Arena.
Happy New Year !
Alain