Thursday, August 30, 2012

[scraping related] how does a browser tell a web-server, that it doesn't want JavaScript?

Rephrasing …
If I tell a browser to not "accept" JavaScript, what does the browser do with that directive?

  • is that related to the User-Agent or Accept HTTP header fields?
  • does that "just" mean, that the browser doesn't execute the JavaScript bits?

(where to disable JavaScript in browsers? [link])

Does Michael Schrenk talk about this in his book or on the related website?

Webbots, Spiders, and Screen Scrapers:







He mentions JavaScript a couple of times.
  • "JavaScript can change a form just before submission", page 70 – nope
  • "bizarre JavaScript and cookie behavior", page 229 – nope
  • "web design techniques that hinder search engine spiders", esp. JavaScript, page 300 – nope
  • "killing spiders", "use cookies, encryption, JavaScript, and redirection", page 313 – nope

No comments: