D
Denise Enck
William Tasso said:and point the way for misbehaving crawlers at the same time.
not necessarily. there are numerous ways to block 'bad' spiders ~~
William Tasso said:and point the way for misbehaving crawlers at the same time.
PeterMcC said:And, if you really want to be safe, you could always password protect the
directory with .htaccess - dead easy and the spiders don't get past the
password protect.
Headless said:Assuming that "The majority of Web authors" use
http://www.host.com/~user url's is a very bold claim.
Again there is a risk of ambiguity here, http://www.user.host.com
should be labeled as a "sub-domain", it's not registered anywhere
and it's not portable, so you certainly can not call it "owning a
domain".
I don't see how the robots.txt convention relates to Apache
.htaccess files.
http://www.host.com/~user would resolve to
http://www.host.com/robots.txt for compliant clients looking for a
robots.txt
Indeed.
You have not provided any evidence that Atomz does not follow the
correct procedure for retrieving a robots.txt.
(all my sites use
http://www.user.host.com urls).
Jukka K. Korpela said:I made no such claim. I made the claim that most authors have no
control over what resides at http://www.foo.example/robots.txt
There is no ambiguity here. The domain host.com exists. The domain
user.host.com currently does not exist. Calling some domains subdomains
has no relevance to this, or to our topic. Either a domain name exists
or it does not, on the Internet, according to domain name servers. And
this has little to do with robots.txt.
Sorry, my typo. I meant http://www.host.com/robots.txt of course. The
point stands.
It was you who wrote an objection, based on a claim on Atomz behavior,
to my statement that said that robots.txt must reside on the server
root.
Too bad then. They do not work until you get that domain registered.
Been working for me.Headless said:Same thing since http://www.host.com/~user is the only format where a
robots.txt cannot be used by the user.
Want to reply to this thread or ask your own question?
You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.