Unless I'm reading RFC 3986 incorrectly, that's valid because you can't have an ...

ajanuary · on Oct 27, 2014

I think you're reading it incorrectly.

You can have an empty segment in the path. The BNF for a segment is:

    segment       = *pchar

Which according to RFC2234 section 3.6 means zero or more repetitions.

gpvos · on Oct 27, 2014

But then the server may still decide that an empty segment is so meaningless that it will refuse it.

In fact, it would not be a smart move to just treat double slashes the same as single ones, because of relative URLs: a ".." segment only removes one slash, so the hierarchy levels would get messed up. thttpd is doing the smart thing here.

As one of my teachers at university would say: the empty segment is also a segment.

lmm · on Oct 27, 2014

The server can of course interpret the path as it wants, but it should allow an application running under the server to give 'foo//bar' a meaning if that application wants to, IMO.

gpvos · on Oct 28, 2014

True. I was writing about the case when the URL simply mapped to a file system location. Applications should be able to apply their own interpretation.

stevekemp · on Oct 27, 2014

Agreed.

(The problem in my case was just stupid spiders that were crawling my sites.)

ajanuary · on Oct 27, 2014

Yes, it's a valid URI but //robots.txt is different resource to /robots.txt. It seems thttpd is probay doing the right thing.

zimpenfish · on Oct 27, 2014

The difference between `path-abempty` and `path-absolute` is bloody confusing but I think you're right.