It's hard for me to see this as stealing content since: 1) Padmapper shows the o...

eli · on June 23, 2012

You expect Google, Bing and DDG to honor requests not to be indexed, don't you? How is this different?

And what Craigslist does or doesn't do to make money shouldn't change what you're allowed to do with their content.

ericd · on June 23, 2012

PadMapper obeys robots.txt's, for the record. Also, it doesn't repost their content, it reposts facts about the content, which is a pretty key difference.

They've had an informal amnesty for services using their stuff for a long time, Craig has stated that they're OK with services that interface with them as long as they don't use many server resources, but they recently updated their TOU and started sending out huge waves of C&D's a few weeks ago, based on my talking with people.

eli · on June 23, 2012

So if they add a Disallow line for PadMapper to robots.txt instead of sending a C&D, how does that change the situation in any meaningful way?

ericd · on June 23, 2012

Sure, it's effectively the same in result, without the legal threat backing it up.

dr42 · on June 22, 2012

The difference is that the search engines aggregate from the whole web, PadMapper was just appropriating content from CL, which is against their TOS. It's very clearly spelled out.

All the other companies that have tried have also been told to C&D. If you don't like it, start your own network...

From http://www.craigslist.org/about/terms.of.use

"Any copying, aggregation, display, distribution, performance or derivative use of craigslist or any content posted on craigslist whether done directly or through intermediaries (including but not limited to by means of spiders, robots, crawlers, scrapers, framing, iframes or RSS feeds) is prohibited."

psc · on June 22, 2012

PadMapper gets content from more than just Craigslist. They have their own postings (padlister.com), sublet.com, apartments.com, apartmentfinder.com and a lot more. There's really not that much difference between them and a search engine.

dr42 · on June 23, 2012

I've heard this argument many times before, it was the same with Oodle, but PadMapper's business model is to aggregate apartment listings (Google's is not) and this is in contradiction to the TOS so inevitably Craig Newmark sends them a C&D letter.

Several innovative solutions have been invented over the years, the most obvious one is to do the CL scraping on the client, this way there is no indexing server to block, it's just the client reading all the search listings, parsing them and offering faceted filters. It breaks the spirit of the TOS but it would be impossible for CL to block. I am not advocating this, just pointing out one of the many approaches that have been attempted over the years.

_ques · on June 22, 2012

Not true. Padmapper has multiple DBs.

Google can scan apartment listings and show them as results in their format. PadMapper cant. What's the difference?

See http://www.craigslist.org/robots.txt .

pbreit · on June 23, 2012

I'm not sure why those TOUs would prohibit Padmapper but not Google.

neilc · on June 23, 2012

The immediately following text in the CL ToS reads:

As a limited exception, general purpose Internet search engines and noncommercial public archives will be entitled to access craigslist without individual written agreements executed with CL that specifically authorize an exception to this prohibition if, in all cases and individual instances: (a) they provide a direct hyperlink to the relevant craigslist website, service, forum or content; (b) they access craigslist from a stable IP address using an easily identifiable agent; and (c) they comply with CL's robots.txt file ...