Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
ben_jones
on March 16, 2017
|
parent
|
context
|
favorite
| on:
Web Scraping: Bypassing “403 Forbidden,” captchas,...
Well the problem is when someone scrapes ALL the good listings then pre-purchases them for resale at double the cost.
Karawebnetwork
on March 16, 2017
[–]
How is it different than paying 50+ low-wage remote workers to "scrape" the phonebook for you and then using the information acquired for profit?
wumpus
on March 16, 2017
|
parent
[–]
One difference is that Feist v. Rural Telephone says that the data in a phonebook can't be copyrighted.
https://en.wikipedia.org/wiki/Feist_Publications,_Inc.,_v._R...
.
Karawebnetwork
on March 16, 2017
|
root
|
parent
[–]
What about using those employees to "crawl" the web for you then?
wumpus
on March 16, 2017
|
root
|
parent
[–]
I suspect it's roughly the same as a crawer -- same issues of fair use, TOS/CFAA, etc -- but likely there's no expectation that humans will read and follow robots.txt.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: