Great project, some of my thoughts:
Trust
A client can, in theory, submit any data it wants and the protocol doesn't have anything builtin to make sure the data is legitimate. Any authenticity checks have to be done out-of-band. Possibly comparing answers between clients.
Introducing an additional actor solely responsible for verifying the legitimacy of submitted data could be one solution. These 'Verifiers' could use various techniques to authenticate the data, such as headless browsers or more sophisticated bots. Scrapers attempting to cheat the protocol would then be penalized by no longer receiving tasks. Naturally, this would require some form of incentive, which brings me to my next point.
Incencitive
Scraping can become costly quickly, so participants should be compensated for their work. Users could pay a small fee, which would be redistributed among all protocol actors, including scrapers and verifiers. Ulixee has already explored this concept. While their approach (a blockchain-like monetization system) doesn't necessarily need to be followed, it could provide some valuable inspiration.
Great project, some of my thoughts:
Trust
Introducing an additional actor solely responsible for verifying the legitimacy of submitted data could be one solution. These 'Verifiers' could use various techniques to authenticate the data, such as headless browsers or more sophisticated bots. Scrapers attempting to cheat the protocol would then be penalized by no longer receiving tasks. Naturally, this would require some form of incentive, which brings me to my next point.
Incencitive
Scraping can become costly quickly, so participants should be compensated for their work. Users could pay a small fee, which would be redistributed among all protocol actors, including scrapers and verifiers. Ulixee has already explored this concept. While their approach (a blockchain-like monetization system) doesn't necessarily need to be followed, it could provide some valuable inspiration.