/
ParsedText.txt
1 lines (1 loc) · 8.93 KB
/
ParsedText.txt
1
<a class="external text" href="https://www.diggernaut.com/" rel="nofollow">diggernaut.com</a><a class="external text" href="https://listly.io/" rel="nofollow">Listly.io</a><a class="external text" href="https://www.promptcloud.com/blog/how-to-read-and-respect-robots-file" rel="nofollow">robots.txt</a><a class="external text" href="https://pdfs.semanticscholar.org/4fb4/3c5a212df751e84c3b2f8d29fabfe56c3616.pdf" rel="nofollow">"Joint Optimization of Wrapper Generation and Template Detection"</a><a class="external text" href="http://www.gooseeker.com/en/node/knowledgebase/freeformat" rel="nofollow">Semantic annotation based web scraping</a><a class="external text" href="http://www.xconomy.com/san-francisco/2012/07/25/diffbot-is-using-computer-vision-to-reinvent-the-semantic-web/" rel="nofollow">"Diffbot Is Using Computer Vision to Reinvent the Semantic Web"</a><a class="external text" href="https://web.archive.org/web/20020308222536/http://www.chillingeffects.org/linking/faq.cgi#QID596" rel="nofollow">"FAQ about linking – Are website terms of use binding contracts?"</a><a class="external text" href="http://www.chillingeffects.org/linking/faq.cgi#QID596" rel="nofollow">the original</a><a class="external text" href="http://scholarship.law.berkeley.edu/btlj/vol29/iss4/16/" rel="nofollow">"Symbiotic Relationships: Pragmatic Acceptance of Data Scraping"</a><a class="external text" href="http://www.tomwbell.com/NetLaw/Ch06.html" rel="nofollow">"Internet Law, Ch. 06: Trespass to Chattels"</a><a class="external text" href="https://web.archive.org/web/20020308222536/http://www.chillingeffects.org/linking/faq.cgi#QID460" rel="nofollow">"What are the "trespass to chattels" claims some companies or website owners have brought?"</a><a class="external text" href="http://www.chillingeffects.org/linking/faq.cgi#QID460" rel="nofollow">the original</a><a class="external text" href="http://www.tomwbell.com/NetLaw/Ch07/Ticketmaster.html" rel="nofollow">"Ticketmaster Corp. v. Tickets.com, Inc."</a><a class="external text" href="https://web.archive.org/web/20110723131832/http://www.fornova.net/documents/AAFareChase.pdf" rel="nofollow">"American Airlines v. FareChase"</a><a class="external text" href="http://www.fornova.net/documents/AAFareChase.pdf" rel="nofollow">the original</a><a class="external text" href="http://www.thefreelibrary.com/American+Airlines,+FareChase+Settle+Suit.-a0103213546" rel="nofollow">"American Airlines, FareChase Settle Suit"</a><a class="external text" href="http://www.imperva.com/docs/WP_Detecting_and_Blocking_Site_Scraping_Attacks.pdf" rel="nofollow">Detecting and Blocking Site Scraping Attacks</a><a class="external text" href="http://library.findlaw.com/2003/Jul/29/132944.html" rel="nofollow">"Controversy Surrounds 'Screen Scrapers': Software Helps Users Access Web Sites But Activity by Competitors Comes Under Scrutiny"</a><a class="external text" href="http://www.fornova.net/documents/Cvent.pdf" rel="nofollow">"QVC Inc. v. Resultly LLC, No. 14-06714 (E.D. Pa. filed Nov. 24, 2014)"</a><a class="external text" href="https://www.scribd.com/doc/249068700/LinkedIn-v-Resultly-LLC-Complaint?secret_password=pEVKDbnvhQL52oKfdrmT" rel="nofollow">"QVC Inc. v. Resultly LLC, No. 14-06714 (E.D. Pa. filed Nov. 24, 2014)"</a><a class="external text" href="http://newmedialaw.proskauer.com/2014/12/05/qvc-sues-shopping-app-for-web-scraping-that-allegedly-triggered-site-outage/" rel="nofollow">"QVC Sues Shopping App for Web Scraping That Allegedly Triggered Site Outage"</a><a class="external text" href="http://www.fornova.net/documents/pblog-bna-com.pdf" rel="nofollow">"Did Iqbal/Twombly Raise the Bar for Browsewrap Claims?"</a><a class="external text" href="https://www.techdirt.com/articles/20090605/2228205147.shtml" rel="nofollow">"Can Scraping Non-Infringing Content Become Copyright Infringement... Because Of How Scrapers Work? | Techdirt"</a><a class="external text" href="https://www.eff.org/cases/facebook-v-power-ventures" rel="nofollow">"Facebook v. Power Ventures"</a><a class="external text" href="https://web.archive.org/web/20071012005033/http://www.bvhd.dk/uploads/tx_mocarticles/S_-_og_Handelsrettens_afg_relse_i_Ofir-sagen.pdf" rel="nofollow">"UDSKRIFT AF SØ- & HANDELSRETTENS DOMBOG"</a><a class="external text" href="http://www.bvhd.dk/uploads/tx_mocarticles/S_-_og_Handelsrettens_afg_relse_i_Ofir-sagen.pdf" rel="nofollow">the original</a><a class="external text" href="http://www.bailii.org/ie/cases/IEHC/2010/H47.html" rel="nofollow">"High Court of Ireland Decisions >> Ryanair Ltd -v- Billigfluege.de GMBH 2010 IEHC 47 (26 February 2010)"</a><a class="external text" href="http://www.lkshields.ie/htmdocs/publications/newsletters/update26/update26_03.htm" rel="nofollow">"Intellectual Property: Website Terms of Use"</a><a class="external text" href="https://www.lloyds.com/~/media/5880dae185914b2487bed7bd63b96286.ashx" rel="nofollow">"Spam Act 2003: An overview for business"</a><a class="external text" href="http://www.webstartdesign.com.au/spam_business_practical_guide.pdf" rel="nofollow">"Spam Act 2003: A practical guide for business"</a><a class="external text" href="https://s3.us-west-2.amazonaws.com/research-papers-mynk/Breaking-Fraud-And-Bot-Detection-Solutions.pdf" rel="nofollow">Breaking Fraud & Bot Detection Solutions</a><a dir="ltr" href="https://en.wikipedia.org/w/index.php?title=Web_scraping&oldid=830441507">https://en.wikipedia.org/w/index.php?title=Web_scraping&oldid=830441507</a><a href="https://donate.wikimedia.org/wiki/Special:FundraiserRedirector?utm_source=donate&utm_medium=sidebar&utm_campaign=C13_en.wikipedia.org&uselang=en" title="Support us">Donate to Wikipedia</a><a accesskey="g" href="https://www.wikidata.org/wiki/Special:EntityPage/Q665452" title="Link to connected data repository item [g]">Wikidata item</a><a class="interlanguage-link-target" href="https://ar.wikipedia.org/wiki/%D8%A5%D8%B3%D8%AA%D8%AE%D9%84%D8%A7%D8%B5_%D8%A7%D9%84%D9%85%D9%88%D8%A7%D9%82%D8%B9" hreflang="ar" lang="ar" title="إستخلاص المواقع – Arabic">العربية</a><a class="interlanguage-link-target" href="https://ca.wikipedia.org/wiki/Web_scraping" hreflang="ca" lang="ca" title="Web scraping – Catalan">Català</a><a class="interlanguage-link-target" href="https://de.wikipedia.org/wiki/Screen_Scraping" hreflang="de" lang="de" title="Screen Scraping – German">Deutsch</a><a class="interlanguage-link-target" href="https://es.wikipedia.org/wiki/Web_scraping" hreflang="es" lang="es" title="Web scraping – Spanish">Español</a><a class="interlanguage-link-target" href="https://eu.wikipedia.org/wiki/Web_scraping" hreflang="eu" lang="eu" title="Web scraping – Basque">Euskara</a><a class="interlanguage-link-target" href="https://fr.wikipedia.org/wiki/Web_scraping" hreflang="fr" lang="fr" title="Web scraping – French">Français</a><a class="interlanguage-link-target" href="https://is.wikipedia.org/wiki/Vefs%C3%B6fnun" hreflang="is" lang="is" title="Vefsöfnun – Icelandic">Íslenska</a><a class="interlanguage-link-target" href="https://it.wikipedia.org/wiki/Web_scraping" hreflang="it" lang="it" title="Web scraping – Italian">Italiano</a><a class="interlanguage-link-target" href="https://lv.wikipedia.org/wiki/Rasmo%C5%A1ana" hreflang="lv" lang="lv" title="Rasmošana – Latvian">Latviešu</a><a class="interlanguage-link-target" href="https://nl.wikipedia.org/wiki/Scrapen" hreflang="nl" lang="nl" title="Scrapen – Dutch">Nederlands</a><a class="interlanguage-link-target" href="https://ja.wikipedia.org/wiki/%E3%82%A6%E3%82%A7%E3%83%96%E3%82%B9%E3%82%AF%E3%83%AC%E3%82%A4%E3%83%94%E3%83%B3%E3%82%B0" hreflang="ja" lang="ja" title="ウェブスクレイピング – Japanese">日本語</a><a class="interlanguage-link-target" href="https://sr.wikipedia.org/wiki/Web_scraping" hreflang="sr" lang="sr" title="Web scraping – Serbian">Српски / srpski</a><a class="interlanguage-link-target" href="https://tr.wikipedia.org/wiki/Web_kaz%C4%B1ma" hreflang="tr" lang="tr" title="Web kazıma – Turkish">Türkçe</a><a class="interlanguage-link-target" href="https://uk.wikipedia.org/wiki/Web_scraping" hreflang="uk" lang="uk" title="Web scraping – Ukrainian">Українська</a><a class="interlanguage-link-target" href="https://zh.wikipedia.org/wiki/%E7%BD%91%E9%A1%B5%E6%8A%93%E5%8F%96" hreflang="zh" lang="zh" title="网页抓取 – Chinese">中文</a><a class="wbc-editpage" href="https://www.wikidata.org/wiki/Special:EntityPage/Q665452#sitelinks-wikipedia" title="Edit interlanguage links">Edit links</a><a class="extiw" href="https://wikimediafoundation.org/wiki/Privacy_policy" title="wmf:Privacy policy">Privacy policy</a><a href="https://www.mediawiki.org/wiki/Special:MyLanguage/How_to_contribute">Developers</a><a href="https://wikimediafoundation.org/wiki/Cookie_statement">Cookie statement</a><a href="https://wikimediafoundation.org/"><img alt="Wikimedia Foundation" height="31" src="/static/images/wikimedia-button.png" srcset="/static/images/wikimedia-button-1.5x.png 1.5x, /static/images/wikimedia-button-2x.png 2x" width="88"/></a>