-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
11 changed files
with
1,010 additions
and
391 deletions.
There are no files selected for viewing
14 changes: 14 additions & 0 deletions
14
_freeze/posts/2024-03-15-ai-garbage/index/execute-results/html.json
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,14 @@ | ||
{ | ||
"hash": "624c0e78207cefbaf1c18199756efc0e", | ||
"result": { | ||
"markdown": "---\ntitle: \"Shitty R help from shitty AI\"\ndate: 2024-03-15\nslug: \"ai-garbage\"\nimage: resources/beckhams.png\ncategories:\n - ai\n - r\n---\n\n\n![](resources/beckhams.png){fig.alt=\"Beckhams meme. Panels with Victoria then David. She says 'I wrote an R help website' and he responds 'be honest'. She says 'I am being honest' and he says 'really?'. She says 'well, I used AI to write an R help website and am shamelessly making money from innocent suckers as a result' and he says 'thank you'.\" width='50%'}\n\n## tl;dr\n\nThe rise of R 'help' websites written by AI is predatory and shameless. Things will only get worse.\n\n## Help\n\nRegular readers know this for sure: I'm not an R expert. I don't just 'know' stuff. I'm DuckGoGoing 'how to do x in r' every 10 minutes.\n\nIn doing this, I've noticed a trend that I want to complain about: I've found a few suspicious 'help websites' for R appearing high up the search rankings.\n\nWhy 'suspicious'? They're clearly written with an AI tool. And they're garbage. In content and ethics.\n\nI'm absolutely not going to name websites here because I do not want to send any traffic there. \n\n## You are a sucker\n\nHere's where they excel. They:\n\n* use your gullibility to make money\n* are brazen\n* have excellent SEO ([search-engine optimisation](https://en.wikipedia.org/wiki/Search_engine_optimization))\n\n### Gullibility\n\nYou might be thinking 'okay, but maybe this is an efficient way of helping people'. To which the obvious retort is 'okay, no, this is an efficient way to make money by exploiting the clicks of vulnerable learners'.\n\nHow? At least one of these sites suggests it has 'partners', which are clearly just affiliate links. They will make a commission if their visitors sign up for a course at the affiliate link. The site's KPI is conversions, not 'people helped'.\n\n### Brazenness\n\nThese sites seem to have tens (hundreds?) of pages published on the same day without any attribution to a particular human. Either they have some very efficient staff or they assume no-one will check.\n\nPerhaps most brazen though is the poor attention to detail. At least one of these sites retains the sentence:\n\n> Certainly! Here are the two sections for adding <affiliate> and <affiliate> to your webpage:\n\nClearly someone has asked a chatbot for some text and it has obliged. And then they forgot to delete this telltale line from the output before pasting it into their website. And the biggest joke is that I found this on a page full of affiliate links.\n\n### SEO\n\nI found links to these sites as the top search result.\n\nAs is well known, people will just click the top links willy-nilly. These must be the most appropriate sites if they're top of the search rankings, right?\n\nNo. The answer is no. Google is gameable as heck and easily manipulated for clicks.\n\n## Why this sucks\n\nI mean it's kind of obvious that this garbage is harmful, but for the benefit of the doubt, my concerns are as follows.\n\n1. Who are they stealing from?\n2. How much of the code is hallucinatory?\n3. Is this ruining learners' understanding?\n\nIt's pretty common knowledge that many AIs are trained on data without the consent of original creators. How much content is stolen from people without their consent? Maybe it slurped up some of _your_ material against your will.\n\nThese sites have code where the examples literally cannot be run; the syntax cannot be evaluated if copy-pasted into an R terminal. At least one of these sites was offering advice for {ggplot2} without ever showing an example plot.\n\nI'm pretty seasoned at searching for things on the internet, particularly R. I can separate the wheat from the chaff, I reckon. But not everyone can. How can a beginner user know what's wrong if they copy and paste trash from a shameless website like this?\n\n## Suck it up\n\nThis is scummy.\n\nI'm asking that you take two seconds to think 'could this be a fake help website?' Consider the telltale signs:\n\n* suspicious wording and accidentally-undeleted verbiage copied from the output of an LLM (large language model) query\n* obvious links to affiliate sites\n* code that doesn't run when you copy it to your machine\n* examples of code, but no output\n* crappy [Corporate-Memphis-style](https://en.wikipedia.org/wiki/Corporate_Memphis) AI-generated images of a generic white-guy in his 30s at a computer who is probably called Matt[^matt]\n\nMaybe I don't need to warn you about this. It's 2024. Times have changed. You're smart. I grew up with floppies and CD ROMs. \n\nDon't patronise these sites by clicking affiliate links; patronise them with condescension. It's all we can do.\n\n### Environment {.appendix}\n\n<details><summary>Session info</summary>\n\n::: {.cell}\n::: {.cell-output .cell-output-stdout}\n```\nLast rendered: 2024-03-15 22:47:51 GMT\n```\n:::\n\n::: {.cell-output .cell-output-stdout}\n```\nR version 4.3.1 (2023-06-16)\nPlatform: aarch64-apple-darwin20 (64-bit)\nRunning under: macOS Ventura 13.2.1\n\nMatrix products: default\nBLAS: /Library/Frameworks/R.framework/Versions/4.3-arm64/Resources/lib/libRblas.0.dylib \nLAPACK: /Library/Frameworks/R.framework/Versions/4.3-arm64/Resources/lib/libRlapack.dylib; LAPACK version 3.11.0\n\nlocale:\n[1] en_US.UTF-8/en_US.UTF-8/en_US.UTF-8/C/en_US.UTF-8/en_US.UTF-8\n\ntime zone: Europe/London\ntzcode source: internal\n\nattached base packages:\n[1] stats graphics grDevices utils datasets methods base \n\nloaded via a namespace (and not attached):\n [1] htmlwidgets_1.6.2 compiler_4.3.1 fastmap_1.1.1 cli_3.6.2 \n [5] tools_4.3.1 htmltools_0.5.6.1 rstudioapi_0.15.0 yaml_2.3.8 \n [9] rmarkdown_2.25 knitr_1.45 jsonlite_1.8.7 xfun_0.41 \n[13] digest_0.6.33 rlang_1.1.3 evaluate_0.23 \n```\n:::\n:::\n\n</details>\n\n[^matt]: I can say this because it's my exact description.", | ||
"supporting": [], | ||
"filters": [ | ||
"rmarkdown/pagebreak.lua" | ||
], | ||
"includes": {}, | ||
"engineDependencies": {}, | ||
"preserve": {}, | ||
"postProcess": true | ||
} | ||
} |
Oops, something went wrong.