Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

remove /old/ pages from search engines #350

Closed
SethTisue opened this issue Nov 4, 2015 · 11 comments
Closed

remove /old/ pages from search engines #350

SethTisue opened this issue Nov 4, 2015 · 11 comments

Comments

@SethTisue
Copy link
Member

e.g. Googling "Scala FAQ" takes you to http://www.scala-lang.org/old/faq which (as @soc recently pointed out at https://groups.google.com/d/msg/scala-internals/nkh7_L4EpHA/9z1DM5QQBQAJ) is embarrassingly outdated.

I think it's fine to leave the old pages up so that links to them don't just go dead, but we really don't want this stuff turning up when newbies google us.

@SethTisue SethTisue self-assigned this Nov 4, 2015
@DarkDimius
Copy link
Member

There are a lot of knowledge hidden discussions in old part of forums that would be great to find through google. Such as http://www.scala-lang.org/old/node/9126.html
Could we mark everything but old forums as non-indexed by search engines?

@fsalvi
Copy link
Contributor

fsalvi commented Nov 5, 2015

Forums are just the archives of google groups mailing list.
Eg, for Inlining Problem:
https://groups.google.com/forum/#!msg/scala-internals/a6Py8kpeiQY/VPDKvJV8SEEJ

@fsalvi
Copy link
Contributor

fsalvi commented Nov 5, 2015

I thought it would be interesting to know which are the old pages often requested on the scala website.

Here is the list (based on our webserver statistics):

Searching for these keywords on google always put the old website in top results...

There were other old web pages often requested, but I already added some redirects to point them to new pages which have the same content (eg tour of scala).

@soc
Copy link
Member

soc commented Nov 6, 2015

Thanks, good to know!

@raboof
Copy link
Member

raboof commented Dec 11, 2015

I know the README is quite adamant that "Subdirectory scala-lang.org/old is a static copy of the old website. It was generated once and copied there, and it stays like that".

Still, it'd be real neat to put it up on github (even in its current 'raw' form), so we can at least create PR's to add some links to more recent material.

@SethTisue SethTisue removed their assignment Apr 28, 2016
@jarrodu
Copy link
Member

jarrodu commented Nov 20, 2016

It sounds like we could benefit from updating the robots.txt file in the root directory.

@adamvoss
Copy link

adamvoss commented Dec 4, 2016

What @fsalvi did with redirects sounds good. I know I still have recently pulled useful information out of the old website because it appeared in the search results. I did not investigate everything to see if the the current site had the info. I did note that other, similarly ranked, results did not look like that would have been as concise in answering the programming/language question I had.

I guess what I am trying to say is that I would be worried about delisting web pages if we are not sure there are good alternatives as you may be making learning harder.

@fsalvi
Copy link
Contributor

fsalvi commented Feb 7, 2017

Here's an updated list of old pages still requested (ordered by most watched)

@SethTisue
Copy link
Member Author

Thanks, Fabien, for compiling this list.

I think nearly all, perhaps 100%, of these results, are egregiously outdated and actually quite harmful for beginners to be landing on. So I still support my original proposal:

I think it's fine to leave the old pages up so that links to them don't just go dead, but we really don't want this stuff turning up when newbies google us.

@fsalvi
Copy link
Contributor

fsalvi commented Oct 1, 2020

I updated robots.txt to stop the index of /old directory.

@fsalvi fsalvi closed this as completed Oct 1, 2020
@SethTisue
Copy link
Member Author

Thank you, Fabien!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

7 participants