Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add in posts for all referenced books to "books/" #7

Closed
swamidass opened this issue Nov 23, 2021 · 5 comments
Closed

Add in posts for all referenced books to "books/" #7

swamidass opened this issue Nov 23, 2021 · 5 comments

Comments

@swamidass
Copy link
Collaborator

I can provide a list of html links to books. But it will be an ongoing project to get this done right. Long term, a script to scrape the required information would be ideal. The hard part will be scraping amazon, because it requires some specially headers to enable mining.

@swamidass
Copy link
Collaborator Author

@madroxdupe42 you have the list now. Let me know if you need anything more from me regarding it, and be sure to let me know about any ambiguous cases.

@swamidass
Copy link
Collaborator Author

Also, there were 150, so it may not be a bad idea to build a script to scrape some or all of that info. It requires some finagling to get around their anti-scraping. See here for some info: https://www.scrapehero.com/tutorial-how-to-scrape-amazon-product-details-using-python-and-selectorlib/.

Alternatively, they do have an API you could register for, and hopefully it has all the right info.

Another possibility is this site, which claims to convert asin to isbn and reverse: https://www.synccentric.com/features/isbn-to-asin/

I also reccomend the "Editions" function of https://pypi.org/project/isbnlib/ for collecting the relevant isbn editions of a book. It also has some helpful function in there too.

@madroxdupe42
Copy link
Contributor

I agree that a script is a sensible way to go given the volume. Thanks for the links to the research you've already done on the topic. I'll see what I can do.

@swamidass
Copy link
Collaborator Author

If you make a script, aim for using python, and put it into the scripts directory of the project, and making it usable enough. If it is robust enough, I'll link it into build system.

madroxdupe42 added a commit to madroxdupe42/peacefulscience.org that referenced this issue Dec 24, 2021
swamidass pushed a commit that referenced this issue Dec 24, 2021
* Typeset all remaining Faith across the Multiverse excerpts for #10; fix link in Power of Babelfish editor's note

* Add images to Faith across the Multiverse posts for #10

* Add image credits and adjust YouTube videos for #10

* Correct dates in FatM excerpts

* First batch of books for #7
madroxdupe42 added a commit to madroxdupe42/peacefulscience.org that referenced this issue Dec 31, 2021
swamidass pushed a commit that referenced this issue Jan 3, 2022
* Typeset all remaining Faith across the Multiverse excerpts for #10; fix link in Power of Babelfish editor's note

* Add images to Faith across the Multiverse posts for #10

* Add image credits and adjust YouTube videos for #10

* Correct dates in FatM excerpts

* First batch of books for #7

* Second batch of books for #7

* Scaffolding and first two excerpts of Flat Earths and Fake Footnotes for #17
@swamidass
Copy link
Collaborator Author

First batch looks like its done. I'll send you an updated list of books, and also get a system for you to check at any time what the unresolved books are. The goal will be to keep updating them, hopefully within a few days of a new reference being added.

madroxdupe42 added a commit to madroxdupe42/peacefulscience.org that referenced this issue Jan 5, 2022
madroxdupe42 added a commit to madroxdupe42/peacefulscience.org that referenced this issue Jan 5, 2022
swamidass pushed a commit that referenced this issue Jan 5, 2022
* Typeset all remaining Faith across the Multiverse excerpts for #10; fix link in Power of Babelfish editor's note

* Add images to Faith across the Multiverse posts for #10

* Add image credits and adjust YouTube videos for #10

* Correct dates in FatM excerpts

* First batch of books for #7

* Second batch of books for #7

* Scaffolding and first two excerpts of Flat Earths and Fake Footnotes for #17

* Two remaining excerpts from Flat Earths for #17

* Figured out the solution for non-ASCII characters in author names for #7

* Figured out the solution for non-ASCII characters in author names for #7
madroxdupe42 added a commit to madroxdupe42/peacefulscience.org that referenced this issue Jan 6, 2022
swamidass pushed a commit that referenced this issue Jan 6, 2022
* Typeset all remaining Faith across the Multiverse excerpts for #10; fix link in Power of Babelfish editor's note

* Add images to Faith across the Multiverse posts for #10

* Add image credits and adjust YouTube videos for #10

* Correct dates in FatM excerpts

* First batch of books for #7

* Second batch of books for #7

* Scaffolding and first two excerpts of Flat Earths and Fake Footnotes for #17

* Two remaining excerpts from Flat Earths for #17

* Figured out the solution for non-ASCII characters in author names for #7

* Figured out the solution for non-ASCII characters in author names for #7

* Trimmed obvious and easily removed marketing language from book descriptions for #7
swamidass pushed a commit that referenced this issue Jan 10, 2022
* Typeset all remaining Faith across the Multiverse excerpts for #10; fix link in Power of Babelfish editor's note

* Add images to Faith across the Multiverse posts for #10

* Add image credits and adjust YouTube videos for #10

* Correct dates in FatM excerpts

* First batch of books for #7

* Second batch of books for #7

* Scaffolding and first two excerpts of Flat Earths and Fake Footnotes for #17

* Two remaining excerpts from Flat Earths for #17

* Figured out the solution for non-ASCII characters in author names for #7

* Figured out the solution for non-ASCII characters in author names for #7

* Trimmed obvious and easily removed marketing language from book descriptions for #7

* Update title and publication date; add headerimage; add reference section to first Flat Earths excerpt
swamidass pushed a commit that referenced this issue Jan 13, 2022
* Typeset all remaining Faith across the Multiverse excerpts for #10; fix link in Power of Babelfish editor's note

* Add images to Faith across the Multiverse posts for #10

* Add image credits and adjust YouTube videos for #10

* Correct dates in FatM excerpts

* First batch of books for #7

* Second batch of books for #7

* Scaffolding and first two excerpts of Flat Earths and Fake Footnotes for #17

* Two remaining excerpts from Flat Earths for #17

* Figured out the solution for non-ASCII characters in author names for #7

* Figured out the solution for non-ASCII characters in author names for #7

* Trimmed obvious and easily removed marketing language from book descriptions for #7

* Update title and publication date; add headerimage; add reference section to first Flat Earths excerpt

* Update publication dates; add headerimage; add reference section to remaining Flat Earths excerpts
swamidass pushed a commit that referenced this issue Jan 24, 2022
* Typeset all remaining Faith across the Multiverse excerpts for #10; fix link in Power of Babelfish editor's note

* Add images to Faith across the Multiverse posts for #10

* Add image credits and adjust YouTube videos for #10

* Correct dates in FatM excerpts

* First batch of books for #7

* Second batch of books for #7

* Scaffolding and first two excerpts of Flat Earths and Fake Footnotes for #17

* Two remaining excerpts from Flat Earths for #17

* Figured out the solution for non-ASCII characters in author names for #7

* Figured out the solution for non-ASCII characters in author names for #7

* Trimmed obvious and easily removed marketing language from book descriptions for #7

* Update title and publication date; add headerimage; add reference section to first Flat Earths excerpt

* Update publication dates; add headerimage; add reference section to remaining Flat Earths excerpts

* Typeset 2 MTE-ETS prints
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants