Thaqalayn.net API

UPDATE AS OF 2024-04

the API now has V2 endpoints. The old endpoints are still available but will no longer be updated. All new Thaqalayn hadiths will be reflected in the V2 endpoints. All examples below use the V2 endpoints. The old endpoints can be accessed by replacing .../api/v2/... in the URL with .../api/... in the URL.

Data returned in V2 is very similar to what was returned in the original endpoints. One breaking change is that the behdudiGrading field has been changed to behbudiGrading to reflect the correct spelling. Also, because the data that is retrieved is now formatted in a different way (ex. gradings are are better formatted), it is hard to know what is an application breaking change and what isn't. So I decided to separate this update into it's own version.

Developers are encouraged to migrate to the V2 endpoints to fetch all the latest data. Migration should be relatively seemless, with the only expected change being the behdudiGrading->behbudiGrading. The old endpoints will still be available for the foreseeable future.

Introduction

https://www.thaqalayn-api.net/

A Rest + GQL API that allows for the retrieval of hadiths from thaqalayn.net in JSON format. To create it, I first built a web scraper (python) to get all the hadiths on thaqalayn.net. Afterwards I stored the data in an online database and created an API using node.js + express. I also created a simple front-end with react to showcase one of the endpoints (api/random). The front-end can be reached at https://thaqalayn-api-423621.web.app

Update as of 2024-04: The API now relies on a Go script as opposed to a python script to fetch all the data. All relevant code is found in the V2 directory.

How to use

Here is a simple example of how to fetch one of the endpoints using axios. Change url to whatever endpoint you'd like.

const url = "https://www.thaqalayn-api.net/api/v2/random"

request = axios.get(url).then(res => {
        console.log(res.data);
        //...
    })

Endpoints

All endpoints

The GraphQl endpoint can be found at: https://www.thaqalayn-api.net/graphql

A list of endpoints can be found on the Swagger UI page: https://www.thaqalayn-api.net/api-docs/

Retrieve all the available books, with minimum and maximum Id's:
- https://www.thaqalayn-api.net/api/v2/allbooks
Retrieve a random hadith from any book:
- https://www.thaqalayn-api.net/api/v2/random
Retrieve a random hadith from a given book:
- https://www.thaqalayn-api.net/api/v2/[bookId]/random
Make a query throughout the entire database. This is a very simplistic case-insensitive search mechanism that accepts both english and arabic and searches for any hadith with an exact match. Use it with query q. Use the pipe character | to separate multiple queries (where all matches that contain either query 1 OR query 2 are retrieved):
- https://www.thaqalayn-api.net/api/v2/query?q=[query]
- https://www.thaqalayn-api.net/api/v2/query?q=[query1]|[query2]
Make a query for a specific book. Same rules as above apply here:
- https://www.thaqalayn-api.net/api/v2/query/[bookId]?q=[query]
Get all the hadiths for a particular book:
- https://www.thaqalayn-api.net/api/v2/[bookId]
Return a specific hadith based on id:
- https://www.thaqalayn-api.net/api/v2/[bookId]/[id]

Examples

Retrieve a random hadith from a given book:
- https://www.thaqalayn-api.net/api/v2/Al-Amali-Mufid/random
Make a query throughout all books:
- https://www.thaqalayn-api.net/api/v2/query?q=misery%20and%20wretchedness
Make a query throughout all books with multiple queries:
- https://www.thaqalayn-api.net/api/v2/query?q=misery%20and%20wretchedness|We%20seek%20refuge%20in%20Allah%20from%20the%20Fire
Make a query for a specific book:
- https://www.thaqalayn-api.net/api/v2/query/Al-Kafi-Volume-6-Kulayni?q=misery%20and%20wretchedness
Get all the hadiths for a particular book:
- https://www.thaqalayn-api.net/api/v2/Al-Amali-Mufid
Get a specific hadith based on id:
- https://www.thaqalayn-api.net/api/v2/Uyun-akhbar-al-Rida-Volume-1-Saduq/80

Extra info

Most folders in the V1 directory are no longer used (except for the /V1/DB/models).

In the V2 directory, you'll find of relevance:

ThaqalaynData directory, which is all the data that was fetched from the website manually. This data is stored in a JSON format. This will not include any new data that has been added and fetched using the automated github actions workflow.
WebScraper directory, which contains the Go script that fetches all the data from the website. The main package is found under /WebScraper/cmd. The script has the following flags associated with it:
- -datapath: Required if using the script to scrape. Provides the path where the data will be stored after being scraped. DO NOT PUT a slash at the end. Ex. go run main.go -datapath=../ThaqalaynData
- -singlebook: The Thaqalayn ID (int) of a single book to scrape. This flag is optional and if not provided, the script will scrape all the books. Ex. go run main.go -datapath=../ThaqalaynData -singlebook=17
- -booknamesonly: This flag signifies that you only want to create the "allBooks.json" (all books combined into a single json) and "BookNames.json" (Book metadata) files. This requires that the books have already been scraped. When this flag is used, no scraping will be done. The flag value represents the path where the data (the two files mentioned) is stored. DO NOT PUT a slash at the end. Ex. go run main.go -booknamesonly=../ThaqalaynData
- -webapp-url: The URL of the webapp. This is required if you want to scrape the data. Ex. go run main.go -datapath=../ThaqalaynData -webapp-url=https://someWebAppUrl.com . I cannot make public the URL of Thaqalayn's GQL API. This can be either a flag or an environment variable, where the environment variable takes precedence.
- -webapp-api-key: The API key of the webapp. This is required if you want to scrape the data. Ex. go run main.go -datapath=../ThaqalaynData -webapp-url=https://someWebAppUrl.com -webapp-api-key=SomeAPIKey. This can be either a flag or an environment variable, where the environment variable takes precedence.

To do any scraping, the WEBAPP_URL and WEBAPP_API_KEY env variables need to be set.

A developer can also use push_books and push_hadiths (or to combine the two, push_all) to push data into their mongoDB atlas instance if they want to create one and have the data stored in their themselves. They will need the MONGODB_URI env variable set to their mongoDB atlas URI (can use .env file). If that's all done, A developer can follow the steps below to publish the data in the /V2/ThaqalaynData directory to their mongoDB atlas instance:

cd V2/Deploy
make push_all or make push_books (pushes the booknames.json) or make push_hadiths (pushes the allBooks.json)

Feel free to use any part of this project and modify as you'd like.

Developer Setup

Scraper Setup

Clone the repository
Make sure Thaqalayn API credentials are set. This can be done by setting the WEBAPP_URL and WEBAPP_API_KEY environment variables or by using the -webapp-api-key and -webapp-url flags when running the script. Environment variables take precedence. If you want to use environment variables, copy the .env.example file to V2/WebScraper/cmd folder, rename to .env and fill in the values 2 api values.
cd V2/WebScraper/cmd
To scrape all the books: go run main.go -datapath=../../ThaqalaynData. Add the -webapp-api-key and -webapp-url flags if you haven't set the environment variables.
To scrape a single book: go run main.go -datapath=../../ThaqalaynData -singlebook=17. Add the -webapp-api-key and -webapp-url flags if you haven't set the environment variables. Replace 17 with the book ID you want to scrape.

API Setup

Clone the repository
run npm install
Using the .env.example file, create a .env file at the root of the directory. You will need a value for MONGODB_URI. This is the URI of your mongoDB atlas instance that stores the data. This uses models found in /v1/models and /v2/models.
run npm start

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
.github/workflows		.github/workflows
API		API
V1		V1
V2		V2
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
handler.js		handler.js
package-lock.json		package-lock.json
package.json		package.json
serverless.yml		serverless.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Thaqalayn.net API

UPDATE AS OF 2024-04

Introduction

How to use

Endpoints

All endpoints

Examples

Extra info

Developer Setup

Scraper Setup

API Setup

About

Releases

Packages

Languages

License

MohammedArab1/ThaqalaynAPI

Folders and files

Latest commit

History

Repository files navigation

Thaqalayn.net API

UPDATE AS OF 2024-04

Introduction

How to use

Endpoints

All endpoints

Examples

Extra info

Developer Setup

Scraper Setup

API Setup

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages