- dockerfile: Add tesseract-ocr and libtesseract-dev to Dockerfile.base (
b534f5c
) - docker: Add new env file with project id and other configurations (
994f2e5
)
- bis_fetcher: Add author and timestamp to fetched data, add support for fetching pdfs (
231de01
) - bis_fetcher: Added use_selenium flag into request function (
f6a1727
) - bis_fetcher: Use base_url for article links, add use_selenium field and change config_name value to "bis". (
a9e3655
) - bis_fetcher: Add base_url to BaseFetcher (
3575281
) - fetcher: Replace selenium webdriver with chromedriver, introduce locator strategies and Response class in requests (
b938086
) - fetcher: Add wait_time parameter and locator capability to ChromeWebDriver (
369cb21
) - bis_fetcher: Add ChromeWebDriver class for fetching HTTP responses using Selenium (
568c5cc
) - bis_fetcher: Add request method and driver method (
22803da
)
- pyproject.toml: Add selenium dependency (
6d59443
)
- docker: Add Dockerfile.app and Dockerfile.base files (
5912d5c
)
- fetcher: Implement _next_page_func for handling, add support for start_urls and page_placeholder (
fe89717
) - bis_fetcher: Add BaseFetcher and BisFetcher classes (
847dc7a
)
- .copier-config: Update various configurations (
8d3c95d
)
- README.md: Update library description and token in codecov-image link (
ad1c6aa
) - book: Add favicon and google analytics id to config (
0cf569e
) - book: Update description and utility of 'bis-fetcher' library (
0174a0e
)
- Initial version (
1ef112f
)