Please add support for the robots.txt file #2

ygguser · 2023-04-03T18:40:26Z

Please add reading the robots.txt file and crawling based on its content in the crawler functions.

d47081 · 2023-04-03T18:53:43Z

Nice think! forgot about this important moment

d47081 · 2023-04-07T11:55:17Z

Hello, after #3 implementation, added the basic robots.txt support.

Maybe I wrote 'new bicycle' but as is.

For right now, crawler supports the User-agent: * section, Allow/Disallow constructions only.

Plus, have added the CRAWL_ROBOTS_DEFAULT_RULES directive to config file, where search provider can set own default rules, when the website does not provide this file.

For contributors, the library class presented here:
https://github.com/YGGverse/YGGo/blob/main/library/robots.php

Please reopen if I something missed.

d47081 added the good first issue Good for newcomers label Apr 3, 2023

d47081 pushed a commit that referenced this issue Apr 3, 2023

implement robots.txt library #2

ed2d404

d47081 pushed a commit that referenced this issue Apr 3, 2023

update variable names #2

c9cd38f

d47081 mentioned this issue Apr 5, 2023

I think the search results should be divided into categories somehow... #1

Open

d47081 pushed a commit that referenced this issue Apr 7, 2023

implement MySQL/Sphinx data model #3, add basical robots.txt support #2

2495a2b

d47081 mentioned this issue Apr 7, 2023

Implement MySQL + Sphinx data driving model #3

Closed

d47081 closed this as completed Apr 7, 2023

d47081 added the enhancement New feature or request label Apr 7, 2023

d47081 pushed a commit that referenced this issue Apr 9, 2023

add meta:robots tag support #2

5c8d299

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Please add support for the robots.txt file #2

Please add support for the robots.txt file #2

ygguser commented Apr 3, 2023

d47081 commented Apr 3, 2023

d47081 commented Apr 7, 2023

Please add support for the robots.txt file #2

Please add support for the robots.txt file #2

Comments

ygguser commented Apr 3, 2023

d47081 commented Apr 3, 2023

d47081 commented Apr 7, 2023