Description
in code/tools/db_load_utils.py
what is the "INCLUDE TYPES" for if it simply gets all that are not listed in "SKIP_TYPES"
Item type categorization
SKIP_TYPES = ["ItemList", "ListItem", "AboutPage", "Person"]
INCLUDE_TYPES = [
"FAQ", "WebPage", "Question", "Answer", "Organization", "BlogPosting", "MedicalWebPage", "HealthTopicContent", "MedicalOrganization", "OverviewHealthAspect"
I've noticed that the scraper will pick up image objects event if not added to INCLUDE_TYPES.
Should I simply Leave INCLUDE_TYPES as an empty array?
In the source Repo you have:
Item type categorization
SKIP_TYPES = ["ItemList", "ListItem", "AboutPage", "WebPage", "WebSite", "Person"]
INCLUDE_TYPES = [
"Recipe", "NeurIPSPoster", "InvitedTalk", "Oral", "Movie", "LocalBusiness", "Review",
"TVShow", "TVEpisode", "Product", "Offer", "PodcastEpisode", "Book",
"Podcast", "TVSeries", "ProductGroup", "Event", "FoodEstablishment",
"Apartment", "House", "Home", "RealEstateListing", "SingleFamilyResidence", "Offer",
"AggregateOffer", "Event", "BusinessEvent", "Festival", "MusicEvent", "EducationEvent",
"SocialEvent", "SportsEvent"
I can't work out why you have added these Types if it gets all by default?