Skip to content

v2

Compare
Choose a tag to compare
@KalimeroMK KalimeroMK released this 03 Feb 12:37
· 16 commits to main since this release
  1. Integration of cURL for Fetching Data:
  2. Image Extraction from Content:
  3. XPath Configuration for Content Selection:
  4. Support for Multiple div Classes:
  5. Expanded the configuration to include an additional div class, demonstrating how to target content areas with various class attributes. This showcases the method's adaptability to different webpage structures.
  6. Error Handling and Robustness:
  7. Implemented error handling in various parts of the code to gracefully manage exceptions and unexpected situations, such as HTTP errors or issues with image URLs. This increases the code's robustness and reliability.
  8. Return Structured Data:
  9. Adjusted methods to return structured data, including both textual content and arrays of image URLs, providing a comprehensive overview of the processed content.
  10. Flexible Content Extraction:
  11. Demonstrated how to extract and concatenate selected HTML content, preserving tags while optionally removing scripts and styles, thus maintaining the relevance and cleanliness of the content.