Releases · KalimeroMK/RssFeed

30 May 17:24

585d224

Code update Latest

Latest

• Switched from SimpleXMLElement to SimplePie for more robust RSS feed parsing.
• Simplified feed parsing logic with SimplePie.
• Simplified image extraction using regex directly from feed item descriptions.
• Removed dependency on DOMDocument and DOMXPath for HTML parsing.
• Maintained robust image saving logic to download and save images to storage.
• Removed fetchContentUsingCurl and convertToAbsoluteUrl methods, reducing code complexity.
• Simplified error handling and logging for RSS feed parsing.
• Added dependency on Laravel’s Container for creating SimplePie instances, ensuring better integration with Laravel’s service container.

Assets 2

07 Mar 14:23

KalimeroMK

v2.4

991f899

Image return update

Find Images: The code then looks for tags within each found element.
$images = $xpath->query('.//img', $element): This line finds all elements within the current element and stores them in $images.
Iterate over Found Images: The loop iterates over each image found in the current element.
foreach ($images as $img): This starts another loop for each image found within the current element.
Extract and Validate Image URLs: The code extracts the URL from each image and checks whether it's a valid URL or a placeholder. If it's a placeholder, the code attempts to find a real image URL from alternative attributes commonly used with lazy-loaded images.
$src = $img->getAttribute('src'): This line gets the src attribute of the image.
If the src is a data URI or SVG (indicating a placeholder), the code checks other attributes like data-src, data-lazy-src, or data-original for an actual image URL.
Convert Relative URLs to Absolute: If a valid image URL is found and it's a relative URL, the code converts it to an absolute URL based on the page's domain.
$src = $this->convertToAbsoluteUrl($src, ...): This calls a method that converts a relative URL to an absolute one.
Store Unique URLs: Finally, if the image URL hasn't already been added to the $imageUrls array, the code adds it.
if (!in_array($src, $imageUrls)) { $imageUrls[] = $src; }: This checks if the URL is already in the array and, if not, adds it.

Assets 2

07 Mar 12:20

KalimeroMK

v2.3.1

e4b0238

Add missing facade

use Illuminate\Support\Facades\Log; add

Assets 2

07 Mar 12:12

KalimeroMK

v2.3

a045b1a

Code refactor

Assets 2

23 Feb 13:02

KalimeroMK

v2.2

6ffcaf4

Add new featurend

namic XPath Configuration: The method now dynamically selects XPath queries based on the domain of the RSS feed. This allows for custom content scraping strategies for different websites.
Image Size Filtering: Added functionality to filter images by their width, ensuring that only images larger than a specified width (e.g., 600px) are considered. This helps in focusing on significant images only.
Unique Images: Updated the logic to ensure that only unique images are returned by the method, eliminating duplicates and reducing unnecessary data.
Domain-based Configuration: Shifted to a domain-based configuration approach for XPath queries, allowing for more granular and accurate content extraction tailored to each specific source or domain.
Configuration File Usage: The class now leverages a configuration file (rssfeed.php) for setting parameters like minimum image width and domain-specific XPath queries, offering a centralized place for configurations.

Assets 2

03 Feb 12:53

KalimeroMK

v2.1

92b2595

v2.1

Some small updates

Assets 2

03 Feb 12:37

KalimeroMK

266d7d9

Integration of cURL for Fetching Data:
Image Extraction from Content:
XPath Configuration for Content Selection:
Support for Multiple div Classes:
Expanded the configuration to include an additional div class, demonstrating how to target content areas with various class attributes. This showcases the method's adaptability to different webpage structures.
Error Handling and Robustness:
Implemented error handling in various parts of the code to gracefully manage exceptions and unexpected situations, such as HTTP errors or issues with image URLs. This increases the code's robustness and reliability.
Return Structured Data:
Adjusted methods to return structured data, including both textual content and arrays of image URLs, providing a comprehensive overview of the processed content.
Flexible Content Extraction:
Demonstrated how to extract and concatenate selected HTML content, preserving tags while optionally removing scripts and styles, thus maintaining the relevance and cleanliness of the content.

Assets 2

03 Feb 12:18

KalimeroMK

v1.2.1

9e23718

v1.2.1

Code refactore

Assets 2

14 Jun 06:47

KalimeroMK

v1.2

35ca80a

v1.2

Make RssFeed class implement ShouldQueue with Dispatchable so it can be used as a job

Assets 2

11 Jun 23:06

KalimeroMK

v1.1

78bf34b

Added return type

Added return type array to the parseRssFeeds method.
Added return type string to the saveImageToStorage method.
Added return type bool|string to the retrieveFullContent method.
Added return type string|null to the getImageWithSizeGreaterThan method.
Added type annotations to the properties within the foreach loops, such as (string).

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: KalimeroMK/RssFeed

Code update

Image return update

Add missing facade

Code refactor

Add new featurend

v2.1

v2

v1.2.1

v1.2

Added return type