Does CSS extraction support the extraction of arrays? Currently, only one item can be extracted. #892

cccmolo · 2025-03-26T10:07:01Z

cccmolo
Mar 26, 2025

`import asyncio
import json
from crawl4ai import AsyncWebCrawler, CrawlerRunConfig, CacheMode
from crawl4ai.extraction_strategy import JsonCssExtractionStrategy

async def main():
schema = {
"name": "Example Items",
"baseSelector": "div.item",
"fields": [
{"name": "title", "selector": "h2", "type": "text"},
{"name": "link", "selector": "a", "type": "attribute", "attribute": "href"}
]
}
raw_html = """

Item 1

Link 1

Item 2

Link 2

""" async with AsyncWebCrawler() as crawler: result = await crawler.arun( url="raw://" + raw_html, config=CrawlerRunConfig( cache_mode=CacheMode.BYPASS, extraction_strategy=JsonCssExtractionStrategy(schema) ) ) data = json.loads(result.extracted_content) print(data)

if name == "main":
asyncio.run(main())`

result：[{'title': 'Item 1', 'link': 'https://example.com/item1'}]

I hope so： [{'title': 'Item 1', 'link': 'https://example.com/item1'}，{'title': 'Item 2', 'link': 'https://example.com/item2'}]]

aravindkarnam · 2025-03-28T04:56:02Z

aravindkarnam
Mar 28, 2025
Collaborator

@cccmolo Yes. we support extracting a list of items from a page. Here's an example, where we are getting a list of items from a restaurant menu. Check it out.
https://gist.github.com/aravindkarnam/021e6c4e3da0da3ca795b40d615ebe9f

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Does CSS extraction support the extraction of arrays? Currently, only one item can be extracted. #892

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Item 1

Item 2

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

Does CSS extraction support the extraction of arrays? Currently, only one item can be extracted. #892

Uh oh!

Uh oh!

cccmolo Mar 26, 2025

Item 1

Item 2

Replies: 1 comment

Uh oh!

aravindkarnam Mar 28, 2025 Collaborator

cccmolo
Mar 26, 2025

aravindkarnam
Mar 28, 2025
Collaborator