🤖 PulsarRPA

🌟 Introduction

💖 PulsarRPA: The AI-Powered, Lightning-Fast Browser Automation Solution! 💖

✨ Key Capabilities:

🤖 AI Integration with LLMs – Smarter automation powered by large language models.
⚡ Ultra-Fast Automation – Coroutine-safe browser automation concurrency, spider-level crawling performance.
🧠 Web Understanding – Deep comprehension of dynamic web content.
📊 Data Extraction APIs – Powerful tools to extract structured data effortlessly.

Automate the browser and extract data at scale with simple text.

Go to https://www.amazon.com/dp/B0C1H26C46
After page load: scroll to the middle.

Summarize the product.
Extract: product name, price, ratings.
Find all links containing /dp/.

🎥 Demo Videos

🎬 YouTube:

📺 Bilibili: https://www.bilibili.com/video/BV1kM2rYrEFC

🚀 Quick Start Guide

▶️ Run PulsarRPA

📦 Run the Executable JAR — Best Experience

🧩 Download

# For Linux/macOS/Windows (with curl)
curl -L -o PulsarRPA.jar https://github.com/platonai/PulsarRPA/releases/download/v3.0.7/PulsarRPA.jar

🚀 Run

java -DEEPSEEK_API_KEY=${DEEPSEEK_API_KEY} -jar PulsarRPA.jar

🔍 Tip: Make sure DEEPSEEK_API_KEY is set in your environment, or AI features will not be available.

📂 Resources

▶ Run with IDE

Details

Open the project in your IDE
Run the ai.platon.pulsar.app.PulsarApplicationKt main class

🐳 Docker Users

Details

docker run -d -p 8182:8182 -e DEEPSEEK_API_KEY=${DEEPSEEK_API_KEY} galaxyeye88/pulsar-rpa:latest

🌟 For Beginners – Just Text, No Code!

Use the ai/command API to perform actions and extract data using natural language instructions.

📥 Example Request (Text-based):

curl -X POST "http://localhost:8182/api/ai/command" \
  -H "Content-Type: text/plain" \
  -d '
    Go to https://www.amazon.com/dp/B0C1H26C46
    After page load: click #title, then scroll to the middle.
    
    Summarize the product.
    Extract: product name, price, ratings.
    Find all links containing /dp/.
  '

📄 JSON-Based Version:

Details

curl -X POST "http://localhost:8182/api/ai/command" \
  -H "Content-Type: application/json" \
  -d '{
    "url": "https://www.amazon.com/dp/B0C1H26C46",
    "pageSummaryPrompt": "Provide a brief introduction of this product.",
    "dataExtractionRules": "product name, price, and ratings",
    "linkExtractionRules": "all links containing `/dp/` on the page",
    "onPageReadyActions": ["click #title", "scroll to the middle"]
  }'

💡 Tip: You don't need to fill in every field — just what you need.

🎓 For Advanced Users — LLM + X-SQL: Precise, Flexible, Powerful

Harness the power of the x/e API for highly precise, flexible, and intelligent data extraction.

curl -X POST "http://localhost:8182/api/scrape/execute" -H "Content-Type: text/plain" -d "
select
  llm_extract(dom, 'product name, price, ratings') as llm_extracted_data,
  dom_base_uri(dom) as url,
  dom_first_text(dom, '#productTitle') as title,
  dom_first_slim_html(dom, 'img:expr(width > 400)') as img
from load_and_select('https://www.amazon.com/dp/B0C1H26C46', 'body');
"

The extracted data example:

{
  "llm_extracted_data": {
    "product name": "Apple iPhone 15 Pro Max",
    "price": "$1,199.00",
    "ratings": "4.5 out of 5 stars"
  },
  "url": "https://www.amazon.com/dp/B0C1H26C46",
  "title": "Apple iPhone 15 Pro Max",
  "img": "<img src=\"https://example.com/image.jpg\" />"
}

X-SQL Guide: X-SQL

👨‍💻 For Experts - Native API: Powerful!

🎮 Browser Control:

Details

val prompts = """
move cursor to the element with id 'title' and click it
scroll to middle
scroll to top
get the text of the element with id 'title'
"""

val eventHandlers = DefaultPageEventHandlers()
eventHandlers.browseEventHandlers.onDocumentActuallyReady.addLast { page, driver ->
    val result = session.instruct(prompts, driver)
}
session.open(url, eventHandlers)

📝 Example: View Kotlin Code

🤖 Complete Robotic Process Automation Capabilities:

Details

val options = session.options(args)
val event = options.eventHandlers.browseEventHandlers
event.onBrowserLaunched.addLast { page, driver ->
    warnUpBrowser(page, driver)
}
event.onWillFetch.addLast { page, driver ->
    waitForReferrer(page, driver)
    waitForPreviousPage(page, driver)
}
event.onWillCheckDocumentState.addLast { page, driver ->
    driver.waitForSelector("body h1[itemprop=name]")
    driver.click(".mask-layer-close-button")
}
session.load(url, options)

📝 Example: View Kotlin Code

🔍 Complex Data Extraction with X-SQL:

Details

select
    llm_extract(dom, 'product name, price, ratings, score') as llm_extracted_data,
    dom_first_text(dom, '#productTitle') as title,
    dom_first_text(dom, '#bylineInfo') as brand,
    dom_first_text(dom, '#price tr td:matches(^Price) ~ td') as price,
    dom_first_text(dom, '#acrCustomerReviewText') as ratings,
    str_first_float(dom_first_text(dom, '#reviewsMedley .AverageCustomerReviews span:contains(out of)'), 0.0) as score
from load_and_select('https://www.amazon.com/dp/B0C1H26C46  -i 1s -njr 3', 'body');

📚 Example Code:

📜 Documents

🔧 Proxies - Unblock Websites

Details

Set the environment variable PROXY_ROTATION_URL to the URL provided by your proxy service:

export PROXY_ROTATION_URL=https://your-proxy-provider.com/rotation-endpoint

Each time the rotation URL is accessed, it should return a response containing one or more fresh proxy IPs. Ask your proxy provider for such a URL.

✨ Features

🕷️ Web Spider

Scalable crawling
Browser rendering
AJAX data extraction

🤖 AI-Powered

Automatic field extraction
Pattern recognition
Accurate data capture

🧠 LLM Integration

Natural language web content analysis
Intuitive content description

🎯 Text-to-Action

Simple language commands
Intuitive browser control

🤖 RPA Capabilities

Human-like task automation
SPA crawling support
Advanced workflow automation

🛠️ Developer-Friendly

One-line data extraction
SQL-like query interface
Simple API integration

📊 X-SQL Power

Extended SQL for web data
Content mining capabilities
Web business intelligence

🛡️ Bot Protection

Advanced stealth techniques
IP rotation
Privacy context management

⚡ Performance

Parallel page rendering
High-efficiency processing
Block-resistant design

💰 Cost-Effective

100,000+ pages/day
Minimal hardware requirements
Resource-efficient operation

✅ Quality Assurance

Smart retry mechanisms
Precise scheduling
Complete lifecycle management

🌐 Scalability

Fully distributed architecture
Massive-scale capability
Enterprise-ready

📦 Storage Options

Local File System
MongoDB
HBase
Gora support

📊 Monitoring

Comprehensive logging
Detailed metrics
Full transparency

📞 Contact Us

💬 WeChat: galaxyeye
🌐 Weibo: galaxyeye
📧 Email: galaxyeye@live.cn, ivincent.zhang@gmail.com
🐦 Twitter: galaxyeye8
🌍 Website: platon.ai

Name		Name	Last commit message	Last commit date
Latest commit History 3,090 Commits
.github/workflows		.github/workflows
.mvn		.mvn
bin		bin
docker		docker
docs		docs
pulsar-all		pulsar-all
pulsar-app		pulsar-app
pulsar-bom		pulsar-bom
pulsar-client		pulsar-client
pulsar-common		pulsar-common
pulsar-dom		pulsar-dom
pulsar-persist		pulsar-persist
pulsar-plugins		pulsar-plugins
pulsar-python		pulsar-python
pulsar-ql-common		pulsar-ql-common
pulsar-ql		pulsar-ql
pulsar-resources		pulsar-resources
pulsar-rest		pulsar-rest
pulsar-skeleton		pulsar-skeleton
pulsar-spring-support		pulsar-spring-support
pulsar-tests		pulsar-tests
pulsar-third		pulsar-third
pulsar-tools		pulsar-tools
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
DEVPLAN.md		DEVPLAN.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README-CN.md		README-CN.md
README.md		README.md
VERSION		VERSION
application.properties		application.properties
cloc.sh		cloc.sh
docker-compose.yaml		docker-compose.yaml
mvnw		mvnw
mvnw.cmd		mvnw.cmd
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🤖 PulsarRPA

🌟 Introduction

✨ Key Capabilities:

🎥 Demo Videos

🚀 Quick Start Guide

▶️ Run PulsarRPA

📦 Run the Executable JAR — Best Experience

🧩 Download

🚀 Run

▶ Run with IDE

🐳 Docker Users

🌟 For Beginners – Just Text, No Code!

📥 Example Request (Text-based):

📄 JSON-Based Version:

🎓 For Advanced Users — LLM + X-SQL: Precise, Flexible, Powerful

👨‍💻 For Experts - Native API: Powerful!

🎮 Browser Control:

🤖 Complete Robotic Process Automation Capabilities:

🔍 Complex Data Extraction with X-SQL:

📜 Documents

🔧 Proxies - Unblock Websites

✨ Features

📞 Contact Us

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🤖 PulsarRPA

🌟 Introduction

✨ Key Capabilities:

🎥 Demo Videos

🚀 Quick Start Guide

▶️ Run PulsarRPA

📦 Run the Executable JAR — Best Experience

🧩 Download

🚀 Run

▶ Run with IDE

🐳 Docker Users

🌟 For Beginners – Just Text, No Code!

📥 Example Request (Text-based):

📄 JSON-Based Version:

🎓 For Advanced Users — LLM + X-SQL: Precise, Flexible, Powerful

👨‍💻 For Experts - Native API: Powerful!

🎮 Browser Control:

🤖 Complete Robotic Process Automation Capabilities:

🔍 Complex Data Extraction with X-SQL:

📜 Documents

🔧 Proxies - Unblock Websites

✨ Features

📞 Contact Us

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages