Intelligent AI Agent Collection Based on NexAU Framework, Specializing in HTML Generation, Academic Poster Creation, Data Visualization, and More
NexHTML is an AI Agent development platform built on the Nexau Framework, integrating multiple specialized AI Agents designed to solve automation needs in real-world scenarios.
Professional HTML code generation and optimization AI system for creating high-quality frontend pages.
Core Features:
- πΌοΈ Smart Image Search - Integrated Unsplash API with automatic image search and annotation
- π¨ Visual Enhancement - AI-driven design suggestions and code optimization
- π Auto Annotation - VLM-powered descriptive text generation for images
- π Iterative Optimization - Multi-round conversational code improvement
Use Cases:
- Landing Page Rapid Prototyping
- Marketing Page Generation
- Course Materials and Presentation Pages
Automatically converts academic papers (PDF) into visually stunning academic posters in HTML.
Core Features:
- π PDF Parsing - MinerU-based conversion from PDF to structured Markdown with image extraction
- πΌοΈ Smart Image Annotation - VLM automatically generates titles and descriptions for paper figures
- ποΈ Logo Management - Auto-extract institutional information and match university/organization logos
- π± QR Code Integration - Extract arXiv links and generate access QR codes
- π Layout Optimization - AI-driven multi-column layout balancing and height detection
- π¨ Poster Rendering - Generate professional HTML academic posters with preview screenshots
Use Cases:
- Conference Poster Generation
- Academic Presentation Materials
- Research Showcase and Exhibition
End-to-end data visualization agent that transforms data topics into interactive dashboards with charts and insights.
Core Features:
- π Systematic Data Retrieval - Multi-source search with keyword expansion and cross-validation
- π§Ή Data Engineering - Structured extraction, cleaning, and quality assessment
- π Python Analysis - Stateful Python execution with pandas/numpy for data exploration
- π Interactive Dashboards - Plotly-based HTML dashboards with real-time CSV data loading
- π HTTP Service - Non-blocking local server for dashboard display
- π¦ Kaggle Integration - Direct dataset download from Kaggle platform
Use Cases:
- Trend Analysis and Reporting
- Dataset Exploration and Visualization
- Business Intelligence Dashboards
- Academic Data Analysis
Prompt: View Complete Prompt β
Prompt: View Complete PRD Document β
- Python: 3.13+
- System: macOS / Linux / Windows
- Tools: Git, uv (recommended) or pip
# Clone main project
git clone https://github.com/nex-agi/NexHTML.git
cd NexHTML
# Initialize submodules
git submodule update --init --recursive# One-command installation (installs everything including MinerU and Nexau)
uv pip install -e .# One-command installation
pip install -e .Copy and edit .env file:
cp .env.example .env
vim .envRequired Configuration:
# Core LLM Configuration
LLM_MODEL=your_model_name
LLM_BASE_URL=https://api.openai.com/v1
LLM_API_KEY=your_api_key_herePrerequisites:
Before launching WebDevAgent, you need to:
- Apply for Unsplash API Key - Visit Unsplash Developers to register and obtain your API key for image search functionality
- Configure VLM for Image Captioning - Set up a Vision Language Model to generate image descriptions
- Update
.envfile with your credentials:UNSPLASH_ACCESS_KEYS=your_unsplash_key IMAGE_CAPTIONER_MODEL=your_vlm_model IMAGE_CAPTIONER_BASE_URL=your_vlm_url IMAGE_CAPTIONER_API_KEY=your_vlm_key
Launch Command:
uv run python src/WebDevAgent/start.pyPrerequisites:
Before launching Paper2PosterAgent, you need to:
- Configure VLM (Vision Language Model) - Set up a Vision Language Model for image analysis and caption generation
- Update
.envfile with your VLM credentials:VLM_MODEL=your_vlm_model VLM_BASE_URL=your_vlm_url VLM_API_KEY=your_vlm_key
Launch Commands:
# First start MinerU service (PDF parsing)
# In another terminal:
uv run mineru-api
# Start Agent
uv run src/Paper2PosterAgent/start.pyPrerequisites:
Before launching DatavisSearchAgent, you need to:
- Apply for Serper API Key - Visit Serper.dev to register and obtain your API key for web search functionality
- Configure Kaggle API Credentials (Optional) - For downloading datasets from Kaggle
- Update
.envfile with your credentials:SERPER_API_KEY=your_serper_api_key # Optional: For Kaggle dataset downloads KAGGLE_USERNAME=your_kaggle_username KAGGLE_KEY=your_kaggle_api_key
Launch Command:
uv run python src/DatavisSearchAgent/start.pyTo enable observability and monitoring with Langfuse, add the following to your .env file:
LANGFUSE_SECRET_KEY=sk-lf-xxx
LANGFUSE_PUBLIC_KEY=pk-lf-xxx
LANGFUSE_HOST=http://your-langfuse-host:portSee Configuration Guide β for more configuration details.
For detailed information about each Agent (configuration, tools, workflows), please see Agent Overview Documentation β
We extend our gratitude to NexAU Framework, MinerU, Paper2Poster, Unsplash, Langfuse, and other projects for providing their codebases and service support.
β If this project helps you, please give it a Star! β



