Latest Documentation Generator

A powerful documentation generator that crawls and processes API documentation from various sources, with special support for HubSpot's API documentation.

Features

🔍 Intelligent crawling with Firecrawl technology
📚 Specialized support for HubSpot API documentation
🎯 Extracts endpoints, parameters, examples, and descriptions
📝 Generates clean, well-formatted Markdown output
🔄 Handles dynamic, JavaScript-rendered content
⚡ Efficient parallel processing of multiple documentation pages

Prerequisites

Node.js 16.x or later
npm 7.x or later

Installation

Clone the repository:

git clone https://github.com/[username]/latest-documentation.git
cd latest-documentation

Install dependencies:

npm install

Usage

Start the development server:

npm run dev

Access the documentation generator at http://localhost:3000
To generate documentation programmatically:

import { DocumentationProcessor } from './utils/documentationProcessor';

const processor = new DocumentationProcessor();

// Generate documentation for a specific URL
const markdown = await processor.processDocumentation('https://app.hubspot.com/developer-docs/...');

// Or generate documentation for all configured sources
const fullDocs = await processor.generateDocumentation();

Configuration

The documentation processor can be configured with various options:

const options = {
  waitForSelectors: ['[data-test-id="endpoint"]'], // Elements to wait for
  waitForTimeout: 2000, // Timeout in milliseconds
  headers: {            // Custom headers for requests
    'User-Agent': '...',
    'Accept': '...'
  },
  extractors: {         // Content extraction rules
    endpoints: {
      selector: '...',
      multiple: true,
      extract: {
        // Nested extraction rules
      }
    }
  }
};

Architecture

The project uses a modular architecture with the following key components:

DocumentationProcessor: Main class for processing documentation
FirecrawlClient: Handles browser automation and content extraction
MarkdownGenerator: Converts extracted content to markdown format

Error Handling

The system provides detailed error information through the DocumentationError class:

FETCH_ERROR: Failed to fetch the documentation page
PARSE_ERROR: Failed to parse the content
NO_CONTENT: No content could be extracted
EXTRACTION_ERROR: Failed to extract specific content
PROCESSING_ERROR: General processing error

Contributing

Fork the repository
Create your feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add some amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.github		.github
data/docs		data/docs
public/docs		public/docs
src		src
.env.example		.env.example
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
next.config.js		next.config.js
package-lock.json		package-lock.json
package.json		package.json
requirements.txt		requirements.txt
tailwind.config.js		tailwind.config.js
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json
vercel.json		vercel.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Latest Documentation Generator

Features

Prerequisites

Installation

Usage

Configuration

Architecture

Error Handling

Contributing

License

About

Releases

Packages

Contributors 2

Languages

License

duhman/latest-documentation

Folders and files

Latest commit

History

Repository files navigation

Latest Documentation Generator

Features

Prerequisites

Installation

Usage

Configuration

Architecture

Error Handling

Contributing

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages