Web fetch

Asset Crawler for common Web pages

Usage

As a Node.js package

import { savePage } from 'web-fetch';

savePage({
    source: 'http://URL.to/one/of/your/old/posts/'
});

Docker example

https://github.com/kaiyuanshe/KYS-service

As a Command Line tool

web-fetch http://URL.to/one/of/your/old/posts/

In GitHub actions

name: Fetch Web pages
on:
    issues:
        types:
            - opened
jobs:
    Fetch-and-Save:
        runs-on: ubuntu-latest
        permissions:
            contents: write
            issues: write
        steps:
            - uses: actions/checkout@v3

            - uses: pnpm/action-setup@v2
              with:
                  version: 9
            - uses: actions/setup-node@v3
              with:
                  node-version: 18
                  cache: pnpm
            - uses: browser-actions/setup-chrome@v1

            - name: Setup Web-fetch
              run: pnpm i web-fetch -g

            - name: Fetch first URL in Issue Body
              run: web-fetch $(echo "${{ github.event.issue.Body }}" | grep -Eo "https?://\S+")

            - uses: stefanzweifel/git-auto-commit-action@v5
              with:
                  commit_message: '${{ github.event.issue.title }}'

Supported Structure

Web-fetch/source/config.ts

Lines 10 to 29 in d4c574a

    
           export const body_tag = [ 
        
               'article', 
        
               'main', 
        
               '.article', 
        
               '.content', 
        
               '.post', 
        
               '.blog', 
        
               '.main', 
        
               '.container', 
        
               'body' 
        
           ]; 
        
           export const meta_tag = { 
        
               title: ['h1', 'h2', '.title', 'title'].map(likeOf), 
        
               authors: ['.author', '.publisher', '.creator', '.editor'].map(likeOf), 
        
               date: ['.date', '.time', '.publish', '.create'].map(likeOf), 
        
               updated: ['.update', '.edit', '.modif'].map(likeOf), 
        
               categories: ['.breadcrumb', '.categor'].map(likeOf), 
        
               tags: ['.tag', '.label'].map(likeOf) 
        
           };

Renderer

Puppeteer (default)
JSDOM

Wrapper

Hexo plugin

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.github		.github
.husky		.husky
source		source
wrapper/Hexo		wrapper/Hexo
.editorconfig		.editorconfig
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
.npmignore		.npmignore
.npmrc		.npmrc
ReadMe.md		ReadMe.md
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Web fetch

Usage

As a Node.js package

Docker example

As a Command Line tool

In GitHub actions

Supported Structure

Renderer

Wrapper

About

Releases

Sponsor this project

Packages

Contributors 2

Languages

	export const body_tag = [
	'article',
	'main',
	'.article',
	'.content',
	'.post',
	'.blog',
	'.main',
	'.container',
	'body'
	];

	export const meta_tag = {
	title: ['h1', 'h2', '.title', 'title'].map(likeOf),
	authors: ['.author', '.publisher', '.creator', '.editor'].map(likeOf),
	date: ['.date', '.time', '.publish', '.create'].map(likeOf),
	updated: ['.update', '.edit', '.modif'].map(likeOf),
	categories: ['.breadcrumb', '.categor'].map(likeOf),
	tags: ['.tag', '.label'].map(likeOf)
	};

TechQuery/Web-fetch

Folders and files

Latest commit

History

Repository files navigation

Web fetch

Usage

As a Node.js package

Docker example

As a Command Line tool

In GitHub actions

Supported Structure

Renderer

Wrapper

About

Topics

Resources

Stars

Watchers

Forks

Releases

Sponsor this project

Packages 0

Contributors 2

Languages

Packages