Skip to content

A tool for gathering and saving structure data from HTML/Text on HTTP by Regex.

License

Notifications You must be signed in to change notification settings

joeries/ZofX.HtmlCollector

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Project Introduction

A tool for gathering and saving structure data into database from HTML/Text on HTTP by Regex.

UI

Database Config

  • Database type:OleDb(Excel or Access),MsSql,MySql,PgSql,SQLite;
  • Connection String:connection string of each kind of databases in C#;
  • Table Name:name of data table.

Gathering Config

  • Website URL:a URL for application;
  • Detail Page Base URL:no dynamic parameters and containing detail data;
  • PageIndex Parameter Name:page index parameter name;
  • PageIndex Parameter Position:PageIndex Parameter in GET or POST;
  • Data Index:first row data index;
  • GET:fixed parameters in URL;
  • POST:fixed parameters in HTTP request body;
  • Primary Key:field name which can identify every row uniquely;
  • Header Regex:fetching all field names via this;
  • Attachment Header:the else field names formatted as source field index(form 0)-filed name(seperated by semicolon);
  • PageIndex Regex:fetching max page via this;
  • Data Regex:fetching data list via this;
  • Attachment Regex:fetching detail data via this.

About

A tool for gathering and saving structure data from HTML/Text on HTTP by Regex.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published