Skip to content
master
Switch branches/tags
Go to file
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
src
 
 
 
 
 
 
 
 
 
 
 
 
 
 

README.md

Strip Html Tags filter plugin for Embulk

This plugin strips HTML tags from values of specified columns.

Overview

  • Plugin type: filter

Configuration

  • columns: column names (array, default: [])

Example

This settings strips tags on column foo and bar, leaves other columns untouched.

in:
  type: file
  path_prefix: ./test.csv
  parser:
    type: csv
    charset: UTF-8
    delimiter: ","
    columns:
      - {name: foo, type: string}
      - {name: bar, type: string}
      - {name: baz, type: string}

filters:
  - type: strip_html_tags
    columns:
      - foo
      - bar

out:
  type: stdout

it converts a CSV record like this:

<a>foo</a>,<div>bar</div>,<p>baz</p>

into:

foo,bar,<p>baz</p>

Build

$ ./gradlew gem

About

embulk filter to strip HTML tags into plain texts

Resources

License

Packages

No packages published