saxophonist

Extract elements from large XML files with node.js streams

Usage

'use strict'

var fs = require('fs')
var p = require('path')
var saxophonist = require('./')
var count = 0

console.time('parsing time')

fs.createReadStream(p.join(__dirname, 'wikipedia', '1.xml'))
  .pipe(saxophonist('page'))
  .on('data', function () {
    count++
  })
  .on('end', function () {
    console.timeEnd('parsing time')
    console.log('read', count, 'pages')
  })

The data format is:

{
  path: ['a', 'path', 'to', 'page'], // the path in the XML document
  children: null, // or an array with elements like this
  attributes: {}, // object with all element attribute
  text: null // or string, containing the element text
}

Acknowledgements

saxophonist is sponsored by nearForm.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
test		test
.gitignore		.gitignore
.npmrc		.npmrc
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
example.js		example.js
package.json		package.json
saxophonist.js		saxophonist.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

saxophonist

Usage

Acknowledgements

License

About

Releases

Sponsor this project

Packages

Contributors 2

Languages

License

mcollina/saxophonist

Folders and files

Latest commit

History

Repository files navigation

saxophonist

Usage

Acknowledgements

License

About

Resources

License

Stars

Watchers

Forks

Releases

Sponsor this project

Packages 0

Contributors 2

Languages

Packages