Walk over nested Archives #164

alixaxel · 2019-04-28T09:47:00Z

What would you like to have changed?

I'd like to have the ability to Walk over io.ReadClosers and not just file paths.

Why is this feature a useful, necessary, and/or important addition to this project?

Although I understand this is somewhat of a niche need, it's conceivable that a ZIP file could contain more ZIP files nested within it. By providing a such a method, it would be possible to inspect the contents of the inner ZIP/Tar/Rar, something along the lines of:

err := archiver.Walk(path, func(file archiver.File) error {
  if (filepath.Ext(file.Name()) == ".zip") {
    err := archiver.WalkDeep(file.ReadCloser, func(file archiver.File) error {
      return nil
    })
  } else {
    // non-archive file
  }

  return nil
}

I think (but might be wrong) it's not possible to read just the inner ZIP EOCD and correctly map each entry to the correct byte offsets (due to the potential several layers of compression).

The text was updated successfully, but these errors were encountered:

torgabor · 2019-06-18T18:15:53Z

I think this is a great idea, not just for the nested reads you mentioned, but for any case where you have the archive not as a file on disk, but abstracted away as an io.Reader. The only difficulty with the implementation seems to be that right now the code uses the file extension to infer the type of file, so to implement this feature, a header-based format autodetection would need to be implemented.

mholt · 2022-01-02T08:41:02Z

I think #302, which will soon become v4 of this package, allows this because every file that you walk can be handled with an arbitrary callback function, and that function could be simply starting another walk. I rewrote the entire thing and got rid of the reliance on file extensions as well (except for optionally matching unknown files to formats, which can use extension or peeking the stream, or both). We can reopen this issue if it remains unresolved and needs more discussion.

alixaxel added the feature request label Apr 28, 2019

ferhatelmas mentioned this issue Nov 13, 2019

Prevent arbitrary file overwrite via path traversal [CVE-2019-10743] #169

Closed

mholt closed this as completed Jan 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Walk over nested Archives #164

Walk over nested Archives #164

alixaxel commented Apr 28, 2019

torgabor commented Jun 18, 2019

mholt commented Jan 2, 2022 •

edited

Loading

Walk over nested Archives #164

Walk over nested Archives #164

Comments

alixaxel commented Apr 28, 2019

What would you like to have changed?

Why is this feature a useful, necessary, and/or important addition to this project?

torgabor commented Jun 18, 2019

mholt commented Jan 2, 2022 • edited Loading

mholt commented Jan 2, 2022 •

edited

Loading