uvgrep

UniVersal Grep (uvgrep)

The uvgrep tool lets you grep text, PDF, Microsoft Office (docx, xlsx, pptx) and LibreOffice/OpenOffice (ODF) files simultaneously. It requires that bash (for getopts), grep, pdfgrep, sed, mktemp, xmllint, and unzip (for unpacking docx, xlsx, pptx, odt, odp and ods files) are installed.

Installation

Get the file uvgrep, make sure it is executable (chmod a+x uvgrep), and move it to a folder which is in the PATH variable, e. g. /usr/local/bin. Check that the required tools (see above) are installed.

Usage

uvgrep [options] [files]

Options

-i: ignore case

-n: output line numbers (text) or page numbers (PDF)

-x: remove XML tags from output lines

Limitations

The current and initial version of uvgrep relies solely on filename extensions and should be modified to use the output of the file program.

uvgrep cannot detect on which pages, tables or slides, of a LibreOffice document a search term was found; it cannot detect on which pages of a Microsoft Word document it was found. (However it will display information about slides or tables in Microsoft documents.)

uvgrep uses English or German error messages for the three problems that can occur (file not found, unsupported file type, invalid option). In order to add further languages, add a code block that checks LANG and sets the three message variables accordingly.

Example

[esser@quad:~]$ uvgrep -in libreoffice *.sh *.pdf *.odt *.???x
uvgrep.sh:5:# uvgrep: grep txt, PDF and LibreOffice files
uvgrep.pdf:1:   5 # uvgrep: grep txt, PDF and LibreOffice files
test.odt:<text:p text:style-name="Standard">This test file contains the word "LibreOffice".</text:p>
test2.pptx[/slide9.xml]:<a:t>This is not a LibreOffice but a Microsoft Office file.</a:t>
[esser@quad:~]$ uvgrep -inx libreoffice *.sh *.pdf *.odt *.???x
uvgrep.sh:5:# uvgrep: grep txt, PDF and LibreOffice files
uvgrep.pdf:1:   5 # uvgrep: grep txt, PDF and LibreOffice files
test.odt:This test file contains the word "LibreOffice".
test2.pptx[/slide9.xml]:This is not a LibreOffice but a Microsoft Office file.

Name Choice

uvgrep was meant to be named "ugrep" (universal grep), but a different project already uses that name, so I picked "uvgrep".

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
CHANGES		CHANGES
LICENSE.txt		LICENSE.txt
README.md		README.md
uvgrep		uvgrep

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

uvgrep

Installation

Usage

Options

Limitations

Example

Name Choice

Author and Copyright

About

Uh oh!

Releases 1

Packages

Languages

License

hgesser/uvgrep

Folders and files

Latest commit

History

Repository files navigation

uvgrep

Installation

Usage

Options

Limitations

Example

Name Choice

Author and Copyright

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages