Skip to content

Releases: philippe554/excelextract

ExcelExtract 1.0.0

01 Jul 15:16

Choose a tag to compare

Changes

  • Make exports field optional in the config file
  • Add propper logging system
  • Add a mode to run without a config file, but specify the xlsx as command argument
  • Allow to write the output to to the console
  • Add order field to config file (allows to order columns)
  • Add export to xlsx (to export both in json mode: "output": "file.csv|xlsx")
  • Add file tokens

ExcelExtract 0.6.0

11 May 12:27

Choose a tag to compare

ExcelExtract 0.6.0 Pre-release
Pre-release

Changes

  • Add automatic type detection
  • Add more types: integer, bool, time, date, datetime, and timedelta

ExcelExtract 0.5.0

09 May 15:46

Choose a tag to compare

ExcelExtract 0.5.0 Pre-release
Pre-release

Changes

  • Column names can contain tokens, generating dynamic columns
  • Columns are implicit tokens, and can be used for calculations
  • Intrarow property is added to loops
  • Basic table added
  • Missing end/count field for a loop results in looping to the end of the sheet
  • The command parameter specifying the config file can be a GLOB

ExcelExtract 0.4.0

05 May 20:53

Choose a tag to compare

ExcelExtract 0.4.0 Pre-release
Pre-release

Changes

  • Add Excel style formulas using the Formulas package
  • Add test matrix using GitHub Actions over multiple Python versions
  • Properly select minimum version of dependencies

ExcelExtract 0.3.0

05 May 14:30

Choose a tag to compare

ExcelExtract 0.3.0 Pre-release
Pre-release

Changes

  • Add findcell which searches for both a row and a column in a full sheet
  • Add unique property which fails the export if more than 1 match is found
  • Make config file keys case insensitive

ExcelExtract 0.2.0

05 May 12:23

Choose a tag to compare

ExcelExtract 0.2.0 Pre-release
Pre-release

Welcome to the first functional release of ExcelExtract (0.2.0)! The previous version (0.1.0) was an empty package. This release introduces the core capabilities for extracting structured data from Excel files based on a JSON configuration.

Changes

  • Introduced Core Extraction Logic: The package now provides the fundamental ability to read data from .xlsx files based on a user-defined configuration.
  • JSON Configuration: Implemented support for defining extraction rules using a JSON configuration file (config.json).
  • Input File Selection: Added functionality to specify input Excel files using file paths with support for glob patterns (*, ?, **) for selecting multiple files.
  • Lookup Operations: Introduced various lookup operations (loopsheets, findRow, findColumn, loopRows, loopColumns) to dynamically locate data within workbooks and sheets.
  • Token System: Added support for defining and using tokens (e.g., %%ROW%%, %%SHEET_NAME%%) as placeholders for dynamic values found during lookups.
  • Cell Content Matching: Implemented the findRow and findColumn operations with a match property for finding cells based on specific text content (exact string or list of alternatives) and a select property for handling multiple matches.
  • Configurable Row Triggering: Developed a trigger system (defaulting to nonempty, with options for never and nonzero) on column definitions to control when a new row is created in the output CSV based on cell content.
  • Multi-File and Multi-Sheet Extraction: Enabled combining data from multiple sheets and multiple input files into a single output CSV.
  • Built-in %%FILE_NAME%% Token: Included a pre-defined token to easily reference the name of the currently processed input file.
  • CSV Output: Added the capability to output extracted data into standard, UTF-8 encoded CSV files.