The goal of this project is to provide a comprehensive toolkit for mapping various industry and classification codes to standardized economic utilities and purposes.
This project is currently in the development phase. All mappings are still being validated and may be subject to change. Everything is provided as is and no guarantees are made about the accuracy or completeness of the mappings.
###TODO:
- Income labels for retail need to be mapped upon Canberra Group Handbook income classifications.
- All labels should eventually be consistent and mappeable to SNA standards.
- Multi-organisational validation of the mappings, ideally through a standardised method.
This project provides mapping tables and tools to convert different types of industry and classification codes into standardized economic utilities as defined by international standards, including:
- System of National Accounts (SNA)
- Classification of Individual Consumption According to Purpose (COICOP)
- Canberra Group Handbook on Household Income Statistics
- And other international economic classification standards
The primary goals of this project are to:
- Standardize transaction classifications across different coding systems
- Enable consistent economic analysis and reporting
- Facilitate international comparability of economic data
- Support automated transaction categorization systems
- NACE (Statistical Classification of Economic Activities in the European Community)
- ISIC (International Standard Industrial Classification)
- NAICS (North American Industry Classification System)
- Custom merchant category codes (MCCs)
- SNA (System of National Accounts)
- COICOP (Classification of Individual Consumption According to Purpose)
- Canberra Group Handbook income classifications
The project is organized per classification system. In each folder, you will find the collected documentation and codes. If available, mappings from the literature are also provided.
- COICOP/: Contains the COICOP codes and documentation of the codes.
- Household Income statistics/: Contains the documentation of the household income classification system. TODO: income labels for retail need to be mapped upon this.
- MCC/: Contains the documentation of the merchant category codes, mappings from the literature, python code to extract and collect the MCC descriptions from multiple sources, as well as our own mapping table.
- NACE/: Contains the NACE codes and documentation of the codes, mappings from the literature, as well as our own mapping table.
- SNA/: Contains the SNA codes and documentation of the codes. TODO: All labels should eventually be consistent and mappeable to SNA standards.
If you want to run the python code that collects the MCC descriptions in the MCC/python-code folder, you need to install the dependencies. This has been made easy with the use of uv.
- Python 3.12 or higher
- uv for dependency management
# Clone the repository
git clone [repository-url]
# Install uv if you haven't already
curl -LsSf https://astral.sh/uv/install.sh | sh
# Create virtual environment and install dependencies from pyproject.toml
uv sync[Add specific usage instructions once implemented]
Contributions are welcome! Please feel free to submit a Pull Request.
See LICENSE.md file.