Skip to content

TLP-COI/awesome-tlp

Repository files navigation

Awesome Technical Language Processing

Awesome Lint Awesome List

A curated list of awesome TLP Resources

The links and information below are provided as a convenience to the user community. Anyone who has a tool, technique, resource, or dataset that can be of benefit to the TLP COI is welcome to submit information and links to the webmaster for inclusion in this list. Any mention of computer hardware, software or services here does not constitute endorsement by NIST, nor does it indicate that the products are necessarily those best suited for the intended purpose.

Technical Language Processing (TLP) is a set of tools, techniques, and guidelines meant to tailor Natural Language Processing (NLP) tools to engineering (and other) expert-driven text-based data.

Contents

Legend: 📃 paper - 🖥️ software tool - 🗄️ dataset - 🏷️ model - 📘 standard - 🔌 library

What is TLP

TLP Support Tools

  • 🖥️ Nestor - Nestor Graphical User Interface (GUI) is a free toolkit that helps maintainers annotate their Maintenance Work Order (MWO) data through a process called "tagging".
  • 📃 Hybrid Datafication Paper - A paper describing the tagging methodology that is used in Nestor.
  • 🔌 Nestor GUI repository - The GitHub repository containing the open-source code for Nestor.
  • 🔌 Redcoat - A web-based annotation tool that supports collaborative hierarchical entity typing.
  • 🔌 MaintNet: A Collaborative Open-Source Library for Predictive Maintenance Language Resources - MaintNet is a resource of technical language tools and data and includes tools such as technical language spellchecker, POS, etc.
  • 🗄️ MaintNet Datasets - The datasets in MaintNet spans maintenance records in aviation, automotive and facility industries.
  • 📃 MaintNet Paper - Paper that describes the MaintNet library.
  • 🖥️ Puggle - A Python package for working with the outputs of Information Extraction models and tools such as SPERT and QuickGraph. Also available on GitHub (link).
  • 🖥️ Mudlark - A Python package for automatically cleaning the short text present in maintenance work orders and strategies. Also available on GitHub (link).

TLP Datasets

TLP Learning Resources

TLP Benchmark Datasets

  • 🗄️📃 (and leaderboard)DesignQA - DesignQA is a benchmark for evaluating proficiency of multimodel LLMs (MLLMs) in comprehending and applying engineering requirements in technical documentation. Two (of the 6) benchmarks are also applicable to LLM's.
  • 🗄️ FMC-MWO2KG - FMC-MWO2KG (The MWO2KG Failure Mode Classification Dataset) comprises 502 observation and label pairs for training, 62 pairs for validation and 62 pairs for testing.

TLP Resources

Standards

  • 📘 ISO 15926-4:2019 - Reference data for recording information about process plants.
  • 📘 ISO 14224:2016 - Bases for the collection of reliability and maintenance (RM) data for equipment in oil and gas industry.

Ontologies

TLP Research

Human Centric TLP Research

TLP Representations & Embeddings

Follow

  • TLP COI - The TLP COI will bring together interested participants to discuss ongoing and future directions for text analysis of technical data.
  • IOF Maintenance WG - Industrial Ontologies Foundry (IOF) maintenance management ontology Working Group (WG).

Contributing

Please follow the guidelines before contributing!

Contributors

Thanks goes to these contributors!

About

A curated list of awesome Technical Language Processing

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks