Skip to content

Latest commit

 

History

History

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 

Preprocessing

This directory contains an example of data preprocessing using Beautiful Soup.

What is it?

  1. preprocess_openmp_faq.py: Python script that reads the OpenMP FAQ from an HTML file, and writes the output to standard output. The output is in JSONL format, with each question-answer pair on a single line.
  2. openmp_faq.html: HTML file containing the OpenMP FAQ.
  3. openmp_faq.jsonl: JSONL file containing the OpenMP FAQ in a structured format.