GitHub repo for the paper Semantic Science: Publication Beyond the PDF, presented at IEEE SoutheastCon 2024.
- /media contains an image
- article.html contains the unformatted, Pandoc-output HTML from the Markdown file
- article.md contains the source Markdown file
- article.pdf contains the Pandoc-output PDF from the Markdown file
- article_styled.html contains a Pandoc-output HTML with its CSS swapped for the style from this user
- article_annotated.html contains the GPT RDFa annotated article.html file
-
Write the Markdown file
-
Run the following to get an HTML
pandoc article.md -f markdown -t html -s -o article.html --metadata title="Semantic Science: Publication Beyond the PDF - An Example Scientific Article Written in Markdown"
- Run the following to get a PDF
pandoc article.md -f markdown -t pdf -s -o article.pdf
-
Give the content between the tags of the article.html to GPT-4, followed by the prompt
Annotate the given HTML with RDFa, and also annotate it with semantic HTML elements, such as <article>, <section>, <figure> and <figcaption>. Return to me the entire annotated HTML.
-
Paste the results back into the HTML file.