-
Notifications
You must be signed in to change notification settings - Fork 1
Expand file tree
/
Copy pathtalk.Rmd
More file actions
51 lines (35 loc) · 1.9 KB
/
talk.Rmd
File metadata and controls
51 lines (35 loc) · 1.9 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
---
title: "Getting started"
description: "Introduction to talk: HuggingFace transformers in R."
author: ""
opengraph:
image:
src: "http://r-text.org/articles/text_files/figure-html/unnamed-chunk-5-1.png"
twitter:
card: summary_large_image
creator: "@oscarkjell"
output: github_document #rmarkdown::html_vignette
vignette: >
%\VignetteIndexEntry{text}
%\VignetteEngine{knitr::rmarkdown}
%\VignetteEncoding{UTF-8}
---
<a rel="me" href="https://mastodon.online/@oscarkjell">Oscar Kjell</a>
```{r, include = FALSE}
knitr::opts_chunk$set(
collapse = TRUE,
comment = "#>"
)
evaluate = FALSE
```
The *talk*-package uses Hugging Face transformers language models, natural language processing and machine learning methods to analyse audio recordings of talk and numerical variables.
This Getting Started tutorial is going through some central *talk* functions.
In short the workflow includes to first transform audio recordings into text (speech-to-text) as well as embeddings (which can be used in downstream tasks, for example using textTrain functions in the text-package.
### talkEmbed(): mapping audio recordings to numbers using a HuggingFace language model
The `talkEmbed()` function transforms audio recording files to embeddings. The example data that will be used in this tutorial comes from participants that have described their harmony in life and satisfaction with life with a text response, 10 descriptive words or rating scales.
```{r setup, eval = evaluate, warning=FALSE, message=FALSE}
library(talk)
# work in progress
```
### Articles related to the talk-package
[WhiSPA: Semantically and Psychologically Aligned Whisper with Self-Supervised Contrastive and Student-Teacher Learning]() *Rajath Rao, Adithya V Ganesan, Oscar Kjell, Jonah Luby, Akshay Raghavan, Scott M. Feltman, Whitney Ringwald, Ryan L. Boyd, Benjamin J. Luft, Camilo J. Ruggero, Neville Ryant, ROMAN KOTOV, H. Schwartz*