Stress Pattern Occurrence in English Words

Update May 6th, 2024

The stress pattern was based on The CMU Pronouncing Dictionary.

The cmudict module from the NLTK library was used to extract the stress pattern from the dataset.

The English words dataset was based on the SubtlexUS dataset.

Disclaimers

According to what is mentioned on the CMU Pronouncing Dictionary website, "Stress is difficult to get right and people disagree about it."

Visualizations

Instagram
Facebook

To Run the ETL Process

main.py

Execute the script

`stress_pattern_finder` Package

eng_stress_pattern_finder.py

Find the stress pattern of the English words with the given dataset

`stress_pattern_etl` Package

extract_and_transform_syllable_data.py

Extract data from SubtlexUS dataset
Transform data to find a syllable count and stress pattern of each English word
- Words that aren't in the dictionary will be filtered out

load_to_sqlite.py

Load data to SQLite database tables

Sources

The CMU Pronouncing Dictionary: http://www.speech.cs.cmu.edu/cgi-bin/cmudict
“SubtlexUS” dataset: http://www.lexique.org/?page_id=241

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
.github/workflows		.github/workflows
graph		graph
stress_pattern_finder		stress_pattern_finder
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
SUBTLEXus74286wordstextversion.tsv		SUBTLEXus74286wordstextversion.tsv
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github/workflows

.github/workflows

graph

graph

stress_pattern_finder

stress_pattern_finder

tests

tests

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

SUBTLEXus74286wordstextversion.tsv

SUBTLEXus74286wordstextversion.tsv

main.py

main.py

requirements.txt

requirements.txt

Repository files navigation

Stress Pattern Occurrence in English Words

Disclaimers

Visualizations

To Run the ETL Process

`stress_pattern_finder` Package

`stress_pattern_etl` Package

Sources

About

Releases 5

Packages

Languages

License

sakan811/Stress-Pattern-Occurrence-in-English-Words

Folders and files

Latest commit

History

Repository files navigation

Stress Pattern Occurrence in English Words

Disclaimers

Visualizations

To Run the ETL Process

stress_pattern_finder Package

stress_pattern_etl Package

Sources

About

Topics

Resources

License

Stars

Watchers

Forks

Languages

`stress_pattern_finder` Package

`stress_pattern_etl` Package