RegulatoryComplexity

This is the code repository for the research project "Measuring Regulatory Complexity" by Jean-Edouard Colliard and Co-Pierre Georg. Use this code at your own risk. The code provides a simple dashboard that allows users to classify words in large regulatory texts (in our case the Dodd-Frank Act) in various categories, e.g. as operators or operands. This is useful when measuring the complexity of the regulatory text using the Halstead (1977) measures. The dashboard is still work in progress.

Source of raw data https://www.fdic.gov/regulations/laws/important/

Pdf to txt It will parse pdf documents located in 001_raw_data to txt. Input: 001_raw_data/pdf/.pdf Run the shell 100_code/shells/001_totxt.sh Output: 001_raw_data/txt/.txt

Pdf to xlm (maximum 100 pages per document) It will parse pdf documents located in 001_raw_data to xlm. Input: 001_raw_data/pdf/.pdf Run the shell 100_code/shells/001_toxlm.sh Output: 001_raw_data/xlm/.xlm

Clean data for Dodd-Frank Input:001_raw_data/txt/DODDFRANK.txt Run 100_code/python/001_clean_data

Halstead Measures.

Features of the text such as bullets, definitions, references... Input: 010_cleaned_data/DODDFRANK.txt Run 100_code/python/002_regex_op

Name		Name	Last commit message	Last commit date
Latest commit History 169 Commits
001_raw_data		001_raw_data
010_cleaned_data		010_cleaned_data
020_auxiliary_data/Sections		020_auxiliary_data/Sections
050_results/DoddFrank		050_results/DoddFrank
100_code		100_code
200_analysis		200_analysis
300_experiments_code		300_experiments_code
310_experiments_extras		310_experiments_extras
700_complementary_exercises		700_complementary_exercises
710_tax		710_tax
Documentation		Documentation
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

001_raw_data

001_raw_data

010_cleaned_data

010_cleaned_data

020_auxiliary_data/Sections

020_auxiliary_data/Sections

050_results/DoddFrank

050_results/DoddFrank

100_code

100_code

200_analysis

200_analysis

300_experiments_code

300_experiments_code

310_experiments_extras

310_experiments_extras

700_complementary_exercises

700_complementary_exercises

710_tax

710_tax

Documentation

Documentation

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

Repository files navigation

RegulatoryComplexity

About

Releases

Packages

Contributors 3

Languages

License

cogeorg/RegulatoryComplexity

Folders and files

Latest commit

History

Repository files navigation

RegulatoryComplexity

About

Topics

Resources

License

Stars

Watchers

Forks

Languages