No description, website, or topics provided.
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
archives
assets
classifiers
close-reading
conclusion
crowdsourcing
cyborg-readers
data-cleaning
introduction
issues
reading-at-scale
sentiment-analysis
styles
topic-modeling
.gitignore
GLOSSARY.md
README.md
SUMMARY.md
acknowledgements.md
archives.md
book.json
classifiers.md
close-reading.md
conclusion.md
contexts-and-claims.md
cover.jpg
crowdsourcing.md
cyborg-readers.md
data-cleaning.md
introduction.md
issues-in-digital-text-analysis.md
reading-at-scale.md
schedule.md
sentiment-analysis.md
topic-modeling.md

README.md

Note: We have moved platforms since releasing this book. To see the new and improved version of the book running off Jekyll, check it out here. Materials here may be slightly out of date.

Preface

(Note: We welcome feedback on this book! If you find an error, want clarification on a particular issue, or find deep problems with particular explanations, drop us a line on our GitHub issues page. We'll be grateful and list you in our acknowledgements!)

This workbook provides a brief introduction to digital text analysis through a series of three-part units. Each unit introduces students to a concept, a tool for or method of digital text analysis, and a series of exercises for practicing the new skills. In some cases, studies of particular projects are presented instead of tools in the third section of each unit.

The materials here are meant to form the basis for a digital text analysis course that does not require extensive training in programming and is written with student readers in mind. Originally developed for use in a course titled "Scandal, Crime, and Spectacle in the Nineteenth Century," this workbook draws from these course materials for its datasets and prompts. The book is intended to be modularized enough that it could be used in conjunction with other courses either in whole or in part, as all of its materials are openly available on GitHub. The tripartite structure of each chapter means that sections can be easily removed and replaced with different tools or content. In particular, we envision our course-specific exercises in the third section of each chapter to be removable. For more guidance on how to remix the work for your own ends, see Adapting This Book.

The book is best viewed online using either Chrome or Firefox. You can also download it to read as a PDF here.

Introduction to Text Analysis: A Coursebook by Brandon Walsh and Sarah Horowitz is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Creative Commons License