Skip to content
Switch branches/tags

Latest commit


Git stats


Failed to load latest commit information.
Latest commit message
Commit time


Code and resource documents for the General Knowledge Project

This is a collaborative research project exploring the relationship between generalized encyclopeic knowledge formation and the racialization of peoples, cultures, and practices. The project is seeking to develop a classifier capable of discovering racialized language from sources of 'universal knowledge.'"

In March of 2012, the Encyclopedia Britannica ceased printing paper editions of its handsomely bound reference books. The Encyclopedia Britannica, first published in Edinburgh, Scotland in 1768, remains the oldest English language encyclopedia in continuous production, but it will only be updated through its online offering in the following years. In the era of community based online encyclopedias like Wikipedia, now is an interesting time to reflect on the content of the complete print run of the Encyclopedia Britannica. This also represents an interesting moment to reflect on how past systems of defining general knowledge has worked to shape societal prejudices, beliefs, and assumptions.

At this early stage of our research, this project will use techniques related to large scale text analysis through Natural Language Processing (NLP) to chart and track the evolution of popular conceptions of race and racialization across all 15 editions of the encyclopedia released over its 244 year history. We will generate a project site to host our findings and store a detailed description of our methods. In addition to a wiki for community based discussion on these materials, our site will also host interactive visualizations that will allow users to explore this extremely large dataset alongside our analysis.


Code and resource documents for the General Knowledge Project




No releases published


No packages published