Code and resource documents for the General Knowledge Project
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Type Name Latest commit message Commit time
Failed to load latest commit information.


Code and resource documents for the General Knowledge Project

This is a collaborative research project exploring the relationship between generalized encyclopeic knowledge formation and the racialization of peoples, cultures, and practices. The project is seeking to develop a classifier capable of discovering racialized language from sources of 'universal knowledge.'"

In March of 2012, the Encyclopedia Britannica ceased printing paper editions of its handsomely bound reference books. The Encyclopedia Britannica, first published in Edinburgh, Scotland in 1768, remains the oldest English language encyclopedia in continuous production, but it will only be updated through its online offering in the following years. In the era of community based online encyclopedias like Wikipedia, now is an interesting time to reflect on the content of the complete print run of the Encyclopedia Britannica. This also represents an interesting moment to reflect on how past systems of defining general knowledge has worked to shape societal prejudices, beliefs, and assumptions.

At this early stage of our research, this project will use techniques related to large scale text analysis through Natural Language Processing (NLP) to chart and track the evolution of popular conceptions of race and racialization across all 15 editions of the encyclopedia released over its 244 year history. We will generate a project site to host our findings and store a detailed description of our methods. In addition to a wiki for community based discussion on these materials, our site will also host interactive visualizations that will allow users to explore this extremely large dataset alongside our analysis.