Skip to content

Special Topics in AI: Artificial Intelligence as an Archival Science

Notifications You must be signed in to change notification settings

dasmiq/cs7180-sp2024

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

51 Commits
 
 
 
 

Repository files navigation

Artificial Intelligence as an Archival Science

CS7180: Special Topics in AI: Spring 2024

Class meeting: Mondays and Thursdays, 11:45am – 1:25pm

Instructor: David Smith (Office hours: TBA; WVH 356 or via zoom)

Course Description

A common mode for understanding artificial intelligence systems, from popular fiction to textbooks in computer science, has been the metaphor of an agent that perceives, forms beliefs about, and intervenes in the world. In the past year, some scholars have instead framed large pretrained language and vision models as cultural technologies (Gopnik; Farrell and Shalizi, 2023). Other researchers have pointed out that the builders of large AI models must take on some curatorial tasks in order to be successful and should learn from archival practice (Jo and Gebru, 2020).

In this seminar, we will read and discuss papers addressing large language and vision models as tools to investigate human language, history, and culture; analyzing and auditing corpus creation for model training; and exploring and mitigating biases and gaps in the archives of the past. Students will take turns presenting and leading discussion of papers along with the relevant background material. All students will write short reviews of the papers we read and work on writing research papers on a topic of their choice.

Prerequisites

There are no official prerequisites; however, it is expected that students have some background either in NLP, computer vision, or other machine learning field, or in working computationally with large collections of text and images in the humanities or social sciences.

Syllabus

Each week, we will read about two papers on a common theme. The papers could be tied together by methodology—e.g., model or inference method—, by subject matter, or by media or archive type.

A list of the first few sets of papers is forthcoming. Further readings will be added by input from seminar participants. General topics include:

  • Computational models as archives
  • Archival documentation for models and datasets, “collections as data”
  • Text and Natural Language Processing
    • Literary and narrative archives
    • Documentary archives
  • Vision
    • OCR and textual archives: e.g., manuscripts, typewritten records, government archives
    • OCR and visual archives: e.g., text found on images, maps, photographs
    • Image recognition for journalistic and documentary collections
    • Image recognition for art archives
    • Action recognition and audiovisual archives
  • Sound
    • Speech recognition: oral history, radio archives
    • Sound classification: music and ambient sound
  • Generative Models: Abundance and Loss
    • Missing data
    • Bias, error, and inference
    • Text correction and restoration
    • Image inpainting and video generation
    • Narrative generation
    • Critical fabulation

Readings scheduled so far are as follows:

About

Special Topics in AI: Artificial Intelligence as an Archival Science

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published