Skip to content

Commit

Permalink
unrounded gb
Browse files Browse the repository at this point in the history
  • Loading branch information
kddubey committed Jun 23, 2024
1 parent b484f27 commit b670c80
Show file tree
Hide file tree
Showing 2 changed files with 4 additions and 10 deletions.
10 changes: 4 additions & 6 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -77,12 +77,10 @@ score is valid. Who is right, Alice or Bob?
## Usage

Reproduce the experiment results by running [`./experiment.sh`](./experiment.sh) on a T4
GPU. Batch sizes were set to safely avoid OOMs across the 135,000 pretraining and
GPU. Batch sizes were set to safely avoid OOMs across the many pretraining and
finetuning runs that will occur. But they were not set too low; GPU utilization hovers
from 50-80%. The experiment will take ~5 days to finish, as 81,000 models will be
trained and evaluated.

I ran experiments in parallel through [Google
from 50-80%. The experiment will take ~5 days to finish. I ran experiments in parallel
through [Google
Cloud](https://github.com/kddubey/pretrain-on-test/tree/main/cloud_scripts/gcp).

After finishing the experiment, move all the `run-*` directories to
Expand All @@ -95,7 +93,7 @@ To analyze the accuracy data, see [`analysis/`](./analysis/).
The full set of accuracy data used in the paper, including observation-level per-class
probability scores, can be downloaded at [this Google Drive
link](https://drive.google.com/file/d/1n7N4uTKgcUJZ7hjAbZYGTpGEPoYxQVGx/view?usp=sharing)
(~3.3 GB unzipped, just a bunch of CSVs).
(3.29 GB unzipped, just a bunch of CSVs).


<details>
Expand Down
4 changes: 0 additions & 4 deletions cloud.py
Original file line number Diff line number Diff line change
Expand Up @@ -77,10 +77,6 @@ class UploadGCP:
Upload to a GCP bucket.
"""

# Alternate design is to make this one function whose `bucket_name` argument can be
# partial'd out. Partial objects are opaque, and I'm not sure if re-creating the
# bucket is bad, so I decided to make it a class.

def __init__(self, bucket_name: str) -> None:
from google.cloud.storage import Client

Expand Down

0 comments on commit b670c80

Please sign in to comment.