update base code #1

avalanchesiqi · 2024-04-20T23:10:32Z

No description provided.

Initial Topic Models

…guide Topic model rating filter guide update

Topic seed token prefix adjustments and model updates

Guide updates for new thresholds at which writing ability is locked and unlocked.

This commit splits the monolithic scoring binary into two separate scoring binaries (that may still be run sequentially): 1. Prescoring: do expensive pre-computation to learn user and note parameters 2. [Final] Scoring: ingest prescoring outputs in order to save computation time, then run scoring like it is today. In this commit, the final result of scoring is the same. In the future though, this unlocks much work to simplify the final scorer.

…scorer Split scoring binary into separate prescoring and final scoring binaries

…udoraters Final scoring and prescoring each run about ~10mins faster now when run in parallel on one large CPU machine, due to sharing large dataframes in memory across multiple processes instead of re-reading them. Also, the core scorer itself now runs about ~10mins faster due to cleanup of unused pseudorater computations (uncertainty estimation)

…_memory_optimization Optimize scorer: used shared memory across processes, and streamline uncertainty estimation

Brad Miller and others added 15 commits March 22, 2024 15:05

topic modelilng

88bac22

Merge pull request twitter#209 from twitter/bradm/topic_models

17048d4

Initial Topic Models

Guide update for Topic Models (twitter#210)

82b704c

prefix adjustments and topic model updates

878cc3e

Topic model rating filter guide update

cd73aa9

Merge pull request twitter#213 from twitter/bradm/topic_model_update_…

c6cc6d5

…guide Topic model rating filter guide update

Merge pull request twitter#212 from twitter/bradm/topic_model_threshold

6a1aee6

Topic seed token prefix adjustments and model updates

Updated thresholds for writing lock/unlock (twitter#214)

25b4ed5

Guide updates for new thresholds at which writing ability is locked and unlocked.

Describe prescoring vs. scoring split in ranking-notes.md

6a8362b

Update note-ranking-code.md with a note about prescoring.

0436479

Update ranking-notes.md

d4e765a

Merge pull request twitter#216 from twitter/jbaxter/2024_04_11_split_…

b539096

…scorer Split scoring binary into separate prescoring and final scoring binaries

Merge pull request twitter#218 from twitter/jbaxter/2024_04_17_shared…

c445258

…_memory_optimization Optimize scorer: used shared memory across processes, and streamline uncertainty estimation

avalanchesiqi merged commit 1e86c32 into code_comments Apr 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update base code #1

update base code #1

avalanchesiqi commented Apr 20, 2024

update base code #1

update base code #1

Conversation

avalanchesiqi commented Apr 20, 2024