Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Post record language tagging, lexicon language format #1219

Merged
merged 10 commits into from
Jun 23, 2023
Merged

Conversation

devinivy
Copy link
Collaborator

  • Adds the language format to lexicon, validated as well-formed BCP 47 language tags.
  • Adds the langs field to the post record, an array containing up to three languages used within the contents of the post.

Copy link
Collaborator

@dholms dholms left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yup looks great!

should we start indexing supplied languages on posts now? that way we don't have to do any backfill for old posts?

we may like to use it as input to feeds

@pfrazee
Copy link
Collaborator

pfrazee commented Jun 22, 2023

Looks great and tests on my end work. Good to merge and publish a new api version

@devinivy devinivy merged commit 3da0324 into main Jun 23, 2023
@devinivy devinivy deleted the post-languages branch June 23, 2023 20:23
mloar pushed a commit to mloar/atproto that referenced this pull request Sep 26, 2023
…#1219)

* Add languages field to post record

* helper for parsing bcp47

* add language format to lexicon

* codegen for post record langs field

* re-export language parsing in api package

* tests and tidy for lexicon language format

* index post langs, in-progress

* update snapshots, fixes record-with-media embed issue

* index post langs on bsky appview

* don't bother indexing post langs in pds appview, tidy
mloar pushed a commit to mloar/atproto that referenced this pull request Nov 15, 2023
…#1219)

* Add languages field to post record

* helper for parsing bcp47

* add language format to lexicon

* codegen for post record langs field

* re-export language parsing in api package

* tests and tidy for lexicon language format

* index post langs, in-progress

* update snapshots, fixes record-with-media embed issue

* index post langs on bsky appview

* don't bother indexing post langs in pds appview, tidy
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants