Remove fields `[author, title, thumbnail]` from `_blog.yml` #3151

mishig25 · 2025-10-28T10:33:03Z

Because those fields already exist inside individual posts. Example from ai-action-wh-2025.md:

---
title: "AI Policy @🤗: Response to the White House AI Action Plan RFI"
thumbnail: /blog/assets/151_policy_ntia_rfc/us_policy_thumbnail.png
authors:
- user: yjernite
- user: evijit
- user: irenesolaiman
---

# AI Policy @🤗: Response to the White House AI Action Plan RFI

python script that was used

#!/usr/bin/env python3
"""
Script to remove 'title', 'author', and 'thumbnail' fields from _blog.yml entries
while preserving formatting and blank lines.
"""

import re


def remove_fields_from_blog_yml(file_path):
    """
    Remove 'title', 'author', and 'thumbnail' fields from all entries in the blog YAML file.
    Preserves blank lines and formatting.
    
    Args:
        file_path: Path to the blog YAML file
    """
    # Fields to remove
    fields_to_remove = ["title", "author", "thumbnail"]
    
    # Read the file
    print(f"Reading {file_path}...")
    with open(file_path, "r", encoding="utf-8") as f:
        lines = f.readlines()
    
    # Process lines
    print("Processing lines...")
    new_lines = []
    removed_counts = {field: 0 for field in fields_to_remove}
    
    for line in lines:
        # Check if this line contains one of the fields to remove
        should_remove = False
        for field in fields_to_remove:
            # Match patterns like "  title:" or "  author:" or "  thumbnail:"
            if re.match(rf'^\s*{field}:\s*', line):
                should_remove = True
                removed_counts[field] += 1
                break
        
        # Keep the line if it's not one we want to remove
        if not should_remove:
            new_lines.append(line)
    
    # Write back to the file
    print(f"Writing changes back to {file_path}...")
    with open(file_path, "w", encoding="utf-8") as f:
        f.writelines(new_lines)
    
    # Print summary
    print("✓ Done!")
    print("\nSummary:")
    for field, count in removed_counts.items():
        print(f"  - Removed '{field}' from {count} entries")


if __name__ == "__main__":
    import sys
    
    files = sys.argv[1:] if len(sys.argv) > 1 else ["_blog.yml"]
    
    for file_path in files:
        remove_fields_from_blog_yml(file_path)
        print()

1. Removed fields from `_blog.yml` files ✓

Removed title, author, and thumbnail from:
- _blog.yml (648 entries)
- zh/_blog.yml (218 entries)
- fr/_blog.yml (3 entries)
Preserved blank lines between entries for readability

2. Updated validation script ✓

Removed title, author, thumbnail from _blog.yml schema validation
Added frontmatter validation for all blog post .md files
Validates required fields: title, thumbnail (with extension check), authors array
Handles both Windows (\r\n) and Unix (\n) line endings
Skips special files like README.md

3. Fixed markdown files ✓

Fixed 19 files in root directory:
- Added missing thumbnails (9 files)
- Added placeholder authors (6 files)
- Fixed invalid thumbnail extensions (2 files)
- Removed leading whitespace (3 files)
Fixed 2 files in zh/ directory:
- Added missing authors and thumbnails

julien-c

lgtm on principle

… schema in `validate-yaml.ts`

This reverts commit 55f0366.

mishig25 marked this pull request as ready for review October 28, 2025 10:37

julien-c approved these changes Nov 3, 2025

View reviewed changes

mishig25 added 5 commits November 7, 2025 13:52

Remove fields [author, title, thumbnail] from _blog.yml

2040555

Remove author, title, and thumbnail fields from YAML validation…

515b4c6

… schema in `validate-yaml.ts`

fix upstream

9aac297

remove duplicate fields

77bce5a

Revert "remove duplicate fields"

7b20b58

This reverts commit 55f0366.

mishig25 force-pushed the remove_duplicate_fields branch from 09b5a57 to 7b20b58 Compare November 7, 2025 12:52

mishig25 added 5 commits November 7, 2025 13:53

run script

8b5df8f

more validation

68799ec

fixes

9627f92

fix

cdac5af

Update thumbnail validation to include .webp format in YAML schema

9c7644b

mishig25 merged commit b93ef3e into main Nov 7, 2025
1 check passed

mishig25 deleted the remove_duplicate_fields branch November 7, 2025 13:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Remove fields `[author, title, thumbnail]` from `_blog.yml` #3151

Remove fields `[author, title, thumbnail]` from `_blog.yml` #3151

Uh oh!

mishig25 commented Oct 28, 2025 •

edited

Loading

Uh oh!

julien-c left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Remove fields [author, title, thumbnail] from _blog.yml #3151

Remove fields [author, title, thumbnail] from _blog.yml #3151

Uh oh!

Conversation

mishig25 commented Oct 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

1. Removed fields from _blog.yml files ✓

2. Updated validation script ✓

3. Fixed markdown files ✓

Uh oh!

julien-c left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Remove fields `[author, title, thumbnail]` from `_blog.yml` #3151

Remove fields `[author, title, thumbnail]` from `_blog.yml` #3151

mishig25 commented Oct 28, 2025 •

edited

Loading

1. Removed fields from `_blog.yml` files ✓