I recently got familiar with the Apple ecosystem due to receiving a new iPhone from my mom. (Thanks, mom!)  I have had a MacBook Pro since college and now have an iPhone to seamlessly do tasks together. So watching Apple's latest WWDC 2024 was only the next best thing in order to keep track their newest offerings. 

Goal: NLP/Sentiment Analysis on WWDC 2024 - what people liked, what were the most talked about releases, etc.

Performing this analysis will provide insight on how Apple is still in the game since people have been seeing it as behind on the AI front.

In [1]:
!pip install praw duckdb jupysql bertopic groq --quiet

In [2]:
import os
from pprint import pprint

# API's
import googleapiclient.discovery #YouTube
import googleapiclient.errors
import praw # Reddit
from praw.models import MoreComments
from kaggle_secrets import UserSecretsClient

# Data Manipulations
import re
import numpy as np 
import pandas as pd
pd.set_option('display.max_colwidth', None)
import json
from datetime import datetime 

import nltk
from nltk.tokenize import sent_tokenize
nltk.download('punkt')

import duckdb
%load_ext sql
conn = duckdb.connect()
%sql conn --alias duckdb

## Generate different embeddings
import tensorflow
import tensorflow_hub as hub

# Topic Modeling
from bertopic import BERTopic
from sentence_transformers import SentenceTransformer, util
from umap import UMAP
from hdbscan import HDBSCAN
from bertopic.vectorizers import ClassTfidfTransformer
from bertopic.representation import MaximalMarginalRelevance

from sklearn.feature_extraction.text import CountVectorizer

# Fast Language Model Inference
from groq import Groq

# Sentiment Analysis
from transformers import AutoTokenizer, AutoModelForSequenceClassification, pipeline

[nltk_data] Downloading package punkt to /usr/share/nltk_data...
[nltk_data]   Package punkt is already up-to-date!
Deploy Dash apps for free on Ploomber Cloud! Learn more: https://ploomber.io/s/signup


2024-06-28 22:53:57.971656: E external/local_xla/xla/stream_executor/cuda/cuda_dnn.cc:9261] Unable to register cuDNN factory: Attempting to register factory for plugin cuDNN when one has already been registered
2024-06-28 22:53:57.971810: E external/local_xla/xla/stream_executor/cuda/cuda_fft.cc:607] Unable to register cuFFT factory: Attempting to register factory for plugin cuFFT when one has already been registered
2024-06-28 22:53:58.134772: E external/local_xla/xla/stream_executor/cuda/cuda_blas.cc:1515] Unable to register cuBLAS factory: Attempting to register factory for plugin cuBLAS when one has already been registered


# Get the data (YouTube and Reddit)
If there is not enough data, augment data with synonyms

API Links:
* [YouTube](https://console.cloud.google.com/apis/credentials?project=festive-zoo-404500)
* [Reddit](https://ssl.reddit.com/prefs/apps/)

YouTube Videos:
1. https://www.youtube.com/watch?v=qkYoBNdcXBU
1. https://www.youtube.com/watch?v=p2dhZ3AoDDs


Reddit Sub's:
1. https://www.reddit.com/r/apple/comments/1dct23m/wwdc_2024_postevent_megathread/
1. https://www.reddit.com/r/iOSProgramming/comments/1dcmmsm/wwdc_2024_megathread/
1. https://www.reddit.com/r/apple/comments/1de4qkn/what_are_your_biggest_takeaways_from_wwdc_2024_so/

In [3]:
user_secrets = UserSecretsClient()

In [4]:
# YouTube credentials
api_service_name = "youtube"
api_version = "v3"
DEVELOPER_KEY = user_secrets.get_secret("youtube_apikey")

youtube = googleapiclient.discovery.build(
    api_service_name, api_version, developerKey=DEVELOPER_KEY)

In [5]:
def getcomments(video):
    request = youtube.commentThreads().list(
        part="snippet",
        videoId=video,
        maxResults=100
    )

    comments = []

    response = request.execute()

    for item in response['items']:
        comment = item['snippet']['topLevelComment']['snippet']
        comments.append([
            comment['authorDisplayName'],
            comment['publishedAt'],
            comment['likeCount'],
            comment['textOriginal'],
            comment['videoId']
        ])

    while (1 == 1):
        try:
            nextPageToken = response['nextPageToken']
        except KeyError:
            break
        nextPageToken = response['nextPageToken']
        # Create a new request object with the next page token.
        nextRequest = youtube.commentThreads().list(part="snippet", videoId=video, maxResults=100, pageToken=nextPageToken)
        # Execute the next request.
        response = nextRequest.execute()
        # Get the comments from the next response.
        for item in response['items']:
            comment = item['snippet']['topLevelComment']['snippet']
            published_at = comment['publishedAt']
            comments.append([
                comment['authorDisplayName'],
                published_at,
                comment['likeCount'],
                comment['textOriginal'],
                comment['videoId']
            ])

    df0 = pd.DataFrame(comments, columns = ['author', 'published_at', 'like_count', 'text', 'video_id'])
    return df0

In [6]:
# Get all WWDC 2024 comments from YouTube

# vids = [MKBHD, Vox]
vids = ["qkYoBNdcXBU", "p2dhZ3AoDDs"]

df0 = pd.DataFrame()

for v in vids:
    df1 = getcomments(v)
    df0 = pd.concat([df0, df1]) # Combine all df's together into one big dataframe

df0.sample(5)

Unnamed: 0,author,published_at,like_count,text,video_id
472,@thetpyoutuber7672,2024-06-10T19:25:41Z,10,Are all the freaking features just for iphone 15 pro?,p2dhZ3AoDDs
6012,@justgolfswing6385,2024-06-11T12:17:28Z,0,"@Mkbhbdqwabcdefgh, u think u know more than researchers. \nWhen things are designed better you should respect it. \nThen again, u have to stay in the middle. Subscribers, hey?",qkYoBNdcXBU
1436,@adamcheslyn694,2024-06-12T11:50:40Z,0,"You need to sync up with Elon Musk about Apple and AI. He is freaking out about this on X, and going to ban Apple phones from all of his facilities.",qkYoBNdcXBU
6126,@Shayzare,2024-06-11T12:04:58Z,1,Introducing Apple newest innovation. The Android OS fresh with AI.,qkYoBNdcXBU
1211,@matteouccellini9781,2024-06-12T15:31:34Z,0,"something that was honestly unclear but makes a huge difference: are some of the cool new features that actually use AI in the OS presentations before they talked about actual Apple AI still available on all devices that support the OS, although not of the newest models? For example: iPad calculator with handwritten math, automatic mail sorting, web pages summarisation on Mac ecc?",qkYoBNdcXBU


In [7]:
# Proper date formats
df0['published_at'] = pd.to_datetime(df0['published_at'], format='%Y-%m-%dT%H:%M:%SZ')
df0['published_at_date'] = df0['published_at'].dt.date
df0['published_at_month'] = df0['published_at'].dt.month

In [8]:
# Rename video id's to appropriate video maker
df0['video_id'] = df0['video_id'].map({'qkYoBNdcXBU' : 'MKBHD', 'p2dhZ3AoDDs' : 'The Verge'})
df0.rename(columns = {'video_id':'youtube_channel'}, inplace=True)
df0.sample(5)

Unnamed: 0,author,published_at,like_count,text,youtube_channel,published_at_date,published_at_month
380,@bloodops3893,2024-06-10 20:24:36,3,"So, google gemini lost the battle about apple üòÇ",The Verge,2024-06-10,6
9575,@BoxerboyTech,2024-06-11 07:31:47,7,The setup and background is very good üî•üî•,MKBHD,2024-06-11,6
9637,@warisatilola,2024-06-11 07:29:59,1,I have been waiting for this ü§óü§óüòä,MKBHD,2024-06-11,6
3947,@masoncu13,2024-06-11 16:36:49,1,that math thing was dope!!,MKBHD,2024-06-11,6
7526,@pimpel19,2024-06-11 09:30:58,0,"Thanks MKBHD! But one question that bothers me, what about all Apple devices before the A17 Pro Chip? How is Apple considering those regarding the new Apple Intelligence, is Siri gonna stay dumb? Do they adopt some features? I can‚Äôt imagine how this works. Me for example with the 14 Pro, having a neural engine on the A16 chip, wouldn‚Äôt be able to use those features anyway, because the neural engine is too slow to process on device..? I‚Äôd like to know how they plan on doing things in this regard, will they leave all users with older devices behind..",MKBHD,2024-06-11,6


In [9]:
df0['youtube_channel'].value_counts()

youtube_channel
MKBHD        9829
The Verge     483
Name: count, dtype: int64

In [10]:
df0['app'] = 'YouTube'

In [11]:
len(df0)

10312

In [12]:
# Reddit credentials
reddit = praw.Reddit(client_id=user_secrets.get_secret("reddit_client_id"),
                     client_secret=user_secrets.get_secret("reddit_client_secret"),
                     user_agent=user_secrets.get_secret("reddit_user_agent"))

In [13]:
# get all-level Reddit comments for all Reddit posts
def reddit_comments(url):

    all_level_comments = []

    submission = reddit.submission(url=url)

    submission.comments.replace_more(limit=None)
    for comment in submission.comments.list():
        published_at = str(datetime.fromtimestamp(comment.created)) #UTC
        published_at = pd.to_datetime(published_at, format='%Y-%m-%d %H:%M:%S')
        all_level_comments.append({
            "author": comment.author,
            "published_at": published_at, 
            "published_at_date": published_at.date(),
            "published_at_month": published_at.month,
            "like_count": comment.score,
            "text": comment.body
        })

    all_comments_df = pd.DataFrame(all_level_comments)
    return all_comments_df

In [14]:
# Define function to get dataframe for all Reddit comments
def combo_dataframe(links):
    df = pd.DataFrame()
   
    for l in links:
        df2 = reddit_comments(l)
        df = pd.concat([df, df2]) # Combine all df's together into one big dataframe
    return df

In [15]:
# get WWDC 2024 posts' all-level Reddit comments

reddit_links = ['https://www.reddit.com/r/apple/comments/1dct23m/wwdc_2024_postevent_megathread/',
                 'https://www.reddit.com/r/iOSProgramming/comments/1dcmmsm/wwdc_2024_megathread/',
                 'https://www.reddit.com/r/apple/comments/1de4qkn/what_are_your_biggest_takeaways_from_wwdc_2024_so/'] 

reddit_df = combo_dataframe(reddit_links)

In [16]:
reddit_df.head()

Unnamed: 0,author,published_at,published_at_date,published_at_month,like_count,text
0,BCDragon3000,2024-06-10 18:45:13,2024-06-10,6,813,never forgetting the yay sound effect after the calculator app reveal üò≠üò≠üò≠
1,Oulixonder,2024-06-10 18:45:23,2024-06-10,6,526,Every email and text in the future will just be AI talking to one another
2,hammerheadtiger,2024-06-10 18:44:49,2024-06-10,6,876,"Tim Cook is shouting from the roof, Craig and the exec team skydiving, Mike Rockwell in a Vision Pro, Apple video team budget at an all time high. Seat belts on folks, its a big one this year\n\nImpressions:\n\n**visionOS 2**\n\n- 2k native apps, 1.5m compatible apps\n- Photos: ML to turn 2D photos into 3D spatial photos, Shareplay\n- Quick access hand gesture menu\n- Mac Virtual Display supports different screen sizes including ultra wide which simulates two 4k monitors side by side\n- Travel mode adds train support\n- Cannon will sell a spatial lens for their cameras\n- Other - rearrange home screen, mouse support, new APIs, available in 8 more countries \n\n**iOS 18**\n\n- Home Screen\n\t- App icons and widgets can be placed anywhere\n\t- Adds ability to tint all apps by color, dark mode darkens app icons\n\t- Lock or hide apps\n- Control Center\n\t- Multiple pages including full screen home and music widgets\n\t- developers can build for control center too\n- Swap lock screen controls for other actions - finally!\n- Secure Bluetooth pairing for apps that looks like how AirPods pair\n- Messages: \n\t- Tapbacks support any emoji or sticker now\n\t- Scheduled messages\n\t- Text formatting\n\t- Text effects \n- iMessage and SMS via satellites - I feel like this would have been a huge feature any other year\n- Mail: \n\t- On device categorization\n\t- AI powered digest of emails\n- Photos App has been redesigned entirely with filters for screenshots, pinning, grouped people photos\n- Other\n\t- Maps - Topographic maps\n\t- Tap to Cash - pay each other by tapping phones\n\t- Better event tickets\n\t- Journal app adds features like stats and streaks\n\t- Game mode in iPhone\n\t- Reminders in calendar\n\t- RCS support launching\n\n**AirPods**\n\n- Nod yes or shake for no to respond to Siri\n- Voice isolation for Windy or loud environment \n\n**tvOS**\n\n- Insight in video to identify actors and music\n- Adjust voice in video to make them clear - now we can finally know what Michael Caine said in Interstellar\n- Supports 21:9 projectors for those of you with full theater set ups in this economy\n\n**watchOS 11**\n\n- Training Mode - measure intensity, duration, effort, training load\n- Fitness app - cards can be reorganized\n- Vitals App - check key metrics like heart rate and insights over time\n- Cycle Tracking app - supports pregnancy and gestational metrics - Now Apple can sell a million watches to anxious first time parents\n- Widget stack sorts itself based on context like weather changes and ongoing Uber rides\n- New watch faces created by an AI selecting your good photos and reframing them\n\n**iPad OS**\n\n- Redesigned apps with a more Vision Pro like animated tab bar \n- Shareplay - draw on or even remotely control other peoples screen - big day for \n- HOLD THE PRESSES - CALCULATOR APP IS HERE\n\t- Math Notes - handwrite expressions with variables with Apple Pencil and it will automatically solve them. supports graphs as well\n\t- Math in Notes app too\n\t- Now what will this subreddit complain about anymore?\n- Notes Smart Script \n\t- makes your unreadable handwriting look good. I know some people who need this badly\n\t- Spell check for handwriting\n\t- Automatically shifting words around as you write\n\n**macOS Sequoia**\n\n- Were not even halfway in and absolutely blowing through these platforms, hmmm I wonder what they are saving time for, what a mystery\n- Continuity\n\t- iPhone Mirroring - a lot of Android manufacturers have been doing this for a while now and its a very welcome addition here, sometimes you just want to quickly access a thin on your phone\n\t- iPhone notifications can go to mac now and automatically trigger iPhone Mirroring\n- Tiling and snapping to edges/corners - finally!\n- Presenter preview for screens sharing, background replacement\n- Passwords App - surfaces iCloud Keychain features - imo much needed as 1Password is one of my most used apps because it is surfaced\n- Safari\n\t- Highlights - identify and surface key info from a webpage\n\t- Reader - summarizes websites - still no mention of AI\n\t- Automatic picture in picture\n- Gaming\n\t- Metal 3\n\t- Game Porting Toolkit 2 with better support for Windows Games - we got a mention of MS Windows before AI\n\t- Coming to Mac: Frostpunk 2, Control, Assassins Creed Shadows, \n\n**Apple Intelligence**\n\n- ""AI for the rest of us"" - Craig\n- Writing tools available across the system\n\t- Proofreading\n\t- Write emails and notes\n\t- Summarization for emails as well as email snippets\n\t- Asks you questions to generate a response\n\t- Inbox summaries\n- Notification summaries, selective surface only important notifications to reduce interruptions\n- Can use your OS for you to do tasks like pulling up apps to play music and creating folders\n- Understands personal context from aggregated data across the system\n- GenMoji to create a custom emoji\n- Image Playground\n\t- Create images across the system including of people in iMessages\n\t- Makes it easier to create images based on traits and styles\n\t- Happens entirely on device\n\t- Image Wand can convert a rough sketch into a better image\n- Can create videos based on concepts and find photos from a long time period to tell a story\n- Record and transcribe audio in notes and phone\n- Privacy\n\t- On device processing on A17 and all M chips\n\t- Private Cloud Compute for large server based models\n\t- Data is never stored or shared with Apple\n\t- Verifiable software for independent researchers\n\n**Siri**\n\n- Better language understanding that understands corrections and context\n- Type to Siri\n- Siri can help with tech support for Apple products\n- On-screen awareness - can ask for things to be done based on info on the screen\n- Command Siri to take photos, take notes, or search for something across the system including actions in videos\n- Can do compound understanding like finding a drivers license number and typing it into a form for you or finding a persons flight number and finding its live status to determine if it works with existing lunch plans\n- Rest in peace standalone AI gadgets like Rabbit R1 and Humane Ai Pin, we hardly knew ye\n\n**ChatGPT 4o**\n- Siri can go ask ChatGPT for things like answer general knowledge questions and generate more complex images\n- Free and info not logged, chatGPT subscribers can access paid features\n- Other AI models will be added in the future - sure, throw Google a bone too I guess\n\nWhew! What a day, this was probably the most jam packed WWDC I have watched in a long time. They definitely went deeper on integration that I thought they would. Hats off to the developers working crazy hours to make the features announced a reality."
3,ConflictedRedbird186,2024-06-10 18:45:04,2024-06-10,6,133,I‚Äôm here to overreact. And also download a beta I have no business downloading.
4,BeefIsForDinner,2024-06-10 18:44:24,2024-06-10,6,96,Gimme the betaaaaaaaaaaaaa


In [17]:
len(reddit_df)

1344

In [18]:
reddit_df['app'] = 'Reddit'

In [19]:
# Before concating YouTube and Reddit df's. see if their columns are equal.
list(reddit_df)

['author',
 'published_at',
 'published_at_date',
 'published_at_month',
 'like_count',
 'text',
 'app']

In [20]:
youtube_df = df0.drop(columns=['youtube_channel'], inplace=True)
youtube_df = df0

In [21]:
list(youtube_df)

['author',
 'published_at',
 'like_count',
 'text',
 'published_at_date',
 'published_at_month',
 'app']

In [22]:
# Combine YouTube df and Reddit df together
df = pd.concat([youtube_df, reddit_df])
len(df)

11656

In [23]:
df.head()

Unnamed: 0,author,published_at,like_count,text,published_at_date,published_at_month,app
0,@rmac02,2024-06-28 21:03:24,0,Bros gobbling on apples knob,2024-06-28,6,YouTube
1,@nignam,2024-06-28 16:20:26,0,Sellout,2024-06-28,6,YouTube
2,@powerplay862,2024-06-28 14:48:30,0,THE MATH NOTE IS PRETTY SICK!\nIT'S ALMOST LIKE CHEATING!!\nGOOD JOB APPLE,2024-06-28,6,YouTube
3,@sufferer21,2024-06-28 14:28:28,0,u still a apple propagandist dude i will never forget this bs from you...,2024-06-28,6,YouTube
4,@jacksmith-mu3ee,2024-06-28 14:20:27,0,Louis rossman destroyed you mate,2024-06-28,6,YouTube


# With only 12,000 samples, augment data using synonyms?

# Perform Bertopic Modeling

In [24]:
# No text pre-processing needed
docs = df['text']

# Step 1 - Extract embeddings
embedding_model = SentenceTransformer("all-MiniLM-L6-v2")

# Step 2 - Reduce dimensionality
umap_model = UMAP(n_neighbors=15, n_components=5, min_dist=0.0, metric='cosine')

# Step 3 - Cluster reduced embeddings
hdbscan_model = HDBSCAN(min_cluster_size=15, metric='euclidean', cluster_selection_method='eom', prediction_data=True)

# Step 4 - Tokenize topics
vectorizer_model = CountVectorizer(stop_words="english")

# Step 5 - Create topic representation
ctfidf_model = ClassTfidfTransformer()

# All steps together

representation_model = MaximalMarginalRelevance(diversity=0.5)
topic_model = BERTopic(
  embedding_model=embedding_model,    # Step 1 - Extract embeddings
  umap_model=umap_model,              # Step 2 - Reduce dimensionality
  hdbscan_model=hdbscan_model,        # Step 3 - Cluster reduced embeddings
  vectorizer_model=vectorizer_model,  # Step 4 - Tokenize topics
  ctfidf_model=ctfidf_model,          # Step 5 - Extract topic words # Diversify topic words
  calculate_probabilities=True,        
  verbose=True,
  representation_model=representation_model
)

topics, probs = topic_model.fit_transform(docs)
topic_model.get_topic_info()

modules.json:   0%|          | 0.00/349 [00:00<?, ?B/s]

config_sentence_transformers.json:   0%|          | 0.00/116 [00:00<?, ?B/s]

README.md:   0%|          | 0.00/10.7k [00:00<?, ?B/s]

sentence_bert_config.json:   0%|          | 0.00/53.0 [00:00<?, ?B/s]

config.json:   0%|          | 0.00/612 [00:00<?, ?B/s]

model.safetensors:   0%|          | 0.00/90.9M [00:00<?, ?B/s]

tokenizer_config.json:   0%|          | 0.00/350 [00:00<?, ?B/s]

vocab.txt:   0%|          | 0.00/232k [00:00<?, ?B/s]

tokenizer.json:   0%|          | 0.00/466k [00:00<?, ?B/s]

special_tokens_map.json:   0%|          | 0.00/112 [00:00<?, ?B/s]

1_Pooling/config.json:   0%|          | 0.00/190 [00:00<?, ?B/s]

2024-06-28 22:55:46,403 - BERTopic - Embedding - Transforming documents to embeddings.


Batches:   0%|          | 0/365 [00:00<?, ?it/s]

2024-06-28 22:57:44,980 - BERTopic - Embedding - Completed ‚úì
2024-06-28 22:57:44,982 - BERTopic - Dimensionality - Fitting the dimensionality reduction algorithm
2024-06-28 22:58:32,874 - BERTopic - Dimensionality - Completed ‚úì
2024-06-28 22:58:32,877 - BERTopic - Cluster - Start clustering the reduced embeddings
2024-06-28 22:58:46,105 - BERTopic - Cluster - Completed ‚úì
2024-06-28 22:58:46,117 - BERTopic - Representation - Extracting topics from clusters using representation models.
2024-06-28 22:59:08,209 - BERTopic - Representation - Completed ‚úì


Unnamed: 0,Topic,Count,Name,Representation,Representative_Docs
0,-1,2944,-1_features_android_ai_ios,"[features, android, ai, ios, google, video, really, screen, icons, phones]","[My biggest Takeaway from all of this is that we‚Äôre about to enter a new era of iOS/iPadOS/macOS. We‚Äôre finally going to see bumps in ram and processing power and all of these hyper powerful Apple Silicon chips are finally going to be put to actual work. They‚Äôve been impressive so far, But now we‚Äôre going to really see what they‚Äôre made of and I‚Äôm pretty excited for that.\n\nAs a former android diehard user, I‚Äôve come to really appreciate that while Apple ‚Äútakes a long time‚Äú to implement things that are available on android, all those features are fairly irrelevant in the grand scheme of the stability of the devices.\n\nAnother take away I‚Äôm so excited that I ended up getting a 24 GB M3 MacBook Air, instead of the 8 GB model. I had a feeling it would shift this year. \n\nAside from all of that? I‚Äôm really excited for RCS messaging. I have a lot of android friends still and I really don‚Äôt want to use WhatsApp so we‚Äôve all just been using SMS., I'm really impressed by the continuity features, I'm most of the way into the Apple ecosystem, I use a personal Mac, iPad, Apple TV, and even a work iPhone (plus everyone else in my life has an iPhone so...) \n\nHowever, I've used an Android as my personal phone for nearly 15 years now, and do really enjoy android. And since the majority of the day I'm on my work PC, I like that it has always been a pretty good middle man between the Mac and Windows world. \n\nBut hot damn, just the iPhone Mirroring feature alone makes me think I'm finally going to make the switch to the iPhone side. And there's ton of other really impressive stuff they announced. \nGG Apple., as always, the video i needed for a chill summary of what was actaully important about apple news...100% - looking forward to the iphone 16 launch with ai. it seemed like there was some ai stuff that was useless to most people, and others that were really creative and useful that i would have never thought of. i think wether ai gets to super intelligence or not, it will get better, and at least in the next few years we can expect devices that can connect to other devices and literaly answer emails quickly by voice etc. and make daily life way more efficient by many leaps. the fact apple talked about chat gpt and its connection with them is interesting. but considering, every single major tech company just randomly so happened to release ai software all within the same couple months (more or less) im sure we will be seeing alot more it. the biggest thing i predict is that companies will merge technology and we will live in a city, or state, or possibly world, that all runs off the same network and systems by the same company that uses all the same products etc. so no more apple vs samsung vs whatever. the only devices youll be able to get are the ones that exist which will eventually only be who ever creates the most cost affordable and best product. i understand company diversity creates sustainability i guess but though im sure there will be greed involved in any new societal change as such, i think it would be beneficial. around that time i think a universal basic income is likely also. but such ideas have to be slide in under the rug as not to upset a nation enough to cause issues. anyways im gonna go smoke agian.]"
1,0,996,0_calculator_ipad_handwriting_maths,"[calculator, ipad, handwriting, maths, note, students, high, pencil, homework, lol]","[The calculator ü§£ü§£ü§£ü§£, Math notesü§Ø, You had me at calculator for iPadüòÇ]"
2,1,536,1_siri_typing_voice_accessibility,"[siri, typing, voice, accessibility, chatgpt, settings, able, ask, new, button]","[AI as in Siri?, What have they done to Siri..., Siri at last.]"
3,2,314,2_ai_apple_tech_marketing,"[ai, apple, tech, marketing, news, naming, rebranding, thing, invented, means]","[AI = Artificial Intelligence ‚ùå‚ùå\nAI = Apple Intelligence ü§°ü§°, AI = Artificial Intelligence ‚ùå\nAI = Apple Intelligence ‚úÖ \n\nüòá, Others:AI= Artificial intelligence \nApple:AI=Apple intelligence]"
4,3,272,3_apple_catch_customers_tech,"[apple, catch, customers, tech, companies, stocks, innovative, finally, lost, say]","[Only apple can do..., well done Apple!, Apple ... what ?]"
...,...,...,...,...,...
120,119,16,119_homepods_siri_updated_holster,"[homepods, siri, updated, holster, iplus, magsafe, nail, standby, peripherals, pair]","[My #1 question after all of this... What about homepod? What about home? Will existing homepods be trash? Or will they be partially upgraded, will there be a new one for a better siri experience?, So how about homepods in combination with AI Siri? If I have a ‚Äòsmart‚Äô Siri on my Mac and iPhone, will I still have dumb siri on my homepods? (And will my mac and iPhone laugh in her face?), what about siri on HomePods?]"
121,120,16,120_copying_samsung_telegram_claimed,"[copying, samsung, telegram, claimed, invented, saw, writing, products, microsoft, just]","[So apple have copied Samsung. Same old., Apple copied everything from AndroidüòÇüòÇüòÇüòÇüòÇ, Almost everything is copied from Samsung. üòÇ]"
122,121,16,121_advantage_overrate_unlucky_iphone,"[advantage, overrate, unlucky, iphone, pins, generation, yk, latest, accessible, sync]","[I‚Äôm actually pretty excited about getting a new iPhone to take advantage of the new AI features and to finally dump my lighting cables., But then you find ways to make it as accessible as possible. I expect to at least get these features through cloud AI, you just don‚Äôt talk for 40 minutes about awesome new AI for the people, when the ‚Äùpeople‚Äù isn‚Äôt even everyone on the latest gen iPhone, Yes it is a big feature. And it's unlucky that it supports the last four generations of M chips, but only one generation of iPhone chip. Well it will actually be two by the time it's ready.\n\nBut we were happily living without it until now. Newer iPhones having it doesn't suddenly make your iPhone any worse than it currently is.\n\nAnd I think some people might overrate how often would they actually use it. Let's use our brains while we still can. We will be looking back at these ""good old times"" one day thinking how good it was when everything wasn't done by AI yet. It will just be boring when every message, email etc. will be generated with one click.]"
123,122,16,122_video_hurtful_weve_exclamation,"[video, hurtful, weve, exclamation, club, exited, verge, pride, seconds, morning]","[The only video we were waiting for, Been waiting for a video‚Ä¶, No lie the verge video was next to your video. I clicked theirs and exited because I saw that it was a similar length and immediately clicked yours.]"


* The first topic is -1 and contains the most records. This is the outliers topic and should typically be ignored during analysis.

Let's see if different embeddings create more clear topics.


In [25]:
%%time

#load the universal sentence encoder model
use4 = hub.load("https://tfhub.dev/google/universal-sentence-encoder/4")

#generate embeddings
use4_embeddings = use4(df['text'])
use= np.array(use4_embeddings)

#create list from np arrays to store the embeddings in the dataframe
df['use4'] = use.tolist()

#pass the embeddings into BERTopic
topic_model.fit_transform(docs, use)

#get topic info
topic_model.get_topic_info()

2024-06-28 22:59:41,387 - BERTopic - Dimensionality - Fitting the dimensionality reduction algorithm
2024-06-28 22:59:55,266 - BERTopic - Dimensionality - Completed ‚úì
2024-06-28 22:59:55,269 - BERTopic - Cluster - Start clustering the reduced embeddings
2024-06-28 23:00:05,107 - BERTopic - Cluster - Completed ‚úì
2024-06-28 23:00:05,119 - BERTopic - Representation - Extracting topics from clusters using representation models.
2024-06-28 23:00:24,093 - BERTopic - Representation - Completed ‚úì


CPU times: user 2min 5s, sys: 26.8 s, total: 2min 32s
Wall time: 1min 15s


Unnamed: 0,Topic,Count,Name,Representation,Representative_Docs
0,-1,4849,-1_apple_ai_like_features,"[apple, ai, like, features, android, phone, ios, 15, users, make]","[So I can‚Äôt use my iPhone 15 with aiüíÄüíÄ, as always, the video i needed for a chill summary of what was actaully important about apple news...100% - looking forward to the iphone 16 launch with ai. it seemed like there was some ai stuff that was useless to most people, and others that were really creative and useful that i would have never thought of. i think wether ai gets to super intelligence or not, it will get better, and at least in the next few years we can expect devices that can connect to other devices and literaly answer emails quickly by voice etc. and make daily life way more efficient by many leaps. the fact apple talked about chat gpt and its connection with them is interesting. but considering, every single major tech company just randomly so happened to release ai software all within the same couple months (more or less) im sure we will be seeing alot more it. the biggest thing i predict is that companies will merge technology and we will live in a city, or state, or possibly world, that all runs off the same network and systems by the same company that uses all the same products etc. so no more apple vs samsung vs whatever. the only devices youll be able to get are the ones that exist which will eventually only be who ever creates the most cost affordable and best product. i understand company diversity creates sustainability i guess but though im sure there will be greed involved in any new societal change as such, i think it would be beneficial. around that time i think a universal basic income is likely also. but such ideas have to be slide in under the rug as not to upset a nation enough to cause issues. anyways im gonna go smoke agian., I ditched iPhone a couple years ago to use Samsung because I use Windows PC's and don't like MacOS. But I still like to watch videos like these to keep up with what they're doing. Very cool what they are doing. I can't believe it took them until 2024 to allow people to customize icons on the home screen, though. Btw, I like how Marques reads his list off his Ultra phone. Samsung phones really are great. Can you multitask yet on an iPhone or is it still one app at a time kind of thing? Vision Pro looks really cool. I might be enticed to come back to Apple.]"
1,0,687,0_samsung_s24_ultra_notes,"[samsung, s24, ultra, notes, read, marques, mkbhd, updates, funny, ios]","[Reading notes of a Samsung.., reading notes about apple on samsungüòÇüòÇ, As Marques is reading his notes of a Samsung Ultra. üòÇ]"
2,1,569,1_siri_type_accessibility_voice,"[siri, type, accessibility, voice, settings, chatgpt, carplay, bixby, button, homepods]","[How to get the ai Siri üò≠, Siri at last., What have they done to Siri...]"
3,2,234,2_calculator_app_notes_pencil,"[calculator, app, notes, pencil, students, graphs, feature, ipads, robots, lol]","[iPad calculator app!!!, Calculator on iPad üòÇ‚ù§, Calculator App to iPad. üòÇüòÇ]"
4,3,216,3_naming_ai_chatgpt_insult,"[naming, ai, chatgpt, insult, lol, clever, peak, rebranding, appled, brzzy]","[Apple Intelligence., Apple intelligence üòÇüòÖ, Apple Intelligence ‚úÖ]"
...,...,...,...,...,...
106,105,17,105_gpt_chat_integration_premium,"[gpt, chat, integration, premium, fuckload, device, cancel, generic, free, enhanced]","[So what will be the difference between my current paid chat gpt account versus the integration with apple? Would i need to keep both? Could i potentially cancel my plan with gpt? Would the capabilities on both be different (Siri AI vs gpt)?, so many questions, biggest one is if you pay for a premium chat gpt account do you get enhanced features? or more importantly is the version incorprated into ios just standard chat gpt 3,0? because if the iphone can use chat gpt 4.0 as standard whats the point in getting a premium account with open ai?, chat gpt 4 yes finally!!]"
107,106,17,106_photos_updates_features_mac,"[photos, updates, features, mac, apps, new, airpods, voice, announcements, productivity]","[üéØ Key points for quick navigation:\n\n00:02 *üì° Overview and AI Expectations*\n- WWDC 2024 keynote summary and initial impressions,\n- Apple‚Äôs approach to AI, termed Apple Intelligence,\n- Overview of the keynote structure.\n00:56 *üñ•Ô∏è Vision OS 2.0 Updates*\n- Introduction of Vision OS 2.0,\n- New wrist turn gesture for quick access to time and battery,\n- Immersive media features and improved Mac mirroring.\n02:18 *üì± iOS 18 Home Screen Customization*\n- New home screen customization features,\n- Theme engine for icon and widget color matching,\n- Potential for both aesthetic and unattractive setups.\n05:08 *üì≤ iOS 18 Additional Features*\n- Hidden apps and scheduled text messages,\n- Redesigned Control Center and various app updates,\n- Features like game mode and new categorization in Mail.\n06:03 *üéß Minor Updates for AirPods and Apple TV*\n- Voice isolation and spatial audio for AirPods,\n- New Apple TV features like real-time actor and song info,\n- Boosting voice clarity during shows.\n06:46 *üìù iPad OS 18 and Key New Feature*\n- Full home screen customization and control center updates,\n- SharePlay for remote control and a new calculator app,\n- Handwriting recognition for equations with Apple Pencil.\n09:31 *üíª Mac OS Sequoia*\n- Introduction of Mac OS Sequoia,\n- Features like automatic window snapping and new password app,\n- iPhone mirroring and continuity features.\n10:36 *ü§ñ Apple Intelligence Overview*\n- Rebranding of AI to Apple Intelligence,\n- New generative models and functionalities,\n- Supported devices and examples like writing tools and cleanup tools.\n14:51 *‚òÅÔ∏è On-Device vs. Cloud-Based AI*\n- Explanation of on-device AI vs. cloud-based models,\n- Apple‚Äôs partnership with OpenAI and privacy measures,\n- Secure and private integration of AI features.\n17:11 *üöÄ Conclusion and Future Prospects*\n- Reflections on WWDC announcements and Apple Intelligence,\n- Comparison with competitors and the potential impact,\n- Upcoming videos and content plans.\n\nyou're welcome\n\n-Made with HARPA AI, Tim Cook is shouting from the roof, Craig and the exec team skydiving, Mike Rockwell in a Vision Pro, Apple video team budget at an all time high. Seat belts on folks, its a big one this year\n\nImpressions:\n\n**visionOS 2**\n\n- 2k native apps, 1.5m compatible apps\n- Photos: ML to turn 2D photos into 3D spatial photos, Shareplay\n- Quick access hand gesture menu\n- Mac Virtual Display supports different screen sizes including ultra wide which simulates two 4k monitors side by side\n- Travel mode adds train support\n- Cannon will sell a spatial lens for their cameras\n- Other - rearrange home screen, mouse support, new APIs, available in 8 more countries \n\n**iOS 18**\n\n- Home Screen\n\t- App icons and widgets can be placed anywhere\n\t- Adds ability to tint all apps by color, dark mode darkens app icons\n\t- Lock or hide apps\n- Control Center\n\t- Multiple pages including full screen home and music widgets\n\t- developers can build for control center too\n- Swap lock screen controls for other actions - finally!\n- Secure Bluetooth pairing for apps that looks like how AirPods pair\n- Messages: \n\t- Tapbacks support any emoji or sticker now\n\t- Scheduled messages\n\t- Text formatting\n\t- Text effects \n- iMessage and SMS via satellites - I feel like this would have been a huge feature any other year\n- Mail: \n\t- On device categorization\n\t- AI powered digest of emails\n- Photos App has been redesigned entirely with filters for screenshots, pinning, grouped people photos\n- Other\n\t- Maps - Topographic maps\n\t- Tap to Cash - pay each other by tapping phones\n\t- Better event tickets\n\t- Journal app adds features like stats and streaks\n\t- Game mode in iPhone\n\t- Reminders in calendar\n\t- RCS support launching\n\n**AirPods**\n\n- Nod yes or shake for no to respond to Siri\n- Voice isolation for Windy or loud environment \n\n**tvOS**\n\n- Insight in video to identify actors and music\n- Adjust voice in video to make them clear - now we can finally know what Michael Caine said in Interstellar\n- Supports 21:9 projectors for those of you with full theater set ups in this economy\n\n**watchOS 11**\n\n- Training Mode - measure intensity, duration, effort, training load\n- Fitness app - cards can be reorganized\n- Vitals App - check key metrics like heart rate and insights over time\n- Cycle Tracking app - supports pregnancy and gestational metrics - Now Apple can sell a million watches to anxious first time parents\n- Widget stack sorts itself based on context like weather changes and ongoing Uber rides\n- New watch faces created by an AI selecting your good photos and reframing them\n\n**iPad OS**\n\n- Redesigned apps with a more Vision Pro like animated tab bar \n- Shareplay - draw on or even remotely control other peoples screen - big day for \n- HOLD THE PRESSES - CALCULATOR APP IS HERE\n\t- Math Notes - handwrite expressions with variables with Apple Pencil and it will automatically solve them. supports graphs as well\n\t- Math in Notes app too\n\t- Now what will this subreddit complain about anymore?\n- Notes Smart Script \n\t- makes your unreadable handwriting look good. I know some people who need this badly\n\t- Spell check for handwriting\n\t- Automatically shifting words around as you write\n\n**macOS Sequoia**\n\n- Were not even halfway in and absolutely blowing through these platforms, hmmm I wonder what they are saving time for, what a mystery\n- Continuity\n\t- iPhone Mirroring - a lot of Android manufacturers have been doing this for a while now and its a very welcome addition here, sometimes you just want to quickly access a thin on your phone\n\t- iPhone notifications can go to mac now and automatically trigger iPhone Mirroring\n- Tiling and snapping to edges/corners - finally!\n- Presenter preview for screens sharing, background replacement\n- Passwords App - surfaces iCloud Keychain features - imo much needed as 1Password is one of my most used apps because it is surfaced\n- Safari\n\t- Highlights - identify and surface key info from a webpage\n\t- Reader - summarizes websites - still no mention of AI\n\t- Automatic picture in picture\n- Gaming\n\t- Metal 3\n\t- Game Porting Toolkit 2 with better support for Windows Games - we got a mention of MS Windows before AI\n\t- Coming to Mac: Frostpunk 2, Control, Assassins Creed Shadows, \n\n**Apple Intelligence**\n\n- ""AI for the rest of us"" - Craig\n- Writing tools available across the system\n\t- Proofreading\n\t- Write emails and notes\n\t- Summarization for emails as well as email snippets\n\t- Asks you questions to generate a response\n\t- Inbox summaries\n- Notification summaries, selective surface only important notifications to reduce interruptions\n- Can use your OS for you to do tasks like pulling up apps to play music and creating folders\n- Understands personal context from aggregated data across the system\n- GenMoji to create a custom emoji\n- Image Playground\n\t- Create images across the system including of people in iMessages\n\t- Makes it easier to create images based on traits and styles\n\t- Happens entirely on device\n\t- Image Wand can convert a rough sketch into a better image\n- Can create videos based on concepts and find photos from a long time period to tell a story\n- Record and transcribe audio in notes and phone\n- Privacy\n\t- On device processing on A17 and all M chips\n\t- Private Cloud Compute for large server based models\n\t- Data is never stored or shared with Apple\n\t- Verifiable software for independent researchers\n\n**Siri**\n\n- Better language understanding that understands corrections and context\n- Type to Siri\n- Siri can help with tech support for Apple products\n- On-screen awareness - can ask for things to be done based on info on the screen\n- Command Siri to take photos, take notes, or search for something across the system including actions in videos\n- Can do compound understanding like finding a drivers license number and typing it into a form for you or finding a persons flight number and finding its live status to determine if it works with existing lunch plans\n- Rest in peace standalone AI gadgets like Rabbit R1 and Humane Ai Pin, we hardly knew ye\n\n**ChatGPT 4o**\n- Siri can go ask ChatGPT for things like answer general knowledge questions and generate more complex images\n- Free and info not logged, chatGPT subscribers can access paid features\n- Other AI models will be added in the future - sure, throw Google a bone too I guess\n\nWhew! What a day, this was probably the most jam packed WWDC I have watched in a long time. They definitely went deeper on integration that I thought they would. Hats off to the developers working crazy hours to make the features announced a reality., Summarized by AskTube\n\nWWDC Keynote Highlights üì±\n00:02 Apple's WWDC keynote was filled with exciting announcements, including a surprise introduction to AI in the second half.\n\n00:25 The author live-tweeted the event and summarizes all the key announcements and thoughts.\n\nVision OS 2.0 üìä\n00:47 Vision OS 2.0 brings new features like wrist-turn gestures to see time and battery percentage, and quick access to the control center.\n\n01:05 New Immersive Media Features: Turn old photos into spatial photos using advanced machine learning, and create spatial videos on non-iPhone cameras.\n\niOS Updates üì±\n02:29 iOS got a major update with customizable home screens, icon placement, and a new theme engine.\n\n02:54 Customize your home screen with different colors, and change icon and widget colors to match your wallpaper.\n\nOther Updates üìä\n03:37 Mac mirroring gets a higher resolution, and new tools are available for creating spatial and immersive videos.\n\n04:00 New features include scheduling text messages, text formatting, a game mode that minimizes background activity, and automatic categorization in the new mail app.\n\nAirPods and Apple TV Updates üéß\n06:05 AirPods got minor updates, including voice isolation during phone calls, spatial audio during games, and nodding or shaking for responding to prompts.\n\n06:25 Apple TV updates include showing actors' and characters' names on screen and boosting people's voices over the music.\n\niPad OS Updates üìä\n06:41 iPad OS allows remote control of someone's iPad with their permission, perfect for family tech support.\n\n07:03 The calculator app on iPad OS is impressive, letting you write equations in handwriting and getting real-time answers.\n\nMac OS Updates üìä\n09:40 Mac OS got a new version called Sequoia, featuring automatic window snapping, built-in backgrounds for webcam feeds, and a new password app.\n\nApple Intelligence AI ü§ñ\n10:40 Apple rebranded AI as Apple Intelligence, bringing new generative models and large language models to supported devices.\n\n11:01 Apple Intelligence features include writing tools, a cleanup tool in Photos, and generative emojis.\n\nHow Apple Handles Complex AI Tasks üíª\n15:10 Apple handles most AI tasks on-device, using Apple-built models for faster processing. However, for complex tasks, Apple has two alternative approaches.\n\n15:32 One approach uses a larger server-based model on Apple's Private Cloud compute, ensuring data is not stored or sent to Apple, but may have downsides like slower processing and internet dependence.\n\n15:54 The other approach asks the user for permission to use ChatGPT, allowing access to its capabilities and generating more realistic and varied images.\n\nPrivacy and Security of AI Features üîí\n16:18 Apple ensures privacy and security by not storing requests, obscuring IP addresses, and preventing OpenAI from forming a profile. All this is done without an account and for free.\n\nInsights on Apple's AI Announcements ü§î\n17:01 The author believes Apple's AI features are superior, with personalization and existing user info making them more effective.\n\n17:25 The AI announcements overshadow other features, marking a new age for Apple, and the author is excited to explore its capabilities and what it can bring to users.\n\nConclusion and Future Content üìπ\n18:06 The author will create more videos about Apple's AI features and encourages viewers to subscribe for updates.]"
108,107,17,107_ai_market_buzzword_feedback,"[ai, market, buzzword, feedback, launched, letters, branding, bs, conflated, pandemic]","[I always respected Apple for not calling everything AI, because absolutely NOTHING being marketing as ""Artificial Intelligence"" is actually intelligence. It's just a misleading, trendy marketing term for machine learning, large language models, and a few other standard technologies put together., I actually appreciate that Apple refuses to call this technology ""A.I."", because it definitively, unequivocably is *not* ""artificial intelligence"". I'm not saying the technology isn't interesting and potentially useful (if implemented ethically, which currently none of it is), but it's not ""A.I."" It shows a streak of integrity from Apple that they would not succumb to marketing hype to literally lie to people about the tech.\n\nI'm also impressed that they managed to find a marketing spin that allowed them to draw in all the folks who've fallen for the false terminology while still creating a phrase that doesn't, technically, call it ""artificial intelligence"".\n\nI genuinely wish Silicon Valley would just drop ""A.I."" from its spin machine and get another name for this tech, so it stops getting conflated with the actual science of developing true A.I., It is mildly pleasing that Apple seems to know the difference between artificial intelligence and machine ""learning"". However, being clever at marketing it uses its brand name, beginning with 'A' and links it the broad term ""intelligence"".\nThus, the craze for muddled ideas about what AI is, in strict definitions, can continue and Apple gets to claim a slice of the whole shebang with clever electronics seeming to be new when they are really extensions/evolution of machine ""learning"". \nThere is no dishonesty, merely more colorful and responsive fog and a significant step forward to the demise of x-86.]"
109,108,16,108____,"[, , , , , , , , , ]","[‚ô•, üòÖ, ]"


In [26]:
topic_model.get_topic(0) # top topic

[('samsung', 0.0913764546566378),
 ('s24', 0.05135511138494995),
 ('ultra', 0.047984932163022534),
 ('notes', 0.03590359332701904),
 ('read', 0.029591736637538092),
 ('marques', 0.029525735677828693),
 ('mkbhd', 0.028744697942352293),
 ('updates', 0.019278260120341353),
 ('funny', 0.014830624944973949),
 ('ios', 0.012662653926299538)]

These topics seem more interpretable due to better embeddings.

#### BERTopic Visualizations 
Reference: https://maartengr.github.io/BERTopic/getting_started/visualization/visualization.html#visualize-probablities-or-distribution

In [27]:
topic_model.visualize_topics()

The biggest topic after the outlier topic (0) was topic 1 which was about Siri. I used the slider to highlight the bubble. When you hover over the bubble, you see the words associated with the topic.

In [28]:
topic_model.visualize_barchart(top_n_topics=8)

Looks like some other huge topics were criticism of Apple being behind in development including the AI frontier. Though a huge win was the showcasing of the interactive/predictive calculator.

In [29]:
topic_model.visualize_heatmap()

* Not sure what the benchmark is for good separation between topics. 
* Considering topics not along the diagonal and not associated with topic 0.  
* Most topics are 0.4 or below in similarity score. This means there are nice distinct topics discovered under better embeddings and BERTopic model.

# Perform Sentiment and Emotion Classification

In [30]:
# using Hugging Face sentiment classifier trained on pre-trained on natural language inference (NLI)
# Info pg: https://huggingface.co/lxyuan/distilbert-base-multilingual-cased-sentiments-student

# Set up the inference pipeline using a model from the ü§ó Hub

sentiment_analysis = pipeline(model="lxyuan/distilbert-base-multilingual-cased-sentiments-student")

config.json:   0%|          | 0.00/759 [00:00<?, ?B/s]

model.safetensors:   0%|          | 0.00/541M [00:00<?, ?B/s]

tokenizer_config.json:   0%|          | 0.00/373 [00:00<?, ?B/s]

vocab.txt:   0%|          | 0.00/996k [00:00<?, ?B/s]

tokenizer.json:   0%|          | 0.00/2.92M [00:00<?, ?B/s]

special_tokens_map.json:   0%|          | 0.00/125 [00:00<?, ?B/s]

I need to take care of long comments. Will use an LLM to summarize down to the necessary amount of sentences. Will need to use a Tokenizer to determine max sentences for summary cut-off.

In [31]:
# using uncased (words have no upper or lower case) tokenizer so need to make all words lower case
# normalize text by lowering all letters
df['text'] = df['text'].str.lower()

In [32]:
# Need to make comments text as similar as it was for model training for sentiment inference: remove HTML tags --, expand emojis and slang
# Need to also get rid of weird tags since model was not trained on that for inference
# Remove HTML tags
def cleanhtml(x): 
    cleanr = re.compile('<.*?>')
    cleantext = re.sub(cleanr, '', x) 
    return cleantext.replace('\n\n', ' ').replace('\n', ' ').replace('\t', '')
    
df['processed_text'] = df.apply(lambda row: cleanhtml(row['text']), axis=1)

# # put spaces in between emojis
# def space_emojis(s):
#     return ''.join((' '+c+' ') if c in emoji.UNICODE_EMOJI['en'] else c for c in s)

# df['processed_text'] = df.apply(lambda row: space_emojis(row['processed_text']), axis=1)

# # translating emoji's to English word/slang
# emo = Translator(exact_match_only=False, randomize=True)

# def emoji_trans(x): 
#     return emo.demojify(x)

# df['processed_text'] = df.apply(lambda row: emoji_trans(row['processed_text']), axis=1) # from spaced out emojis into emojis as English words/slang

# Examine sample of dirty comments to see if translation went through
df.head(5)

Unnamed: 0,author,published_at,like_count,text,published_at_date,published_at_month,app,use4,processed_text
0,@rmac02,2024-06-28 21:03:24,0,bros gobbling on apples knob,2024-06-28,6,YouTube,"[0.05477922037243843, 0.05739085376262665, -0.057579558342695236, 0.03466566652059555, 0.059173405170440674, -0.06090722605586052, 0.013141455128788948, -0.05374596267938614, -0.06175786629319191, 0.04992125928401947, -0.06701109558343887, -0.053923316299915314, -0.05231587216258049, -0.019310804083943367, -0.09462444484233856, -0.05315953120589256, -0.006064518820494413, -0.05956262722611427, 0.03411610424518585, -0.07466941326856613, -0.04750526323914528, 0.06314964592456818, -0.029949961230158806, -0.056469257920980453, -0.06058188155293465, 0.08473767340183258, -0.011972262524068356, 0.050325170159339905, 0.007746669463813305, -0.0002209304948337376, -0.032340142875909805, -0.020155634731054306, -0.04089115560054779, -0.04084071144461632, -0.07144267857074738, 0.053870849311351776, -0.013872222974896431, 0.07515692710876465, -0.048982277512550354, 0.01804482378065586, 0.016927745193243027, -0.03408455103635788, 0.05538281053304672, -0.05294008180499077, -0.0659094974398613, 0.07467073202133179, 0.005771309603005648, -0.04508092626929283, -0.055480584502220154, -0.015562505461275578, 6.442357698688284e-05, -0.038493718951940536, 0.03730695694684982, -0.024685755372047424, -0.06456898152828217, -0.06593665480613708, 0.00540491659194231, 0.02285761572420597, 0.028475819155573845, 0.03391097113490105, 0.05469725653529167, 0.020981773734092712, -0.05814001336693764, 0.05057353526353836, -0.028311334550380707, 0.017940057441592216, 0.03160861134529114, -0.03107207827270031, 0.015672560781240463, -0.05345136299729347, -0.05766218155622482, 0.0014843306271359324, -0.03433616831898689, 0.03097318671643734, 0.03025241196155548, -0.06495372951030731, -0.0031826209742575884, 0.030640998855233192, -0.006702883169054985, -0.009099913761019707, 0.0381338968873024, 0.03157615661621094, 0.03686027601361275, -0.06495168060064316, 0.017992926761507988, 0.02654251642525196, 0.02002965286374092, -0.04300713539123535, 0.06323833018541336, -0.04885079711675644, 0.04157043620944023, -0.04514974728226662, 0.061278991401195526, 0.006096941418945789, 0.004772309213876724, -0.01325955055654049, -0.03772035986185074, 0.028910456225275993, -0.01382940448820591, 0.04684141278266907, ...]",bros gobbling on apples knob
1,@nignam,2024-06-28 16:20:26,0,sellout,2024-06-28,6,YouTube,"[0.04043683782219887, -0.062102310359478, 0.030066149309277534, 0.024012491106987, 0.0419841893017292, 0.032178182154893875, 0.03933561220765114, -0.04045035317540169, -0.000740361399948597, -0.07841067016124725, -0.06352648884057999, 0.07465454936027527, 0.006306588649749756, -0.06371334195137024, -0.02874443680047989, -0.042030975222587585, 0.0387619324028492, -0.03888664022088051, 0.024344077333807945, -0.024612504988908768, -0.05909457057714462, 0.02856578677892685, 0.03158183395862579, -0.030660105869174004, -0.03620532900094986, 0.06799277663230896, 0.02207118272781372, -0.03963794931769371, -0.0028048695530742407, 0.03869693726301193, 0.00813886895775795, -0.02876434102654457, 0.003967220429331064, 0.005340253468602896, 0.026797736063599586, 0.009630238637328148, 0.07118301093578339, -0.0035471287555992603, -0.049167487770318985, 0.0007122739916667342, 0.00839848443865776, 0.013733753934502602, 0.01954125612974167, -0.0750945657491684, 0.004887750837951899, 0.009187932126224041, -0.06003612279891968, 0.08184372633695602, -0.04337496683001518, 0.06912688910961151, 0.08141285181045532, -0.025697143748402596, 0.05083417892456055, 0.016065392643213272, -0.004814372397959232, -0.020158816128969193, -0.003619680181145668, 0.059899184852838516, 0.04908784106373787, 0.03904520347714424, 0.051913417875766754, -0.04419468343257904, -0.022200653329491615, -0.01652652584016323, -0.03392167016863823, -0.017891179770231247, 0.017054114490747452, 0.0021550932433456182, -0.0032344937790185213, 0.0339474231004715, -0.07973552495241165, -0.016104428097605705, -0.033158790320158005, -0.010757718235254288, -0.00916349422186613, -0.06124299019575119, -0.06177451089024544, 0.03903641179203987, 0.017293311655521393, -0.06074698641896248, -0.06614050269126892, 0.027284126728773117, -0.06088270992040634, 0.04599722474813461, -0.07630457729101181, 0.015295431017875671, 0.037164460867643356, -0.047454070299863815, 0.010072089731693268, -0.07355521619319916, 0.002272245241329074, 0.049412939697504044, -0.019192064180970192, 0.017052778974175453, -0.06694582104682922, -0.055513545870780945, -0.062470439821481705, -0.03687966987490654, -0.029649846255779266, 0.06645850837230682, ...]",sellout
2,@powerplay862,2024-06-28 14:48:30,0,the math note is pretty sick!\nit's almost like cheating!!\ngood job apple,2024-06-28,6,YouTube,"[-0.04484812915325165, -0.08339715749025345, 0.028410248458385468, 0.02350887469947338, 0.019891755655407906, -0.07675128430128098, -0.04276067018508911, 0.016214335337281227, 0.04771573469042778, 0.03324521332979202, -0.05124563351273537, -0.08289879560470581, -0.08337297290563583, -0.016939319670200348, -0.10783673822879791, -0.027178674936294556, -0.0031233325134962797, -0.07559150457382202, -0.004530211444944143, -0.0873049795627594, -0.029898440465331078, -0.031007125973701477, -0.06296266615390778, 0.01691541261970997, -0.046601809561252594, 0.10261912643909454, -0.07777878642082214, 0.0030107228085398674, -0.05971807241439819, -0.027513397857546806, -0.05511348322033882, 0.026892488822340965, 0.002918386133387685, -0.011210213415324688, -0.03358110412955284, 0.0633760467171669, -0.07829649746417999, 0.01263357698917389, 0.05649981275200844, -0.0005581594887189567, 0.08254978060722351, 0.01887155883014202, -0.009742951020598412, 0.055481988936662674, 0.043272316455841064, 0.0036250390112400055, 0.029089979827404022, 0.028992608189582825, -0.008456387557089329, -0.06390393525362015, -0.008203987963497639, -0.0926123857498169, 0.03289870172739029, -0.02783939801156521, -0.06851879507303238, -0.01116293016821146, 0.010663305409252644, -0.010535347275435925, 0.03597768023610115, -0.011324161663651466, 0.0320604033768177, -0.028241926804184914, -0.024132058024406433, 0.042381785809993744, 0.02700016275048256, 0.0013019214384257793, -0.032187320291996, -0.04821299389004707, 0.040269751101732254, 0.013788800686597824, -0.04829113557934761, -0.024460766464471817, -0.03788619861006737, 0.07359050214290619, 0.06845364719629288, -0.100361667573452, 0.0366378016769886, -0.02680542878806591, 0.04399305209517479, 0.008447336032986641, 0.0644969493150711, 0.0012731784954667091, 0.014431880787014961, -0.031423840671777725, -0.06775195896625519, -0.061923880130052567, 0.04468032345175743, -0.060472521930933, -0.016677843406796455, 0.010662239044904709, 0.01482613105326891, 0.039219263941049576, 0.0015086608473211527, 0.05221906676888466, 0.025590790435671806, 0.07358177751302719, 0.01995600201189518, 0.00725716445595026, -0.024525752291083336, -0.06380206346511841, ...]",the math note is pretty sick! it's almost like cheating!! good job apple
3,@sufferer21,2024-06-28 14:28:28,0,u still a apple propagandist dude i will never forget this bs from you...,2024-06-28,6,YouTube,"[0.05666888505220413, 0.010458385571837425, -0.033825065940618515, -0.0067219543270766735, -0.01516518834978342, -0.04141934588551521, -0.05267658084630966, -0.005851018242537975, -0.0030651285778731108, 0.02550555393099785, -0.035765424370765686, -0.05095531418919563, -0.006403287872672081, -0.03502433001995087, -0.10303778201341629, -0.09512665867805481, -0.009217601269483566, -0.032310813665390015, 0.013986262492835522, -0.07801925390958786, -0.0587272085249424, -0.05497516691684723, 0.02942510135471821, 0.018861180171370506, -0.021144066005945206, 0.09561331570148468, 0.011882743798196316, -0.08729132264852524, 0.05493587255477905, -0.034596096724271774, -0.05147751420736313, -0.07142045348882675, 0.002075251191854477, 0.029187403619289398, -0.0012725176056846976, 0.03767875209450722, -0.02800467051565647, -0.009133628569543362, -0.004804712720215321, -0.003876924281939864, 0.08600848913192749, -0.0603964626789093, -0.037792399525642395, 0.06426748633384705, 0.040405161678791046, 0.01649405248463154, -0.10902475565671921, 0.042139675468206406, -0.05832066759467125, -0.015078152529895306, -0.032140590250492096, -0.0373297855257988, 0.055961839854717255, 0.038518693298101425, -0.07769394665956497, -0.03157364949584007, 0.05991525948047638, 0.050904739648103714, -0.012691210955381393, -0.015931904315948486, 0.04207828268408775, -0.004601995460689068, -0.019944461062550545, 0.017156554386019707, -0.0032405646052211523, 0.05003989860415459, -0.06756819039583206, -0.01767357811331749, 0.04108899086713791, -0.0032624301966279745, -0.033062487840652466, -0.004690384026616812, -0.0027620354667305946, -0.025275500491261482, 0.0418987013399601, -0.07120652496814728, -0.009716051630675793, 0.05177084356546402, -0.06764917820692062, -0.09619075059890747, 0.04137483239173889, 0.034443993121385574, 0.010458813048899174, -0.004793345928192139, 0.020855017006397247, 0.04962741956114769, -0.031056342646479607, -0.05709494650363922, 0.0285166185349226, -0.031397104263305664, -0.0053161634132266045, -0.02676873654127121, 0.0895150825381279, 0.06786532700061798, -0.020296381786465645, 0.07596215605735779, 0.015962820500135422, 0.04965193569660187, 0.026264160871505737, 0.033387187868356705, ...]",u still a apple propagandist dude i will never forget this bs from you...
4,@jacksmith-mu3ee,2024-06-28 14:20:27,0,louis rossman destroyed you mate,2024-06-28,6,YouTube,"[0.020549798384308815, 0.0355159156024456, 0.016873575747013092, -0.008573861792683601, 0.04231791943311691, -0.019414560869336128, 0.01687905378639698, -0.00014053743507247418, -0.013180333189666271, 0.06381138414144516, -0.008507497608661652, -0.02672072872519493, 0.009557360783219337, -0.007814082317054272, 0.02942066453397274, -0.05242370441555977, -0.004958118312060833, -0.019180765375494957, -0.06764046102762222, -0.042346514761447906, -0.006155482959002256, -0.02123323641717434, 0.050891004502773285, 0.07698062807321548, -0.008480637334287167, 0.008382931351661682, 0.08787365257740021, 0.027966145426034927, -0.03228944540023804, -0.04416720196604729, 0.03351699933409691, -0.030592504888772964, -0.04249642789363861, 0.047148849815130234, 0.018946170806884766, -0.06083386391401291, 0.026951033622026443, -0.01074810791760683, -0.051041897386312485, -0.029206374660134315, 0.006735512521117926, -0.06066974997520447, -0.007557410281151533, -0.0424201600253582, -0.07438575476408005, 0.05175406113266945, -0.058469925075769424, 0.043441906571388245, 0.004223366267979145, -0.03042132593691349, 0.050565071403980255, -0.05296187102794647, -0.03434000909328461, 0.01839311420917511, -0.07409023493528366, 0.02765659987926483, 0.05005473643541336, -0.033215053379535675, 0.0027998671866953373, -0.00543176056817174, 0.09956444799900055, -0.0811036005616188, -0.09872432798147202, 0.07307009398937225, 0.07892110198736191, 0.0026390617713332176, -0.03928713500499725, -0.02382984198629856, 0.05972551181912422, -0.024074863642454147, 0.004046234767884016, 0.00725543824955821, 0.05154480040073395, -0.03743629902601242, 0.06690283864736557, 0.017501669004559517, 0.02907414361834526, 0.03398468345403671, -0.0030850523617118597, -0.03865328058600426, -0.036977656185626984, -0.003919669892638922, -0.057712115347385406, -0.002121160039678216, -0.04135245457291603, 0.033589959144592285, -0.048458345234394073, 0.06866581737995148, 0.04254477098584175, -0.013634305447340012, 0.07313109189271927, -0.018714558333158493, 0.04880201444029808, -0.07337287813425064, 0.004200661554932594, -0.02541457675397396, -0.029409093782305717, 0.0249997116625309, -0.03148101642727852, 0.02268814854323864, ...]",louis rossman destroyed you mate


In [33]:
# %%time
# Due to transformer error:
# If you are running a transformer model with HuggingFace, there is a chance that one of the input sentences is longer than 512 tokens. 
# Either truncate or split your sentences. I suspect the shorter sentences are padded to 512 tokens.

#1. get token length/total words with max: 500 total words as max total words per comments ince tokenizer can add suffixes as a token
#2. eliminate token lengths greater than max/SUMMARIZE comments using Grok since losing richness of dataset with eliminating comments

tokenizer = AutoTokenizer.from_pretrained("bert-base-uncased")
# sentence = "Sentence to check whether it exceeds 512 tokens"

# tokenized_sentence = tokenizer.tokenize(sentence)

# print(len(sentence.split()))  # here is the default length of the sentence
# print(len(tokenized_sentence)) # here is the tokenized length

## Example applied to df

df['num_tokens'] = df.apply(lambda row: len(tokenizer.tokenize(row.processed_text)), axis = 1)
df.head()

tokenizer_config.json:   0%|          | 0.00/48.0 [00:00<?, ?B/s]

config.json:   0%|          | 0.00/570 [00:00<?, ?B/s]

vocab.txt:   0%|          | 0.00/232k [00:00<?, ?B/s]

tokenizer.json:   0%|          | 0.00/466k [00:00<?, ?B/s]

Token indices sequence length is longer than the specified maximum sequence length for this model (553 > 512). Running this sequence through the model will result in indexing errors


Unnamed: 0,author,published_at,like_count,text,published_at_date,published_at_month,app,use4,processed_text,num_tokens
0,@rmac02,2024-06-28 21:03:24,0,bros gobbling on apples knob,2024-06-28,6,YouTube,"[0.05477922037243843, 0.05739085376262665, -0.057579558342695236, 0.03466566652059555, 0.059173405170440674, -0.06090722605586052, 0.013141455128788948, -0.05374596267938614, -0.06175786629319191, 0.04992125928401947, -0.06701109558343887, -0.053923316299915314, -0.05231587216258049, -0.019310804083943367, -0.09462444484233856, -0.05315953120589256, -0.006064518820494413, -0.05956262722611427, 0.03411610424518585, -0.07466941326856613, -0.04750526323914528, 0.06314964592456818, -0.029949961230158806, -0.056469257920980453, -0.06058188155293465, 0.08473767340183258, -0.011972262524068356, 0.050325170159339905, 0.007746669463813305, -0.0002209304948337376, -0.032340142875909805, -0.020155634731054306, -0.04089115560054779, -0.04084071144461632, -0.07144267857074738, 0.053870849311351776, -0.013872222974896431, 0.07515692710876465, -0.048982277512550354, 0.01804482378065586, 0.016927745193243027, -0.03408455103635788, 0.05538281053304672, -0.05294008180499077, -0.0659094974398613, 0.07467073202133179, 0.005771309603005648, -0.04508092626929283, -0.055480584502220154, -0.015562505461275578, 6.442357698688284e-05, -0.038493718951940536, 0.03730695694684982, -0.024685755372047424, -0.06456898152828217, -0.06593665480613708, 0.00540491659194231, 0.02285761572420597, 0.028475819155573845, 0.03391097113490105, 0.05469725653529167, 0.020981773734092712, -0.05814001336693764, 0.05057353526353836, -0.028311334550380707, 0.017940057441592216, 0.03160861134529114, -0.03107207827270031, 0.015672560781240463, -0.05345136299729347, -0.05766218155622482, 0.0014843306271359324, -0.03433616831898689, 0.03097318671643734, 0.03025241196155548, -0.06495372951030731, -0.0031826209742575884, 0.030640998855233192, -0.006702883169054985, -0.009099913761019707, 0.0381338968873024, 0.03157615661621094, 0.03686027601361275, -0.06495168060064316, 0.017992926761507988, 0.02654251642525196, 0.02002965286374092, -0.04300713539123535, 0.06323833018541336, -0.04885079711675644, 0.04157043620944023, -0.04514974728226662, 0.061278991401195526, 0.006096941418945789, 0.004772309213876724, -0.01325955055654049, -0.03772035986185074, 0.028910456225275993, -0.01382940448820591, 0.04684141278266907, ...]",bros gobbling on apples knob,6
1,@nignam,2024-06-28 16:20:26,0,sellout,2024-06-28,6,YouTube,"[0.04043683782219887, -0.062102310359478, 0.030066149309277534, 0.024012491106987, 0.0419841893017292, 0.032178182154893875, 0.03933561220765114, -0.04045035317540169, -0.000740361399948597, -0.07841067016124725, -0.06352648884057999, 0.07465454936027527, 0.006306588649749756, -0.06371334195137024, -0.02874443680047989, -0.042030975222587585, 0.0387619324028492, -0.03888664022088051, 0.024344077333807945, -0.024612504988908768, -0.05909457057714462, 0.02856578677892685, 0.03158183395862579, -0.030660105869174004, -0.03620532900094986, 0.06799277663230896, 0.02207118272781372, -0.03963794931769371, -0.0028048695530742407, 0.03869693726301193, 0.00813886895775795, -0.02876434102654457, 0.003967220429331064, 0.005340253468602896, 0.026797736063599586, 0.009630238637328148, 0.07118301093578339, -0.0035471287555992603, -0.049167487770318985, 0.0007122739916667342, 0.00839848443865776, 0.013733753934502602, 0.01954125612974167, -0.0750945657491684, 0.004887750837951899, 0.009187932126224041, -0.06003612279891968, 0.08184372633695602, -0.04337496683001518, 0.06912688910961151, 0.08141285181045532, -0.025697143748402596, 0.05083417892456055, 0.016065392643213272, -0.004814372397959232, -0.020158816128969193, -0.003619680181145668, 0.059899184852838516, 0.04908784106373787, 0.03904520347714424, 0.051913417875766754, -0.04419468343257904, -0.022200653329491615, -0.01652652584016323, -0.03392167016863823, -0.017891179770231247, 0.017054114490747452, 0.0021550932433456182, -0.0032344937790185213, 0.0339474231004715, -0.07973552495241165, -0.016104428097605705, -0.033158790320158005, -0.010757718235254288, -0.00916349422186613, -0.06124299019575119, -0.06177451089024544, 0.03903641179203987, 0.017293311655521393, -0.06074698641896248, -0.06614050269126892, 0.027284126728773117, -0.06088270992040634, 0.04599722474813461, -0.07630457729101181, 0.015295431017875671, 0.037164460867643356, -0.047454070299863815, 0.010072089731693268, -0.07355521619319916, 0.002272245241329074, 0.049412939697504044, -0.019192064180970192, 0.017052778974175453, -0.06694582104682922, -0.055513545870780945, -0.062470439821481705, -0.03687966987490654, -0.029649846255779266, 0.06645850837230682, ...]",sellout,2
2,@powerplay862,2024-06-28 14:48:30,0,the math note is pretty sick!\nit's almost like cheating!!\ngood job apple,2024-06-28,6,YouTube,"[-0.04484812915325165, -0.08339715749025345, 0.028410248458385468, 0.02350887469947338, 0.019891755655407906, -0.07675128430128098, -0.04276067018508911, 0.016214335337281227, 0.04771573469042778, 0.03324521332979202, -0.05124563351273537, -0.08289879560470581, -0.08337297290563583, -0.016939319670200348, -0.10783673822879791, -0.027178674936294556, -0.0031233325134962797, -0.07559150457382202, -0.004530211444944143, -0.0873049795627594, -0.029898440465331078, -0.031007125973701477, -0.06296266615390778, 0.01691541261970997, -0.046601809561252594, 0.10261912643909454, -0.07777878642082214, 0.0030107228085398674, -0.05971807241439819, -0.027513397857546806, -0.05511348322033882, 0.026892488822340965, 0.002918386133387685, -0.011210213415324688, -0.03358110412955284, 0.0633760467171669, -0.07829649746417999, 0.01263357698917389, 0.05649981275200844, -0.0005581594887189567, 0.08254978060722351, 0.01887155883014202, -0.009742951020598412, 0.055481988936662674, 0.043272316455841064, 0.0036250390112400055, 0.029089979827404022, 0.028992608189582825, -0.008456387557089329, -0.06390393525362015, -0.008203987963497639, -0.0926123857498169, 0.03289870172739029, -0.02783939801156521, -0.06851879507303238, -0.01116293016821146, 0.010663305409252644, -0.010535347275435925, 0.03597768023610115, -0.011324161663651466, 0.0320604033768177, -0.028241926804184914, -0.024132058024406433, 0.042381785809993744, 0.02700016275048256, 0.0013019214384257793, -0.032187320291996, -0.04821299389004707, 0.040269751101732254, 0.013788800686597824, -0.04829113557934761, -0.024460766464471817, -0.03788619861006737, 0.07359050214290619, 0.06845364719629288, -0.100361667573452, 0.0366378016769886, -0.02680542878806591, 0.04399305209517479, 0.008447336032986641, 0.0644969493150711, 0.0012731784954667091, 0.014431880787014961, -0.031423840671777725, -0.06775195896625519, -0.061923880130052567, 0.04468032345175743, -0.060472521930933, -0.016677843406796455, 0.010662239044904709, 0.01482613105326891, 0.039219263941049576, 0.0015086608473211527, 0.05221906676888466, 0.025590790435671806, 0.07358177751302719, 0.01995600201189518, 0.00725716445595026, -0.024525752291083336, -0.06380206346511841, ...]",the math note is pretty sick! it's almost like cheating!! good job apple,18
3,@sufferer21,2024-06-28 14:28:28,0,u still a apple propagandist dude i will never forget this bs from you...,2024-06-28,6,YouTube,"[0.05666888505220413, 0.010458385571837425, -0.033825065940618515, -0.0067219543270766735, -0.01516518834978342, -0.04141934588551521, -0.05267658084630966, -0.005851018242537975, -0.0030651285778731108, 0.02550555393099785, -0.035765424370765686, -0.05095531418919563, -0.006403287872672081, -0.03502433001995087, -0.10303778201341629, -0.09512665867805481, -0.009217601269483566, -0.032310813665390015, 0.013986262492835522, -0.07801925390958786, -0.0587272085249424, -0.05497516691684723, 0.02942510135471821, 0.018861180171370506, -0.021144066005945206, 0.09561331570148468, 0.011882743798196316, -0.08729132264852524, 0.05493587255477905, -0.034596096724271774, -0.05147751420736313, -0.07142045348882675, 0.002075251191854477, 0.029187403619289398, -0.0012725176056846976, 0.03767875209450722, -0.02800467051565647, -0.009133628569543362, -0.004804712720215321, -0.003876924281939864, 0.08600848913192749, -0.0603964626789093, -0.037792399525642395, 0.06426748633384705, 0.040405161678791046, 0.01649405248463154, -0.10902475565671921, 0.042139675468206406, -0.05832066759467125, -0.015078152529895306, -0.032140590250492096, -0.0373297855257988, 0.055961839854717255, 0.038518693298101425, -0.07769394665956497, -0.03157364949584007, 0.05991525948047638, 0.050904739648103714, -0.012691210955381393, -0.015931904315948486, 0.04207828268408775, -0.004601995460689068, -0.019944461062550545, 0.017156554386019707, -0.0032405646052211523, 0.05003989860415459, -0.06756819039583206, -0.01767357811331749, 0.04108899086713791, -0.0032624301966279745, -0.033062487840652466, -0.004690384026616812, -0.0027620354667305946, -0.025275500491261482, 0.0418987013399601, -0.07120652496814728, -0.009716051630675793, 0.05177084356546402, -0.06764917820692062, -0.09619075059890747, 0.04137483239173889, 0.034443993121385574, 0.010458813048899174, -0.004793345928192139, 0.020855017006397247, 0.04962741956114769, -0.031056342646479607, -0.05709494650363922, 0.0285166185349226, -0.031397104263305664, -0.0053161634132266045, -0.02676873654127121, 0.0895150825381279, 0.06786532700061798, -0.020296381786465645, 0.07596215605735779, 0.015962820500135422, 0.04965193569660187, 0.026264160871505737, 0.033387187868356705, ...]",u still a apple propagandist dude i will never forget this bs from you...,20
4,@jacksmith-mu3ee,2024-06-28 14:20:27,0,louis rossman destroyed you mate,2024-06-28,6,YouTube,"[0.020549798384308815, 0.0355159156024456, 0.016873575747013092, -0.008573861792683601, 0.04231791943311691, -0.019414560869336128, 0.01687905378639698, -0.00014053743507247418, -0.013180333189666271, 0.06381138414144516, -0.008507497608661652, -0.02672072872519493, 0.009557360783219337, -0.007814082317054272, 0.02942066453397274, -0.05242370441555977, -0.004958118312060833, -0.019180765375494957, -0.06764046102762222, -0.042346514761447906, -0.006155482959002256, -0.02123323641717434, 0.050891004502773285, 0.07698062807321548, -0.008480637334287167, 0.008382931351661682, 0.08787365257740021, 0.027966145426034927, -0.03228944540023804, -0.04416720196604729, 0.03351699933409691, -0.030592504888772964, -0.04249642789363861, 0.047148849815130234, 0.018946170806884766, -0.06083386391401291, 0.026951033622026443, -0.01074810791760683, -0.051041897386312485, -0.029206374660134315, 0.006735512521117926, -0.06066974997520447, -0.007557410281151533, -0.0424201600253582, -0.07438575476408005, 0.05175406113266945, -0.058469925075769424, 0.043441906571388245, 0.004223366267979145, -0.03042132593691349, 0.050565071403980255, -0.05296187102794647, -0.03434000909328461, 0.01839311420917511, -0.07409023493528366, 0.02765659987926483, 0.05005473643541336, -0.033215053379535675, 0.0027998671866953373, -0.00543176056817174, 0.09956444799900055, -0.0811036005616188, -0.09872432798147202, 0.07307009398937225, 0.07892110198736191, 0.0026390617713332176, -0.03928713500499725, -0.02382984198629856, 0.05972551181912422, -0.024074863642454147, 0.004046234767884016, 0.00725543824955821, 0.05154480040073395, -0.03743629902601242, 0.06690283864736557, 0.017501669004559517, 0.02907414361834526, 0.03398468345403671, -0.0030850523617118597, -0.03865328058600426, -0.036977656185626984, -0.003919669892638922, -0.057712115347385406, -0.002121160039678216, -0.04135245457291603, 0.033589959144592285, -0.048458345234394073, 0.06866581737995148, 0.04254477098584175, -0.013634305447340012, 0.07313109189271927, -0.018714558333158493, 0.04880201444029808, -0.07337287813425064, 0.004200661554932594, -0.02541457675397396, -0.029409093782305717, 0.0249997116625309, -0.03148101642727852, 0.02268814854323864, ...]",louis rossman destroyed you mate,6


In [34]:
# Now let us take a look at comments composition that are around 500 tokens in length

query = """
select author
, num_tokens
, processed_text
from df
order by 2 desc
limit 10
"""

duckdb.query(query).df()

Unnamed: 0,author,num_tokens,processed_text
0,hammerheadtiger,1375,"tim cook is shouting from the roof, craig and the exec team skydiving, mike rockwell in a vision pro, apple video team budget at an all time high. seat belts on folks, its a big one this year impressions: **visionos 2** - 2k native apps, 1.5m compatible apps - photos: ml to turn 2d photos into 3d spatial photos, shareplay - quick access hand gesture menu - mac virtual display supports different screen sizes including ultra wide which simulates two 4k monitors side by side - travel mode adds train support - cannon will sell a spatial lens for their cameras - other - rearrange home screen, mouse support, new apis, available in 8 more countries **ios 18** - home screen - app icons and widgets can be placed anywhere - adds ability to tint all apps by color, dark mode darkens app icons - lock or hide apps - control center - multiple pages including full screen home and music widgets - developers can build for control center too - swap lock screen controls for other actions - finally! - secure bluetooth pairing for apps that looks like how airpods pair - messages: - tapbacks support any emoji or sticker now - scheduled messages - text formatting - text effects - imessage and sms via satellites - i feel like this would have been a huge feature any other year - mail: - on device categorization - ai powered digest of emails - photos app has been redesigned entirely with filters for screenshots, pinning, grouped people photos - other - maps - topographic maps - tap to cash - pay each other by tapping phones - better event tickets - journal app adds features like stats and streaks - game mode in iphone - reminders in calendar - rcs support launching **airpods** - nod yes or shake for no to respond to siri - voice isolation for windy or loud environment **tvos** - insight in video to identify actors and music - adjust voice in video to make them clear - now we can finally know what michael caine said in interstellar - supports 21:9 projectors for those of you with full theater set ups in this economy **watchos 11** - training mode - measure intensity, duration, effort, training load - fitness app - cards can be reorganized - vitals app - check key metrics like heart rate and insights over time - cycle tracking app - supports pregnancy and gestational metrics - now apple can sell a million watches to anxious first time parents - widget stack sorts itself based on context like weather changes and ongoing uber rides - new watch faces created by an ai selecting your good photos and reframing them **ipad os** - redesigned apps with a more vision pro like animated tab bar - shareplay - draw on or even remotely control other peoples screen - big day for - hold the presses - calculator app is here - math notes - handwrite expressions with variables with apple pencil and it will automatically solve them. supports graphs as well - math in notes app too - now what will this subreddit complain about anymore? - notes smart script - makes your unreadable handwriting look good. i know some people who need this badly - spell check for handwriting - automatically shifting words around as you write **macos sequoia** - were not even halfway in and absolutely blowing through these platforms, hmmm i wonder what they are saving time for, what a mystery - continuity - iphone mirroring - a lot of android manufacturers have been doing this for a while now and its a very welcome addition here, sometimes you just want to quickly access a thin on your phone - iphone notifications can go to mac now and automatically trigger iphone mirroring - tiling and snapping to edges/corners - finally! - presenter preview for screens sharing, background replacement - passwords app - surfaces icloud keychain features - imo much needed as 1password is one of my most used apps because it is surfaced - safari - highlights - identify and surface key info from a webpage - reader - summarizes websites - still no mention of ai - automatic picture in picture - gaming - metal 3 - game porting toolkit 2 with better support for windows games - we got a mention of ms windows before ai - coming to mac: frostpunk 2, control, assassins creed shadows, **apple intelligence** - ""ai for the rest of us"" - craig - writing tools available across the system - proofreading - write emails and notes - summarization for emails as well as email snippets - asks you questions to generate a response - inbox summaries - notification summaries, selective surface only important notifications to reduce interruptions - can use your os for you to do tasks like pulling up apps to play music and creating folders - understands personal context from aggregated data across the system - genmoji to create a custom emoji - image playground - create images across the system including of people in imessages - makes it easier to create images based on traits and styles - happens entirely on device - image wand can convert a rough sketch into a better image - can create videos based on concepts and find photos from a long time period to tell a story - record and transcribe audio in notes and phone - privacy - on device processing on a17 and all m chips - private cloud compute for large server based models - data is never stored or shared with apple - verifiable software for independent researchers **siri** - better language understanding that understands corrections and context - type to siri - siri can help with tech support for apple products - on-screen awareness - can ask for things to be done based on info on the screen - command siri to take photos, take notes, or search for something across the system including actions in videos - can do compound understanding like finding a drivers license number and typing it into a form for you or finding a persons flight number and finding its live status to determine if it works with existing lunch plans - rest in peace standalone ai gadgets like rabbit r1 and humane ai pin, we hardly knew ye **chatgpt 4o** - siri can go ask chatgpt for things like answer general knowledge questions and generate more complex images - free and info not logged, chatgpt subscribers can access paid features - other ai models will be added in the future - sure, throw google a bone too i guess whew! what a day, this was probably the most jam packed wwdc i have watched in a long time. they definitely went deeper on integration that i thought they would. hats off to the developers working crazy hours to make the features announced a reality."
1,@flowting5163,1246,"my take on wwdc's ipad calculator app: it's nothing more than a gimmick wwdc just dropped yesterday and one of the highlight features was the calculator app feature. mkbhd, the gold standard reviewer to go to when it comes to technology, called it the only ‚Äúincredible feature‚Äù that came to the ipad. but like almost every other hyped software/ai update these days, they hardly get used by us in the real world. ok, let‚Äôs dive into the main math notes features that everyone is so excited about and i will explain why they are all useless gimmicks: it‚Äôs able to give you answers as you write down a maths question. you can adjust the information in your math question and the answer will update accordingly. it can support variables and equations, and you can manipulate them all you want. you can add graphs in the notes and demonstrate the function as you manipulate the numbers. the features listed above are pretty cool, but if we were to put them in a real-world context, they instantly become obsolete and can easily be replaced with better tools and software. what is the real-world context? in this case, apple is targeting students in classroom settings, right? you are writing down equations, doing maths questions, playing with graphs, variables, functions etc. so how does it compare to classroom tools? the first feature is being able to give you answers as you write down a maths question. this feature is pretty useless beyond 4th grade. the maths that we do after 4th grade becomes more complex, we need to manipulate equations and find x. it‚Äôs not a matter of 89 * 34 =? in maths exams or practices, you need to show your working out and you need to do the mental arithmetic. in this case, how does the ipad even come in handy? you may argue that, in some cases, you might need to do complex calculations that you cannot do in your head. yeah right, that‚Äôs what the physical calculator that comes with the exams is for. the physical calculator is faster, easier and more representative of what you would face in exams. a physical calculator is just better in every way and form than doing maths on an ipad and hoping that the ai will read your handwriting and then provide an answer. anything beyond primary school, the feature becomes inferior to a physical calculator. anything beyond middle school, you would not even bother using the maths notes app. there is a reason why we need to buy a 200 dollar calculator for our exams like the ti-nspire that calculator simply blows the ipad out of the water in so many ways. the functions on the calculator go beyond solving questions, it can solve integrals, binomial theorem, graph trigonometric functions and show the gradient, the concavity, local max and local min‚Ä¶ simply put, anything beyond primary school, the feature on your ipad can easily be replaced by physical calculators that not only better resemble exam conditions, but they have way more features and are way more powerful than the maths notes feature on the ipad. the second feature, adjusting numbers and you get new a answer. i don‚Äôt even know what‚Äôs so worth showing off about this feature. the cool factor that comes from these features is not because of their applications but instead because of their novelty and difficulty in designing something that could easily be done by something else. you can do just the same with the calculator shown above, or even if you have a crappy one, how hard is it for you to retype the numbers using physical buttons? finally, for the third and fourth features, the graphs and functions. does apple not know free amazing software like geogebra and desmos exist? they have way more features than the maths notes and are free. they are just better in so many ways that this blog won‚Äôt be able to fit it all. you don‚Äôt have to spend 1000 dollars on an ipad to get access to those features. all of the software features that you need for school work are already there on the internet for you. the physical calculator is just as good, if not, better than the ipad calculator when it comes to doing maths. the features on the ipad are a complete gimmick and they are not worth the hype. this reminds me of the humanoid robots that some companies are designing these days. sure, they are cool, but suck at what they do. there are robots everywhere around us, but they simply don‚Äôt take the shape of humans. there are robots in the manufacturing factories to build cars, make clothes, and seal boxes. these robots are good at what they do. but why do we not find them impressive? instead, we think that humanoid robots are impressive. the reason does not lie within their real-life applications and how pragmatic they are, the reason is that they are a marvel piece of engineering (that is good at nothing). it‚Äôs easy to mix a product‚Äôs usefulness with its engineering complexity with all the marketing on the internet these days. in the future, when we see another new feature or product marketed to us, rather than blindly following the hype, perhaps we should stop and reflect on how this feature would be translated to real-world use. the chances are, they are pretty damn useless and easily replaceable with much cheaper and effective alternatives. other examples include the samsung ai features. i got my ai update on my z fold 4 not long ago and i have rarely used them. i tested the translations and interpretations features and they are all wrong and hardly make any sense. i used the circle to search function maybe twice after all these months of having the feature. don't forget that the circle to search function was considered a highly useful function by reviewers, but to be honest, none of the circle of people i know use them and they don‚Äôt come close to altering the way we interact with our tech by 1%."
2,@mndlessdrwer,763,"i will never understand these chucklefucks trying to sell vr as a productivity tool at this stage in its development. like, how heavy and cumbersome is apple vision or an oculus headset? three pounds? four pounds? heavy enough that wearing it for the duration of its battery life can become legitimately uncomfortable to bear. once they figure out how to use nanostructure lensing to create something roughly the size of a pair of bulky glasses, then it'll be viable as a productivity tool. go ahead and make it wired with a belt clip or harnessed compute module, i really don't mind, but don't try to normalize people trying to get actual work done with a vr toaster strapped to their face. ==================================== my requirements for a phone are actually quite simple: -a camera with acceptable image quality -no distracting hole-punch or notch in the display -stereo front-facing speakers -a headphone jack as backup for when i need it -expandable storage for when i need it -the ability to make my home screen look like palm's webos thus far, the number of devices that fit that bill are very, very small and have never included the iphone. ==================================== okay, i will admit that the new calculator app on ipad os 18 is legitimately one of the best hands-on applications of ocr and ai interpretation that i've seen in a while. it wouldn't benefit me at all because my handwriting is atrocious, but it's still ridiculously cool to see. ==================================== i honestly wish i could care about all of these ai features, but i really don't. an enhanced spell-checker and grammar+syntax checker would be nice, but ai has repeatedly demonstrated that the english language is quite a challenge for it. photo editing is a good use case for it, but even then, that relies on the photo being taken well in the first place and the camera app can be a bit flaky about adjusting white balance, gamma, exposure, focus, stabilization, etc. i'd be most interested in them applying ai to properly optimize the shot in the first place. couple that with some stupid fast sensors so it can make a few tweaks to the settings, take a few photos in one go, then use ai and it's image processor to merge the image into something like what the light camera could do. the ability to shift focus while making adjustments to all of the usual corrections in exposure, but with the data from a bunch of different images taken with different settings to draw from so you never lose detail would be an awesome feature that even google can't offer yet. ==================================== i'm a bit disappointed that apple didn't make any announcements about their totl mac products, so there's no official news yet about an m3 or m4 ultra chip or the next generation of mac pro. content creators who rely heavily on mac should also be feeling impatient to know when the next generation of these totl apple silicon powered devices will be available."
3,@ThuanSuBa,707,"summarized by asktube wwdc keynote highlights üì± 00:02 apple's wwdc keynote was filled with exciting announcements, including a surprise introduction to ai in the second half. 00:25 the author live-tweeted the event and summarizes all the key announcements and thoughts. vision os 2.0 üìä 00:47 vision os 2.0 brings new features like wrist-turn gestures to see time and battery percentage, and quick access to the control center. 01:05 new immersive media features: turn old photos into spatial photos using advanced machine learning, and create spatial videos on non-iphone cameras. ios updates üì± 02:29 ios got a major update with customizable home screens, icon placement, and a new theme engine. 02:54 customize your home screen with different colors, and change icon and widget colors to match your wallpaper. other updates üìä 03:37 mac mirroring gets a higher resolution, and new tools are available for creating spatial and immersive videos. 04:00 new features include scheduling text messages, text formatting, a game mode that minimizes background activity, and automatic categorization in the new mail app. airpods and apple tv updates üéß 06:05 airpods got minor updates, including voice isolation during phone calls, spatial audio during games, and nodding or shaking for responding to prompts. 06:25 apple tv updates include showing actors' and characters' names on screen and boosting people's voices over the music. ipad os updates üìä 06:41 ipad os allows remote control of someone's ipad with their permission, perfect for family tech support. 07:03 the calculator app on ipad os is impressive, letting you write equations in handwriting and getting real-time answers. mac os updates üìä 09:40 mac os got a new version called sequoia, featuring automatic window snapping, built-in backgrounds for webcam feeds, and a new password app. apple intelligence ai ü§ñ 10:40 apple rebranded ai as apple intelligence, bringing new generative models and large language models to supported devices. 11:01 apple intelligence features include writing tools, a cleanup tool in photos, and generative emojis. how apple handles complex ai tasks üíª 15:10 apple handles most ai tasks on-device, using apple-built models for faster processing. however, for complex tasks, apple has two alternative approaches. 15:32 one approach uses a larger server-based model on apple's private cloud compute, ensuring data is not stored or sent to apple, but may have downsides like slower processing and internet dependence. 15:54 the other approach asks the user for permission to use chatgpt, allowing access to its capabilities and generating more realistic and varied images. privacy and security of ai features üîí 16:18 apple ensures privacy and security by not storing requests, obscuring ip addresses, and preventing openai from forming a profile. all this is done without an account and for free. insights on apple's ai announcements ü§î 17:01 the author believes apple's ai features are superior, with personalization and existing user info making them more effective. 17:25 the ai announcements overshadow other features, marking a new age for apple, and the author is excited to explore its capabilities and what it can bring to users. conclusion and future content üìπ 18:06 the author will create more videos about apple's ai features and encourages viewers to subscribe for updates."
4,kasakka1,557,"my takeaway was that across macos, ios and ipados i found very few things that i could tell i would include in my daily use. i already use 1password so apple passwords is not useful for me. i can already do corner splits with bettertouchtool's snap functions. i could have used something like improvements to macos virtual desktop handling instead, e.g naming desktops, arranging them in some other manner than a horizontal line, separating desktops vs fullscreen apps etc. it's one feature i use every day that is woefully forgotten by apple. and microsoft for that matter! i just shipped off my 2017 ipad pro 12.9"" in a trade-in for a samsung galaxy tab s9+ 12.4"". that way i got most of what the 2024 ipad pro 12.9"" offers at about 1/3 of the cost thanks to steep sales + trade-in value. the s9+ 16:10 format is not ideal, but it is more compact than the old ipad pro. this came at a good time right when apple announced my ipad pro would not have gotten ios 18 anyway. not that it matters much, because there were exactly zero features showcased that made me go ""i want that!"" i already moved away from ios to the samsung galaxy fold 4. with the discontinuation of the iphone mini line, i was left to pick from what i wanted to get away from - phones larger than i'd like! i like the fold because when folded it is similar width to the iphone 12 mini i was using, obviously just taller, heavier and much thicker. it has not been the most reliable phone, but when it works i really do love the foldable thing and apple has nothing like that in their lineup. apple is facing a dilemma where they make great hardware, but then have very few incentives for people to upgrade. i could easily be using something as old as the iphone xs and not feel like i'm getting a lesser experience vs the latest and greatest iphone pro unless i'm really into photography. great for a consumer, but not good business. we have gone from ""worth upgrading every 1-2 years"" to ""why bother upgrading if it still works"" as phone/tablet costs have crept up to very pricy levels. softwarewise, samsung oneui allows so many things that finetune its features to my liking, so it feels more like i can make it mine instead of adapting to what apple thinks is good. i'm not talking about launchers or widgets here, but just a variety of toggles that alter how its vast array of features works. so, i'm out, except for mac."
5,KLM_SpitFire,555,"**lots** of great updates. there's so much to process. there are a handful of things that i didn't really like at first glance: 1. all of the generative features. i didn't like how microsoft started baking them into their software, and i don't really like them here. can't place my finger on *why.* i feel like the siri features are more essential for me. i'm **really** excited to play around with an enhanced siri. 2. the color-tinted icons. the effect reminds me of those cheap icon packs that circulated on the play store years ago. a developer would take official icons, add a tint, then dump them on the store en masse. i feel as though apple heard the folks that have been wanting custom icon support, and this is some kind of stopgap. 3. the passwords app. i get this is going to be useful for many, many users that don't have good password management and hygiene. that said, i use bitwarden. i really hope apple doesn't begin to shove their password solution down my throat. 4. how blatantly chatgpt branding is baked into the user-experience. like, *whaaaat*? it feels very un-apple to me. i love how native search engine support with auto-suggestions isn't supported on ios ‚Äî if you've ever tried to google something from spotlight, you probably know my pain ‚Äî but i **can** have an os-level supported chatgpt conversation with chatgpt branding? wild. 5. (honestly, i'm glad they brought window snapping to mac os. that feature has been missing for so, so long. stage manager is not for me and never will be. that said, the 'old man' in me has really grown accustomed to my swish + alt-tab setup. hopefully i don't have to fight with these new os features to continue using those tools.) 6. (i find it ironic that the settings app has become so complicated to navigate that a major sales pitch from today‚Äôs presentation was using the raw power of apple‚Äôs flagship llm to‚Ä¶ navigate through settings. not a complaint about the ai feature. seems good. i‚Äôll use it. i just feel that the settings app has some other problems that seeing this feature triggered in my mind.) i get that there will be people that will use or benefit from these features, but as both a software engineer and consumer, feature bloat always makes me a little uncomfortable. that said, i‚Äôm very interested to get my hands on this release."
6,@johnpearson1258,553,"romans 3:23 states: ""for all have sinned, and come short of the glory of god;"". we are all sinners, that have committed at least one sin in our lives, which earns us an eternal punishment in hell. revelation states 21:8 ""but the fearful, and unbelieving, and the abominable, and murderers, and whoremongers, and sorcerers, and idolaters, and all liars, shall have their part in the lake which burneth with fire and brimstone: which is the second death."". god sent his son, jesus, to earth as a perfect sinless sacrifice. to take on all the sins of the world; both past, present, and future sins. shedding his blood on the cross as a blood atonement for all sins. after dying on the cross, he rose from the dead on the 3rd day and rose to heaven. to show life after death. to show everlasting life through jesus's sacrifice. john 14:6 states: ""jesus saith unto him, i am the way, the truth, and the life: no man cometh unto the father, but by me."". romans 10:13 states ""for whosoever shall call upon the name of the lord shall be saved."". \r \r eternal life in heaven is just one prayer away. all you have to do is believe that jesus christ, the son of god, died on the cross for your sins. as an atonement for your sins. and rose from the grave on the 3rd day. ask jesus to save you of your sins and give you eternal life in heaven. salvation is a free and everlasting gift from god. please don't let pride drag you into eternal punishment in hell, but ask jesus to give you eternal life in heaven. if you're getting a strong feeling of resistance while reading this, it's because satan doesn't want you to get saved. he wants you to resist, and think it's not real. he wants you in hell for all eternity with him. hell is not a fun place, like worldly people like to imagine. it's everlasting torture with no relief. i'm only taking the time to text this, because i want to get the message out. there are so many distractions in life keeping people from witnessing to other people. keeping people from thinking about salvation. keeping you from thinking about life after death. please, ask jesus into your hearts to save you of your sins and cleanse you from all unrighteousness. ask him for a place in heaven, and he will save you from eternal damnation in hell."
7,@lellmajar,544,"not only that, for most of the presentation they did say that iphone would have apple intelligence and in the end they said only iphone 15 pro models, is like... üòÇ okay apple, not all people can afford a 1100 and 1300 dollars usd smartphones, thanks, steve jobs said that apple is for the humanity and to give them technology to improve people's lifes, apple had vision, tim cook did it until 2018, then everything has go slowly downhill, not all of it of course, but very down, not only that, but jobs said that according to our efforts we decide the price of our products, we try the most to price it correctly, well that was in 2009, and for your knowledge, apple intelligence is only available for 2 iphones (of more than 24 iphones eligible for the ios 18 update *oh is because they don't have 6gb of ram, well darling the android phone of my granny costs 200 dollars and has 8gb of ddr4 ram and on screen ai analysis to summarize websites with gemini for free -well you exchange your data even when they say no but i don't believe any company in terms of 100% privacy so it's meh for me, is logic, that's why usa gov wants to ban tiktok cuz they don't have control over their info and the excuse is that they are ""spies""-, apple does not use a ""unique physical ram"" to justify their price, like i say.. thefts*) but hey you have good animations :d and ""the ecosystem"" that windows and android are reaching more fast than ever to the point of apple, apple= no moral and respectful for people's money and time (so contrary to steve's apple), yeah they have good and incredible things like the integration of apple intelligence offline but it doesn't help the fact that whenever they do something great they put it like god's new technology to justify prices in this times when they are not the unique smartphone company they were in 2007-2018, others are in the same level or little lower or little higher than them in terms of technology, i hope that the apple directive is changed but that will not happen because money runs the world and if they are still selling, nothing will change. p.d. they said thar a17 pro chipset is the only capable of running apple intelligence , really? they always said new processors of apple silicon are the most fastest in the world all the time, the processor of the iphone 14 pro max is slow? gosh they are really inmoral capitalists (i agree with capitalism and liberal economies)"
8,@hilal_younus,469,"my first impression is definitely that they‚Äôre playing catch-up, in terms of qol features. they could‚Äôve done this back in ios 14 at once, there‚Äôs no reason to take 4 years to incorporate all these features. summarization was already available on mac, i‚Äôve sent numerous feature requests, asking them to bring it system-wide to ipads and iphones. this was also the case for increasing options for tapback in imessage (like in whatsapp, and telegram). and, google photos already does the features which apple just adopted. and amazon already has insight features called x-ray. i predict that within the next 5 years, we will definitely see ipad becoming more powerful, and hopefully files app being reformed. this years wwdc just proves that. the only original features i see is the image and emoji generation, most of them look like dall-e/ midjourney mock-ups, with some additional twists like ‚Äúsketch‚Äù. this isn‚Äôt to criticize apple for finally adopting these features, but i miss the times when we actually got feature-packed updates for macos x. nowadays, it‚Äôs just incremental features with increasing bugs. there were definitely years when apple wouldn‚Äôt have added much features, and instead focused on improving reliability. last time we got that was in 2018, with ios 12. since then, ipads particularly have become more buggier, and have felt like they‚Äôre being neglected. furthermore, there aren‚Äôt many new features that are specific to a platform, whatever updates we received on an ipad can most likely be done better on an iphone or the mac. - why would i pay $1000 for the ipad pro when the $300 ipad does the same thing? another thing, is the fact that i want to highlight is all the features macs got: - better app organizing into windows (windows had this for so many years, and 3rd party apps have brought this to mac many years ago) - other features which are ported from the iphone, it‚Äôs obvious that it‚Äôs made with iphone in mind. a few days ago, quinn from snazzy labs, highlighted this issue in a perfect manner highlighting how apple failed to address the glaring issues with ipads."
9,@madboyreadynow28,466,"i want to thank you for always giving us a well thought videos. others spend time babbling and trying to be funny and you sound like a tech guy giving us tech news. i have a suggestion to take you to a new level. i have been writing apple for years. i honestly believe they have used some of my ideas. i‚Äôll share them in a moment. by the way i am a software engineer. i am not one who thinks i am so d smart or the smarted person in the room. i love to always learn and ai find i am always learning new things from you. so my idea for you is a real tech news channel. this will make you something like the wal mart of tech news. this idea could put some tech influencers out of business with this idea. i tried to float this to apple but they don‚Äôt seem to like the idea. we have cnn, nbc news, abc news, cbs news and from time to time we get tech news but mostly when something new like a new iphone is released. tech news is happening every day all day. i say join forces with some of the best like yourself, brian tong, possibly some from engadget and create a real tech news channel. it‚Äôs not based around politics although t today politics and tech seem to collide. evs and republicans wanting to k evs, tictok, twitter. with social media companies being as influential as they have become. they‚Äôre not on the hot seat enough with cameras in their face or reporters running up on them to ask difficult questions. a real tech news channel will do this. what you and others do is journalism. i want the news to still be fun but i want it just like real news every day 24/7 just like a cnn. you will hire anchors do some in-depth interviews like we see you do often with tech ceo‚Äôs. it‚Äôs a militia billion. dollar idea i am giving to you. this is something new of its kind and needed. tech is no longer just mobile. we have stepped into the world of ai and humans need to understand how these new pieces of tech will i pack them. i would love to work with you as a producer or something. i have a huge idea and it will work."


469 is the number of tokens closest to 500 and not exceeding it. Let's take a look at the number of sentences that these comments are comprised of.


In [35]:
# Sentence Count
df['sent_count'] = df['processed_text'].str.count('[\w][\.!\?]')

In [36]:
query = """
select author
, num_tokens
, processed_text
, sent_count
from df
order by 2 desc
limit 10
"""

duckdb.query(query).df()

Unnamed: 0,author,num_tokens,processed_text,sent_count
0,hammerheadtiger,1375,"tim cook is shouting from the roof, craig and the exec team skydiving, mike rockwell in a vision pro, apple video team budget at an all time high. seat belts on folks, its a big one this year impressions: **visionos 2** - 2k native apps, 1.5m compatible apps - photos: ml to turn 2d photos into 3d spatial photos, shareplay - quick access hand gesture menu - mac virtual display supports different screen sizes including ultra wide which simulates two 4k monitors side by side - travel mode adds train support - cannon will sell a spatial lens for their cameras - other - rearrange home screen, mouse support, new apis, available in 8 more countries **ios 18** - home screen - app icons and widgets can be placed anywhere - adds ability to tint all apps by color, dark mode darkens app icons - lock or hide apps - control center - multiple pages including full screen home and music widgets - developers can build for control center too - swap lock screen controls for other actions - finally! - secure bluetooth pairing for apps that looks like how airpods pair - messages: - tapbacks support any emoji or sticker now - scheduled messages - text formatting - text effects - imessage and sms via satellites - i feel like this would have been a huge feature any other year - mail: - on device categorization - ai powered digest of emails - photos app has been redesigned entirely with filters for screenshots, pinning, grouped people photos - other - maps - topographic maps - tap to cash - pay each other by tapping phones - better event tickets - journal app adds features like stats and streaks - game mode in iphone - reminders in calendar - rcs support launching **airpods** - nod yes or shake for no to respond to siri - voice isolation for windy or loud environment **tvos** - insight in video to identify actors and music - adjust voice in video to make them clear - now we can finally know what michael caine said in interstellar - supports 21:9 projectors for those of you with full theater set ups in this economy **watchos 11** - training mode - measure intensity, duration, effort, training load - fitness app - cards can be reorganized - vitals app - check key metrics like heart rate and insights over time - cycle tracking app - supports pregnancy and gestational metrics - now apple can sell a million watches to anxious first time parents - widget stack sorts itself based on context like weather changes and ongoing uber rides - new watch faces created by an ai selecting your good photos and reframing them **ipad os** - redesigned apps with a more vision pro like animated tab bar - shareplay - draw on or even remotely control other peoples screen - big day for - hold the presses - calculator app is here - math notes - handwrite expressions with variables with apple pencil and it will automatically solve them. supports graphs as well - math in notes app too - now what will this subreddit complain about anymore? - notes smart script - makes your unreadable handwriting look good. i know some people who need this badly - spell check for handwriting - automatically shifting words around as you write **macos sequoia** - were not even halfway in and absolutely blowing through these platforms, hmmm i wonder what they are saving time for, what a mystery - continuity - iphone mirroring - a lot of android manufacturers have been doing this for a while now and its a very welcome addition here, sometimes you just want to quickly access a thin on your phone - iphone notifications can go to mac now and automatically trigger iphone mirroring - tiling and snapping to edges/corners - finally! - presenter preview for screens sharing, background replacement - passwords app - surfaces icloud keychain features - imo much needed as 1password is one of my most used apps because it is surfaced - safari - highlights - identify and surface key info from a webpage - reader - summarizes websites - still no mention of ai - automatic picture in picture - gaming - metal 3 - game porting toolkit 2 with better support for windows games - we got a mention of ms windows before ai - coming to mac: frostpunk 2, control, assassins creed shadows, **apple intelligence** - ""ai for the rest of us"" - craig - writing tools available across the system - proofreading - write emails and notes - summarization for emails as well as email snippets - asks you questions to generate a response - inbox summaries - notification summaries, selective surface only important notifications to reduce interruptions - can use your os for you to do tasks like pulling up apps to play music and creating folders - understands personal context from aggregated data across the system - genmoji to create a custom emoji - image playground - create images across the system including of people in imessages - makes it easier to create images based on traits and styles - happens entirely on device - image wand can convert a rough sketch into a better image - can create videos based on concepts and find photos from a long time period to tell a story - record and transcribe audio in notes and phone - privacy - on device processing on a17 and all m chips - private cloud compute for large server based models - data is never stored or shared with apple - verifiable software for independent researchers **siri** - better language understanding that understands corrections and context - type to siri - siri can help with tech support for apple products - on-screen awareness - can ask for things to be done based on info on the screen - command siri to take photos, take notes, or search for something across the system including actions in videos - can do compound understanding like finding a drivers license number and typing it into a form for you or finding a persons flight number and finding its live status to determine if it works with existing lunch plans - rest in peace standalone ai gadgets like rabbit r1 and humane ai pin, we hardly knew ye **chatgpt 4o** - siri can go ask chatgpt for things like answer general knowledge questions and generate more complex images - free and info not logged, chatgpt subscribers can access paid features - other ai models will be added in the future - sure, throw google a bone too i guess whew! what a day, this was probably the most jam packed wwdc i have watched in a long time. they definitely went deeper on integration that i thought they would. hats off to the developers working crazy hours to make the features announced a reality.",11
1,@flowting5163,1246,"my take on wwdc's ipad calculator app: it's nothing more than a gimmick wwdc just dropped yesterday and one of the highlight features was the calculator app feature. mkbhd, the gold standard reviewer to go to when it comes to technology, called it the only ‚Äúincredible feature‚Äù that came to the ipad. but like almost every other hyped software/ai update these days, they hardly get used by us in the real world. ok, let‚Äôs dive into the main math notes features that everyone is so excited about and i will explain why they are all useless gimmicks: it‚Äôs able to give you answers as you write down a maths question. you can adjust the information in your math question and the answer will update accordingly. it can support variables and equations, and you can manipulate them all you want. you can add graphs in the notes and demonstrate the function as you manipulate the numbers. the features listed above are pretty cool, but if we were to put them in a real-world context, they instantly become obsolete and can easily be replaced with better tools and software. what is the real-world context? in this case, apple is targeting students in classroom settings, right? you are writing down equations, doing maths questions, playing with graphs, variables, functions etc. so how does it compare to classroom tools? the first feature is being able to give you answers as you write down a maths question. this feature is pretty useless beyond 4th grade. the maths that we do after 4th grade becomes more complex, we need to manipulate equations and find x. it‚Äôs not a matter of 89 * 34 =? in maths exams or practices, you need to show your working out and you need to do the mental arithmetic. in this case, how does the ipad even come in handy? you may argue that, in some cases, you might need to do complex calculations that you cannot do in your head. yeah right, that‚Äôs what the physical calculator that comes with the exams is for. the physical calculator is faster, easier and more representative of what you would face in exams. a physical calculator is just better in every way and form than doing maths on an ipad and hoping that the ai will read your handwriting and then provide an answer. anything beyond primary school, the feature becomes inferior to a physical calculator. anything beyond middle school, you would not even bother using the maths notes app. there is a reason why we need to buy a 200 dollar calculator for our exams like the ti-nspire that calculator simply blows the ipad out of the water in so many ways. the functions on the calculator go beyond solving questions, it can solve integrals, binomial theorem, graph trigonometric functions and show the gradient, the concavity, local max and local min‚Ä¶ simply put, anything beyond primary school, the feature on your ipad can easily be replaced by physical calculators that not only better resemble exam conditions, but they have way more features and are way more powerful than the maths notes feature on the ipad. the second feature, adjusting numbers and you get new a answer. i don‚Äôt even know what‚Äôs so worth showing off about this feature. the cool factor that comes from these features is not because of their applications but instead because of their novelty and difficulty in designing something that could easily be done by something else. you can do just the same with the calculator shown above, or even if you have a crappy one, how hard is it for you to retype the numbers using physical buttons? finally, for the third and fourth features, the graphs and functions. does apple not know free amazing software like geogebra and desmos exist? they have way more features than the maths notes and are free. they are just better in so many ways that this blog won‚Äôt be able to fit it all. you don‚Äôt have to spend 1000 dollars on an ipad to get access to those features. all of the software features that you need for school work are already there on the internet for you. the physical calculator is just as good, if not, better than the ipad calculator when it comes to doing maths. the features on the ipad are a complete gimmick and they are not worth the hype. this reminds me of the humanoid robots that some companies are designing these days. sure, they are cool, but suck at what they do. there are robots everywhere around us, but they simply don‚Äôt take the shape of humans. there are robots in the manufacturing factories to build cars, make clothes, and seal boxes. these robots are good at what they do. but why do we not find them impressive? instead, we think that humanoid robots are impressive. the reason does not lie within their real-life applications and how pragmatic they are, the reason is that they are a marvel piece of engineering (that is good at nothing). it‚Äôs easy to mix a product‚Äôs usefulness with its engineering complexity with all the marketing on the internet these days. in the future, when we see another new feature or product marketed to us, rather than blindly following the hype, perhaps we should stop and reflect on how this feature would be translated to real-world use. the chances are, they are pretty damn useless and easily replaceable with much cheaper and effective alternatives. other examples include the samsung ai features. i got my ai update on my z fold 4 not long ago and i have rarely used them. i tested the translations and interpretations features and they are all wrong and hardly make any sense. i used the circle to search function maybe twice after all these months of having the feature. don't forget that the circle to search function was considered a highly useful function by reviewers, but to be honest, none of the circle of people i know use them and they don‚Äôt come close to altering the way we interact with our tech by 1%.",51
2,@mndlessdrwer,763,"i will never understand these chucklefucks trying to sell vr as a productivity tool at this stage in its development. like, how heavy and cumbersome is apple vision or an oculus headset? three pounds? four pounds? heavy enough that wearing it for the duration of its battery life can become legitimately uncomfortable to bear. once they figure out how to use nanostructure lensing to create something roughly the size of a pair of bulky glasses, then it'll be viable as a productivity tool. go ahead and make it wired with a belt clip or harnessed compute module, i really don't mind, but don't try to normalize people trying to get actual work done with a vr toaster strapped to their face. ==================================== my requirements for a phone are actually quite simple: -a camera with acceptable image quality -no distracting hole-punch or notch in the display -stereo front-facing speakers -a headphone jack as backup for when i need it -expandable storage for when i need it -the ability to make my home screen look like palm's webos thus far, the number of devices that fit that bill are very, very small and have never included the iphone. ==================================== okay, i will admit that the new calculator app on ipad os 18 is legitimately one of the best hands-on applications of ocr and ai interpretation that i've seen in a while. it wouldn't benefit me at all because my handwriting is atrocious, but it's still ridiculously cool to see. ==================================== i honestly wish i could care about all of these ai features, but i really don't. an enhanced spell-checker and grammar+syntax checker would be nice, but ai has repeatedly demonstrated that the english language is quite a challenge for it. photo editing is a good use case for it, but even then, that relies on the photo being taken well in the first place and the camera app can be a bit flaky about adjusting white balance, gamma, exposure, focus, stabilization, etc. i'd be most interested in them applying ai to properly optimize the shot in the first place. couple that with some stupid fast sensors so it can make a few tweaks to the settings, take a few photos in one go, then use ai and it's image processor to merge the image into something like what the light camera could do. the ability to shift focus while making adjustments to all of the usual corrections in exposure, but with the data from a bunch of different images taken with different settings to draw from so you never lose detail would be an awesome feature that even google can't offer yet. ==================================== i'm a bit disappointed that apple didn't make any announcements about their totl mac products, so there's no official news yet about an m3 or m4 ultra chip or the next generation of mac pro. content creators who rely heavily on mac should also be feeling impatient to know when the next generation of these totl apple silicon powered devices will be available.",18
3,@ThuanSuBa,707,"summarized by asktube wwdc keynote highlights üì± 00:02 apple's wwdc keynote was filled with exciting announcements, including a surprise introduction to ai in the second half. 00:25 the author live-tweeted the event and summarizes all the key announcements and thoughts. vision os 2.0 üìä 00:47 vision os 2.0 brings new features like wrist-turn gestures to see time and battery percentage, and quick access to the control center. 01:05 new immersive media features: turn old photos into spatial photos using advanced machine learning, and create spatial videos on non-iphone cameras. ios updates üì± 02:29 ios got a major update with customizable home screens, icon placement, and a new theme engine. 02:54 customize your home screen with different colors, and change icon and widget colors to match your wallpaper. other updates üìä 03:37 mac mirroring gets a higher resolution, and new tools are available for creating spatial and immersive videos. 04:00 new features include scheduling text messages, text formatting, a game mode that minimizes background activity, and automatic categorization in the new mail app. airpods and apple tv updates üéß 06:05 airpods got minor updates, including voice isolation during phone calls, spatial audio during games, and nodding or shaking for responding to prompts. 06:25 apple tv updates include showing actors' and characters' names on screen and boosting people's voices over the music. ipad os updates üìä 06:41 ipad os allows remote control of someone's ipad with their permission, perfect for family tech support. 07:03 the calculator app on ipad os is impressive, letting you write equations in handwriting and getting real-time answers. mac os updates üìä 09:40 mac os got a new version called sequoia, featuring automatic window snapping, built-in backgrounds for webcam feeds, and a new password app. apple intelligence ai ü§ñ 10:40 apple rebranded ai as apple intelligence, bringing new generative models and large language models to supported devices. 11:01 apple intelligence features include writing tools, a cleanup tool in photos, and generative emojis. how apple handles complex ai tasks üíª 15:10 apple handles most ai tasks on-device, using apple-built models for faster processing. however, for complex tasks, apple has two alternative approaches. 15:32 one approach uses a larger server-based model on apple's private cloud compute, ensuring data is not stored or sent to apple, but may have downsides like slower processing and internet dependence. 15:54 the other approach asks the user for permission to use chatgpt, allowing access to its capabilities and generating more realistic and varied images. privacy and security of ai features üîí 16:18 apple ensures privacy and security by not storing requests, obscuring ip addresses, and preventing openai from forming a profile. all this is done without an account and for free. insights on apple's ai announcements ü§î 17:01 the author believes apple's ai features are superior, with personalization and existing user info making them more effective. 17:25 the ai announcements overshadow other features, marking a new age for apple, and the author is excited to explore its capabilities and what it can bring to users. conclusion and future content üìπ 18:06 the author will create more videos about apple's ai features and encourages viewers to subscribe for updates.",26
4,kasakka1,557,"my takeaway was that across macos, ios and ipados i found very few things that i could tell i would include in my daily use. i already use 1password so apple passwords is not useful for me. i can already do corner splits with bettertouchtool's snap functions. i could have used something like improvements to macos virtual desktop handling instead, e.g naming desktops, arranging them in some other manner than a horizontal line, separating desktops vs fullscreen apps etc. it's one feature i use every day that is woefully forgotten by apple. and microsoft for that matter! i just shipped off my 2017 ipad pro 12.9"" in a trade-in for a samsung galaxy tab s9+ 12.4"". that way i got most of what the 2024 ipad pro 12.9"" offers at about 1/3 of the cost thanks to steep sales + trade-in value. the s9+ 16:10 format is not ideal, but it is more compact than the old ipad pro. this came at a good time right when apple announced my ipad pro would not have gotten ios 18 anyway. not that it matters much, because there were exactly zero features showcased that made me go ""i want that!"" i already moved away from ios to the samsung galaxy fold 4. with the discontinuation of the iphone mini line, i was left to pick from what i wanted to get away from - phones larger than i'd like! i like the fold because when folded it is similar width to the iphone 12 mini i was using, obviously just taller, heavier and much thicker. it has not been the most reliable phone, but when it works i really do love the foldable thing and apple has nothing like that in their lineup. apple is facing a dilemma where they make great hardware, but then have very few incentives for people to upgrade. i could easily be using something as old as the iphone xs and not feel like i'm getting a lesser experience vs the latest and greatest iphone pro unless i'm really into photography. great for a consumer, but not good business. we have gone from ""worth upgrading every 1-2 years"" to ""why bother upgrading if it still works"" as phone/tablet costs have crept up to very pricy levels. softwarewise, samsung oneui allows so many things that finetune its features to my liking, so it feels more like i can make it mine instead of adapting to what apple thinks is good. i'm not talking about launchers or widgets here, but just a variety of toggles that alter how its vast array of features works. so, i'm out, except for mac.",25
5,KLM_SpitFire,555,"**lots** of great updates. there's so much to process. there are a handful of things that i didn't really like at first glance: 1. all of the generative features. i didn't like how microsoft started baking them into their software, and i don't really like them here. can't place my finger on *why.* i feel like the siri features are more essential for me. i'm **really** excited to play around with an enhanced siri. 2. the color-tinted icons. the effect reminds me of those cheap icon packs that circulated on the play store years ago. a developer would take official icons, add a tint, then dump them on the store en masse. i feel as though apple heard the folks that have been wanting custom icon support, and this is some kind of stopgap. 3. the passwords app. i get this is going to be useful for many, many users that don't have good password management and hygiene. that said, i use bitwarden. i really hope apple doesn't begin to shove their password solution down my throat. 4. how blatantly chatgpt branding is baked into the user-experience. like, *whaaaat*? it feels very un-apple to me. i love how native search engine support with auto-suggestions isn't supported on ios ‚Äî if you've ever tried to google something from spotlight, you probably know my pain ‚Äî but i **can** have an os-level supported chatgpt conversation with chatgpt branding? wild. 5. (honestly, i'm glad they brought window snapping to mac os. that feature has been missing for so, so long. stage manager is not for me and never will be. that said, the 'old man' in me has really grown accustomed to my swish + alt-tab setup. hopefully i don't have to fight with these new os features to continue using those tools.) 6. (i find it ironic that the settings app has become so complicated to navigate that a major sales pitch from today‚Äôs presentation was using the raw power of apple‚Äôs flagship llm to‚Ä¶ navigate through settings. not a complaint about the ai feature. seems good. i‚Äôll use it. i just feel that the settings app has some other problems that seeing this feature triggered in my mind.) i get that there will be people that will use or benefit from these features, but as both a software engineer and consumer, feature bloat always makes me a little uncomfortable. that said, i‚Äôm very interested to get my hands on this release.",37
6,@johnpearson1258,553,"romans 3:23 states: ""for all have sinned, and come short of the glory of god;"". we are all sinners, that have committed at least one sin in our lives, which earns us an eternal punishment in hell. revelation states 21:8 ""but the fearful, and unbelieving, and the abominable, and murderers, and whoremongers, and sorcerers, and idolaters, and all liars, shall have their part in the lake which burneth with fire and brimstone: which is the second death."". god sent his son, jesus, to earth as a perfect sinless sacrifice. to take on all the sins of the world; both past, present, and future sins. shedding his blood on the cross as a blood atonement for all sins. after dying on the cross, he rose from the dead on the 3rd day and rose to heaven. to show life after death. to show everlasting life through jesus's sacrifice. john 14:6 states: ""jesus saith unto him, i am the way, the truth, and the life: no man cometh unto the father, but by me."". romans 10:13 states ""for whosoever shall call upon the name of the lord shall be saved."". \r \r eternal life in heaven is just one prayer away. all you have to do is believe that jesus christ, the son of god, died on the cross for your sins. as an atonement for your sins. and rose from the grave on the 3rd day. ask jesus to save you of your sins and give you eternal life in heaven. salvation is a free and everlasting gift from god. please don't let pride drag you into eternal punishment in hell, but ask jesus to give you eternal life in heaven. if you're getting a strong feeling of resistance while reading this, it's because satan doesn't want you to get saved. he wants you to resist, and think it's not real. he wants you in hell for all eternity with him. hell is not a fun place, like worldly people like to imagine. it's everlasting torture with no relief. i'm only taking the time to text this, because i want to get the message out. there are so many distractions in life keeping people from witnessing to other people. keeping people from thinking about salvation. keeping you from thinking about life after death. please, ask jesus into your hearts to save you of your sins and cleanse you from all unrighteousness. ask him for a place in heaven, and he will save you from eternal damnation in hell.",28
7,@lellmajar,544,"not only that, for most of the presentation they did say that iphone would have apple intelligence and in the end they said only iphone 15 pro models, is like... üòÇ okay apple, not all people can afford a 1100 and 1300 dollars usd smartphones, thanks, steve jobs said that apple is for the humanity and to give them technology to improve people's lifes, apple had vision, tim cook did it until 2018, then everything has go slowly downhill, not all of it of course, but very down, not only that, but jobs said that according to our efforts we decide the price of our products, we try the most to price it correctly, well that was in 2009, and for your knowledge, apple intelligence is only available for 2 iphones (of more than 24 iphones eligible for the ios 18 update *oh is because they don't have 6gb of ram, well darling the android phone of my granny costs 200 dollars and has 8gb of ddr4 ram and on screen ai analysis to summarize websites with gemini for free -well you exchange your data even when they say no but i don't believe any company in terms of 100% privacy so it's meh for me, is logic, that's why usa gov wants to ban tiktok cuz they don't have control over their info and the excuse is that they are ""spies""-, apple does not use a ""unique physical ram"" to justify their price, like i say.. thefts*) but hey you have good animations :d and ""the ecosystem"" that windows and android are reaching more fast than ever to the point of apple, apple= no moral and respectful for people's money and time (so contrary to steve's apple), yeah they have good and incredible things like the integration of apple intelligence offline but it doesn't help the fact that whenever they do something great they put it like god's new technology to justify prices in this times when they are not the unique smartphone company they were in 2007-2018, others are in the same level or little lower or little higher than them in terms of technology, i hope that the apple directive is changed but that will not happen because money runs the world and if they are still selling, nothing will change. p.d. they said thar a17 pro chipset is the only capable of running apple intelligence , really? they always said new processors of apple silicon are the most fastest in the world all the time, the processor of the iphone 14 pro max is slow? gosh they are really inmoral capitalists (i agree with capitalism and liberal economies)",7
8,@hilal_younus,469,"my first impression is definitely that they‚Äôre playing catch-up, in terms of qol features. they could‚Äôve done this back in ios 14 at once, there‚Äôs no reason to take 4 years to incorporate all these features. summarization was already available on mac, i‚Äôve sent numerous feature requests, asking them to bring it system-wide to ipads and iphones. this was also the case for increasing options for tapback in imessage (like in whatsapp, and telegram). and, google photos already does the features which apple just adopted. and amazon already has insight features called x-ray. i predict that within the next 5 years, we will definitely see ipad becoming more powerful, and hopefully files app being reformed. this years wwdc just proves that. the only original features i see is the image and emoji generation, most of them look like dall-e/ midjourney mock-ups, with some additional twists like ‚Äúsketch‚Äù. this isn‚Äôt to criticize apple for finally adopting these features, but i miss the times when we actually got feature-packed updates for macos x. nowadays, it‚Äôs just incremental features with increasing bugs. there were definitely years when apple wouldn‚Äôt have added much features, and instead focused on improving reliability. last time we got that was in 2018, with ios 12. since then, ipads particularly have become more buggier, and have felt like they‚Äôre being neglected. furthermore, there aren‚Äôt many new features that are specific to a platform, whatever updates we received on an ipad can most likely be done better on an iphone or the mac. - why would i pay $1000 for the ipad pro when the $300 ipad does the same thing? another thing, is the fact that i want to highlight is all the features macs got: - better app organizing into windows (windows had this for so many years, and 3rd party apps have brought this to mac many years ago) - other features which are ported from the iphone, it‚Äôs obvious that it‚Äôs made with iphone in mind. a few days ago, quinn from snazzy labs, highlighted this issue in a perfect manner highlighting how apple failed to address the glaring issues with ipads.",16
9,@madboyreadynow28,466,"i want to thank you for always giving us a well thought videos. others spend time babbling and trying to be funny and you sound like a tech guy giving us tech news. i have a suggestion to take you to a new level. i have been writing apple for years. i honestly believe they have used some of my ideas. i‚Äôll share them in a moment. by the way i am a software engineer. i am not one who thinks i am so d smart or the smarted person in the room. i love to always learn and ai find i am always learning new things from you. so my idea for you is a real tech news channel. this will make you something like the wal mart of tech news. this idea could put some tech influencers out of business with this idea. i tried to float this to apple but they don‚Äôt seem to like the idea. we have cnn, nbc news, abc news, cbs news and from time to time we get tech news but mostly when something new like a new iphone is released. tech news is happening every day all day. i say join forces with some of the best like yourself, brian tong, possibly some from engadget and create a real tech news channel. it‚Äôs not based around politics although t today politics and tech seem to collide. evs and republicans wanting to k evs, tictok, twitter. with social media companies being as influential as they have become. they‚Äôre not on the hot seat enough with cameras in their face or reporters running up on them to ask difficult questions. a real tech news channel will do this. what you and others do is journalism. i want the news to still be fun but i want it just like real news every day 24/7 just like a cnn. you will hire anchors do some in-depth interviews like we see you do often with tech ceo‚Äôs. it‚Äôs a militia billion. dollar idea i am giving to you. this is something new of its kind and needed. tech is no longer just mobile. we have stepped into the world of ai and humans need to understand how these new pieces of tech will i pack them. i would love to work with you as a producer or something. i have a huge idea and it will work.",31


## Try to use open source LLM/Fast Language Model (groqcloud) to summarize Pandas text column
Reference: https://console.groq.com/docs/quickstart

In [37]:
client = Groq(
    #api_key=os.environ.get(user_secrets.get_secret("groq_key")),
    api_key=user_secrets.get_secret("groq_key"),
)

chat_completion = client.chat.completions.create(
    messages=[
        {
            "role": "user",
            "content": '''
            
            Please summarize the following comment in 15 sentences or less as if you are the author trying to write a condensed version of your original comment.
            Here is the original comment:
            i want to thank you for always giving us a well thought videos. others spend time babbling and trying to be funny and you sound like a tech guy giving us tech news. i have a suggestion to take you to a new level. i have been writing apple for years. i honestly believe they have used some of my ideas. i‚Äôll share them in a moment. by the way i am a software engineer. i am not one who thinks i am so d smart or the smarted person in the room. i love to always learn and ai find i am always learning new things from you. so my idea for you is a real tech news channel. this will make you something like the wal mart of tech news. this idea could put some tech influencers out of business with this idea. i tried to float this to apple but they don‚Äôt seem to like the idea. we have cnn, nbc news, abc news, cbs news and from time to time we get tech news but mostly when something new like a new iphone is released. tech news is happening every day all day. i say join forces with some of the best like yourself, brian tong, possibly some from engadget and create a real tech news channel. it‚Äôs not based around politics although t today politics and tech seem to collide. evs and republicans wanting to k evs, tictok, twitter. with social media companies being as influential as they have become. they‚Äôre not on the hot seat enough with cameras in their face or reporters running up on them to ask difficult questions. a real tech news channel will do this. what you and others do is journalism. i want the news to still be fun but i want it just like real news every day 24/7 just like a cnn. you will hire anchors do some in-depth interviews like we see you do often with tech ceo‚Äôs. it‚Äôs a militia billion. dollar idea i am giving to you. this is something new of its kind and needed. tech is no longer just mobile. we have stepped into the world of ai and humans need to understand how these new pieces of tech will i pack them. i would love to work with you as a producer or something. i have a huge idea and it will work.	31
            
            ''',
        }
    ],
    model="llama3-8b-8192",
)

chat_completion.choices[0].message.content

"Here is a condensed version of the original comment:\n\nI want to thank you for creating well-researched and informative videos. As a software engineer, I appreciate your tech-heavy content and find it both enjoyable and educational. I have a suggestion to take your content to the next level: creating a 24/7 real tech news channel. This would differentiate you from other tech influencers and put some out of business. I've been writing to Apple for years and believe they've used some of my ideas. I'd love to collaborate with you and others like Brian Tong and Engadget to make this channel a reality. The idea would focus on tech news, not politics, and would be a departure from the typical tech news we see today. We could hire anchors and conduct in-depth interviews with tech CEOs. I believe this has the potential to be a billion-dollar idea and would love to work with you as a producer."

In [38]:
# Exclude chat bot intro portion as whole inference
spl_word = '\n\n'
res = chat_completion.choices[0].message.content.split(spl_word, 1)
res[1]

"I want to thank you for creating well-researched and informative videos. As a software engineer, I appreciate your tech-heavy content and find it both enjoyable and educational. I have a suggestion to take your content to the next level: creating a 24/7 real tech news channel. This would differentiate you from other tech influencers and put some out of business. I've been writing to Apple for years and believe they've used some of my ideas. I'd love to collaborate with you and others like Brian Tong and Engadget to make this channel a reality. The idea would focus on tech news, not politics, and would be a departure from the typical tech news we see today. We could hire anchors and conduct in-depth interviews with tech CEOs. I believe this has the potential to be a billion-dollar idea and would love to work with you as a producer."

In [39]:
#Let's check if this condensed version from Groq is <512/500 tokens in length
len(tokenizer.tokenize(res[1]))

182

In [40]:
# Now write up a function in order to loop through Pandas text column that has >500 tokens 
# Put responses in a list
# Add summaries to df 
# Now have run tokenizer
# Check to make sure that each row now has 500 or less tokens
# After verification, perform sentiment analysis
# Possibly perform sentiment analysis using HF and Groq

In [41]:
# %%time

# df = (
# df.assign(sentiment = lambda x: x['text'].apply(lambda s: sentiment_analysis(s)))
#     .assign(
#          label = lambda x: x['sentiment'].apply(lambda s: (s[0]['label'])),
#          score = lambda x: x['sentiment'].apply(lambda s: (s[0]['score']))
#     )
# )

# df.head()

# Aspect Based Sentiment Analysis (ASBA) Using PyABSA

# Named Entity Recognition in order to isolate products?

# Radar Graph between 2 similar, close in proximity restaurants 