# Cluster Analysis
<br>
## Task

* Group the facebook posts into groups so that the groups contain posts with similar content.
* This task has no unique solution - the data is not labled so we don't know the correct answer therefore you will not be able to verify how good your model is

## Data
* We prepared two datasets about facebook pages
* The first dataset contains list of pages
* The second dataset contains for each page 100 randomly selected posts
* You will actualy not need to use the pages set unless you come up with some interesting features that will be useful

## Notes
* Create vector reprezentation for the texts
* Split the text to words
* Compute IDF
* Use KMeans algorithm to train a model

## About K-Means
* K-means is a unsupervised learning algorithm that can be used to cluster data into groups
* For details see <a target="_blank" href="https://en.wikipedia.org/wiki/K-means_clustering">wiki</a>

## Documentation
<br>
* Pyspark documentation of DataFrame API is <a target="_blank" href="https://spark.apache.org/docs/latest/api/python/pyspark.sql.html">here</a>

* Pyspark documentation of ML Pipelines library is <a target="_blank" href="https://spark.apache.org/docs/latest/api/python/pyspark.ml.html">here</a>

* Prezentation slides are accessed <a target="_blank" href = "https://docs.google.com/presentation/d/1XNKIfE5Atj_Mzse0wjmbwLecmVs2YkWm9cqOLqDVWPo/edit?usp=sharing">here</a>

### Import functions

In [4]:
from pyspark.sql.functions import col, count, desc, row_number, collect_list, length, array_contains, size
from pyspark.sql.functions import col, count, desc, array_contains, broadcast, explode, length, first, when

from pyspark.sql import Window

from pyspark.ml.feature import Tokenizer, StopWordsRemover, HashingTF, IDF, Normalizer, SQLTransformer

from pyspark.ml.clustering import KMeans

from pyspark.ml import Pipeline

import random

### Load the data

In [6]:
pages = spark.table('mlprague.facebook_pages')

posts = spark.table('mlprague.facebook_posts')

### You may want to do some exploratory analytics first

hint:
* see how many records you have
* what is the schema of the dataset
* see some records
* use can use printSchema(), show(), count(), or proprietaray function display()

In [8]:
pages.count()

In [9]:
posts.count()

In [10]:
display(pages)

page_id,name,description
1404160529823912,The Cannabist,"The Cannabist is The Denver Post's home for ideas, people, art, food and news, centered around the culture of cannabis. www.thecannabist.co"
474226972647323,Alibabuy,"Réalisez vos rêves de voyage avec le comparateur Alibabuy. Bons plans, actualités et évasion, on vous aide à organiser vos futures vacances."
87076260498,ICING,Icing is the one-stop shop for trendsetting jewelry & accessories at unbeatable prices!
54971236771,NASA - National Aeronautics and Space Administration,Explore the universe and discover our home planet with the official NASA page on Facebook. Visit us at www.nasa.gov
152967154570,ARB 4x4 Accessories - USA Office,"ARB is a the manufacturer of quality off road accessories including air locking differentials, air compressors, winch bumpers, suspension systems, roof racks, skid plates, side rails, rear bumpers, roof top tents and recovery gear."
156195289363,Ram Trucks,Welcome to the official Ram Trucks Facebook page. Visit us at www.RamTrucks.com Standard operating hours: Mon – Fri 7:30 AM to 6:00 PM EST.
244456705578510,Nelnet Careers,"Catch a glimpse of what LifeAtNelnet is like. Like and subscribe to our page to receive job postings, company updates, and day-to-day shenanigans right in your feed."
307808202665938,Young Ford Morgan,"Young Ford in Morgan Utah is proud to be your wasatch front Ford dealer and meet all of your service, new car sales and used car sales needs!"
104983900783,Já chci na pláž ...,"www.jachcinaplaz.cz, největší Facebooková stránka v ČR. Užijte si s námi dovolenou. Stránka pro všechny, co nechtějí být na pláži sami."
108380356656,Atlanta Marriott Marquis,"Experience bustling energy and a stylish vibe at Atlanta Marriott Marquis. Our Atlanta hotel's location in Georgia is ideal, in Peachtree Center in the heart of downtown near the MARTA leading to top Atlanta attractions."


In [11]:
display(posts)

page_id,message
240229921942,"So, I think it's great you guys are making waves in the industry; including Bixby. However, I noticed you are extremely bias towards anything remotely conservative. This to me seems like a short fall to what could potentially be a great brand. It makes me wonder who you are, Like, who your team is. I wonder how a group of writers with so much going for them can be so bias and still feel like they are creative? I mean, how is it that I can be exposed to Flipboard for less then 3 minutes, and already know: your entire team does not support Trump; your entire team (much like AP) has an extremely progressive agenda; your entire team ONLY values progressive views; your entire team will NOT expose any good truths about any opposing views without constructing the narrative in such a way your paper still reads as if the opposing view is less good then your progressive agenda. I cant scroll down your news feed, which you managed to pitch of to Samsung's Bixby, without feeling a extremely bias narrative that onlky captures one side of the story. It's kind of like, have you ever noticed that AP ONLY posts pictures of Trump making a duck face. You see, it's not so much I am a huge Trump supporter, rather I bring it up as a metaphorical similarity to your Flipboard camp, and it scares me because, based on your boards, there is no doubt in my mind that your entire camp only has one point of view, which is damaging for any news business who wants to be deemed as credible. All of your stories, all of the news orginizations you post on, your entire flip board spells - Damn the president, loss faith in the system, fear big business, there are only good Democrates, the only good republicans are ones who side with Democrares, focus on black athletes, black talent, black calture, love imigrants, demand change and feel guilty if you happen to miss our message. I hear what you are saying, and a lot of what the progressives push for definetly has a place in our society. However, to completely disregard a very serious narrative told through a conservitives lense is extremely damaging. I mean the world isn't build so plainly, and relationships at the top arnt so black and white. Your demographic is a very young and vulnerable generation, your corrupting them and the youth by not capturing the story. Your helping to destroy due process, and frankly your making talk smack about your company to everyone I know. Which, i know it's like who the hell is this guy. You never know though."
6510408962,"A quick throwback to Graduation last week... As well as celebrating the achievements of our students, we also formally welcomed our new chancellor, the incredibly inspiring Margaret Casely-Hayford."
66189885427,"TAKE A SNACK BREAK! Popcorn, cookies & Candy! PopArt - 161 Valley Rd, Clifton. Opens 12pm M-F. Mention to Lisa you are from Montclaire State to receive a special deal!"
422179114514548,"If you know Simone, our office manager, then you'll know her beautiful handmade hats! Get your orders in ASAP!"
106079664342,"Three Film Fest Finalists are in the running for our $1,500 Grand Prize with less than a week left to win the public vote. This week we'll be sharing their messages to you, asking you to pick their film! First up is Jake Hatfield ( IES Abroad Tokyo | Penn State), and his film ""Sapporo""! Vote for Jake's film here:"
446653042113846,TRIFIX i-SIZE je dokonale přizpůsobenaý potřebám dětí s výškou od 76 cm do 105 cm a je perfektním nástupcem jakékoliv dětské autosedačky. bit.ly/trifix-i-size
153527691354397,"Cardiff University Global Opportunities offer a range of opportunities for our students to study, work or volunteer abroad. Find out more about the reasons to study at cardiffuni ️"
190377228672,Lee ripped off his shirt and hopped up on the grooming table to show his dog that it wasn't that bad!
98888988071,"Awesome! Congratulations to Donna and David on your new 2016 JEEP WRANGLER! Thank you again, Kunes Country Ford Lincoln of Delavan and DANE ANDERSEN."
126364554047791,NAGEZ. ROULEZ. VOLEZ. Décollez avec l'ultra rapide chaussure Skechers Go Run 5!


### Extract the features & construct the pipeline

hint
* do vector representation for the texts
 * use: 
 * <a target="_blank" href="https://spark.apache.org/docs/latest/api/python/pyspark.ml.html#pyspark.ml.feature.Tokenizer">Tokenizer</a> 
 * <a target="_blank" href="https://spark.apache.org/docs/latest/api/python/pyspark.ml.html#pyspark.ml.feature.StopWordsRemover">StopWordsRemover</a> 
 * <a target="_blank" href="https://spark.apache.org/docs/latest/api/python/pyspark.ml.html#pyspark.ml.feature.HashingTF">HashingTF</a> to compute term frequency and reduce the space or use the <a target="_blank" href="https://spark.apache.org/docs/latest/api/python/pyspark.ml.html#pyspark.ml.feature.CountVectorizer">CountVectorizer</a>
 * <a target="_blank" href="https://spark.apache.org/docs/latest/api/python/pyspark.ml.html#pyspark.ml.feature.IDF">IDF</a> 
 * <a target="_blank" href="https://spark.apache.org/docs/latest/api/python/pyspark.ml.html#pyspark.ml.feature.Normalizer">Normalizer</a> 
 * <a target="_blank" href="https://spark.apache.org/docs/latest/api/python/pyspark.ml.html#pyspark.ml.clustering.KMeans">KMeans</a> 
* See the slides 83, 84, 85, 101 in the presentation
* after you apply StopWordsRemover it is good to filter out rows with no (or very few) words. You can use the SQLTransformer defined bellow, that filters out all rows that have less than 10 words. This transformer assumes that the output column of StopWordsRemover is named 'noStopWords'. Just add this SQLTransformer to the pipeline right behind the StopwordsRemover

Note
* You may want to play with some input parameters: 
 * number of clusters for KMeans (try 4-8)
 * distanceMeasure for KMeans (default is 'euclidean' but you can try also 'cosine') 
 * numFeatures for HashingTF (try 1000)

In [13]:
# add this to the pipeline to remove empty or short messages

emptyRowsRemover = SQLTransformer(statement='SELECT * FROM __THIS__ where size(noStopWords) >= 10')

In [14]:
tokenizer = Tokenizer(inputCol='message', outputCol='words')

stopWordsRemover = StopWordsRemover(inputCol='words', outputCol='noStopWords')

hashingTF = HashingTF(numFeatures=1000, inputCol='noStopWords', outputCol='hashingTF')

idf = IDF(inputCol='hashingTF', outputCol='idf')

normalizer = Normalizer(inputCol='idf', outputCol='features')

kmeans = KMeans(featuresCol='features', predictionCol='prediction', k=5, seed=1)

pipeline = Pipeline(stages=[tokenizer, stopWordsRemover, emptyRowsRemover, hashingTF, idf, normalizer, kmeans])

model = pipeline.fit(posts)

### Apply the model on the data

hint
* just call transform, since the model is a transformer
* pass the training data as argument to the transform function

In [16]:
predictions = model.transform(posts)

### See how many pages are in your clusters

hint
* you can simply group by the column prediction and count
* the column with the cluster is called prediction by default

In [18]:
display(
  predictions
  .groupBy('prediction')
  .agg(count('*').alias('cnt'))
  .orderBy(desc('cnt'))
)

prediction,cnt
1,288200
0,28766
2,21562
3,6768
4,3754


### See what pages are in your clusters

hint
* just filter the result for specific cluster:
 * filter(col('prediction') == 0) and so on for other clusters

In [20]:
display(
  predictions
  .filter(col('prediction') == 0)
)

page_id,message,words,noStopWords,hashingTF,idf,features,prediction
5882213990,"Je veux faire partie de vos étudiants. S'il vous plaît, comment dois-je m'y prendre ?","List(je, veux, faire, partie, de, vos, étudiants., s'il, vous, plaît,, comment, dois-je, m'y, prendre, ?)","List(je, veux, faire, partie, de, vos, étudiants., s'il, vous, plaît,, comment, dois-je, m'y, prendre, ?)","List(0, 1000, List(39, 145, 175, 228, 281, 294, 573, 640, 667, 681, 745, 755, 779, 841, 976), List(1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(39, 145, 175, 228, 281, 294, 573, 640, 667, 681, 745, 755, 779, 841, 976), List(3.091922368003184, 3.1960780135991707, 4.140699654446765, 3.9583480897007006, 2.050513023711846, 3.3676482761238886, 4.111074428044981, 3.738239898122811, 4.19167080525024, 3.7151518798351018, 2.6219660525186645, 3.8696751780955942, 2.6643302550033408, 2.8823962213604877, 3.100411490887389))","List(0, 1000, List(39, 145, 175, 228, 281, 294, 573, 640, 667, 681, 745, 755, 779, 841, 976), List(0.23223863609667092, 0.24006191307325356, 0.31101377259213353, 0.2973170901899588, 0.1540169160950969, 0.2529487967077935, 0.30878858018600963, 0.28078445446315475, 0.3148422873910098, 0.27905028094935613, 0.19693955651349646, 0.2906567431311191, 0.20012159132335436, 0.21650083264259637, 0.2328762660516423))",0
579407188770431,[Maxi Banque Populaire IX] Le Maxi Banque Populaire IX sera mis à l'eau lundi à Lorient ! Ne ratez rien en restant connectés sur les réseaux sociaux de Voile Banque Populaire ! PassionVoile ️,"List([maxi, banque, populaire, ix], le, maxi, banque, populaire, ix, sera, mis, à, l'eau, lundi, à, lorient, !, ne, ratez, rien, en, restant, connectés, sur, les, réseaux, sociaux, de, voile, banque, populaire, !, passionvoile, , ️)","List([maxi, banque, populaire, ix], le, maxi, banque, populaire, ix, sera, mis, à, l'eau, lundi, à, lorient, !, ne, ratez, rien, en, restant, connectés, sur, les, réseaux, sociaux, de, voile, banque, populaire, !, passionvoile, , ️)","List(0, 1000, List(2, 3, 22, 25, 35, 38, 53, 56, 259, 281, 284, 372, 377, 410, 420, 451, 460, 489, 556, 669, 738, 772, 810, 825, 847, 871, 883, 951), List(1.0, 1.0, 2.0, 3.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 3.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(2, 3, 22, 25, 35, 38, 53, 56, 259, 281, 284, 372, 377, 410, 420, 451, 460, 489, 556, 669, 738, 772, 810, 825, 847, 871, 883, 951), List(3.8291772783786517, 2.3574385638229733, 6.421067435701964, 6.586143824010502, 3.378763680358356, 4.91144421661795, 3.967088350283624, 3.43618413769408, 3.2711450220488736, 2.050513023711846, 3.5870173771991576, 0.692932335326463, 3.752792665261161, 3.475023971010344, 2.5919043690326036, 4.328509778496269, 2.7421472773601483, 4.515537611646647, 3.909879486175017, 4.120381721502938, 4.430664970094392, 2.4905117702516324, 11.306030293608941, 3.6562172665938975, 3.845260565182122, 3.2619055196995856, 3.993310814201236, 3.9581980634458245))","List(0, 1000, List(2, 3, 22, 25, 35, 38, 53, 56, 259, 281, 284, 372, 377, 410, 420, 451, 460, 489, 556, 669, 738, 772, 810, 825, 847, 871, 883, 951), List(0.1660131115748169, 0.10220621373070282, 0.2783839209147222, 0.28554077025292696, 0.14648553228900774, 0.2129345489776138, 0.17199221478762428, 0.14897498318076402, 0.141819749790416, 0.08889952661978266, 0.155514324036184, 0.030041924083248236, 0.16270147401453072, 0.15065887533647282, 0.11237142548533513, 0.18766155875510987, 0.11888517266657474, 0.19577010800084677, 0.16951193747219215, 0.178638214094997, 0.19209047389478887, 0.10797557238411407, 0.4901703765974236, 0.1585144695306016, 0.16671039882282984, 0.14141917326368555, 0.17312908375754682, 0.1716067784201128))",0
214574018572873,Notre découverte du jour : un restaurant situé à Chelsea NYC A essayer si vous allez à New-York.,"List(notre, découverte, du, jour, :, un, restaurant, situé, à, chelsea, nyc, a, essayer, si, vous, allez, à, new-york.)","List(notre, découverte, du, jour, :, un, restaurant, situé, à, chelsea, nyc, essayer, si, vous, allez, à, new-york.)","List(0, 1000, List(127, 157, 203, 236, 308, 425, 489, 607, 621, 687, 719, 745, 786, 926, 940, 957), List(1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(127, 157, 203, 236, 308, 425, 489, 607, 621, 687, 719, 745, 786, 926, 940, 957), List(4.021836899412498, 2.1630689403498136, 2.83850736339219, 3.74791817352314, 4.159785937730411, 3.968299920930088, 4.515537611646647, 3.061601953899516, 3.5179417105473965, 3.4426133052987775, 2.872824334216562, 2.6219660525186645, 3.5141817639630744, 2.8541997653481443, 3.83195010645748, 3.60024880427899))","List(0, 1000, List(127, 157, 203, 236, 308, 425, 489, 607, 621, 687, 719, 745, 786, 926, 940, 957), List(0.28929612833394214, 0.15559245325289417, 0.20417764594001847, 0.26959278658740504, 0.2992189878856541, 0.28544514158708495, 0.32480868346165326, 0.22022513938646915, 0.2530502708139445, 0.24763179748023498, 0.20664611172914332, 0.18860153869677884, 0.2527798128076177, 0.20530642148304976, 0.2756373163538591, 0.2589707305805868))",0
225154230895978,"100 000 fans ! Pour fêter ça, le guideMICHELIN donne l'opportunité au 100 000e fan et à 6 d'entre vous d'assiter à un atelier cuisineétoilée avec le chef Stéphanie Le Quellec : il se tiendra en septembre prochain, dans les cuisines du restaurant ""La Scène"", Prince de Galles, a Luxury Collection Hotel ! TalentsContemporains RestaurantLaScene 3x2 places à gagner : tentez votre chance en tagguant un ami avec qui vous aimeriez y aller !! Fin du jeu le 13/07/2017 à 23h59, les 3 gagnants seront tirés au sort parmi vos commentaires.","List(100, 000, fans, !, pour, fêter, ça,, le, guidemichelin, donne, l'opportunité, au, 100, 000e, fan, et, à, 6, d'entre, vous, d'assiter, à, un, atelier, cuisineétoilée, avec, le, chef, stéphanie, le, quellec, :, il, se, tiendra, en, septembre, prochain,, dans, les, cuisines, du, restaurant, ""la, scène"",, prince, de, galles,, a, luxury, collection, hotel, !, , talentscontemporains, restaurantlascene, , , 3x2, places, à, gagner, :, tentez, votre, chance, en, tagguant, un, ami, avec, qui, vous, aimeriez, y, aller, !!, , fin, du, jeu, le, 13/07/2017, à, 23h59,, les, 3, gagnants, seront, tirés, au, sort, parmi, vos, commentaires.)","List(100, 000, fans, !, pour, fêter, ça,, le, guidemichelin, donne, l'opportunité, au, 100, 000e, fan, et, à, 6, d'entre, vous, d'assiter, à, un, atelier, cuisineétoilée, avec, le, chef, stéphanie, le, quellec, :, il, se, tiendra, en, septembre, prochain,, dans, les, cuisines, du, restaurant, ""la, scène"",, prince, de, galles,, luxury, collection, hotel, !, , talentscontemporains, restaurantlascene, , , 3x2, places, à, gagner, :, tentez, votre, chance, en, tagguant, un, ami, avec, qui, vous, aimeriez, y, aller, !!, , fin, du, jeu, le, 13/07/2017, à, 23h59,, les, 3, gagnants, seront, tirés, au, sort, parmi, vos, commentaires.)","List(0, 1000, List(3, 32, 35, 38, 66, 115, 127, 157, 164, 171, 203, 206, 219, 224, 230, 238, 243, 246, 256, 258, 259, 270, 281, 287, 302, 311, 313, 315, 339, 342, 362, 368, 369, 372, 373, 390, 411, 418, 420, 450, 462, 489, 522, 552, 563, 566, 590, 598, 602, 607, 613, 641, 673, 675, 678, 707, 709, 720, 745, 772, 786, 816, 831, 841, 851, 855, 876, 879, 926, 935, 937, 949, 956, 970), List(2.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 4.0, 2.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 4.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 4.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(3, 32, 35, 38, 66, 115, 127, 157, 164, 171, 203, 206, 219, 224, 230, 238, 243, 246, 256, 258, 259, 270, 281, 287, 302, 311, 313, 315, 339, 342, 362, 368, 369, 372, 373, 390, 411, 418, 420, 450, 462, 489, 522, 552, 563, 566, 590, 598, 602, 607, 613, 641, 673, 675, 678, 707, 709, 720, 745, 772, 786, 816, 831, 841, 851, 855, 876, 879, 926, 935, 937, 949, 956, 970), List(4.714877127645947, 4.002833952286883, 3.378763680358356, 4.91144421661795, 3.6083570991096887, 2.8646490767342616, 4.021836899412498, 4.326137880699627, 3.5139893267451225, 4.069308987781493, 5.67701472678438, 3.717861429705104, 3.7453685454801544, 3.556339812268653, 3.0377157086680824, 3.9581980634458245, 3.8461989659480826, 6.139672998585678, 3.8236545818960006, 2.9474430485365235, 3.2711450220488736, 3.8253607310072972, 2.050513023711846, 4.039416579623079, 4.111598949431253, 3.6887963751501984, 3.8049478789518165, 2.962626528696895, 3.5653130034947016, 3.822082254528048, 2.9524786366265463, 3.5821647529433647, 3.9255824667688115, 2.771729341305852, 4.8574857089698975, 3.0677407085978405, 3.7031070636921615, 2.996623657996016, 5.183808738065207, 3.9159010097457028, 3.0401087699357547, 9.031075223293294, 4.225977503601093, 4.1958477621490635, 3.9371196520602996, 2.3862685596183986, 5.905176776079411, 3.354684137657673, 3.792922660486507, 3.061601953899516, 3.4087923803616538, 4.101852962090629, 3.9495348277850035, 3.498523624690295, 2.5009375814569688, 3.616744895804571, 3.750230594683798, 3.9701200336728166, 5.243932105037329, 9.96204708100653, 3.5141817639630744, 3.921525078214651, 3.4152215945143665, 2.8823962213604877, 6.305835348419539, 2.867014036198173, 4.1599694744841615, 3.8947008133156995, 5.708399530696289, 3.800069194220581, 3.3075713637221864, 4.083831363911255, 3.7661975268712133, 2.896253493969596))","List(0, 1000, List(3, 32, 35, 38, 66, 115, 127, 157, 164, 171, 203, 206, 219, 224, 230, 238, 243, 246, 256, 258, 259, 270, 281, 287, 302, 311, 313, 315, 339, 342, 362, 368, 369, 372, 373, 390, 411, 418, 420, 450, 462, 489, 522, 552, 563, 566, 590, 598, 602, 607, 613, 641, 673, 675, 678, 707, 709, 720, 745, 772, 786, 816, 831, 841, 851, 855, 876, 879, 926, 935, 937, 949, 956, 970), List(0.13195868745881933, 0.11203021842545884, 0.09456391087676701, 0.13746015321415223, 0.10098970848874753, 0.08017501240476824, 0.1125620676958973, 0.12107876007110858, 0.09834856916687125, 0.11389070348056501, 0.15888673060815955, 0.10405448564947119, 0.1048243472588374, 0.09953386293628048, 0.08501875381618186, 0.11078096141502967, 0.10764636140271644, 0.17183548338283622, 0.1070153953672207, 0.08249222737193405, 0.09155190939462257, 0.10706314660044279, 0.057389195921915215, 0.11305408296241017, 0.11507430332440066, 0.10324102087697577, 0.10649183729765356, 0.08291717844887124, 0.09978500215039984, 0.10697138950023553, 0.08263316202981653, 0.10025664429046569, 0.10986812504375953, 0.07757440034337278, 0.13594997730633834, 0.08585908527645476, 0.10364154450155902, 0.08386868077612694, 0.145083016713561, 0.10959716308083427, 0.08508573015837304, 0.252759255552998, 0.11827549891721675, 0.11743223597995577, 0.11019102462031628, 0.06678622974183626, 0.16527246744451252, 0.09388997924220557, 0.10615527878252635, 0.08568727549420768, 0.09540434589368296, 0.11480153530465356, 0.11053849715222944, 0.09791571934098339, 0.06999555486407107, 0.10122446384133946, 0.1049604249579806, 0.1111146302467904, 0.14676573301272516, 0.2788150404820687, 0.09835395504692955, 0.10975456796610462, 0.095584284975448, 0.08067171461940005, 0.1764859896411185, 0.08024120224138587, 0.1164280843084796, 0.10900381779970938, 0.15976511988920444, 0.1063552940079283, 0.09257139985132018, 0.11429715175927624, 0.10540730307530012, 0.08105954816325531))",0
720989584583157,"Milé zákaznice Kalhotky (brazilky) s černou krajkou po obvodu od vaší oblíbené značky Victoria's Secret ! Cena 290,- Velikosti XS, S, M, L ! Pouze do vyprodání zásob ! Doba dodání 1-3 dny . Objednávky do zpráv!","List(milé, zákaznice, , kalhotky, (brazilky), s, černou, krajkou, po, obvodu, od, vaší, oblíbené, značky, victoria's, secret, !, cena, 290,-, , velikosti, xs,, s,, m,, l, !, , pouze, do, vyprodání, zásob, !, doba, dodání, 1-3, dny, ., objednávky, do, zpráv!)","List(milé, zákaznice, , kalhotky, (brazilky), černou, krajkou, po, obvodu, od, vaší, oblíbené, značky, victoria's, secret, !, cena, 290,-, , velikosti, xs,, s,, m,, l, !, , pouze, vyprodání, zásob, !, doba, dodání, 1-3, dny, ., objednávky, zpráv!)","List(0, 1000, List(6, 7, 38, 39, 56, 95, 102, 120, 206, 266, 309, 354, 372, 381, 395, 445, 451, 488, 508, 547, 571, 572, 590, 633, 650, 678, 693, 719, 745, 847, 854, 959, 993), List(1.0, 1.0, 3.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 3.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(6, 7, 38, 39, 56, 95, 102, 120, 206, 266, 309, 354, 372, 381, 395, 445, 451, 488, 508, 547, 571, 572, 590, 633, 650, 678, 693, 719, 745, 847, 854, 959, 993), List(4.0210378582193655, 3.422306688661753, 7.367166324926925, 3.091922368003184, 3.43618413769408, 4.171972203752553, 4.043982797602661, 3.7706645897653606, 3.717861429705104, 3.947751766391347, 4.364338467021407, 3.7541373265480553, 2.078797005979389, 3.9727042108348534, 3.71092552933192, 3.723895879774548, 4.328509778496269, 3.629946053439997, 3.3184307716663834, 3.600458579819932, 3.7586735937518743, 3.6350367765921425, 2.9525883880397057, 3.6373197585045207, 3.3184307716663834, 2.5009375814569688, 3.742704460591343, 2.872824334216562, 2.6219660525186645, 3.845260565182122, 4.194706860713271, 2.4718452465817693, 3.6414641640439425))","List(0, 1000, List(6, 7, 38, 39, 56, 95, 102, 120, 206, 266, 309, 354, 372, 381, 395, 445, 451, 488, 508, 547, 571, 572, 590, 633, 650, 678, 693, 719, 745, 847, 854, 959, 993), List(0.18644673974023718, 0.15868488360235197, 0.34159940563577623, 0.14336568452488765, 0.15932835059252215, 0.19344523555941132, 0.18751064644601667, 0.17483747833377325, 0.17238911117376438, 0.18304862378823059, 0.2023648388775389, 0.17407114524954315, 0.09638927511074405, 0.1842056141173645, 0.17206750862801867, 0.17266891543854476, 0.20070354087433168, 0.16831266726658764, 0.153868384294412, 0.1669453975433247, 0.1742814820482912, 0.168548712976204, 0.13690507231102353, 0.16865456985929173, 0.153868384294412, 0.11596301124182239, 0.1735410282353104, 0.13320658741529176, 0.12157483700443568, 0.17829633072576667, 0.1944993919287114, 0.11461406323880255, 0.16884673688885773))",0
277102898996105,"Toujours plus avec la Bugaboo Donkey², et même plus d'enfants. Disponible à partir de Novembre 2017. En savoir plus : bit.ly/bugaboodonkey2 A la recherche d'une poussette pour jumeaux ? bit.ly/Donkey2_Twin","List(toujours, plus, avec, la, bugaboo, donkey²,, et, même, plus, d'enfants., , disponible, à, partir, de, novembre, 2017., , en, savoir, plus, :, bit.ly/bugaboodonkey2, a, la, recherche, d'une, poussette, pour, jumeaux, ?, , bit.ly/donkey2_twin)","List(toujours, plus, avec, la, bugaboo, donkey²,, et, même, plus, d'enfants., , disponible, à, partir, de, novembre, 2017., , en, savoir, plus, :, bit.ly/bugaboodonkey2, la, recherche, d'une, poussette, pour, jumeaux, ?, , bit.ly/donkey2_twin)","List(0, 1000, List(3, 213, 240, 281, 284, 331, 372, 423, 489, 515, 529, 566, 590, 639, 678, 735, 759, 766, 779, 785, 849, 860, 926, 975, 976), List(1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 3.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 2.0, 1.0, 2.0, 1.0, 3.0, 1.0, 1.0, 1.0))","List(0, 1000, List(3, 213, 240, 281, 284, 331, 372, 423, 489, 515, 529, 566, 590, 639, 678, 735, 759, 766, 779, 785, 849, 860, 926, 975, 976), List(2.3574385638229733, 3.5059402223453446, 3.219523504421303, 2.050513023711846, 3.5870173771991576, 3.886289155699058, 2.078797005979389, 4.020558739788097, 2.2577688058233236, 2.8731284088185403, 3.9375604072279433, 2.3862685596183986, 2.9525883880397057, 3.753281423824166, 5.0018751629139375, 3.4598714702560156, 3.699510146198087, 7.595533039086304, 2.6643302550033408, 4.786980323858369, 3.194399221934601, 8.951237961474199, 2.8541997653481443, 3.849423041819835, 3.100411490887389))","List(0, 1000, List(3, 213, 240, 281, 284, 331, 372, 423, 489, 515, 529, 566, 590, 639, 678, 735, 759, 766, 779, 785, 849, 860, 926, 975, 976), List(0.11794286948448038, 0.17540251373249277, 0.16107305883229492, 0.10258735631255846, 0.17945881129198848, 0.19443140606228965, 0.10400240656255204, 0.20114892577837848, 0.11295638226920414, 0.14374288014696454, 0.1969965115204658, 0.11938523684179714, 0.14771827025991985, 0.18777701692923515, 0.2502442772296663, 0.17309768980267065, 0.1850868349340862, 0.380005220768623, 0.13329668919126428, 0.23949306854725733, 0.15981608865470737, 0.4478312634805764, 0.14279587836265215, 0.1925869909736495, 0.15511387377362587))",0
1728156394122761,"Énormément d'émotions sur la ligne d'arrivée aujourd'hui. Les Amazones se sont attendues et soutenues à l'arrivée Si vous voulez nous rejoindre en mars, c'est possible : RaidAmazones","List(énormément, d'émotions, sur, la, ligne, d'arrivée, aujourd'hui., les, amazones, se, sont, attendues, et, soutenues, à, l'arrivée, , si, vous, voulez, nous, rejoindre, en, mars,, c'est, possible, :, , raidamazones)","List(énormément, d'émotions, sur, la, ligne, d'arrivée, aujourd'hui., les, amazones, se, sont, attendues, et, soutenues, à, l'arrivée, , si, vous, voulez, nous, rejoindre, en, mars,, c'est, possible, :, , raidamazones)","List(0, 1000, List(3, 66, 100, 190, 212, 218, 232, 372, 373, 420, 460, 489, 507, 553, 566, 570, 592, 599, 624, 719, 745, 764, 775, 785, 806, 818, 926, 932), List(1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(3, 66, 100, 190, 212, 218, 232, 372, 373, 420, 460, 489, 507, 553, 566, 570, 592, 599, 624, 719, 745, 764, 775, 785, 806, 818, 926, 932), List(2.3574385638229733, 3.6083570991096887, 3.604979430709536, 3.9992326018040463, 3.7958516660825876, 3.693045104054072, 3.5213281005089145, 1.385864670652926, 2.4287428544849488, 2.5919043690326036, 2.7421472773601483, 2.2577688058233236, 4.023597040678187, 3.7650838721127937, 2.3862685596183986, 4.3781693189430175, 3.959098558779075, 3.4520638954156313, 3.094385656790193, 2.872824334216562, 2.6219660525186645, 3.7046193073352085, 2.979339698252861, 2.3934901619291846, 3.957748119681203, 3.9209467928146986, 2.8541997653481443, 3.66636171566567))","List(0, 1000, List(3, 66, 100, 190, 212, 218, 232, 372, 373, 420, 460, 489, 507, 553, 566, 570, 592, 599, 624, 719, 745, 764, 775, 785, 806, 818, 926, 932), List(0.13528707677882595, 0.20707393668874072, 0.2068801013578026, 0.22950523350193083, 0.21783374704686975, 0.21193395416829558, 0.20207957586154104, 0.07953105670742895, 0.13937903878940872, 0.14874244052750624, 0.15736447810095663, 0.1295672966708943, 0.23090344330608942, 0.2160680658668875, 0.13694155291854937, 0.25125139543064773, 0.22720204840795366, 0.19810468889045346, 0.1775784940306545, 0.16486368393200718, 0.1504675999205401, 0.21259816665382036, 0.17097631501119634, 0.13735598130759766, 0.22712455032986054, 0.22501262087822824, 0.1637948698737574, 0.21040266607068894))",0
175807419122004,Vous êtes né(e) entre le 18 et le 22 novembre ? Joyeux anniversaire ! Mihael est votre ange gardien !,"List(vous, êtes, né(e), entre, le, 18, et, le, 22, novembre, ?, joyeux, anniversaire, !, , , mihael, est, votre, ange, gardien, !)","List(vous, êtes, né(e), entre, le, 18, et, le, 22, novembre, ?, joyeux, anniversaire, !, , , mihael, est, votre, ange, gardien, !)","List(0, 1000, List(38, 46, 53, 166, 284, 301, 342, 372, 429, 566, 607, 656, 679, 693, 745, 772, 790, 886, 976), List(2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0))","List(0, 1000, List(38, 46, 53, 166, 284, 301, 342, 372, 429, 566, 607, 656, 679, 693, 745, 772, 790, 886, 976), List(4.91144421661795, 4.273562511909724, 3.967088350283624, 3.874630453204228, 3.5870173771991576, 3.588882083012448, 3.822082254528048, 1.385864670652926, 3.6154662595088958, 2.3862685596183986, 3.061601953899516, 4.014192332800399, 4.342731656973723, 3.742704460591343, 2.6219660525186645, 4.981023540503265, 3.2019760787241576, 3.833405614488173, 3.100411490887389))","List(0, 1000, List(38, 46, 53, 166, 284, 301, 342, 372, 429, 566, 607, 656, 679, 693, 745, 772, 790, 886, 976), List(0.3053134354056337, 0.26566036269274046, 0.24660880168090446, 0.24086127876449248, 0.22298219220059015, 0.2230981091718041, 0.23759468947739065, 0.08615044474549832, 0.22475067935189447, 0.14833922968573127, 0.19032043715918148, 0.24953695846925272, 0.26996017113444204, 0.23266027387724275, 0.16299105267302935, 0.3096387420713898, 0.19904660901455132, 0.23829859117662264, 0.1927329807088319))",0
172742172763985,Les ASTUCES de Noël par Cuisineaddict.com ! Utilisez des emporte-pièces pour réaliser vos pancakes... ou encore vos brownies ! Retrouver-les ICI : bit.ly/2B1QnuP et LÀ : bit.ly/2yShtzT,"List(les, astuces, de, noël, par, cuisineaddict.com, !, , , utilisez, des, emporte-pièces, pour, réaliser, vos, pancakes..., ou, encore, vos, brownies, !, , , retrouver-les, ici, :, bit.ly/2b1qnup, et, là, :, bit.ly/2yshtzt)","List(les, astuces, de, noël, par, cuisineaddict.com, !, , , utilisez, des, emporte-pièces, pour, réaliser, vos, pancakes..., ou, encore, vos, brownies, !, , , retrouver-les, ici, :, bit.ly/2b1qnup, et, là, :, bit.ly/2yshtzt)","List(0, 1000, List(38, 106, 118, 237, 252, 265, 281, 293, 303, 345, 372, 404, 419, 420, 566, 596, 678, 731, 735, 760, 799, 822, 841, 861, 926), List(2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 4.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 2.0))","List(0, 1000, List(38, 106, 118, 237, 252, 265, 281, 293, 303, 345, 372, 404, 419, 420, 566, 596, 678, 731, 735, 760, 799, 822, 841, 861, 926), List(4.91144421661795, 3.92253788304694, 2.936096872149222, 3.6558845801478017, 3.648373222618163, 3.545458746589327, 2.050513023711846, 3.342290990860326, 4.005504180842761, 4.104975254306251, 2.771729341305852, 3.402662423761032, 3.844724731277808, 2.5919043690326036, 2.3862685596183986, 3.65046631068609, 2.5009375814569688, 3.700669029164731, 3.4598714702560156, 3.9071673297769483, 3.6800118796671573, 3.7271053921878825, 5.764792442720975, 3.8522526603621507, 5.708399530696289))","List(0, 1000, List(38, 106, 118, 237, 252, 265, 281, 293, 303, 345, 372, 404, 419, 420, 566, 596, 678, 731, 735, 760, 799, 822, 841, 861, 926), List(0.26226361300488704, 0.2094575265409056, 0.15678308453892023, 0.1952186478010446, 0.19481755279163185, 0.18932207709784227, 0.10949426083119791, 0.17847322952602288, 0.21388780001198554, 0.2191994032727416, 0.1480061096617916, 0.1816969717526472, 0.20530242318999692, 0.13840373104221695, 0.12742309317654027, 0.19492932049473935, 0.13354624364730516, 0.1976100689162173, 0.1847518473795704, 0.2086367624933709, 0.19650700871179003, 0.19902173028818232, 0.3078311037587986, 0.2057044030950978, 0.3048198084649582))",0
126259790016,Petit sondage du mardi ... qui n'a pas ses chaussons à la maison ?,"List(petit, sondage, du, mardi, ..., qui, n'a, pas, ses, chaussons, à, la, maison, ?)","List(petit, sondage, du, mardi, ..., qui, n'a, pas, ses, chaussons, à, la, maison, ?)","List(0, 1000, List(59, 129, 136, 203, 259, 489, 600, 683, 782, 785, 882, 909, 947, 976), List(1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(59, 129, 136, 203, 259, 489, 600, 683, 782, 785, 882, 909, 947, 976), List(3.661666871361462, 4.220892415389494, 3.4639812527427645, 2.83850736339219, 3.2711450220488736, 2.2577688058233236, 3.906027566154489, 3.874354515012632, 3.5366611808276285, 2.3934901619291846, 2.9993227149651807, 3.7518158642066934, 4.044146263070945, 3.100411490887389))","List(0, 1000, List(59, 129, 136, 203, 259, 489, 600, 683, 782, 785, 882, 909, 947, 976), List(0.2853873714100343, 0.32897296060814896, 0.26997991326456405, 0.22123098130587904, 0.25495041251429884, 0.17596868513216654, 0.3044326474585849, 0.3019640753225977, 0.2756445284136471, 0.18654669848619274, 0.23376480048717774, 0.29241351141872296, 0.31519750762755777, 0.2416435783916162))",0


In [21]:
display(
  predictions
  .filter(col('prediction') == 1)
)

page_id,message,words,noStopWords,hashingTF,idf,features,prediction
6510408962,"A quick throwback to Graduation last week... As well as celebrating the achievements of our students, we also formally welcomed our new chancellor, the incredibly inspiring Margaret Casely-Hayford.","List(, a, quick, throwback, to, graduation, last, week..., , as, well, as, celebrating, the, achievements, of, our, students,, we, also, formally, welcomed, our, new, chancellor,, the, incredibly, inspiring, margaret, casely-hayford.)","List(, quick, throwback, graduation, last, week..., , well, celebrating, achievements, students,, also, formally, welcomed, new, chancellor,, incredibly, inspiring, margaret, casely-hayford.)","List(0, 1000, List(25, 39, 48, 54, 69, 84, 114, 120, 155, 157, 178, 263, 372, 612, 632, 789, 792, 959, 989), List(1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(25, 39, 48, 54, 69, 84, 114, 120, 155, 157, 178, 263, 372, 612, 632, 789, 792, 959, 989), List(2.1953812746701673, 3.091922368003184, 3.8674806908618775, 4.047749280398138, 3.6294060091432407, 3.5300889859024323, 3.6702909938055592, 3.7706645897653606, 3.5984674866811264, 2.1630689403498136, 3.4226578739155817, 3.5180383055728277, 1.385864670652926, 3.963160852787954, 4.025841710532011, 3.8965327027873364, 2.7000064552320593, 2.4718452465817693, 2.690460552150776))","List(0, 1000, List(25, 39, 48, 54, 69, 84, 114, 120, 155, 157, 178, 263, 372, 612, 632, 789, 792, 959, 989), List(0.15134453662628009, 0.21315001793489216, 0.26661522525626663, 0.27904252727729595, 0.25020290416977586, 0.243356216974751, 0.25302142099420344, 0.2599409458828048, 0.24807007357676922, 0.14911699860295333, 0.23595016316055023, 0.24252547078430192, 0.09553832349153613, 0.2732111955954985, 0.27753226978888085, 0.2686180041510535, 0.18613223614948324, 0.17040332709952422, 0.18547416354260393))",1
66189885427,"TAKE A SNACK BREAK! Popcorn, cookies & Candy! PopArt - 161 Valley Rd, Clifton. Opens 12pm M-F. Mention to Lisa you are from Montclaire State to receive a special deal!","List(take, a, snack, break!, popcorn,, cookies, &, candy!, popart, -, 161, valley, rd,, clifton., opens, 12pm, m-f., mention, to, lisa, you, are, from, montclaire, state, to, receive, a, special, deal!)","List(take, snack, break!, popcorn,, cookies, &, candy!, popart, -, 161, valley, rd,, clifton., opens, 12pm, m-f., mention, lisa, montclaire, state, receive, special, deal!)","List(0, 1000, List(3, 146, 148, 181, 313, 323, 334, 349, 429, 438, 499, 508, 562, 604, 609, 684, 743, 787, 817, 855, 860, 894), List(1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(3, 146, 148, 181, 313, 323, 334, 349, 429, 438, 499, 508, 562, 604, 609, 684, 743, 787, 817, 855, 860, 894), List(2.3574385638229733, 3.3485499017618596, 3.7000894198052023, 3.8818317029920415, 3.8049478789518165, 4.184120750510544, 3.4400188059681924, 3.605822778851446, 3.6154662595088958, 3.859022250254804, 2.2295712966342998, 3.3184307716663834, 2.610557519489445, 3.6748005843115346, 3.922248405551794, 4.2780967323738155, 3.700205314804463, 8.42895056213772, 3.2818427455557644, 2.867014036198173, 2.9837459871580663, 3.818553531007595))","List(0, 1000, List(3, 146, 148, 181, 313, 323, 334, 349, 429, 438, 499, 508, 562, 604, 609, 684, 743, 787, 817, 855, 860, 894), List(0.1302641570208009, 0.18502964908138725, 0.20445454510207242, 0.2144970147883141, 0.21024867225212762, 0.23120049480934235, 0.19008391428384158, 0.19924568634008194, 0.19977855277311082, 0.21323664084749602, 0.12319864022367394, 0.18336536685902277, 0.14425066249834273, 0.203057168776694, 0.21673030636557955, 0.23639329272377418, 0.20446094907145398, 0.4657555689452644, 0.18134363511530097, 0.15842159041740517, 0.1648718069458584, 0.2110004750023508))",1
422179114514548,"If you know Simone, our office manager, then you'll know her beautiful handmade hats! Get your orders in ASAP!","List(if, you, know, simone,, our, office, manager,, then, you'll, know, her, beautiful, handmade, hats!, get, your, orders, in, asap!)","List(know, simone,, office, manager,, know, beautiful, handmade, hats!, get, orders, asap!)","List(0, 1000, List(192, 414, 488, 534, 726, 779, 798, 857, 872, 959), List(1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(192, 414, 488, 534, 726, 779, 798, 857, 872, 959), List(3.6366669459473364, 3.880720437424147, 3.629946053439997, 3.3942626143658856, 3.9558006993968076, 5.3286605100066815, 3.2989989920251186, 3.8829442048450673, 3.993310814201236, 2.4718452465817693))","List(0, 1000, List(192, 414, 488, 534, 726, 779, 798, 857, 872, 959), List(0.3019639321559086, 0.3222284636728356, 0.3014058752430052, 0.281836335589077, 0.32846261474290406, 0.44245549692656083, 0.273926296418322, 0.322413110098973, 0.33157719793963786, 0.20524511082064087))",1
106079664342,"Three Film Fest Finalists are in the running for our $1,500 Grand Prize with less than a week left to win the public vote. This week we'll be sharing their messages to you, asking you to pick their film! First up is Jake Hatfield ( IES Abroad Tokyo | Penn State), and his film ""Sapporo""! Vote for Jake's film here:","List(three, film, fest, finalists, are, in, the, running, for, our, $1,500, grand, prize, with, less, than, a, week, left, to, win, the, public, vote., this, week, we'll, be, sharing, their, messages, to, you,, asking, you, to, pick, their, film!, first, up, is, jake, hatfield, (, ies, abroad, tokyo, |, penn, state),, and, his, film, ""sapporo""!, , vote, for, jake's, film, here:)","List(three, film, fest, finalists, running, $1,500, grand, prize, less, week, left, win, public, vote., week, sharing, messages, you,, asking, pick, film!, first, jake, hatfield, (, ies, abroad, tokyo, |, penn, state),, film, ""sapporo""!, , vote, jake's, film, here:)","List(0, 1000, List(3, 5, 59, 122, 168, 183, 189, 195, 214, 219, 224, 257, 333, 372, 434, 492, 498, 565, 590, 619, 653, 662, 694, 696, 713, 716, 736, 806, 845, 851, 886, 950, 961, 963), List(3.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(3, 5, 59, 122, 168, 183, 189, 195, 214, 219, 224, 257, 333, 372, 434, 492, 498, 565, 590, 619, 653, 662, 694, 696, 713, 716, 736, 806, 845, 851, 886, 950, 961, 963), List(7.07231569146892, 4.40033089074587, 3.661666871361462, 6.778648665450606, 3.2346884397586577, 3.0531312003027793, 3.4799402649438864, 3.9504275521412717, 3.4272345409429934, 3.7453685454801544, 3.556339812268653, 4.09615395694146, 3.8150372156048395, 1.385864670652926, 3.96557594830145, 4.030507473963632, 3.2876567154211838, 4.270892831712348, 2.9525883880397057, 4.15484313590517, 3.4517022317588864, 3.5089031874760015, 3.7579365835232057, 3.278568173844731, 3.730325238747619, 3.0032984051432297, 3.653005924484114, 3.957748119681203, 4.014192332800399, 3.1529176742097693, 3.833405614488173, 4.328509778496269, 4.141960802258275, 3.3468390373582304))","List(0, 1000, List(3, 5, 59, 122, 168, 183, 189, 195, 214, 219, 224, 257, 333, 372, 434, 492, 498, 565, 590, 619, 653, 662, 694, 696, 713, 716, 736, 806, 845, 851, 886, 950, 961, 963), List(0.30901499684494826, 0.19226633759587444, 0.1599914407194681, 0.2961836246215646, 0.1413352120596305, 0.1334023210200928, 0.15205114942612136, 0.17260843701217526, 0.14974824612734852, 0.16364866894457475, 0.15538932137791878, 0.17897574957061715, 0.16669274457948177, 0.06055342910468849, 0.17327032511161208, 0.17610741780839884, 0.1436495871880601, 0.18661072164930284, 0.12900924268908256, 0.18154009161125603, 0.15081732784401974, 0.15331664403995457, 0.16419781188525334, 0.14325247600565622, 0.1629913725815038, 0.13122494635091925, 0.15961301269267536, 0.1729282990282309, 0.1753945504089468, 0.13776235219866884, 0.16749532621901847, 0.18912821399627566, 0.18097721596133942, 0.14623547642497772))",1
446653042113846,TRIFIX i-SIZE je dokonale přizpůsobenaý potřebám dětí s výškou od 76 cm do 105 cm a je perfektním nástupcem jakékoliv dětské autosedačky. bit.ly/trifix-i-size,"List(trifix, i-size, je, dokonale, přizpůsobenaý, potřebám, dětí, s, výškou, od, 76, cm, do, 105, cm, a, je, perfektním, nástupcem, jakékoliv, dětské, autosedačky., bit.ly/trifix-i-size)","List(trifix, i-size, je, dokonale, přizpůsobenaý, potřebám, dětí, výškou, od, 76, cm, 105, cm, je, perfektním, nástupcem, jakékoliv, dětské, autosedačky., bit.ly/trifix-i-size)","List(0, 1000, List(39, 95, 106, 112, 188, 200, 263, 289, 324, 345, 539, 590, 622, 664, 712, 721, 767, 844), List(2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0))","List(0, 1000, List(39, 95, 106, 112, 188, 200, 263, 289, 324, 345, 539, 590, 622, 664, 712, 721, 767, 844), List(6.183844736006368, 4.171972203752553, 3.92253788304694, 4.105670422912627, 3.2762899225555397, 3.4705922400011175, 3.5180383055728277, 4.288270182518224, 3.2652759307094916, 4.104975254306251, 3.821297016953346, 2.9525883880397057, 3.4986183620310887, 3.769670183219776, 3.0070581342445175, 7.1925428147343125, 3.574469644946501, 4.248584058230007))","List(0, 1000, List(39, 95, 106, 112, 188, 200, 263, 289, 324, 345, 539, 590, 622, 664, 712, 721, 767, 844), List(0.3510381471784658, 0.23683023345621218, 0.22267060210690282, 0.2330664820532765, 0.18598506158098202, 0.1970150159896736, 0.19970838551304193, 0.24343211767700965, 0.18535983060321132, 0.23302701943576645, 0.21692346459465175, 0.16760950531036165, 0.19860597410230443, 0.21399276551791083, 0.1707015879226051, 0.4082988837809527, 0.2029118218317434, 0.24117925653097977))",1
153527691354397,"Cardiff University Global Opportunities offer a range of opportunities for our students to study, work or volunteer abroad. Find out more about the reasons to study at cardiffuni ️","List(cardiff, university, global, opportunities, offer, a, range, of, opportunities, for, our, students, to, study,, work, or, volunteer, abroad., find, out, more, about, the, reasons, to, study, at, cardiffuni, ️)","List(cardiff, university, global, opportunities, offer, range, opportunities, students, study,, work, volunteer, abroad., find, reasons, study, cardiffuni, ️)","List(0, 1000, List(22, 82, 140, 146, 296, 369, 382, 412, 510, 523, 527, 845, 849, 857, 870, 934), List(1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(22, 82, 140, 146, 296, 369, 382, 412, 510, 523, 527, 845, 849, 857, 870, 934), List(3.210533717850982, 3.700553080413949, 3.639171711754565, 3.3485499017618596, 3.6917800817474853, 3.9255824667688115, 7.724576007199031, 3.1273652149329827, 2.9397743410062183, 3.0933743426277713, 3.367897642319437, 4.014192332800399, 3.194399221934601, 3.8829442048450673, 3.938883839521687, 3.739083017715062))","List(0, 1000, List(22, 82, 140, 146, 296, 369, 382, 412, 510, 523, 527, 845, 849, 857, 870, 934), List(0.20411962239439646, 0.2352741206935912, 0.231371610116256, 0.21289442864781316, 0.234716350138151, 0.24958106072508948, 0.4911138384809406, 0.19883192729362903, 0.18690525661651913, 0.19667085233552517, 0.21412452116322236, 0.25521470733475426, 0.20309382185672759, 0.24687020119571076, 0.2504267366849923, 0.2377237807638325))",1
98888988071,"Awesome! Congratulations to Donna and David on your new 2016 JEEP WRANGLER! Thank you again, Kunes Country Ford Lincoln of Delavan and DANE ANDERSEN.","List(awesome!, congratulations, to, donna, and, david, on, your, new, 2016, jeep, wrangler!, , thank, you, again,, kunes, country, ford, lincoln, of, delavan, and, dane, andersen.)","List(awesome!, congratulations, donna, david, new, 2016, jeep, wrangler!, , thank, again,, kunes, country, ford, lincoln, delavan, dane, andersen.)","List(0, 1000, List(25, 84, 121, 210, 220, 309, 313, 322, 333, 372, 490, 504, 605, 676, 722, 769, 808, 836), List(1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(25, 84, 121, 210, 220, 309, 313, 322, 333, 372, 490, 504, 605, 676, 722, 769, 808, 836), List(2.1953812746701673, 3.5300889859024323, 4.057973643872748, 4.177747720505446, 3.5304801456961936, 4.364338467021407, 3.8049478789518165, 3.2794048348556966, 3.8150372156048395, 0.692932335326463, 3.931991811361012, 3.4049891084247648, 2.697324404197966, 4.079418593682047, 4.017211322937998, 4.453296426325783, 4.208869776877175, 4.049884454076199))","List(0, 1000, List(25, 84, 121, 210, 220, 309, 313, 322, 333, 372, 490, 504, 605, 676, 722, 769, 808, 836), List(0.1405338861393792, 0.22597310514148494, 0.2597648128844013, 0.2674319623869303, 0.2259981446216668, 0.2793762761278205, 0.24356776572544264, 0.2099259527189784, 0.24421361877373868, 0.04435697572837031, 0.2517002836337842, 0.21796503285794133, 0.17266498766027613, 0.2611375776836353, 0.257155476895653, 0.28507078025259985, 0.26942419197230155, 0.2592469960974294))",1
126364554047791,NAGEZ. ROULEZ. VOLEZ. Décollez avec l'ultra rapide chaussure Skechers Go Run 5!,"List(nagez., roulez., volez., , décollez, avec, l'ultra, rapide, chaussure, skechers, go, run, 5!)","List(nagez., roulez., volez., , décollez, avec, l'ultra, rapide, chaussure, skechers, go, run, 5!)","List(0, 1000, List(40, 77, 223, 364, 372, 470, 495, 515, 590, 603, 641, 654, 674), List(1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(40, 77, 223, 364, 372, 470, 495, 515, 590, 603, 641, 654, 674), List(4.052024196477655, 3.0915439384811654, 4.5102666457458644, 3.8138678527742598, 0.692932335326463, 3.480405316340446, 3.442075959772683, 2.8731284088185403, 2.9525883880397057, 4.441308515178509, 4.101852962090629, 3.8855913676587432, 3.708001037571024))","List(0, 1000, List(40, 77, 223, 364, 372, 470, 495, 515, 590, 603, 641, 654, 674), List(0.3129222931901304, 0.23874808535657974, 0.34831060064069297, 0.2945303918598703, 0.053512507547323805, 0.268778358669996, 0.26581832941742817, 0.2218806914081975, 0.22801708095304318, 0.34298522860318165, 0.31677037771440336, 0.30006932392568053, 0.2863547036162192))",1
189010391223,"We created a giant Boston Red Sox hat for their biggest fan at the Museum of Science, Boston.","List(we, created, a, giant, boston, red, sox, hat, for, their, biggest, fan, at, the, museum, of, science,, boston.)","List(created, giant, boston, red, sox, hat, biggest, fan, museum, science,, boston.)","List(0, 1000, List(8, 235, 245, 262, 291, 355, 447, 463, 466, 563, 628), List(1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(8, 235, 245, 262, 291, 355, 447, 463, 466, 563, 628), List(3.6917800817474853, 3.6414641640439425, 3.3859327744463292, 3.6928149809371895, 3.695003291629054, 3.464438939601411, 4.04349256146244, 3.9762111138724183, 3.1228655497061975, 3.9371196520602996, 3.7622434873689525))","List(0, 1000, List(8, 235, 245, 262, 291, 355, 447, 463, 466, 563, 628), List(0.302225376660501, 0.29810629403823, 0.2771873690861337, 0.3023100980660257, 0.3024892428169194, 0.28361422951898346, 0.33101810924595226, 0.32551015362836494, 0.2556515274802909, 0.32230995439960025, 0.30799382137637615))",1
22156858944,This warmhearted police officer was captured distracting a young girl after she lost her father to a horrific car accident. He deserves endless shares! ♥,"List(this, warmhearted, police, officer, was, captured, distracting, a, young, girl, after, she, lost, her, father, to, a, horrific, car, accident., he, deserves, endless, shares!, ♥)","List(warmhearted, police, officer, captured, distracting, young, girl, lost, father, horrific, car, accident., deserves, endless, shares!, ♥)","List(0, 1000, List(29, 107, 233, 245, 317, 320, 351, 372, 410, 521, 555, 706, 714, 862, 900, 942), List(1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(29, 107, 233, 245, 317, 320, 351, 372, 410, 521, 555, 706, 714, 862, 900, 942), List(4.272534868206092, 3.5622793273891324, 3.9535583784034594, 3.3859327744463292, 3.842183433849135, 3.5776431133414945, 4.116507795193133, 0.692932335326463, 3.475023971010344, 4.0597986132718304, 4.263740456274265, 3.2888082144161124, 3.4741915240307644, 3.862424308377209, 3.4171409809001703, 3.945231192738668))","List(0, 1000, List(29, 107, 233, 245, 317, 320, 351, 372, 410, 521, 555, 706, 714, 862, 900, 942), List(0.29151276917871594, 0.2430524134613589, 0.2697491738626843, 0.23102038246120887, 0.262150120957169, 0.24410067636059712, 0.2808671254276669, 0.04727840267089614, 0.2370990271583148, 0.27699788827228244, 0.2909127311557093, 0.2243936256709101, 0.23704222974607658, 0.263531145001744, 0.23314970169793636, 0.26918101443801284))",1


In [22]:
display(
  predictions
  .filter(col('prediction') == 2)
)

page_id,message,words,noStopWords,hashingTF,idf,features,prediction
110427862329490,Une heure de sommeil gagnée ce week-end avec le passage à l'heure d'hiver : la raison parfaite pour venir passer cette longue nuit dans une de nos chambres et profitez de nos jardins colorés… One more hour of sleep this weekend to enter Winter time: the perfect reason to spend this long night in our of our rooms and enjoy our colourful gardens…,"List(une, heure, de, sommeil, gagnée, ce, week-end, avec, le, passage, à, l'heure, d'hiver, :, la, raison, parfaite, pour, venir, passer, cette, longue, nuit, dans, une, de, nos, chambres, et, profitez, de, nos, jardins, colorés…, , one, more, hour, of, sleep, this, weekend, to, enter, winter, time:, the, perfect, reason, to, spend, this, long, night, in, our, of, our, rooms, and, enjoy, our, colourful, gardens…)","List(une, heure, de, sommeil, gagnée, ce, week-end, avec, le, passage, à, l'heure, d'hiver, :, la, raison, parfaite, pour, venir, passer, cette, longue, nuit, dans, une, de, nos, chambres, et, profitez, de, nos, jardins, colorés…, , one, hour, sleep, weekend, enter, winter, time:, perfect, reason, spend, long, night, rooms, enjoy, colourful, gardens…)","List(0, 1000, List(18, 44, 73, 76, 115, 134, 184, 215, 268, 281, 372, 384, 390, 394, 400, 410, 489, 527, 559, 566, 578, 590, 618, 678, 688, 690, 694, 700, 717, 772, 785, 786, 792, 813, 831, 835, 881, 889, 904, 908, 926, 931, 948, 953, 961, 998), List(1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 3.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(18, 44, 73, 76, 115, 134, 184, 215, 268, 281, 372, 384, 390, 394, 400, 410, 489, 527, 559, 566, 578, 590, 618, 678, 688, 690, 694, 700, 717, 772, 785, 786, 792, 813, 831, 835, 881, 889, 904, 908, 926, 931, 948, 953, 961, 998), List(3.7668167602800646, 2.475925497057267, 3.059950930807626, 3.564604322118547, 2.8646490767342616, 3.3148766866552717, 3.9289276811457117, 4.270687766603457, 4.072499318755467, 6.151539071135538, 0.692932335326463, 4.063291921323997, 3.0677407085978405, 3.3382476406970167, 4.185061395862566, 3.475023971010344, 2.2577688058233236, 3.367897642319437, 4.209448201649882, 4.772537119236797, 4.283274335324853, 2.9525883880397057, 4.211958582421762, 2.5009375814569688, 3.3145613784388805, 3.6131263307662893, 3.7579365835232057, 3.338005558791457, 4.104627851141388, 2.4905117702516324, 2.3934901619291846, 3.5141817639630744, 5.400012910464119, 3.1006023730818417, 3.4152215945143665, 7.239897315244388, 4.055655762040296, 3.919213940395525, 3.1862552309372094, 3.5672391027266888, 2.8541997653481443, 4.18186680515362, 4.021197615388808, 3.698236924196119, 4.141960802258275, 3.700205314804463))","List(0, 1000, List(18, 44, 73, 76, 115, 134, 184, 215, 268, 281, 372, 384, 390, 394, 400, 410, 489, 527, 559, 566, 578, 590, 618, 678, 688, 690, 694, 700, 717, 772, 785, 786, 792, 813, 831, 835, 881, 889, 904, 908, 926, 931, 948, 953, 961, 998), List(0.1467209411460437, 0.09643955155087475, 0.1191878737407516, 0.13884458263788838, 0.11158068877243348, 0.12911746394931597, 0.153035309057503, 0.16634717543583583, 0.15862755501278963, 0.23960804558662946, 0.026990345126866634, 0.158268918502807, 0.11949129267547405, 0.13002778388009775, 0.16301195036343558, 0.13535534643150296, 0.08794214987403522, 0.13118267842896342, 0.1639618386442753, 0.18589466447441116, 0.1668374337430823, 0.11500600498492436, 0.16405962026013968, 0.09741379500275842, 0.12910518240731272, 0.14073455903657145, 0.146375050179154, 0.1300183545695476, 0.1598789905907053, 0.09700770016735576, 0.09322862021988468, 0.13688057810611176, 0.21033541763395325, 0.12077128441586327, 0.13302598943840954, 0.2820005897537722, 0.15797148314872772, 0.1526569500144127, 0.1241075411854794, 0.1389472097405965, 0.11117367858360093, 0.16288751814779517, 0.15662930697489152, 0.14404969411327595, 0.16133314301494522, 0.14412636471898332))",2
129932183799426,"[COUP DE ], En plein coeur du bocage normand, située à 1h30 de la Capitale, cette maison de caractère a été entièrement rénovée avec goût et matériaux de qualité. Amoureux d'histoire, vous tomberez sous le charme de cette magnifique maison et de ses caves voutées du XI et XIIième siècle. Pour plus d'infos, n'hésitez pas à contacter Corinne Simao, conseillère en Immobilier Capifrance ! Lien direct vers le bien sélection :","List([coup, de, ],, en, plein, coeur, du, bocage, normand,, située, à, 1h30, de, la, capitale,, cette, maison, de, caractère, a, été, entièrement, rénovée, avec, goût, et, matériaux, de, qualité., amoureux, d'histoire,, vous, tomberez, sous, le, charme, de, cette, magnifique, maison, et, de, ses, caves, voutées, du, xi, et, xiiième, siècle., pour, plus, d'infos,, n'hésitez, pas, à, contacter, corinne, simao,, conseillère, en, immobilier, capifrance, !, lien, direct, vers, le, bien, sélection, :)","List([coup, de, ],, en, plein, coeur, du, bocage, normand,, située, à, 1h30, de, la, capitale,, cette, maison, de, caractère, été, entièrement, rénovée, avec, goût, et, matériaux, de, qualité., amoureux, d'histoire,, vous, tomberez, sous, le, charme, de, cette, magnifique, maison, et, de, ses, caves, voutées, du, xi, et, xiiième, siècle., pour, plus, d'infos,, n'hésitez, pas, à, contacter, corinne, simao,, conseillère, en, immobilier, capifrance, !, lien, direct, vers, le, bien, sélection, :)","List(0, 1000, List(3, 21, 38, 45, 59, 71, 134, 136, 151, 195, 203, 274, 280, 281, 290, 305, 312, 323, 330, 345, 399, 411, 467, 477, 478, 489, 521, 560, 566, 572, 585, 589, 590, 674, 675, 678, 683, 691, 698, 745, 772, 785, 787, 793, 809, 811, 816, 844, 852, 860, 899, 909, 926, 944, 969, 995), List(2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 6.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 3.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0))","List(0, 1000, List(3, 21, 38, 45, 59, 71, 134, 136, 151, 195, 203, 274, 280, 281, 290, 305, 312, 323, 330, 345, 399, 411, 467, 477, 478, 489, 521, 560, 566, 572, 585, 589, 590, 674, 675, 678, 683, 691, 698, 745, 772, 785, 787, 793, 809, 811, 816, 844, 852, 860, 899, 909, 926, 944, 969, 995), List(4.714877127645947, 3.9587983035598846, 2.455722108308975, 4.100641365231033, 3.661666871361462, 3.460600864890833, 6.629753373310543, 3.4639812527427645, 4.435730714855718, 3.9504275521412717, 5.67701472678438, 4.115278322833114, 3.61631850202346, 12.303078142271076, 3.7619968782431648, 3.7183334061150455, 3.530871458555814, 4.184120750510544, 2.8402225035965354, 4.104975254306251, 3.49975591073656, 3.7031070636921615, 3.247061845049806, 2.6612908638271366, 3.4452145535700565, 4.515537611646647, 4.0597986132718304, 3.7817945739770997, 7.158805678855195, 3.6350367765921425, 3.6556628506568707, 3.8012225229830117, 2.9525883880397057, 3.708001037571024, 3.498523624690295, 2.5009375814569688, 7.748709030025264, 3.5696773849465853, 4.205790482681114, 2.6219660525186645, 4.981023540503265, 2.3934901619291846, 4.21447528106886, 4.167708595477117, 3.514085540725088, 3.6847938539396656, 3.921525078214651, 4.248584058230007, 3.3097652061753875, 2.9837459871580663, 3.2392831311369683, 3.7518158642066934, 5.708399530696289, 3.563289530525856, 4.107758832999898, 3.747675071540662))","List(0, 1000, List(3, 21, 38, 45, 59, 71, 134, 136, 151, 195, 203, 274, 280, 281, 290, 305, 312, 323, 330, 345, 399, 411, 467, 477, 478, 489, 521, 560, 566, 572, 585, 589, 590, 674, 675, 678, 683, 691, 698, 745, 772, 785, 787, 793, 809, 811, 816, 844, 852, 860, 899, 909, 926, 944, 969, 995), List(0.1450413113216852, 0.1217824519412388, 0.07554410119031502, 0.12614589622826688, 0.1126419523281626, 0.1064566634115721, 0.20394765271103277, 0.10656065252381114, 0.13645407550074792, 0.121524946871725, 0.17463904956022358, 0.12659616533749088, 0.11124692404279815, 0.37847319001908025, 0.11572835211527675, 0.11438515544591155, 0.1086183611136734, 0.12871392911257531, 0.08737228674561034, 0.12627921740240874, 0.10766116970952529, 0.11391675539817779, 0.0998876736759788, 0.08186790583249616, 0.10598334232388486, 0.13890913353294954, 0.12488947190547509, 0.11633737832585615, 0.22022261345264374, 0.11182274458183235, 0.1124572537638903, 0.11693503021026808, 0.0908289399703351, 0.1140673061696501, 0.10762326153520073, 0.07693504125934919, 0.23836950324221542, 0.10981212763152545, 0.12938054380579392, 0.08065817712796332, 0.1532286349102559, 0.07362969221110749, 0.12964770973878018, 0.12820905052863402, 0.10810198465931557, 0.11335339565727671, 0.12063583510798942, 0.1306969802994113, 0.10181658454167346, 0.09178742497671293, 0.099648381148142, 0.11541515887896191, 0.17560452394942147, 0.10961562139040158, 0.1263648471851079, 0.115287777829163))",2
111814205522880,Cena Lagunera con nuestra red de Concesionarios en el emblemático Cerro de las Noas de la ciudad de Torreón Coahuila ConGebesa2017,"List(cena, lagunera, con, nuestra, red, de, concesionarios, en, el, emblemático, cerro, de, las, noas, de, la, ciudad, de, torreón, coahuila, congebesa2017)","List(cena, lagunera, con, nuestra, red, de, concesionarios, en, el, emblemático, cerro, de, las, noas, de, la, ciudad, de, torreón, coahuila, congebesa2017)","List(0, 1000, List(3, 40, 93, 95, 262, 281, 291, 361, 376, 420, 488, 610, 785, 788, 874, 879, 888, 939), List(1.0, 1.0, 1.0, 1.0, 1.0, 4.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(3, 40, 93, 95, 262, 281, 291, 361, 376, 420, 488, 610, 785, 788, 874, 879, 888, 939), List(2.3574385638229733, 4.052024196477655, 3.6104739500271767, 4.171972203752553, 3.6928149809371895, 8.202052094847383, 3.695003291629054, 3.537941401594337, 4.37021020927545, 2.5919043690326036, 3.629946053439997, 4.222454305593775, 2.3934901619291846, 3.5416930102989923, 3.9274718649037306, 3.8947008133156995, 3.974532364908916, 2.9211484301694886))","List(0, 1000, List(3, 40, 93, 95, 262, 281, 291, 361, 376, 420, 488, 610, 785, 788, 874, 879, 888, 939), List(0.13854913464546784, 0.23814170795359638, 0.21219133728985395, 0.24519117802904788, 0.21703060595774984, 0.4820431961608846, 0.21715921527013912, 0.20792852341499146, 0.25684183333796107, 0.15232893570338693, 0.21333573321133167, 0.2481580640520895, 0.14066792484295307, 0.20814900938969932, 0.23082163691443508, 0.22889564787335454, 0.23358743181223632, 0.17167895417585152))",2
102209433243222,Je serai présent Mercredi 18 Octobre à l'Espace Morning Trudaine à l'occasion de la soiree Como smart de 18h à 22h. Suivez le lien pour vous inscrire (AGENDA):,"List(je, serai, présent, mercredi, 18, octobre, à, l'espace, morning, trudaine, à, l'occasion, de, la, soiree, como, smart, de, 18h, à, 22h., , suivez, le, lien, pour, vous, inscrire, (agenda):)","List(je, serai, présent, mercredi, 18, octobre, à, l'espace, morning, trudaine, à, l'occasion, de, la, soiree, como, smart, de, 18h, à, 22h., , suivez, le, lien, pour, vous, inscrire, (agenda):)","List(0, 1000, List(39, 53, 64, 165, 263, 281, 296, 358, 369, 372, 395, 415, 489, 503, 563, 634, 675, 678, 684, 734, 745, 772, 785, 807, 841, 903), List(1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 3.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(39, 53, 64, 165, 263, 281, 296, 358, 369, 372, 395, 415, 489, 503, 563, 634, 675, 678, 684, 734, 745, 772, 785, 807, 841, 903), List(3.091922368003184, 3.967088350283624, 3.8147772390012693, 4.152836385254535, 3.5180383055728277, 4.101026047423692, 3.6917800817474853, 3.3946894722713723, 3.9255824667688115, 0.692932335326463, 3.71092552933192, 4.126753424475631, 6.773306417469971, 4.241987332658576, 3.9371196520602996, 4.060130784010643, 3.498523624690295, 2.5009375814569688, 4.2780967323738155, 4.171972203752553, 2.6219660525186645, 2.4905117702516324, 2.3934901619291846, 3.7488911728701546, 2.8823962213604877, 3.3818780299148377))","List(0, 1000, List(39, 53, 64, 165, 263, 281, 296, 358, 369, 372, 395, 415, 489, 503, 563, 634, 675, 678, 684, 734, 745, 772, 785, 807, 841, 903), List(0.16277760493756704, 0.20885166681980136, 0.20083308324975765, 0.21863057348554374, 0.18521094041232142, 0.21590296208417553, 0.19435776456805665, 0.17871710736384772, 0.20666654458669623, 0.0364801739835017, 0.19536569741122978, 0.21725740773946703, 0.35658805911613334, 0.22332402175789376, 0.2072739322135344, 0.21374998660828542, 0.18418356888189355, 0.1316645701783343, 0.22522504025118015, 0.21963799939502382, 0.13803624524125044, 0.13111569204507384, 0.12600788429629192, 0.19736443987288424, 0.15174687380562016, 0.17804263509243062))",2
387799227966054,"កីឡាហ្រ្វង់កូហ្វូនីលើកទី៨នៅទីក្រុងអាប៊ីដហ្សង់បានប្រព្រឹត្តទៅប្រកបដោយការចែករម្លែក និងសាមគ្គីភាព។ សូមអបអរសាទរដល់អត្តពលិកកម្ពុជា ជ័យ ចាន់រស្មី ដែលទទួលបានមេដាយសំរិទ្ធលើវិញ្ញាសាចំបាប់នារី! Partage et solidarité ont été au cœur de la 8e édition des Jeux de la Francophonie ️ à Abidjan. Bravo à l’athlète cambodgienne️, Chey Chan Raksmey, qui a remporté une médaille de bronze en lutte libre féminine !","List(កីឡាហ្រ្វង់កូហ្វូនីលើកទី៨នៅទីក្រុងអាប៊ីដហ្សង់បានប្រព្រឹត្តទៅប្រកបដោយការចែករម្លែក, និងសាមគ្គីភាព។, សូមអបអរសាទរដល់អត្តពលិកកម្ពុជា, ជ័យ, ចាន់រស្មី, ដែលទទួលបានមេដាយសំរិទ្ធលើវិញ្ញាសាចំបាប់នារី!, , , partage, et, solidarité, ont, été, au, cœur, de, la, 8e, édition, des, jeux, de, la, francophonie, ️, à, abidjan., bravo, à, l’athlète, cambodgienne️,, chey, chan, raksmey,, qui, a, remporté, une, médaille, de, bronze, en, lutte, libre, féminine, !)","List(កីឡាហ្រ្វង់កូហ្វូនីលើកទី៨នៅទីក្រុងអាប៊ីដហ្សង់បានប្រព្រឹត្តទៅប្រកបដោយការចែករម្លែក, និងសាមគ្គីភាព។, សូមអបអរសាទរដល់អត្តពលិកកម្ពុជា, ជ័យ, ចាន់រស្មី, ដែលទទួលបានមេដាយសំរិទ្ធលើវិញ្ញាសាចំបាប់នារី!, , , partage, et, solidarité, ont, été, au, cœur, de, la, 8e, édition, des, jeux, de, la, francophonie, ️, à, abidjan., bravo, à, l’athlète, cambodgienne️,, chey, chan, raksmey,, qui, remporté, une, médaille, de, bronze, en, lutte, libre, féminine, !)","List(0, 1000, List(3, 22, 38, 42, 61, 104, 118, 121, 134, 145, 197, 236, 246, 259, 281, 372, 375, 427, 457, 471, 489, 496, 498, 566, 572, 605, 626, 657, 670, 740, 769, 785, 792, 845, 892, 944, 947, 953), List(1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 3.0, 2.0, 1.0, 1.0, 1.0, 1.0, 3.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 3.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(3, 22, 38, 42, 61, 104, 118, 121, 134, 145, 197, 236, 246, 259, 281, 372, 375, 427, 457, 471, 489, 496, 498, 566, 572, 605, 626, 657, 670, 740, 769, 785, 792, 845, 892, 944, 947, 953), List(2.3574385638229733, 3.210533717850982, 2.455722108308975, 4.070483189040282, 3.880859278104689, 3.899923494394419, 2.936096872149222, 4.057973643872748, 3.3148766866552717, 3.1960780135991707, 3.81192194828789, 3.74791817352314, 3.069836499292839, 3.2711450220488736, 6.151539071135538, 1.385864670652926, 3.591789665538521, 3.277504343957166, 3.543970577258674, 4.26211278551817, 6.773306417469971, 3.4797543049247546, 3.2876567154211838, 2.3862685596183986, 3.6350367765921425, 2.697324404197966, 3.9986076017837013, 3.657882361056424, 4.342731656973723, 3.3009408156187012, 4.453296426325783, 7.180470485787554, 2.7000064552320593, 4.014192332800399, 4.254417324292936, 3.563289530525856, 4.044146263070945, 3.698236924196119))","List(0, 1000, List(3, 22, 38, 42, 61, 104, 118, 121, 134, 145, 197, 236, 246, 259, 281, 372, 375, 427, 457, 471, 489, 496, 498, 566, 572, 605, 626, 657, 670, 740, 769, 785, 792, 845, 892, 944, 947, 953), List(0.09895206357375361, 0.13476021875169683, 0.1030774561466406, 0.17085607976338144, 0.16289673033305133, 0.16369693932735316, 0.12324089747665037, 0.17033099914083946, 0.1391399520100909, 0.13415344927397652, 0.16000312744247397, 0.15731660755318733, 0.12885453776001982, 0.13730434173259512, 0.25820714676708856, 0.05817083469300761, 0.15076320748378494, 0.13757127043878156, 0.14875603005989782, 0.1788996166361059, 0.2843054565710971, 0.1460605907131558, 0.13799741011510158, 0.10016218527981142, 0.15257847891780246, 0.11321856697313147, 0.16783914528680258, 0.1535374085633001, 0.18228354519990164, 0.138555002218428, 0.18692443478821838, 0.30139592305934065, 0.11333114444959529, 0.1684933049828429, 0.17857660428699346, 0.1495669314853723, 0.16975060316142543, 0.15523126704112805))",2
1458317351061262,Les lecteurs de 26in testent le casque Urge Bike Products All-M.,"List(les, lecteurs, de, 26in, testent, le, casque, urge, bike, products, all-m.)","List(les, lecteurs, de, 26in, testent, le, casque, urge, bike, products, all-m.)","List(0, 1000, List(34, 124, 182, 222, 281, 290, 420, 716, 744, 772, 784), List(1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(34, 124, 182, 222, 281, 290, 420, 716, 744, 772, 784), List(3.4790108104667135, 4.336142570977663, 3.6813758262300524, 3.721643493071609, 2.050513023711846, 3.7619968782431648, 2.5919043690326036, 3.0032984051432297, 3.2742439247971387, 2.4905117702516324, 4.158135621045226))","List(0, 1000, List(34, 124, 182, 222, 281, 290, 420, 716, 744, 772, 784), List(0.3091785816296702, 0.3853516079356639, 0.32716269606729725, 0.3307412707823147, 0.18222843872087488, 0.33432746325799245, 0.23034171498585887, 0.26690217182402703, 0.29098101377918373, 0.22133098705580068, 0.3695321870431627))",2
448154871874384,"// CALENDRIER* DE L'AVENT // Des livres à gagner tous les jours du 1er au 24 décembre Aujourd'hui, tentez de remporter « Jours parfaits » de Raphael Montes, publié aux éditions 10/18. Un roman de captivité traversé de grands élans d’humour noir, best-seller au Brésil, traduit dans neuf pays et en cours d’adaptation au cinéma. Pour tenter de remporter ce livre, rien de plus simple : 1/ Cliquez sur la photo ci-dessous 2/ Connectez-vous à votre compte sur la plateforme web Collibris (ou créez gratuitement un compte) 3/ La magie de Noël fera ensuite le reste ! Nous vous souhaitons bonne chance ;) * Jeu concours ouvert aux personnes résidant en France métropolitaine.","List(//, calendrier*, de, l'avent, //, , , des, livres, à, gagner, tous, les, jours, du, 1er, au, 24, décembre, , , aujourd'hui,, tentez, de, remporter, «, jours, parfaits, », de, raphael, montes,, publié, aux, éditions, 10/18., un, roman, de, captivité, traversé, de, grands, élans, d’humour, noir,, best-seller, au, brésil,, traduit, dans, neuf, pays, et, en, cours, d’adaptation, au, cinéma., , pour, tenter, de, remporter, ce, livre,, rien, de, plus, simple, :, , 1/, cliquez, sur, la, photo, ci-dessous, , 2/, connectez-vous, à, votre, compte, sur, la, plateforme, web, collibris, (ou, créez, gratuitement, un, compte), , 3/, la, magie, de, noël, fera, ensuite, le, reste, !, , , , nous, vous, souhaitons, bonne, chance, ;), , , , , *, jeu, concours, ouvert, aux, personnes, résidant, en, france, métropolitaine.)","List(//, calendrier*, de, l'avent, //, , , des, livres, à, gagner, tous, les, jours, du, 1er, au, 24, décembre, , , aujourd'hui,, tentez, de, remporter, «, jours, parfaits, », de, raphael, montes,, publié, aux, éditions, 10/18., un, roman, de, captivité, traversé, de, grands, élans, d’humour, noir,, best-seller, au, brésil,, traduit, dans, neuf, pays, et, en, cours, d’adaptation, au, cinéma., , pour, tenter, de, remporter, ce, livre,, rien, de, plus, simple, :, , 1/, cliquez, sur, la, photo, ci-dessous, , 2/, connectez-vous, à, votre, compte, sur, la, plateforme, web, collibris, (ou, créez, gratuitement, un, compte), , 3/, la, magie, de, noël, fera, ensuite, le, reste, !, , , , nous, vous, souhaitons, bonne, chance, ;), , , , , *, jeu, concours, ouvert, aux, personnes, résidant, en, france, métropolitaine.)","List(0, 1000, List(2, 3, 34, 35, 38, 50, 88, 111, 118, 157, 176, 191, 198, 203, 214, 244, 246, 256, 258, 262, 270, 281, 288, 295, 302, 303, 311, 314, 337, 344, 345, 351, 362, 366, 372, 373, 385, 390, 402, 412, 413, 420, 431, 433, 439, 440, 456, 458, 460, 464, 489, 501, 539, 548, 566, 576, 607, 610, 624, 651, 678, 693, 698, 700, 721, 731, 745, 764, 767, 772, 779, 785, 789, 797, 813, 843, 855, 857, 860, 861, 862, 866, 905, 906, 926, 945, 973, 974, 977, 980, 981, 997, 999), List(1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 3.0, 1.0, 1.0, 1.0, 1.0, 8.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 15.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 3.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 2.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 3.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(2, 3, 34, 35, 38, 50, 88, 111, 118, 157, 176, 191, 198, 203, 214, 244, 246, 256, 258, 262, 270, 281, 288, 295, 302, 303, 311, 314, 337, 344, 345, 351, 362, 366, 372, 373, 385, 390, 402, 412, 413, 420, 431, 433, 439, 440, 456, 458, 460, 464, 489, 501, 539, 548, 566, 576, 607, 610, 624, 651, 678, 693, 698, 700, 721, 731, 745, 764, 767, 772, 779, 785, 789, 797, 813, 843, 855, 857, 860, 861, 862, 866, 905, 906, 926, 945, 973, 974, 977, 980, 981, 997, 999), List(3.8291772783786517, 4.714877127645947, 3.4790108104667135, 3.378763680358356, 2.455722108308975, 3.8095918501462673, 4.314915869732134, 4.001579837057452, 2.936096872149222, 4.326137880699627, 4.382287375551935, 3.230621917561955, 3.6218758530794877, 2.83850736339219, 3.4272345409429934, 3.8850334875884864, 9.209509497878518, 3.8236545818960006, 2.9474430485365235, 3.6928149809371895, 3.8253607310072972, 16.404104189694767, 3.516976273129292, 3.4463828304946675, 4.111598949431253, 4.005504180842761, 3.6887963751501984, 4.269867926426362, 4.297494566583951, 3.2044439100488926, 4.104975254306251, 4.116507795193133, 2.9524786366265463, 4.298969959411391, 10.393985029896946, 2.4287428544849488, 3.909165047336312, 3.0677407085978405, 3.5748785592846466, 3.1273652149329827, 3.500230271544279, 2.5919043690326036, 7.060373523014993, 3.0402285734581214, 3.9945545954561066, 3.7668167602800646, 3.5754922444834096, 4.147383809041079, 8.226441832080445, 3.8407151028104476, 4.515537611646647, 3.4488136216983296, 3.821297016953346, 4.0392538854065005, 2.3862685596183986, 7.116497378245925, 3.061601953899516, 8.44490861118755, 3.094385656790193, 8.039840544177668, 2.5009375814569688, 3.742704460591343, 4.205790482681114, 3.338005558791457, 3.5962714073671562, 3.700669029164731, 2.6219660525186645, 3.7046193073352085, 3.574469644946501, 2.4905117702516324, 2.6643302550033408, 7.180470485787554, 3.8965327027873364, 4.49732315673318, 3.1006023730818417, 7.644164509056096, 2.867014036198173, 3.8829442048450673, 2.9837459871580663, 3.8522526603621507, 3.862424308377209, 3.8245729158405, 3.573448090051247, 4.019760718837598, 2.8541997653481443, 4.4182305678959635, 3.6984682985608717, 3.848884973472175, 4.013082366778251, 3.4956856698955767, 3.7437934557984858, 3.8976616896416605, 4.1011604412873295))","List(0, 1000, List(2, 3, 34, 35, 38, 50, 88, 111, 118, 157, 176, 191, 198, 203, 214, 244, 246, 256, 258, 262, 270, 281, 288, 295, 302, 303, 311, 314, 337, 344, 345, 351, 362, 366, 372, 373, 385, 390, 402, 412, 413, 420, 431, 433, 439, 440, 456, 458, 460, 464, 489, 501, 539, 548, 566, 576, 607, 610, 624, 651, 678, 693, 698, 700, 721, 731, 745, 764, 767, 772, 779, 785, 789, 797, 813, 843, 855, 857, 860, 861, 862, 866, 905, 906, 926, 945, 973, 974, 977, 980, 981, 997, 999), List(0.08617367278631584, 0.10610589410931665, 0.07829335583234079, 0.0760373455304666, 0.0552647678681366, 0.0857329127584615, 0.09710497091851003, 0.0900535040396239, 0.06607535581028091, 0.09735751652578122, 0.09862113213946429, 0.07270349105857063, 0.08150846042001061, 0.06387915391529667, 0.07712815741407315, 0.08743068815687877, 0.20725529279456192, 0.08604938733150513, 0.06633069569670923, 0.08310490914707103, 0.08608778334311622, 0.36916596021219894, 0.07914774911810984, 0.07755907985988436, 0.09252942779574602, 0.0901418193858905, 0.08301447248275132, 0.0960911900345464, 0.09671291341689484, 0.07211436841168614, 0.09238036493058145, 0.09263989885458973, 0.06644401902671186, 0.09674611637656479, 0.2339113077813562, 0.054657613583539204, 0.08797375654529589, 0.0690379329023663, 0.08045081040203239, 0.0703797518690624, 0.07877088893218576, 0.05832947987328766, 0.15889008878144137, 0.06841878639677884, 0.08989540457673992, 0.08477028127666807, 0.0804646210730114, 0.09333474772703611, 0.18513185864654563, 0.08643332561379541, 0.10161986043255263, 0.07761378357051667, 0.08599633154035961, 0.0909013391955719, 0.053701751339101836, 0.16015312738858173, 0.06889978337304682, 0.1900483415797744, 0.06963756381000304, 0.18093249226671387, 0.05628231892536618, 0.08422764632601326, 0.09464915999288048, 0.07512010488686338, 0.0809322454875895, 0.08328166047844178, 0.059006002658158696, 0.08337055946475923, 0.08044160799435805, 0.056047691385812566, 0.05995938732994738, 0.16159280939744144, 0.08768947210335218, 0.10120995345682784, 0.06977746782503698, 0.17202800581932995, 0.06452068198151602, 0.08738366991917375, 0.06714767473072229, 0.08669297243527523, 0.08692188016245112, 0.08607005396642138, 0.08041861842482989, 0.09046265546918433, 0.06423230338137632, 0.09943001526781645, 0.08323213416373741, 0.08661718436723717, 0.09031236257771558, 0.07866861500049012, 0.08425215360520769, 0.08771487937407686, 0.09229451451809474))",2
149788201749395,"HUMOUR • GSL - Gilles Saint-Louis , Mat Le Buzz, Safia Enjoylife, Cleeveland Roumillac vous attendent pour la 1ère édition du MADGWAYA Comedy Show le 08 Décembre au @Palais Palais des sports du Gosier ! Venez vous tordre de rire à partir de 20€ au lieu de 25€ sur Shopping-97.com ! Suivez ce lien pour en profiter :","List(humour, , •, , gsl, -, gilles, saint-louis, ,, mat, le, buzz,, safia, enjoylife,, cleeveland, roumillac, vous, attendent, pour, la, 1ère, édition, du, madgwaya, comedy, show, le, 08, décembre, au, @palais, palais, des, sports, du, gosier, !, venez, vous, tordre, de, rire, à, partir, de, 20€, au, lieu, de, 25€, sur, shopping-97.com, !, suivez, ce, lien, pour, en, profiter, :)","List(humour, , •, , gsl, -, gilles, saint-louis, ,, mat, le, buzz,, safia, enjoylife,, cleeveland, roumillac, vous, attendent, pour, la, 1ère, édition, du, madgwaya, comedy, show, le, 08, décembre, au, @palais, palais, des, sports, du, gosier, !, venez, vous, tordre, de, rire, à, partir, de, 20€, au, lieu, de, 25€, sur, shopping-97.com, !, suivez, ce, lien, pour, en, profiter, :)","List(0, 1000, List(3, 19, 38, 75, 108, 118, 143, 148, 157, 191, 199, 203, 212, 246, 260, 281, 296, 310, 372, 390, 409, 423, 460, 489, 499, 547, 604, 626, 638, 664, 671, 675, 678, 680, 700, 723, 732, 745, 752, 767, 772, 785, 824, 840, 897, 926, 931, 947, 963), List(1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 2.0, 1.0, 3.0, 1.0, 1.0, 3.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(3, 19, 38, 75, 108, 118, 143, 148, 157, 191, 199, 203, 212, 246, 260, 281, 296, 310, 372, 390, 409, 423, 460, 489, 499, 547, 604, 626, 638, 664, 671, 675, 678, 680, 700, 723, 732, 745, 752, 767, 772, 785, 824, 840, 897, 926, 931, 947, 963), List(2.3574385638229733, 3.325656753002941, 4.91144421661795, 4.50584703631608, 4.396138012485834, 2.936096872149222, 3.5011796687484114, 3.7000894198052023, 2.1630689403498136, 3.230621917561955, 3.942569256827864, 5.67701472678438, 3.7958516660825876, 6.139672998585678, 2.934586542286632, 6.151539071135538, 3.6917800817474853, 3.0743511796473655, 2.078797005979389, 3.0677407085978405, 3.645735568599651, 4.020558739788097, 2.7421472773601483, 2.2577688058233236, 2.2295712966342998, 3.600458579819932, 3.6748005843115346, 3.9986076017837013, 4.013240857942701, 7.539340366439552, 3.894982424131416, 3.498523624690295, 5.0018751629139375, 4.16918952873556, 3.338005558791457, 3.631027017765339, 3.96981645139969, 5.243932105037329, 4.138541380227673, 3.574469644946501, 4.981023540503265, 2.3934901619291846, 4.167708595477117, 3.92645405311103, 3.2270772254574647, 2.8541997653481443, 4.18186680515362, 4.044146263070945, 3.3468390373582304))","List(0, 1000, List(3, 19, 38, 75, 108, 118, 143, 148, 157, 191, 199, 203, 212, 246, 260, 281, 296, 310, 372, 390, 409, 423, 460, 489, 499, 547, 604, 626, 638, 664, 671, 675, 678, 680, 700, 723, 732, 745, 752, 767, 772, 785, 824, 840, 897, 926, 931, 947, 963), List(0.08496297848236423, 0.11985792863543578, 0.17701030928124692, 0.16239243332412692, 0.15843847856405271, 0.10581804302293746, 0.12618384098051103, 0.13335262372561235, 0.07795782365449258, 0.11643283718127993, 0.14209168886670107, 0.20460176035029268, 0.13680393134174557, 0.2212761396511877, 0.10576361016280454, 0.22170379739896331, 0.13305315203568818, 0.11080078061505101, 0.07492063123028345, 0.11056253673537668, 0.13139369034712298, 0.1449024593632418, 0.09882802618788378, 0.0813708426641845, 0.08035459375604828, 0.1297618905274071, 0.13244120451882369, 0.14411133203778292, 0.14463872013062373, 0.2717206815715455, 0.1403766413976058, 0.12608811614694762, 0.1802694748273582, 0.15025916927583505, 0.12030298427191911, 0.13086358860567363, 0.1430736880761976, 0.1889932986761484, 0.14915459839871445, 0.12882523947387975, 0.17951797446022777, 0.08626229172902022, 0.1502057958804466, 0.14151089082273766, 0.11630508513563934, 0.1028664402835541, 0.15071606311818228, 0.14575256263375205, 0.12062134618425506))",2
47612613547,Savez-vous que le système de billetterie proposé par Weezevent propose plusieurs choix de langues ? Toutes les infos ici :,"List(savez-vous, que, le, système, de, billetterie, proposé, par, weezevent, propose, plusieurs, choix, de, langues, ?, , toutes, les, infos, ici, :)","List(savez-vous, que, le, système, de, billetterie, proposé, par, weezevent, propose, plusieurs, choix, de, langues, ?, , toutes, les, infos, ici, :)","List(0, 1000, List(114, 167, 237, 281, 324, 354, 372, 404, 408, 420, 509, 679, 693, 712, 772, 819, 872, 926, 970, 976), List(1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(114, 167, 237, 281, 324, 354, 372, 404, 408, 420, 509, 679, 693, 712, 772, 819, 872, 926, 970, 976), List(3.6702909938055592, 3.8819706980579403, 3.6558845801478017, 4.101026047423692, 3.2652759307094916, 3.7541373265480553, 0.692932335326463, 3.402662423761032, 3.3237459243960514, 2.5919043690326036, 4.47242982154077, 4.342731656973723, 3.742704460591343, 3.0070581342445175, 2.4905117702516324, 4.340750368488499, 3.993310814201236, 2.8541997653481443, 2.896253493969596, 3.100411490887389))","List(0, 1000, List(114, 167, 237, 281, 324, 354, 372, 404, 408, 420, 509, 679, 693, 712, 772, 819, 872, 926, 970, 976), List(0.2357626211247144, 0.2493599522348835, 0.23483721933758306, 0.2634310608813157, 0.20974631532452664, 0.24114852409873705, 0.04451078781337935, 0.21857139207283624, 0.21350215893255506, 0.16649202168957467, 0.28728833198849385, 0.2789571181188038, 0.24041412934653045, 0.19315959110609934, 0.1599790272453759, 0.2788298492544906, 0.2565119294657287, 0.18334067217768735, 0.18604201038345747, 0.19915618159172002))",2
158153034248900,"[PREVENTION] ️ Avant le départ en vacances, vérifiez que votre pays de destination accepte la carte d'identité d'apparence périmée dont la validité a été prolongée de 5 ans ️","List([prevention], ️, avant, le, départ, en, vacances,, vérifiez, que, votre, pays, de, destination, accepte, la, carte, d'identité, d'apparence, périmée, dont, la, validité, a, été, prolongée, de, 5, ans, ️)","List([prevention], ️, avant, le, départ, en, vacances,, vérifiez, que, votre, pays, de, destination, accepte, la, carte, d'identité, d'apparence, périmée, dont, la, validité, été, prolongée, de, 5, ans, ️)","List(0, 1000, List(3, 22, 63, 79, 110, 198, 281, 320, 324, 422, 427, 531, 572, 607, 639, 646, 678, 735, 772, 785, 849, 876, 893, 911, 977), List(1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(3, 22, 63, 79, 110, 198, 281, 320, 324, 422, 427, 531, 572, 607, 639, 646, 678, 735, 772, 785, 849, 876, 893, 911, 977), List(2.3574385638229733, 6.421067435701964, 4.000014401546783, 3.932722752114263, 3.9795770901226155, 3.6218758530794877, 4.101026047423692, 3.5776431133414945, 3.2652759307094916, 4.294130391838319, 3.277504343957166, 3.7183334061150455, 3.6350367765921425, 3.061601953899516, 3.753281423824166, 3.6841093131257128, 2.5009375814569688, 3.4598714702560156, 2.4905117702516324, 4.786980323858369, 3.194399221934601, 4.1599694744841615, 3.6831973203584436, 3.704852163314838, 4.013082366778251))","List(0, 1000, List(3, 22, 63, 79, 110, 198, 281, 320, 324, 422, 427, 531, 572, 607, 639, 646, 678, 735, 772, 785, 849, 876, 893, 911, 977), List(0.12442210044509967, 0.33889438720052334, 0.21111480964492782, 0.20756325649173263, 0.21003615875074635, 0.191157219580199, 0.21644605404819253, 0.18882268137797426, 0.1723364061597708, 0.22663781408209097, 0.17298180361985094, 0.19624810573801954, 0.19185183354485152, 0.16158679665155598, 0.19809261012362261, 0.19444180902220787, 0.1319957651249869, 0.18260686925441122, 0.13144550631912924, 0.25264970032472395, 0.16859563890766194, 0.2195570004435052, 0.19439367540064262, 0.19553658579788075, 0.21180451740978518))",2


In [23]:
display(
  predictions
  .filter(col('prediction') == 3)
)

page_id,message,words,noStopWords,hashingTF,idf,features,prediction
303499083413372,Having FOMO? You don’t have too! We are giving away 2 wristbands to ACL weekend 2. Sign up at for more details. . . . . elanparkside acl liveconnected leasetoday tourwithus 78752,"List(having, fomo?, you, don’t, have, too!, we, are, giving, away, 2, wristbands, to, acl, weekend, 2., sign, up, at, , for, more, details., ., ., ., ., elanparkside, acl, liveconnected, leasetoday, tourwithus, 78752)","List(fomo?, don’t, too!, giving, away, 2, wristbands, acl, weekend, 2., sign, , details., ., ., ., ., elanparkside, acl, liveconnected, leasetoday, tourwithus, 78752)","List(0, 1000, List(26, 56, 157, 205, 245, 256, 372, 466, 486, 737, 756, 769, 786, 825, 841, 873, 964, 989, 992), List(1.0, 4.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0))","List(0, 1000, List(26, 56, 157, 205, 245, 256, 372, 466, 486, 737, 756, 769, 786, 825, 841, 873, 964, 989, 992), List(3.9920685780166445, 13.74473655077632, 2.1630689403498136, 4.3420707912155585, 3.3859327744463292, 3.8236545818960006, 0.692932335326463, 3.1228655497061975, 3.0921116364810843, 3.1665545560293524, 3.8486160478284943, 4.453296426325783, 3.5141817639630744, 3.6562172665938975, 2.8823962213604877, 6.729487178479817, 2.9096744113606685, 2.690460552150776, 3.8513085642639737))","List(0, 1000, List(26, 56, 157, 205, 245, 256, 372, 466, 486, 737, 756, 769, 786, 825, 841, 873, 964, 989, 992), List(0.19278719309348816, 0.6637684517809881, 0.10446012670577114, 0.20968969437583107, 0.1635153462000554, 0.18465405085025358, 0.033463473214599554, 0.15081115766686623, 0.14932597260768174, 0.15292101142651016, 0.18585950382747046, 0.21506106452488646, 0.16970881763543172, 0.17656807501958807, 0.1391982792979585, 0.32498413259781733, 0.14051561280063557, 0.12992921535320925, 0.1859895323266833))",3
295886277206213,SNACK For a post work out snack why not rustle up a bowl of bircher muesli. For a delicious and healthy recipe check out Jamie Oliver's tasty recipe: . Photo: @lucasfitfrench muesli birchermuesli foodinspo wellness healthy superfoods deepheat stayactive,"List(snack, , for, a, post, work, out, snack, why, not, rustle, up, a, bowl, of, bircher, muesli., for, a, delicious, and, healthy, recipe, check, out, jamie, oliver's, tasty, recipe:, ., , photo:, @lucasfitfrench, , muesli, birchermuesli, foodinspo, wellness, healthy, superfoods, deepheat, stayactive)","List(snack, , post, work, snack, rustle, bowl, bircher, muesli., delicious, healthy, recipe, check, jamie, oliver's, tasty, recipe:, ., , photo:, @lucasfitfrench, , muesli, birchermuesli, foodinspo, wellness, healthy, superfoods, deepheat, stayactive)","List(0, 1000, List(15, 43, 56, 126, 146, 295, 337, 372, 374, 387, 418, 437, 441, 465, 491, 506, 527, 532, 543, 568, 605, 615, 620, 861, 882, 973), List(1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 3.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(15, 43, 56, 126, 146, 295, 337, 372, 374, 387, 418, 437, 441, 465, 491, 506, 527, 532, 543, 568, 605, 615, 620, 861, 882, 973), List(3.70333956776203, 3.936972776830543, 3.43618413769408, 3.629082122500733, 6.697099803523719, 3.4463828304946675, 4.297494566583951, 2.078797005979389, 3.616851522663978, 3.844992612340219, 2.996623657996016, 3.891327654811639, 3.445484034820284, 3.8383170201264347, 4.322877181282239, 3.721643493071609, 3.367897642319437, 3.9390309957282654, 3.4834800916490103, 4.307868323210694, 2.697324404197966, 7.13549317629475, 3.7780290367048304, 3.8522526603621507, 2.9993227149651807, 3.6984682985608717))","List(0, 1000, List(15, 43, 56, 126, 146, 295, 337, 372, 374, 387, 418, 437, 441, 465, 491, 506, 527, 532, 543, 568, 605, 615, 620, 861, 882, 973), List(0.18236853501013092, 0.19387364959331468, 0.1692126405775456, 0.17871177568301325, 0.32979429988466497, 0.1697148685345097, 0.21162730934650106, 0.10236899901485576, 0.17810948779293137, 0.18934414654851878, 0.14756677222976963, 0.19162588541165293, 0.16967060792934197, 0.18901541137582475, 0.2128772596069496, 0.183269899841134, 0.16584991677256233, 0.19397500523684663, 0.17154169889824383, 0.2121381582070915, 0.13282797621983186, 0.3513826948151134, 0.1860465690608984, 0.18970166286524773, 0.14769968552501817, 0.18212865254415656))",3
131920966828784,It’s Fall Quarter Finals Week. You made it! Best of luck on your final exams. We are proud of each and every one of you. Your Fall Quarter grades will be available online on December 12. To get them just go to the Online Services page and click ‘Get my Grades.’ Student ID and Global PIN required. .,"List(it’s, fall, quarter, finals, week., you, made, it!, best, of, luck, on, your, final, exams., we, are, proud, of, each, and, every, one, of, you., your, fall, quarter, grades, will, be, available, online, on, december, 12., to, get, them, just, go, to, the, online, services, page, and, click, ‘get, my, grades.’, student, id, and, global, pin, required., .)","List(it’s, fall, quarter, finals, week., made, it!, best, luck, final, exams., proud, every, one, you., fall, quarter, grades, available, online, december, 12., get, go, online, services, page, click, ‘get, grades.’, student, id, global, pin, required., .)","List(0, 1000, List(0, 4, 12, 20, 44, 56, 77, 82, 89, 140, 169, 189, 344, 356, 371, 398, 500, 559, 617, 673, 712, 719, 729, 763, 771, 781, 827, 900, 908, 936, 959, 987), List(2.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0))","List(0, 1000, List(0, 4, 12, 20, 44, 56, 77, 82, 89, 140, 169, 189, 344, 356, 371, 398, 500, 559, 617, 673, 712, 719, 729, 763, 771, 781, 827, 900, 908, 936, 959, 987), List(6.192695986465059, 3.5868104021389136, 3.92718095581077, 4.188266224696665, 2.475925497057267, 6.87236827538816, 3.0915439384811654, 3.700553080413949, 3.194399221934601, 3.639171711754565, 3.7321185323120734, 3.4799402649438864, 3.2044439100488926, 3.4842269176732663, 2.829296098614324, 3.411654357077755, 3.4834800916490103, 4.209448201649882, 4.034223448834982, 3.9495348277850035, 3.0070581342445175, 2.872824334216562, 7.65756357395087, 3.105322371639268, 4.086215561225933, 3.779031794372209, 3.6468337457361537, 3.4171409809001703, 7.1344782054533775, 3.4063704433590747, 2.4718452465817693, 4.070986840524109))","List(0, 1000, List(0, 4, 12, 20, 44, 56, 77, 82, 89, 140, 169, 189, 344, 356, 371, 398, 500, 559, 617, 673, 712, 719, 729, 763, 771, 781, 827, 900, 908, 936, 959, 987), List(0.2662999977178024, 0.15424099681163508, 0.16887770396860347, 0.18010496373727708, 0.10647042187390916, 0.29552745041121203, 0.13294341358939651, 0.15913219040990967, 0.13736642447331185, 0.15649265209418492, 0.1604895765607564, 0.149645338094763, 0.1377983688845599, 0.14982967390174365, 0.1216661606270881, 0.14670888890548361, 0.14979755868585226, 0.18101583687340853, 0.17348077437773674, 0.1698389712532777, 0.12931033204872672, 0.12353797365761161, 0.3292926322758294, 0.13353591752089533, 0.17571655334077885, 0.16250695342001997, 0.15682213696414626, 0.14694482619589494, 0.3067987729387085, 0.14648166861010645, 0.10629496183280786, 0.17506168374973088))",3
198950333449602,"All of your favorite festive flavors, none of the guilt. Watch how on YouTube. Link . . . organicturkey healthyholidayrecipes healthyburger holidaysandwich turkeyburger cleaneating cleaneatingrecipe cleancheat fitnessbyeve medium proteinstyle fitgirls lifehacks cranberries bestburgers turkeyleftovers","List(all, of, your, favorite, festive, flavors,, none, of, the, guilt., watch, how, on, youtube., link, , ., ., ., organicturkey, healthyholidayrecipes, healthyburger, holidaysandwich, turkeyburger, cleaneating, cleaneatingrecipe, cleancheat, fitnessbyeve, medium, proteinstyle, fitgirls, lifehacks, cranberries, bestburgers, turkeyleftovers)","List(favorite, festive, flavors,, none, guilt., watch, youtube., link, , ., ., ., organicturkey, healthyholidayrecipes, healthyburger, holidaysandwich, turkeyburger, cleaneating, cleaneatingrecipe, cleancheat, fitnessbyeve, medium, proteinstyle, fitgirls, lifehacks, cranberries, bestburgers, turkeyleftovers)","List(0, 1000, List(2, 54, 56, 66, 84, 86, 101, 125, 202, 266, 282, 372, 475, 582, 634, 636, 638, 642, 707, 738, 818, 831, 889, 942, 962), List(1.0, 2.0, 3.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(2, 54, 56, 66, 84, 86, 101, 125, 202, 266, 282, 372, 475, 582, 634, 636, 638, 642, 707, 738, 818, 831, 889, 942, 962), List(3.8291772783786517, 8.095498560796276, 10.30855241308224, 3.6083570991096887, 3.5300889859024323, 3.3004744339545926, 3.9031838275680992, 3.875182558118184, 3.6548871843760833, 3.947751766391347, 3.8768407041385577, 0.692932335326463, 3.4743764523715597, 4.1912919456132025, 4.060130784010643, 4.235235917021601, 4.013240857942701, 3.493892455314727, 3.616744895804571, 4.430664970094392, 3.9209467928146986, 3.4152215945143665, 3.919213940395525, 3.945231192738668, 4.024718745787387))","List(0, 1000, List(2, 54, 56, 66, 84, 86, 101, 125, 202, 266, 282, 372, 475, 582, 634, 636, 638, 642, 707, 738, 818, 831, 889, 942, 962), List(0.17179805224670153, 0.3632088001158472, 0.46249862497889754, 0.16189083878093105, 0.15837929872297782, 0.14807752110226638, 0.175118394993964, 0.1738621033150456, 0.16397843552521765, 0.17711795900624344, 0.1739364969082201, 0.03108877361725345, 0.15587972660852464, 0.18804451721768334, 0.18215990272856897, 0.1900160866037694, 0.18005616153749118, 0.15675532234346454, 0.1622671616905437, 0.19878411384029857, 0.17591533977077614, 0.1532257128029535, 0.17583759443573355, 0.1770048721438201, 0.18057112301152828))",3
114877038548253,The M&P Shield has quite a few options on our site! Go over and check them out at taguagunleather.com and get FREE SHIPPING now!!! . . pistol holster gun guns handmade repost taguagunleather metal firearms mandpshield keepitreal weapons countryboy firearm glockteam gunpictures edc 9mm leatherholster everydaycarry tactical military tacticool ar15,"List(the, m&p, shield, has, quite, a, few, options, on, our, site!, go, over, and, check, them, out, at, taguagunleather.com, and, get, free, shipping, now!!!, ., ., pistol, holster, gun, guns, handmade, repost, taguagunleather, metal, firearms, mandpshield, keepitreal, weapons, countryboy, firearm, glockteam, gunpictures, edc, 9mm, leatherholster, everydaycarry, tactical, military, tacticool, , ar15)","List(m&p, shield, quite, options, site!, go, check, taguagunleather.com, get, free, shipping, now!!!, ., ., pistol, holster, gun, guns, handmade, repost, taguagunleather, metal, firearms, mandpshield, keepitreal, weapons, countryboy, firearm, glockteam, gunpictures, edc, 9mm, leatherholster, everydaycarry, tactical, military, tacticool, , ar15)","List(0, 1000, List(56, 69, 73, 77, 136, 182, 197, 207, 246, 255, 315, 321, 361, 372, 455, 475, 488, 501, 534, 555, 620, 628, 651, 654, 665, 729, 739, 746, 756, 764, 855, 877, 882, 916, 934, 959, 986, 994), List(2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(56, 69, 73, 77, 136, 182, 197, 207, 246, 255, 315, 321, 361, 372, 455, 475, 488, 501, 534, 555, 620, 628, 651, 654, 665, 729, 739, 746, 756, 764, 855, 877, 882, 916, 934, 959, 986, 994), List(6.87236827538816, 3.6294060091432407, 3.059950930807626, 3.0915439384811654, 3.4639812527427645, 3.6813758262300524, 3.81192194828789, 3.706133841323088, 3.069836499292839, 3.5814440321914467, 2.962626528696895, 4.020399084642871, 3.537941401594337, 0.692932335326463, 4.01609800274348, 3.4743764523715597, 3.629946053439997, 3.4488136216983296, 3.3942626143658856, 4.263740456274265, 3.7780290367048304, 3.7622434873689525, 4.019920272088834, 3.8855913676587432, 3.3531265687892713, 3.828781786975435, 3.607300351679635, 3.596480350117603, 3.8486160478284943, 3.7046193073352085, 2.867014036198173, 3.7994290303167655, 2.9993227149651807, 4.3243905314791835, 3.739083017715062, 2.4718452465817693, 3.9556510548024395, 3.8822487461620536))","List(0, 1000, List(56, 69, 73, 77, 136, 182, 197, 207, 246, 255, 315, 321, 361, 372, 455, 475, 488, 501, 534, 555, 620, 628, 651, 654, 665, 729, 739, 746, 756, 764, 855, 877, 882, 916, 934, 959, 986, 994), List(0.3020902848108995, 0.15953864098392562, 0.1345069721460529, 0.13589571330538686, 0.1522670266330531, 0.16182309027654426, 0.16756153641521299, 0.1629113840831484, 0.1349415143707341, 0.15743036524885629, 0.13022886084954782, 0.17672561420824712, 0.15551811003486135, 0.030459387236728058, 0.1765365505545085, 0.15272397083200925, 0.1595623798362223, 0.1516002995604471, 0.14920238829006358, 0.1874222272703943, 0.16607169784677295, 0.16537780879664013, 0.1767045669338544, 0.1707998401538667, 0.14739416160222185, 0.1683026482513214, 0.15856693747626754, 0.158091320160293, 0.1691745074518204, 0.16284480936168116, 0.12602600036055758, 0.16701238232498009, 0.13184192361292746, 0.19008823668058142, 0.16435973866510273, 0.1086554743031694, 0.17387947005653026, 0.1706529077661531))",3
1446762168929844,"AVIVA captures the essence and energy of life, in the heart of one of Miami's most sophisticated neighborhoods. Be rejuvenated and inspired when you wake up in your sleek and modern apartment home at AVIVA. Indulge yourself in amenities that will refresh and replenish. See more on our community here . Call 305-707-6147. ‪avivacoralgables‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬ coralgables apartments","List(aviva, captures, the, essence, and, energy, of, life,, in, the, heart, of, one, of, miami's, most, sophisticated, neighborhoods., be, rejuvenated, and, inspired, when, you, wake, up, in, your, sleek, and, modern, apartment, home, at, aviva., indulge, yourself, in, amenities, that, will, refresh, and, replenish., see, more, on, our, community, here, ., call, 305-707-6147., ‪avivacoralgables‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬, coralgables, apartments)","List(aviva, captures, essence, energy, life,, heart, one, miami's, sophisticated, neighborhoods., rejuvenated, inspired, wake, sleek, modern, apartment, home, aviva., indulge, amenities, refresh, replenish., see, community, ., call, 305-707-6147., ‪avivacoralgables‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬, coralgables, apartments)","List(0, 1000, List(44, 56, 104, 140, 146, 160, 176, 195, 200, 225, 231, 254, 334, 362, 365, 372, 387, 420, 458, 479, 515, 577, 661, 778, 780, 800, 830, 903, 935, 946), List(1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(44, 56, 104, 140, 146, 160, 176, 195, 200, 225, 231, 254, 334, 362, 365, 372, 387, 420, 458, 479, 515, 577, 661, 778, 780, 800, 830, 903, 935, 946), List(2.475925497057267, 3.43618413769408, 3.899923494394419, 3.639171711754565, 3.3485499017618596, 4.021836899412498, 4.382287375551935, 3.9504275521412717, 3.4705922400011175, 3.6583268548963908, 3.4401081594954825, 3.3864407754732557, 3.4400188059681924, 2.9524786366265463, 3.2346884397586577, 0.692932335326463, 3.844992612340219, 2.5919043690326036, 4.147383809041079, 4.158868758907976, 2.8731284088185403, 3.757077424219052, 4.183556787717141, 3.746096347550126, 3.4918201109408145, 3.4978607144903475, 3.3190639335395344, 3.3818780299148377, 3.800069194220581, 4.325039811882904))","List(0, 1000, List(44, 56, 104, 140, 146, 160, 176, 195, 200, 225, 231, 254, 334, 362, 365, 372, 387, 420, 458, 479, 515, 577, 661, 778, 780, 800, 830, 903, 935, 946), List(0.12722236249609564, 0.17656406240358644, 0.20039273439390204, 0.1869943272722263, 0.1720610858220664, 0.20665710358626171, 0.2251784044863918, 0.20298782279613, 0.17833208008817003, 0.18797859055775343, 0.17676569339961357, 0.1740080613980013, 0.17676110207937687, 0.1517094548350995, 0.1662104557402955, 0.0356054690882043, 0.19757017911215558, 0.1331817930645322, 0.21310817590892076, 0.2136983158210273, 0.1476321417421424, 0.19305276615067632, 0.21496687957797667, 0.1924885171382436, 0.17942284792222657, 0.17973323684765558, 0.17054598589592257, 0.17377361037348524, 0.19526184495938198, 0.2222368093916379))",3
140066932696973,"Stay comfy, healthy, & beat boredom! TripIt's travel tips from top frequent flyers that anyone can benefit from ️","List(stay, comfy,, healthy,, &, beat, boredom!, tripit's, travel, tips, from, top, frequent, flyers, that, anyone, can, benefit, from, ️)","List(stay, comfy,, healthy,, &, beat, boredom!, tripit's, travel, tips, top, frequent, flyers, anyone, benefit, ️)","List(0, 1000, List(22, 56, 109, 200, 209, 290, 299, 348, 433, 441, 541, 562, 569, 934, 942), List(1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(22, 56, 109, 200, 209, 290, 299, 348, 433, 441, 541, 562, 569, 934, 942), List(3.210533717850982, 3.43618413769408, 4.209834004123803, 3.4705922400011175, 3.6517905218705455, 3.7619968782431648, 4.370889941940116, 3.507946432714921, 3.0402285734581214, 3.445484034820284, 3.874906467558727, 2.610557519489445, 4.101852962090629, 3.739083017715062, 3.945231192738668))","List(0, 1000, List(22, 56, 109, 200, 209, 290, 299, 348, 433, 441, 541, 562, 569, 934, 942), List(0.22696443111270018, 0.24291648882985506, 0.29760864198754655, 0.24534892407336706, 0.2581584967417461, 0.26594938921549277, 0.3089942783073139, 0.24798962927818186, 0.2149249343780062, 0.24357393274608602, 0.2739313831636294, 0.1845499080117136, 0.2899750651651793, 0.26432952417858613, 0.27890289651509365))",3
118359534897262,"When it’s snow I usually have ️ choices️ 1. Make snow or 2. Shovel as fast as possible and sit by the place. Can you guess what I chose⁉️ . . . . Doing work in the snow with my @turnerfootwear ! Sun, rain, or snow we got you covered. legendsevolve turnerfootwear","List(when, it’s, snow, , i, usually, have, , ️, choices️, 1., , make, snow, , or, 2., shovel, as, fast, as, possible, and, sit, by, the, , place., , , can, you, guess, what, i, chose⁉️, ., ., ., ., doing, work, in, the, snow, with, my, @turnerfootwear, !, , , sun,, rain,, or, snow, we, got, you, covered., , , legendsevolve, turnerfootwear)","List(it’s, snow, , usually, , ️, choices️, 1., , make, snow, , 2., shovel, fast, possible, sit, , place., , , guess, chose⁉️, ., ., ., ., work, snow, @turnerfootwear, !, , , sun,, rain,, snow, got, covered., , , legendsevolve, turnerfootwear)","List(0, 1000, List(7, 22, 38, 56, 94, 100, 263, 372, 396, 444, 512, 525, 527, 597, 630, 682, 743, 777, 792, 847, 900, 964, 977, 981, 982, 995), List(1.0, 1.0, 1.0, 4.0, 1.0, 1.0, 1.0, 11.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 4.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(7, 22, 38, 56, 94, 100, 263, 372, 396, 444, 512, 525, 527, 597, 630, 682, 743, 777, 792, 847, 900, 964, 977, 981, 982, 995), List(3.422306688661753, 3.210533717850982, 2.455722108308975, 13.74473655077632, 3.929802189290644, 3.604979430709536, 3.5180383055728277, 7.6222556885910935, 4.010075797538204, 4.067466595500856, 3.700437145105721, 2.660880842910326, 3.367897642319437, 3.9258729111507544, 2.976244330387577, 4.470675215250289, 3.700205314804463, 16.778067347838924, 2.7000064552320593, 3.845260565182122, 3.4171409809001703, 2.9096744113606685, 4.013082366778251, 3.7437934557984858, 3.8417827660641426, 3.747675071540662))","List(0, 1000, List(7, 22, 38, 56, 94, 100, 263, 372, 396, 444, 512, 525, 527, 597, 630, 682, 743, 777, 792, 847, 900, 964, 977, 981, 982, 995), List(0.11942327481383705, 0.11203334048239146, 0.08569377407893197, 0.4796301441321189, 0.13713260952635206, 0.12579774065199376, 0.12276388225636138, 0.2659828059290319, 0.13993380125174804, 0.14193648472239845, 0.12912879011556308, 0.0928529009987747, 0.11752464120650304, 0.13699549520381787, 0.10385768340358394, 0.1560066713492065, 0.12912070026965036, 0.5854798911986321, 0.09421821076719442, 0.13418248303927555, 0.11924301459938483, 0.1015346894531516, 0.14003871713955862, 0.13064173243132052, 0.13406112332560938, 0.1307771835482632))",3
171650321163,"Our intofriends use lego with their International Year One Management teacher, Dr.Karen Johnston (thanks Karen!) to learn about the Principles of Scientific Management..learning can be fun when u use lego intoqueens belfast qub intostudy studyabroad","List(our, intofriends, use, lego, with, their, international, year, one, management, teacher,, dr.karen, johnston, (thanks, karen!), to, learn, about, the, principles, of, scientific, management..learning, can, be, fun, when, u, use, lego, , intoqueens, belfast, qub, intostudy, studyabroad)","List(intofriends, use, lego, international, year, one, management, teacher,, dr.karen, johnston, (thanks, karen!), learn, principles, scientific, management..learning, fun, u, use, lego, , intoqueens, belfast, qub, intostudy, studyabroad)","List(0, 1000, List(44, 56, 79, 82, 116, 234, 259, 267, 318, 347, 372, 403, 489, 526, 591, 673, 719, 775, 809, 847, 899, 950, 964, 975), List(1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0))","List(0, 1000, List(44, 56, 79, 82, 116, 234, 259, 267, 318, 347, 372, 403, 489, 526, 591, 673, 719, 775, 809, 847, 899, 950, 964, 975), List(2.475925497057267, 3.43618413769408, 3.932722752114263, 3.700553080413949, 3.3649922320657724, 3.661332367260668, 3.2711450220488736, 3.4956856698955767, 3.4563232701999187, 2.8844959327815496, 0.692932335326463, 4.421801575142747, 4.515537611646647, 3.2153039379961323, 3.4874071654196195, 3.9495348277850035, 2.872824334216562, 2.979339698252861, 3.514085540725088, 3.845260565182122, 3.2392831311369683, 4.328509778496269, 5.819348822721337, 3.849423041819835))","List(0, 1000, List(44, 56, 79, 82, 116, 234, 259, 267, 318, 347, 372, 403, 489, 526, 591, 673, 719, 775, 809, 847, 899, 950, 964, 975), List(0.139345422662296, 0.19338890914995244, 0.22133416969061356, 0.20826762908957444, 0.18938221904893712, 0.20606028203573284, 0.184100485345308, 0.1967376634495169, 0.19452234225786036, 0.16233982217915746, 0.03899832578048066, 0.24885959216034553, 0.25413506900359006, 0.18095774156384192, 0.19627174809520392, 0.2222803555887572, 0.1616829430294755, 0.16767764215887465, 0.19777321067447065, 0.21641178595188104, 0.1823072938049259, 0.24360911719650705, 0.3275137407346161, 0.21664605070140336))",3
315889008467707,"The average age of a Roehampton Online student is 38 years old. By this stage, students are faced with juggling professional and personal commitments. Read this article and find out how online learning can work for you.","List(the, average, age, of, a, roehampton, online, student, is, 38, years, old., by, this, stage,, students, are, faced, with, juggling, professional, and, personal, commitments., read, this, article, and, find, out, how, online, learning, can, work, for, you.)","List(average, age, roehampton, online, student, 38, years, old., stage,, students, faced, juggling, professional, personal, commitments., read, article, find, online, learning, work, you.)","List(0, 1000, List(0, 4, 56, 72, 155, 309, 396, 470, 499, 500, 510, 523, 527, 619, 650, 665, 716, 812, 846, 861, 939), List(2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(0, 4, 56, 72, 155, 309, 396, 470, 499, 500, 510, 523, 527, 619, 650, 665, 716, 812, 846, 861, 939), List(6.192695986465059, 3.5868104021389136, 3.43618413769408, 3.557344030067446, 3.5984674866811264, 4.364338467021407, 4.010075797538204, 3.480405316340446, 2.2295712966342998, 3.4834800916490103, 2.9397743410062183, 3.0933743426277713, 3.367897642319437, 4.15484313590517, 3.3184307716663834, 3.3531265687892713, 3.0032984051432297, 4.132630032964617, 4.146840183042369, 3.8522526603621507, 2.9211484301694886))","List(0, 1000, List(0, 4, 56, 72, 155, 309, 396, 470, 499, 500, 510, 523, 527, 619, 650, 665, 716, 812, 846, 861, 939), List(0.36445754437985406, 0.21109386189420704, 0.20222908335854844, 0.2093596831729486, 0.21177991404599503, 0.2568535713701523, 0.23600421870251048, 0.20483162386986412, 0.1312165301779955, 0.20501258302899747, 0.17301397318643832, 0.18205376450351346, 0.19821023139601973, 0.24452418298994671, 0.19529896718318393, 0.19734091224997663, 0.17675254270042787, 0.24321687952010654, 0.24405318674622034, 0.22671588400188472, 0.17191778604270813))",3


In [24]:
display(
  predictions
  .filter(col('prediction') == 4)
)

page_id,message,words,noStopWords,hashingTF,idf,features,prediction
240229921942,"So, I think it's great you guys are making waves in the industry; including Bixby. However, I noticed you are extremely bias towards anything remotely conservative. This to me seems like a short fall to what could potentially be a great brand. It makes me wonder who you are, Like, who your team is. I wonder how a group of writers with so much going for them can be so bias and still feel like they are creative? I mean, how is it that I can be exposed to Flipboard for less then 3 minutes, and already know: your entire team does not support Trump; your entire team (much like AP) has an extremely progressive agenda; your entire team ONLY values progressive views; your entire team will NOT expose any good truths about any opposing views without constructing the narrative in such a way your paper still reads as if the opposing view is less good then your progressive agenda. I cant scroll down your news feed, which you managed to pitch of to Samsung's Bixby, without feeling a extremely bias narrative that onlky captures one side of the story. It's kind of like, have you ever noticed that AP ONLY posts pictures of Trump making a duck face. You see, it's not so much I am a huge Trump supporter, rather I bring it up as a metaphorical similarity to your Flipboard camp, and it scares me because, based on your boards, there is no doubt in my mind that your entire camp only has one point of view, which is damaging for any news business who wants to be deemed as credible. All of your stories, all of the news orginizations you post on, your entire flip board spells - Damn the president, loss faith in the system, fear big business, there are only good Democrates, the only good republicans are ones who side with Democrares, focus on black athletes, black talent, black calture, love imigrants, demand change and feel guilty if you happen to miss our message. I hear what you are saying, and a lot of what the progressives push for definetly has a place in our society. However, to completely disregard a very serious narrative told through a conservitives lense is extremely damaging. I mean the world isn't build so plainly, and relationships at the top arnt so black and white. Your demographic is a very young and vulnerable generation, your corrupting them and the youth by not capturing the story. Your helping to destroy due process, and frankly your making talk smack about your company to everyone I know. Which, i know it's like who the hell is this guy. You never know though.","List(so,, i, think, it's, great, you, guys, are, making, waves, in, the, industry;, including, bixby., however,, i, noticed, you, are, extremely, bias, towards, anything, remotely, conservative., this, to, me, seems, like, a, short, fall, to, what, could, potentially, be, a, great, brand., it, makes, me, wonder, who, you, are,, like,, who, your, team, is., i, wonder, how, a, group, of, writers, with, so, much, going, for, them, can, be, so, bias, and, still, feel, like, they, are, creative?, i, mean,, how, is, it, that, i, can, be, exposed, to, flipboard, for, less, then, 3, minutes,, and, already, know:, your, entire, team, does, not, support, trump;, your, entire, team, (much, like, ap), has, an, extremely, progressive, agenda;, your, entire, team, only, values, progressive, views;, your, entire, team, will, not, expose, any, good, truths, about, any, opposing, views, without, constructing, the, narrative, in, such, a, way, your, paper, still, reads, as, if, the, opposing, view, is, less, good, then, your, progressive, agenda., i, cant, scroll, down, your, news, feed,, which, you, managed, to, pitch, of, to, samsung's, bixby,, without, feeling, a, extremely, bias, narrative, that, onlky, captures, one, side, of, the, story., it's, kind, of, like,, have, you, ever, noticed, that, ap, only, posts, pictures, of, trump, making, a, duck, face., you, see,, it's, not, so, much, i, am, a, huge, trump, supporter,, rather, i, bring, it, up, as, a, metaphorical, similarity, to, your, flipboard, camp,, and, it, scares, me, because,, based, on, your, boards,, there, is, no, doubt, in, my, mind, that, your, entire, camp, only, has, one, point, of, view,, which, is, damaging, for, any, news, business, who, wants, to, be, deemed, as, credible., all, of, your, stories,, all, of, the, news, orginizations, you, post, on,, your, entire, flip, board, spells, -, damn, the, president,, loss, faith, in, the, system,, fear, big, business,, there, are, only, good, democrates,, the, only, good, republicans, are, ones, who, side, with, democrares,, focus, on, black, athletes,, black, talent,, black, calture,, love, imigrants,, demand, change, and, feel, guilty, if, you, happen, to, miss, our, message., , , i, hear, what, you, are, saying,, and, a, lot, of, what, the, progressives, push, for, definetly, has, a, place, in, our, society., however,, to, completely, disregard, a, very, serious, narrative, told, through, a, conservitives, lense, is, extremely, damaging., i, mean, the, world, isn't, build, so, plainly,, and, relationships, at, the, top, arnt, so, black, and, white., your, demographic, is, a, very, young, and, vulnerable, generation,, your, corrupting, them, and, the, youth, by, not, capturing, the, story., your, helping, to, destroy, due, process,, and, frankly, your, making, talk, smack, about, your, company, to, everyone, i, know., which,, i, know, it's, like, who, the, hell, is, this, guy., you, never, know, though.)","List(so,, think, great, guys, making, waves, industry;, including, bixby., however,, noticed, extremely, bias, towards, anything, remotely, conservative., seems, like, short, fall, potentially, great, brand., makes, wonder, are,, like,, team, is., wonder, group, writers, much, going, bias, still, feel, like, creative?, mean,, exposed, flipboard, less, 3, minutes,, already, know:, entire, team, support, trump;, entire, team, (much, like, ap), extremely, progressive, agenda;, entire, team, values, progressive, views;, entire, team, expose, good, truths, opposing, views, without, constructing, narrative, way, paper, still, reads, opposing, view, less, good, progressive, agenda., cant, scroll, news, feed,, managed, pitch, samsung's, bixby,, without, feeling, extremely, bias, narrative, onlky, captures, one, side, story., kind, like,, ever, noticed, ap, posts, pictures, trump, making, duck, face., see,, much, huge, trump, supporter,, rather, bring, metaphorical, similarity, flipboard, camp,, scares, because,, based, boards,, doubt, mind, entire, camp, one, point, view,, damaging, news, business, wants, deemed, credible., stories,, news, orginizations, post, on,, entire, flip, board, spells, -, damn, president,, loss, faith, system,, fear, big, business,, good, democrates,, good, republicans, ones, side, democrares,, focus, black, athletes,, black, talent,, black, calture,, love, imigrants,, demand, change, feel, guilty, happen, miss, message., , , hear, saying,, lot, progressives, push, definetly, place, society., however,, completely, disregard, serious, narrative, told, conservitives, lense, extremely, damaging., mean, world, build, plainly,, relationships, top, arnt, black, white., demographic, young, vulnerable, generation,, corrupting, youth, capturing, story., helping, destroy, due, process,, frankly, making, talk, smack, company, everyone, know., which,, know, like, hell, guy., never, know, though.)","List(0, 1000, List(5, 6, 8, 10, 13, 18, 26, 37, 42, 44, 52, 55, 57, 62, 71, 86, 88, 89, 99, 109, 111, 114, 115, 117, 125, 126, 130, 134, 149, 150, 158, 159, 161, 162, 168, 178, 184, 187, 191, 195, 197, 198, 212, 216, 240, 248, 260, 264, 268, 281, 293, 294, 298, 304, 305, 320, 322, 324, 329, 330, 348, 361, 362, 372, 374, 382, 385, 390, 393, 397, 399, 407, 412, 418, 420, 422, 433, 436, 437, 440, 441, 445, 446, 450, 456, 457, 458, 468, 491, 493, 497, 498, 499, 506, 511, 513, 517, 524, 531, 536, 538, 542, 564, 567, 588, 594, 598, 608, 612, 617, 618, 620, 625, 627, 631, 657, 669, 674, 675, 682, 684, 690, 691, 695, 718, 729, 732, 735, 745, 746, 756, 775, 779, 781, 784, 791, 800, 816, 817, 826, 827, 828, 836, 842, 856, 865, 879, 884, 887, 893, 898, 906, 909, 911, 925, 927, 930, 936, 938, 945, 952, 954, 959, 964, 966, 967, 969, 972, 973, 978, 984, 996, 998, 999), List(1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 3.0, 1.0, 1.0, 1.0, 3.0, 1.0, 2.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 4.0, 1.0, 1.0, 1.0, 1.0, 1.0, 4.0, 2.0, 1.0, 1.0, 2.0, 3.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 4.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 2.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 3.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 3.0, 1.0, 5.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 6.0, 1.0, 2.0, 2.0, 1.0, 1.0, 1.0, 3.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 4.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 2.0, 2.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 2.0, 3.0, 2.0, 1.0, 3.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 2.0, 1.0, 1.0, 1.0, 1.0, 3.0, 1.0, 3.0))","List(0, 1000, List(5, 6, 8, 10, 13, 18, 26, 37, 42, 44, 52, 55, 57, 62, 71, 86, 88, 89, 99, 109, 111, 114, 115, 117, 125, 126, 130, 134, 149, 150, 158, 159, 161, 162, 168, 178, 184, 187, 191, 195, 197, 198, 212, 216, 240, 248, 260, 264, 268, 281, 293, 294, 298, 304, 305, 320, 322, 324, 329, 330, 348, 361, 362, 372, 374, 382, 385, 390, 393, 397, 399, 407, 412, 418, 420, 422, 433, 436, 437, 440, 441, 445, 446, 450, 456, 457, 458, 468, 491, 493, 497, 498, 499, 506, 511, 513, 517, 524, 531, 536, 538, 542, 564, 567, 588, 594, 598, 608, 612, 617, 618, 620, 625, 627, 631, 657, 669, 674, 675, 682, 684, 690, 691, 695, 718, 729, 732, 735, 745, 746, 756, 775, 779, 781, 784, 791, 800, 816, 817, 826, 827, 828, 836, 842, 856, 865, 879, 884, 887, 893, 898, 906, 909, 911, 925, 927, 930, 936, 938, 945, 952, 954, 959, 964, 966, 967, 969, 972, 973, 978, 984, 996, 998, 999), List(4.40033089074587, 4.0210378582193655, 3.6917800817474853, 3.9556510548024395, 3.6447482392451325, 3.7668167602800646, 3.9920685780166445, 3.8393821247405073, 8.140966378080565, 7.427776491171802, 3.782046114665132, 3.8419163041560815, 3.574571857856081, 11.320961268811674, 3.460600864890833, 6.600948867909185, 4.314915869732134, 3.194399221934601, 6.724025181457354, 4.209834004123803, 4.001579837057452, 3.6702909938055592, 5.729298153468523, 3.921235893712925, 3.875182558118184, 3.629082122500733, 4.1016797869235155, 3.3148766866552717, 14.024142659842179, 3.392557002490692, 3.682058497777785, 3.083880102993244, 3.8202509917533662, 3.9312614044919316, 12.93875375903463, 6.8453157478311635, 3.9289276811457117, 4.103239444115162, 6.46124383512391, 11.851282656423816, 3.81192194828789, 3.6218758530794877, 3.7958516660825876, 3.999388912863372, 3.219523504421303, 4.428261700492592, 5.869173084573264, 3.5786689648881684, 4.072499318755467, 2.050513023711846, 3.342290990860326, 3.3676482761238886, 4.0189633342167665, 4.229316404866608, 3.7183334061150455, 3.5776431133414945, 3.2794048348556966, 3.2652759307094916, 3.883500920238708, 11.360890014386142, 3.507946432714921, 3.537941401594337, 2.9524786366265463, 1.385864670652926, 3.616851522663978, 3.8622880035995153, 3.909165047336312, 6.135481417195681, 3.3629220951115175, 8.911521391370401, 3.49975591073656, 3.8432526667764506, 3.1273652149329827, 2.996623657996016, 2.5919043690326036, 4.294130391838319, 9.120685720374365, 3.90432035276502, 3.891327654811639, 3.7668167602800646, 3.445484034820284, 3.723895879774548, 2.7747311039125733, 3.9159010097457028, 3.5754922444834096, 10.631911731776022, 4.147383809041079, 14.619321518567938, 4.322877181282239, 2.9813709577675054, 3.9416835171773967, 3.2876567154211838, 2.2295712966342998, 3.721643493071609, 21.231788584209077, 3.9118826346485283, 7.803813557591497, 6.935480966638728, 3.7183334061150455, 3.8499613998409132, 3.7629836798527845, 12.956343071345287, 3.450798644456059, 3.699973538236036, 3.973160936122537, 3.1301796292003075, 3.354684137657673, 3.815557371663381, 3.963160852787954, 4.034223448834982, 4.211958582421762, 15.112116146819321, 3.9274718649037306, 3.759534126105672, 3.6921249291559257, 3.657882361056424, 4.120381721502938, 3.708001037571024, 3.498523624690295, 4.470675215250289, 4.2780967323738155, 3.6131263307662893, 3.5696773849465853, 6.609359440229086, 3.391024435140914, 3.828781786975435, 3.96981645139969, 3.4598714702560156, 2.6219660525186645, 3.596480350117603, 7.697232095656989, 2.979339698252861, 5.3286605100066815, 7.558063588744418, 4.158135621045226, 4.091858048625015, 6.995721428980695, 3.921525078214651, 3.2818427455557644, 2.956823002666339, 3.6468337457361537, 7.660993427703539, 4.049884454076199, 3.2682061706062187, 3.9009146449180374, 3.848481612122168, 3.8947008133156995, 7.2877428615431965, 3.9829544344443577, 7.366394640716887, 12.103155979852833, 8.039521437675196, 3.7518158642066934, 11.114556489944514, 3.7140936164639524, 3.6929300358760493, 4.0321214187942775, 3.4063704433590747, 8.486766012595101, 4.4182305678959635, 2.8679716734276797, 3.5971074404391565, 2.4718452465817693, 2.9096744113606685, 3.150439856099582, 8.367865455142384, 8.215517665999796, 3.641792086657145, 3.6984682985608717, 3.337844203405871, 3.6926999392344455, 8.97066322529971, 3.700205314804463, 12.303481323861988))","List(0, 1000, List(5, 6, 8, 10, 13, 18, 26, 37, 42, 44, 52, 55, 57, 62, 71, 86, 88, 89, 99, 109, 111, 114, 115, 117, 125, 126, 130, 134, 149, 150, 158, 159, 161, 162, 168, 178, 184, 187, 191, 195, 197, 198, 212, 216, 240, 248, 260, 264, 268, 281, 293, 294, 298, 304, 305, 320, 322, 324, 329, 330, 348, 361, 362, 372, 374, 382, 385, 390, 393, 397, 399, 407, 412, 418, 420, 422, 433, 436, 437, 440, 441, 445, 446, 450, 456, 457, 458, 468, 491, 493, 497, 498, 499, 506, 511, 513, 517, 524, 531, 536, 538, 542, 564, 567, 588, 594, 598, 608, 612, 617, 618, 620, 625, 627, 631, 657, 669, 674, 675, 682, 684, 690, 691, 695, 718, 729, 732, 735, 745, 746, 756, 775, 779, 781, 784, 791, 800, 816, 817, 826, 827, 828, 836, 842, 856, 865, 879, 884, 887, 893, 898, 906, 909, 911, 925, 927, 930, 936, 938, 945, 952, 954, 959, 964, 966, 967, 969, 972, 973, 978, 984, 996, 998, 999), List(0.058743046393330806, 0.05367960258431423, 0.04928411385428962, 0.05280670913107189, 0.04865625341045576, 0.05028583013374278, 0.053292871719481154, 0.051254555140312956, 0.10867936469047494, 0.09915850190671403, 0.050489137269818365, 0.05128838564594406, 0.04771942058356024, 0.15113130570009953, 0.046197943337080556, 0.08812061074840379, 0.05760278247530379, 0.04264423434327618, 0.08976364118774757, 0.05619997230947973, 0.053419844064280984, 0.0489972412255567, 0.07648434528826503, 0.052347277453157176, 0.05173247977166942, 0.04844711563296385, 0.054756147207146576, 0.044252570960488534, 0.18721793504809783, 0.04528957897426434, 0.049154333737219384, 0.041168838539525215, 0.05099916047556534, 0.05248111489843807, 0.17272833139373794, 0.09138283245869364, 0.05244996041887734, 0.05477696814471726, 0.08625559208820567, 0.1582109309940499, 0.050887970340356754, 0.04835091418142635, 0.05067343707988643, 0.053390595908909814, 0.04297963568657501, 0.05911591400112947, 0.0783516320380933, 0.047774115685915046, 0.05436659704877559, 0.027373709994253807, 0.044618543380232044, 0.04495705523800597, 0.05365190833523286, 0.056460056289321985, 0.049638592461455536, 0.047760420887388494, 0.043779032790824526, 0.04359041632257371, 0.051843527314250855, 0.15166434201303708, 0.0468299919162131, 0.047230415405309256, 0.0394146699038974, 0.01850086156344568, 0.048283840935795116, 0.05156034148636355, 0.05218613541490742, 0.08190676530570025, 0.04489396219401311, 0.11896603403833983, 0.04672065099969878, 0.051306225670559424, 0.041749351235392564, 0.04000399218504562, 0.0346011157745801, 0.05732530282149144, 0.12175831266182921, 0.052121460251827886, 0.05194801178224686, 0.05028583013374278, 0.04599613836554492, 0.04971284975172637, 0.037041795722440755, 0.05227605841436884, 0.04773170745827293, 0.141932709065807, 0.05536628166254858, 0.19516338737396388, 0.05770906350401531, 0.03980037338923386, 0.0526202468556021, 0.04388919282535967, 0.02976408214911021, 0.049682781091011624, 0.2834377624733983, 0.05222241435881484, 0.10417847958265648, 0.09258651003726315, 0.049638592461455536, 0.051395785166749815, 0.05023465970430016, 0.172963143232835, 0.04606708674832525, 0.049393493945600685, 0.05304046059119307, 0.04178692278898259, 0.0447839880286804, 0.05093650210974142, 0.05290696259437288, 0.05385562611083711, 0.05622833476780887, 0.20174204212254992, 0.05243052572563127, 0.0501886092365435, 0.04928871745974921, 0.048831589844476886, 0.05500581220687523, 0.049500658560618464, 0.04670420036492087, 0.05968212120849093, 0.05711125846320034, 0.04823411078460803, 0.04765408089517498, 0.08823288926841723, 0.045269119677642286, 0.05111304393386523, 0.05299581276217431, 0.04618820614596108, 0.035002429882888896, 0.0480118921292872, 0.10275565027310277, 0.0397732567075869, 0.07113595757353923, 0.10089779406614148, 0.055509815002893564, 0.054625030060991654, 0.0933907014405851, 0.05235113797615486, 0.04381157813909778, 0.03947266583696731, 0.04868409427233578, 0.10227187534669917, 0.054064695651354105, 0.04362953411222881, 0.05207599511317702, 0.05137603046175766, 0.051993042397311014, 0.09728909658103742, 0.05317120074245797, 0.09833907332770317, 0.16157336138639825, 0.10732510634296437, 0.05008557284494825, 0.14837586621115156, 0.04958199259059221, 0.04929946538361813, 0.053827564664717, 0.04547398410604418, 0.11329568207147105, 0.0589820015063349, 0.03828652827477799, 0.048020263589699255, 0.032998363896328396, 0.03884324683298424, 0.04205739050566872, 0.11170839667497354, 0.10967460115647162, 0.04861678969444588, 0.04937339946492033, 0.04455918015318753, 0.049296393665143254, 0.1197555590946492, 0.04939658809057524, 0.16424764231414343))",4
157801977596418,Stay refreshed with Firefly Orchard Blueberry Wired! A delicious blend of blueberries with tart lemonade! Available at www.apolloecigs.com @cleanbuilds,"List(stay, refreshed, with, firefly, orchard, blueberry, wired!, a, delicious, blend, of, blueberries, with, tart, lemonade!, available, at, www.apolloecigs.com, , @cleanbuilds)","List(stay, refreshed, firefly, orchard, blueberry, wired!, delicious, blend, blueberries, tart, lemonade!, available, www.apolloecigs.com, , @cleanbuilds)","List(0, 1000, List(72, 107, 175, 351, 371, 372, 433, 465, 547, 571, 786, 797, 938, 947, 999), List(1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(72, 107, 175, 351, 371, 372, 433, 465, 547, 571, 786, 797, 938, 947, 999), List(3.557344030067446, 3.5622793273891324, 4.140699654446765, 4.116507795193133, 2.829296098614324, 0.692932335326463, 3.0402285734581214, 3.8383170201264347, 3.600458579819932, 3.7586735937518743, 3.5141817639630744, 4.49732315673318, 4.243383006297551, 4.044146263070945, 4.1011604412873295))","List(0, 1000, List(72, 107, 175, 351, 371, 372, 433, 465, 547, 571, 786, 797, 938, 947, 999), List(0.24983446008175658, 0.25018106904936976, 0.2908038845233111, 0.2891048753625664, 0.19870320588456783, 0.04866506427442592, 0.21351712338056827, 0.2695673660575706, 0.25286242143945414, 0.2639739592183192, 0.246803147575306, 0.31584977252096486, 0.29801539950532974, 0.2840228804372421, 0.2880270212544645))",4
230191010339228,"***WE ARE HIRING*** Here at the Oaklands Hall Hotel we are currently looking for food & beverage staff. With various hours available during the week and over the weekends. However, as the business is open 7 days a week, flexibility is key. Candidates must be well presented and able to work well in a busy environment. If you are interested please forward your CV to for the attention of Ben Leeming. We will be looking forward to hearing from you. The Oaklands Hall Hotel","List(***we, are, hiring***, , here, at, the, oaklands, hall, hotel, we, are, currently, looking, for, food, &, beverage, staff., with, various, hours, available, during, the, week, and, over, the, weekends., however,, as, the, business, is, open, 7, days, a, week,, flexibility, is, key., candidates, must, be, well, presented, and, able, to, work, well, in, a, busy, environment., if, you, are, interested, please, forward, your, cv, to, , for, the, attention, of, ben, leeming., we, will, be, looking, forward, to, hearing, from, you., , the, oaklands, hall, hotel)","List(***we, hiring***, , oaklands, hall, hotel, currently, looking, food, &, beverage, staff., various, hours, available, week, weekends., however,, business, open, 7, days, week,, flexibility, key., candidates, must, well, presented, able, work, well, busy, environment., interested, please, forward, cv, , attention, ben, leeming., looking, forward, hearing, you., , oaklands, hall, hotel)","List(0, 1000, List(3, 4, 19, 23, 35, 89, 96, 122, 157, 178, 191, 232, 315, 371, 372, 374, 377, 496, 504, 527, 562, 604, 607, 633, 653, 657, 681, 701, 718, 721, 763, 766, 783, 789, 801, 841, 847, 858, 922, 984, 999), List(1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 2.0, 2.0, 1.0, 3.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 3.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(3, 4, 19, 23, 35, 89, 96, 122, 157, 178, 191, 232, 315, 371, 372, 374, 377, 496, 504, 527, 562, 604, 607, 633, 653, 657, 681, 701, 718, 721, 763, 766, 783, 789, 801, 841, 847, 858, 922, 984, 999), List(2.3574385638229733, 3.5868104021389136, 3.325656753002941, 3.836987232696433, 6.757527360716712, 3.194399221934601, 4.066128802659197, 3.389324332725303, 4.326137880699627, 3.4226578739155817, 3.230621917561955, 7.042656201017829, 5.92525305739379, 2.829296098614324, 2.078797005979389, 3.616851522663978, 3.752792665261161, 3.4797543049247546, 6.8099782168495295, 3.367897642319437, 2.610557519489445, 3.6748005843115346, 3.061601953899516, 3.6373197585045207, 3.4517022317588864, 3.657882361056424, 3.7151518798351018, 3.78709026155181, 3.391024435140914, 10.788814222101468, 3.105322371639268, 3.797766519543152, 2.947333860350577, 3.8965327027873364, 3.684451524958175, 2.8823962213604877, 3.845260565182122, 3.602663883942892, 4.042512809587855, 3.6926999392344455, 4.1011604412873295))","List(0, 1000, List(3, 4, 19, 23, 35, 89, 96, 122, 157, 178, 191, 232, 315, 371, 372, 374, 377, 496, 504, 527, 562, 604, 607, 633, 653, 657, 681, 701, 718, 721, 763, 766, 783, 789, 801, 841, 847, 858, 922, 984, 999), List(0.08715555307254433, 0.13260597716609124, 0.12295101051005751, 0.14185512595316188, 0.2498287945078098, 0.11809836115971928, 0.15032659179260655, 0.1253048292727803, 0.1599392431457545, 0.12653718512836362, 0.11943753034089229, 0.2603701346533098, 0.21905924304321311, 0.10460033617200147, 0.07685405064718714, 0.13371661076412447, 0.13874241531750278, 0.12864806559814052, 0.25176792600652037, 0.12451267499080017, 0.0965134735345228, 0.13585901336813438, 0.11318878704833646, 0.13447333055418714, 0.12761096796636262, 0.13523353912357433, 0.1373508187252365, 0.1400104127192772, 0.125367682815853, 0.3988672642222865, 0.11480515035347026, 0.14040511872954606, 0.10896424476560877, 0.1440565484879326, 0.13621581293976953, 0.10656347134641588, 0.14216099473787117, 0.133192087445254, 0.14945349801655822, 0.13652076048718476, 0.15162172706634244))",4
8760099910,"“The Moon Shots Program was conceived around the idea that we could improve patients’ lives faster if we brought together experts from various areas, and armed them with key resources to better utilize new knowledge,” says Dr. Andrew Futreal, co-leader of our Moon Shots Program. Learn more about what makes this ambitious initiative unique. endcancer","List(“the, moon, shots, program, was, conceived, around, the, idea, that, we, could, improve, patients’, lives, faster, if, we, brought, together, experts, from, various, areas,, and, armed, them, with, key, resources, to, better, utilize, new, knowledge,”, says, dr., andrew, futreal,, co-leader, of, our, moon, shots, program., , , learn, more, about, what, makes, this, ambitious, initiative, unique., , endcancer)","List(“the, moon, shots, program, conceived, around, idea, improve, patients’, lives, faster, brought, together, experts, various, areas,, armed, key, resources, better, utilize, new, knowledge,”, says, dr., andrew, futreal,, co-leader, moon, shots, program., , , learn, makes, ambitious, initiative, unique., , endcancer)","List(0, 1000, List(25, 97, 145, 164, 210, 217, 263, 301, 312, 355, 372, 378, 385, 415, 496, 522, 644, 645, 654, 675, 691, 737, 750, 754, 764, 770, 775, 784, 794, 808, 904, 921, 941, 958, 999), List(2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 3.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(25, 97, 145, 164, 210, 217, 263, 301, 312, 355, 372, 378, 385, 415, 496, 522, 644, 645, 654, 675, 691, 737, 750, 754, 764, 770, 775, 784, 794, 808, 904, 921, 941, 958, 999), List(4.390762549340335, 4.219527759753206, 3.1960780135991707, 3.5139893267451225, 4.177747720505446, 4.257043323175983, 3.5180383055728277, 3.588882083012448, 3.530871458555814, 3.464438939601411, 2.078797005979389, 3.858343225308499, 3.909165047336312, 8.253506848951263, 3.4797543049247546, 4.225977503601093, 3.5666304599811807, 3.860925976693232, 3.8855913676587432, 3.498523624690295, 3.5696773849465853, 3.1665545560293524, 3.7520599750342205, 4.1065400638057685, 3.7046193073352085, 4.078910678662239, 2.979339698252861, 8.316271242090451, 4.261503091362508, 4.208869776877175, 3.1862552309372094, 4.1159806933624274, 3.4909735699505995, 3.619734753878063, 4.1011604412873295))","List(0, 1000, List(25, 97, 145, 164, 210, 217, 263, 301, 312, 355, 372, 378, 385, 415, 496, 522, 644, 645, 654, 675, 691, 737, 750, 754, 764, 770, 775, 784, 794, 808, 904, 921, 941, 958, 999), List(0.1793001945715849, 0.17230768911373243, 0.1305143248501087, 0.14349647992295386, 0.17060156880270413, 0.17383966624653754, 0.14366182311413436, 0.14655478371866953, 0.14418587487073917, 0.1414730514282156, 0.08488928824058845, 0.15755843848256065, 0.15963378700703013, 0.33703835433715323, 0.14209846625116543, 0.17257106940676073, 0.145646121431833, 0.15766390713876988, 0.15867113751153616, 0.14286492598295925, 0.1457705449762636, 0.12930871155920332, 0.15321827951478828, 0.16769374890632716, 0.15128100305006195, 0.16656548153216172, 0.12166364762126393, 0.3396013870166851, 0.17402178433983054, 0.17187246211570217, 0.1301132039006762, 0.1680792643372724, 0.1425566136411641, 0.14781467646556584, 0.16747406784786892))",4
1043592612385397,"Speech by Chair Yellen on financial stability a decade after the onset of the crisis at the Fostering a Dynamic Global Recovery Symposium in Jackson Hole, WY: go.usa.gov/xRGfX Learn more about Chair Yellen: go.usa.gov/xRGwG","List(speech, by, chair, yellen, on, financial, stability, a, decade, after, the, onset, of, the, crisis, at, the, fostering, a, dynamic, global, recovery, symposium, in, jackson, hole,, wy:, go.usa.gov/xrgfx, , learn, more, about, chair, yellen:, go.usa.gov/xrgwg)","List(speech, chair, yellen, financial, stability, decade, onset, crisis, fostering, dynamic, global, recovery, symposium, jackson, hole,, wy:, go.usa.gov/xrgfx, , learn, chair, yellen:, go.usa.gov/xrgwg)","List(0, 1000, List(7, 8, 82, 92, 178, 224, 310, 317, 343, 352, 372, 476, 540, 615, 656, 775, 781, 808, 834, 999), List(1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(7, 8, 82, 92, 178, 224, 310, 317, 343, 352, 372, 476, 540, 615, 656, 775, 781, 808, 834, 999), List(3.422306688661753, 3.6917800817474853, 3.700553080413949, 3.7253210581630434, 3.4226578739155817, 3.556339812268653, 3.0743511796473655, 3.842183433849135, 8.128918148873803, 4.227154666774108, 0.692932335326463, 4.267412431024386, 7.990665490111434, 3.567746588147375, 4.014192332800399, 2.979339698252861, 3.779031794372209, 4.208869776877175, 3.6245586899089157, 4.1011604412873295))","List(0, 1000, List(7, 8, 82, 92, 178, 224, 310, 317, 343, 352, 372, 476, 540, 615, 656, 775, 781, 808, 834, 999), List(0.17848468510287926, 0.19253861950561948, 0.192996160587364, 0.1942878930682162, 0.17850300058280105, 0.18547495863376717, 0.16033764712349519, 0.20038265494149027, 0.4239501389035858, 0.2204601861311773, 0.036138727741174194, 0.22255976253646045, 0.41673980256944315, 0.18606985996376715, 0.20935349156052355, 0.15538248211913155, 0.19708908673992814, 0.219506568261041, 0.1890328001720588, 0.2138891678950582))",4
108560955847639,"What a way to celebrate International Education Week? Journey to the USA explores opportunities for higher education for Nepalis in the U.S. A media team, currently traveling through various states such as Texas, Oklahoma, and Maryland, is meeting with Nepalis to learn about their experiences of studying in the U.S. According to the 2017 Open Doors Report published this week, Nepali students are the fastest growing international student population in the United States. The report showed an exciting 20% increase of Nepali students enrolled in the U.S. higher education institution, totaling 11,607 students, highest growth among the top 25 for undergraduate students.","List(what, a, way, to, celebrate, international, education, week?, , journey, to, the, usa, explores, opportunities, for, higher, education, for, nepalis, in, the, u.s., , a, media, team,, currently, traveling, through, various, states, such, as, texas,, oklahoma,, and, maryland,, is, meeting, with, nepalis, to, learn, about, their, experiences, of, studying, in, the, u.s., , according, to, the, 2017, open, doors, report, published, this, week,, nepali, students, are, the, fastest, growing, international, student, population, in, the, united, states., the, report, showed, an, exciting, 20%, increase, of, nepali, students, enrolled, in, the, u.s., higher, education, institution,, totaling, 11,607, students,, highest, growth, among, the, top, 25, for, undergraduate, students.)","List(way, celebrate, international, education, week?, , journey, usa, explores, opportunities, higher, education, nepalis, u.s., , media, team,, currently, traveling, various, states, texas,, oklahoma,, maryland,, meeting, nepalis, learn, experiences, studying, u.s., , according, 2017, open, doors, report, published, week,, nepali, students, fastest, growing, international, student, population, united, states., report, showed, exciting, 20%, increase, nepali, students, enrolled, u.s., higher, education, institution,, totaling, 11,607, students,, highest, growth, among, top, 25, undergraduate, students.)","List(0, 1000, List(1, 14, 41, 54, 57, 86, 90, 101, 117, 159, 178, 198, 200, 226, 254, 261, 307, 332, 338, 339, 361, 363, 372, 382, 384, 405, 427, 441, 457, 475, 488, 500, 505, 508, 523, 550, 566, 591, 604, 621, 623, 632, 633, 742, 747, 763, 775, 782, 783, 802, 826, 834, 849, 900, 935, 963, 999), List(1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 3.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 3.0, 1.0, 1.0, 1.0, 1.0, 2.0, 2.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 3.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(1, 14, 41, 54, 57, 86, 90, 101, 117, 159, 178, 198, 200, 226, 254, 261, 307, 332, 338, 339, 361, 363, 372, 382, 384, 405, 427, 441, 457, 475, 488, 500, 505, 508, 523, 550, 566, 591, 604, 621, 623, 632, 633, 742, 747, 763, 775, 782, 783, 802, 826, 834, 849, 900, 935, 963, 999), List(3.5329283670629916, 4.368625960896118, 3.54080319858375, 4.047749280398138, 7.149143715712162, 3.3004744339545926, 4.181304111712907, 3.9031838275680992, 3.921235893712925, 3.083880102993244, 3.4226578739155817, 3.6218758530794877, 3.4705922400011175, 4.397766487895154, 3.3864407754732557, 3.880859278104689, 4.2481830155135825, 7.570391447136056, 3.7493780277970057, 3.5653130034947016, 3.537941401594337, 3.9820322087411224, 2.078797005979389, 3.8622880035995153, 4.063291921323997, 3.847944049953599, 3.277504343957166, 3.445484034820284, 3.543970577258674, 10.423129357114679, 3.629946053439997, 3.4834800916490103, 3.813089036535516, 3.3184307716663834, 6.186748685255543, 8.268835326087624, 2.3862685596183986, 6.974814330839239, 3.6748005843115346, 3.5179417105473965, 4.035843407070344, 4.025841710532011, 7.274639517009041, 3.7069503107216843, 3.689942901301934, 3.105322371639268, 2.979339698252861, 3.5366611808276285, 2.947333860350577, 3.945675538446934, 2.956823002666339, 10.873676069726747, 3.194399221934601, 3.4171409809001703, 3.800069194220581, 3.3468390373582304, 4.1011604412873295))","List(0, 1000, List(1, 14, 41, 54, 57, 86, 90, 101, 117, 159, 178, 198, 200, 226, 254, 261, 307, 332, 338, 339, 361, 363, 372, 382, 384, 405, 427, 441, 457, 475, 488, 500, 505, 508, 523, 550, 566, 591, 604, 621, 623, 632, 633, 742, 747, 763, 775, 782, 783, 802, 826, 834, 849, 900, 935, 963, 999), List(0.10263782999776734, 0.12691632609325817, 0.10286660780895193, 0.11759428987813432, 0.2076952944089479, 0.09588463129407371, 0.12147429440912383, 0.1133944072794801, 0.11391885179224029, 0.08959218213859535, 0.09943427675424582, 0.10522191209623326, 0.10082685503667162, 0.12776291004442356, 0.09838210586179547, 0.11274583955476647, 0.12341724508503499, 0.21993328752760655, 0.10892607622676201, 0.10357866107705145, 0.10278346753483093, 0.11568509248495559, 0.06039273699652004, 0.11220631111901488, 0.11804583114116617, 0.1117895938457379, 0.0952173094728421, 0.10009741718026759, 0.10295862577257843, 0.30281037933685334, 0.1054563657184386, 0.10120126996645196, 0.11077699393708221, 0.09640629472620761, 0.17973601325068903, 0.24022431996945792, 0.06932533052028433, 0.20263071683782746, 0.10675946932991931, 0.10220244106715018, 0.11724840315878118, 0.11695783614967872, 0.2113411698359809, 0.10769347585677762, 0.1071993804731857, 0.09021511804201607, 0.08655509811796429, 0.10274627485279468, 0.08562527180725377, 0.11462886678286417, 0.0859009482078595, 0.3158995597816789, 0.09280295840203716, 0.09927400123529936, 0.11039874444451495, 0.09723160518871074, 0.1191459787553944))",4
155947571132061,"Get a head start in your architectural career at the University at Buffalo through spring admissions into our Master of Architecture program. At UB, you’ll earn your professional MArch without a heavy load of student debt – as New York State’s leading public research university, UB is ranked nationally year after year for exceptional value.","List(get, a, head, start, in, your, architectural, career, at, the, university, at, buffalo, through, spring, admissions, into, our, master, of, architecture, program., at, ub,, you’ll, earn, your, professional, march, without, a, heavy, load, of, student, debt, –, as, new, york, state’s, leading, public, research, university,, ub, is, ranked, nationally, year, after, year, for, exceptional, value.)","List(get, head, start, architectural, career, university, buffalo, spring, admissions, master, architecture, program., ub,, you’ll, earn, professional, march, without, heavy, load, student, debt, –, new, york, state’s, leading, public, research, university,, ub, ranked, nationally, year, year, exceptional, value.)","List(0, 1000, List(25, 28, 64, 67, 155, 225, 258, 261, 270, 283, 344, 353, 360, 390, 412, 432, 442, 450, 464, 498, 500, 542, 547, 566, 614, 683, 740, 814, 868, 884, 899, 959, 968, 996, 999), List(2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(25, 28, 64, 67, 155, 225, 258, 261, 270, 283, 344, 353, 360, 390, 412, 432, 442, 450, 464, 498, 500, 542, 547, 566, 614, 683, 740, 814, 868, 884, 899, 959, 968, 996, 999), List(4.390762549340335, 4.220502323713004, 3.8147772390012693, 4.102545962811364, 3.5984674866811264, 3.6583268548963908, 2.9474430485365235, 3.880859278104689, 3.8253607310072972, 4.147565083400317, 3.2044439100488926, 4.257650303467759, 4.010549918447915, 3.0677407085978405, 3.1273652149329827, 3.419501637734144, 3.658549175928876, 3.9159010097457028, 3.8407151028104476, 3.2876567154211838, 3.4834800916490103, 4.318781023781763, 3.600458579819932, 2.3862685596183986, 3.9781987301524944, 3.874354515012632, 3.3009408156187012, 4.419419487308393, 3.2720510243758527, 3.6438714307715983, 6.478566262273937, 2.4718452465817693, 4.0999496826857325, 2.990221075099903, 4.1011604412873295))","List(0, 1000, List(25, 28, 64, 67, 155, 225, 258, 261, 270, 283, 344, 353, 360, 390, 412, 432, 442, 450, 464, 498, 500, 542, 547, 566, 614, 683, 740, 814, 868, 884, 899, 959, 968, 996, 999), List(0.19531287789941462, 0.1877392470584406, 0.1696915086438597, 0.1824922322050178, 0.16006947152192424, 0.16273217654046837, 0.13111016088555763, 0.17263101472958273, 0.1701622907138747, 0.18449480326276044, 0.14254224752666989, 0.18939168869073533, 0.17840000176028858, 0.13646132299626462, 0.1391135807293543, 0.15210859123956907, 0.16274206597543195, 0.17418976480464968, 0.17084529429503204, 0.14624377597707264, 0.15495452422210873, 0.1921109468556721, 0.16015804067744926, 0.10614761663437759, 0.17696093425927018, 0.17234166544565152, 0.14683468833243812, 0.19658761525162408, 0.14554959304291903, 0.1620891605596807, 0.2881839787799232, 0.10995429686909546, 0.18237674271145676, 0.1330130420787587, 0.18243060049680998))",4
246075005477935,Lendemain de VICTOIRE Bonne fin d'année à tous nos supporters !,"List(lendemain, de, victoire, , , bonne, fin, d'année, à, tous, nos, supporters, !)","List(lendemain, de, victoire, , , bonne, fin, d'année, à, tous, nos, supporters, !)","List(0, 1000, List(38, 263, 274, 281, 372, 456, 489, 767, 831, 835, 866, 999), List(1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(38, 263, 274, 281, 372, 456, 489, 767, 831, 835, 866, 999), List(2.455722108308975, 3.5180383055728277, 4.115278322833114, 2.050513023711846, 1.385864670652926, 3.5754922444834096, 2.2577688058233236, 3.574469644946501, 3.4152215945143665, 3.619948657622194, 3.8245729158405, 4.1011604412873295))","List(0, 1000, List(38, 263, 274, 281, 372, 456, 489, 767, 831, 835, 866, 999), List(0.2167686311794343, 0.3105401646853965, 0.3632590373090758, 0.18100048855759826, 0.1223314261173622, 0.3156116716166469, 0.19929512867193303, 0.3155214059057546, 0.3014644481606708, 0.31953587614721474, 0.3375982294607308, 0.3620128401732197))",4
294990634643,"New CITROËN C3 AIRCROSS Compact SUV. 12 driving aids, great technological features and 85 personalisation combinations, gives you endless possibilities to make it the SUV to suit your needs.","List(new, citroën, c3, aircross, compact, suv., , 12, driving, aids,, great, technological, features, and, 85, personalisation, combinations,, gives, you, endless, possibilities, to, make, it, the, suv, to, suit, your, needs.)","List(new, citroën, c3, aircross, compact, suv., , 12, driving, aids,, great, technological, features, 85, personalisation, combinations,, gives, endless, possibilities, make, suv, suit, needs.)","List(0, 1000, List(25, 31, 232, 260, 352, 372, 521, 525, 531, 532, 605, 642, 651, 661, 682, 747, 755, 790, 805, 928, 970, 990, 999), List(1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0))","List(0, 1000, List(25, 31, 232, 260, 352, 372, 521, 525, 531, 532, 605, 642, 651, 661, 682, 747, 755, 790, 805, 928, 970, 990, 999), List(2.1953812746701673, 3.8648809767343577, 3.5213281005089145, 2.934586542286632, 4.227154666774108, 0.692932335326463, 4.0597986132718304, 2.660880842910326, 3.7183334061150455, 3.9390309957282654, 2.697324404197966, 3.493892455314727, 4.019920272088834, 4.183556787717141, 4.470675215250289, 3.689942901301934, 3.8696751780955942, 3.2019760787241576, 4.117738781016388, 4.214087684164783, 2.896253493969596, 3.8900656480536644, 4.1011604412873295))","List(0, 1000, List(25, 31, 232, 260, 352, 372, 521, 525, 531, 532, 605, 642, 651, 661, 682, 747, 755, 790, 805, 928, 970, 990, 999), List(0.126951374654263, 0.2234928203736513, 0.20362633503624747, 0.1696970817249774, 0.24444186621013286, 0.04006990199244529, 0.23476423923300632, 0.15386961923022285, 0.21501847664255638, 0.22778066182995169, 0.1559770254726569, 0.20203982570772455, 0.23245821144160428, 0.24192075029184038, 0.2585238248850587, 0.21337675105496134, 0.22377005260670377, 0.1851593021650283, 0.23811472054927574, 0.24368624739167213, 0.16748041292366994, 0.2249491635979881, 0.2371560519834224))",4
330647383625289,"Do not buy the GU10 LED bulbs. They are a rip off. They tell you that the expected life span is ten years. Our bulbs started to go after two weeks. Whilst they initially replaced the first few, we then found the rest gradually going until most had stopped working. However they now say it is over the 30 day money back warranty, which is rubbish because I reported problems within 30 days. Also they break the law which says the seller is supposed to be responsible for return carriage if items are faulty. But when I tried to return them, they say you have to pay postage then they test the bulbs before repaying carriage - they can then say anything they like about the state of the bulbs. DO NOT BUY FROM THIS COMPANY","List(do, not, buy, the, gu10, led, bulbs., they, are, a, rip, off., they, tell, you, that, the, expected, life, span, is, ten, years., our, bulbs, started, to, go, after, two, weeks., whilst, they, initially, replaced, the, first, few,, we, then, found, the, rest, gradually, going, until, most, had, stopped, working., however, they, now, say, it, is, over, the, 30, day, money, back, warranty,, which, is, rubbish, because, i, reported, problems, within, 30, days., also, they, break, the, law, which, says, the, seller, is, supposed, to, be, responsible, for, return, carriage, if, items, are, faulty., , but, when, i, tried, to, return, them,, they, say, you, have, to, pay, postage, then, they, test, the, bulbs, before, repaying, carriage, -, they, can, then, say, anything, they, like, about, the, state, of, the, bulbs., do, not, buy, from, this, company)","List(buy, gu10, led, bulbs., rip, off., tell, expected, life, span, ten, years., bulbs, started, go, two, weeks., whilst, initially, replaced, first, few,, found, rest, gradually, going, stopped, working., however, say, 30, day, money, back, warranty,, rubbish, reported, problems, within, 30, days., also, break, law, says, seller, supposed, responsible, return, carriage, items, faulty., , tried, return, them,, say, pay, postage, test, bulbs, repaying, carriage, -, say, anything, like, state, bulbs., buy, company)","List(0, 1000, List(20, 26, 66, 77, 78, 113, 114, 118, 167, 183, 187, 192, 193, 195, 213, 219, 231, 242, 258, 301, 330, 349, 372, 378, 392, 408, 412, 419, 426, 430, 468, 477, 493, 499, 502, 525, 526, 586, 605, 657, 673, 708, 783, 789, 792, 795, 805, 810, 845, 850, 852, 868, 870, 887, 912, 955, 966, 983, 994, 995, 997, 999), List(1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 3.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0))","List(0, 1000, List(20, 26, 66, 77, 78, 113, 114, 118, 167, 183, 187, 192, 193, 195, 213, 219, 231, 242, 258, 301, 330, 349, 372, 378, 392, 408, 412, 419, 426, 430, 468, 477, 493, 499, 502, 525, 526, 586, 605, 657, 673, 708, 783, 789, 792, 795, 805, 810, 845, 850, 852, 868, 870, 887, 912, 955, 966, 983, 994, 995, 997, 999), List(4.188266224696665, 3.9920685780166445, 3.6083570991096887, 3.0915439384811654, 4.128886379425773, 3.3644121616123828, 3.6702909938055592, 5.872193744298444, 3.8819706980579403, 3.0531312003027793, 4.103239444115162, 3.6366669459473364, 7.236904582749241, 3.9504275521412717, 7.011880444690689, 3.7453685454801544, 3.4401081594954825, 3.8236545818960006, 2.9474430485365235, 3.588882083012448, 2.8402225035965354, 3.605822778851446, 0.692932335326463, 3.858343225308499, 4.013240857942701, 3.3237459243960514, 3.1273652149329827, 7.689449462555616, 3.9892791759290662, 3.0028943530727377, 2.9238643037135876, 2.6612908638271366, 2.9813709577675054, 2.2295712966342998, 3.689369473910798, 2.660880842910326, 3.2153039379961323, 3.3615994529413507, 2.697324404197966, 3.657882361056424, 3.9495348277850035, 4.260690743633027, 2.947333860350577, 3.8965327027873364, 5.400012910464119, 4.329814126507282, 4.117738781016388, 3.7686767645363135, 4.014192332800399, 3.7031070636921615, 9.929295618526162, 3.2720510243758527, 3.938883839521687, 3.9829544344443577, 8.42933827174444, 3.2629528752228727, 3.150439856099582, 4.068136163654429, 3.8822487461620536, 3.747675071540662, 3.8976616896416605, 8.202320882574659))","List(0, 1000, List(20, 26, 66, 77, 78, 113, 114, 118, 167, 183, 187, 192, 193, 195, 213, 219, 231, 242, 258, 301, 330, 349, 372, 378, 392, 408, 412, 419, 426, 430, 468, 477, 493, 499, 502, 525, 526, 586, 605, 657, 673, 708, 783, 789, 792, 795, 805, 810, 845, 850, 852, 868, 870, 887, 912, 955, 966, 983, 994, 995, 997, 999), List(0.12388911939816451, 0.1180855834309779, 0.10673537915207176, 0.0914480206298023, 0.12213266067612066, 0.09951947599148164, 0.10856738678079546, 0.17369977763751965, 0.1148288827668083, 0.09031176995917137, 0.12137402307754647, 0.10757278580361936, 0.21406799086423378, 0.11685383985266472, 0.2074117658747576, 0.11078818442462465, 0.10175856730435102, 0.11310388920373865, 0.08718550927021783, 0.10615930722514286, 0.0840139203163737, 0.10666041383510713, 0.02049697230799281, 0.11412998096938391, 0.11871186050482377, 0.0983165667092922, 0.09250761573604588, 0.22745429050219926, 0.11800307278104841, 0.08882576156553809, 0.08648804884723836, 0.07872111367653171, 0.0881890984816347, 0.06595082780236884, 0.10913172915372199, 0.0787089852377952, 0.0951088474574324, 0.09943627593167575, 0.07978698755986241, 0.10820033881828517, 0.1168274330226234, 0.1260314402040754, 0.08718227947834696, 0.11525962757762259, 0.15973264552077787, 0.12807611324410734, 0.12180291417094402, 0.11147764268221577, 0.11874000517756561, 0.10953808773012544, 0.29370904914535406, 0.09678742904294378, 0.11651237626981202, 0.11781598661915613, 0.24934023759946114, 0.09651830534682594, 0.09319016474826051, 0.1203356663277718, 0.11483710744330232, 0.11085641157809933, 0.11529302306389076, 0.24262505214480407))",4


After playing a little bit with the data and input parameters to the learning algorithms, you might be able to identify that the data gets clustered according to language (with some error of course). By looking at some posts try to identify which clusters belong to english language and save it to a table. You can use this result as input in the next notebook where you will do LDA.

In [26]:
# we generate random string for the table name to avoid collisions
table_name = ''.join([random.choice('abcdefghijklmnoprstuvwxy') for _ in range(20)])

(
  predictions
  .select('page_id', 'message')
  .filter(col('prediction').isin([4])) # here write the number of clusters that belong to english language
  .repartition(32)
  .write
  .mode('overwrite')
  .saveAsTable(table_name)
)

print(table_name)