# Look at uses of a target word over time

In [51]:
from __future__ import print_function
import time
import numpy as np
import pandas as pd
import pyarrow
import fastparquet
from sklearn.decomposition import PCA
from sklearn.manifold import TSNE
%matplotlib inline
import matplotlib.pyplot as plt
from mpl_toolkits.mplot3d import Axes3D
import seaborn as sns
import csv
import textwrap
from scipy.spatial.distance import cosine
import spacy
from collections import defaultdict 
from tqdm import tqdm

pd.set_option('display.max_colwidth', 500)

Load in the file

In [59]:
target = "meaning"

In [60]:
tokens = pd.read_csv('./data/logic_words/{}.csv'.format(target))

In [61]:
len(tokens)

83397

In [63]:
df = parquet_file = "/Volumes/data_gabriella_chronis/corpora/acl-publication-info.74k.parquet"

df = pd.read_parquet(parquet_file, engine='pyarrow')

Left hand join the large file to the token file. or do a constant lookup??. maybe just get the year columns

In [65]:
data = tokens.join(df.set_index("corpus_paper_id"), on="corpus_id")

Add a decade column 

In [67]:
data["year"] = data["year"].astype(int)

In [68]:
data["decade"] = ( data['year'] //10)*10

### Look at 10 example sentences from each decade

In [71]:
#df.style.set_properties(subset=['sentence'], **{'width': '300px'})
pd.set_option('display.max_rows', 1000)


data.groupby('decade').sample(5) [['decade', 'sentence' ]]

Unnamed: 0,decade,sentence
51784,1950,This rank be 1 for the first meaning of any word; the rank be 2 for each recurrence of that word with a second meaning; the rank or weighting coefficient be 3 for each occurrence of a third meaning; etc.
51773,1950,"A warning to the reader may be helpful here before describing the three rules for a single word order, meaning, and form."
52445,1950,"In particular, many highly interesting cases of parallel development in the evolution of the expression of meaning amongst structurally unrelated languages do exist."
70698,1950,"Thus, the meaning of the conjunction ""i"", which has at least four equivalents (and, but, also, even) can be pinpointed in more than 90% of all occurrences by simple reference to the grammatical category of contiguous words; the pronoun-adjective ""ikh"", meaning ""(of) their"" or ""(of) them"", can be similarly resolved."
70849,1950,"Verbs are divided, then, into stems and endings, and the following results cover each of the Four Regular Conjugations: If now the French verb stems and their English translations are entered into the machine's dictionary together with an instruction that each word fed in for translation is to be identified with the stem in the dictionary which it most closely resembles, we will be supplied with the basic meaning of every verb in the passage to be translated."
59093,1960,"2 or 3 , iff i can take on meaning m, which occurs iff p(m,l) ~ O. (I.e., we require that if a word ca..~n take on a meaning, there is a non-zero probability that it wil_.~l do so. )"
27641,1960,The word bug takes on a different meaning in each sentence.
79492,1960,"Explanation of the diagram Each connection line is marked by an arrow symbol, meaning regular bus or train line taxi C oling charter The mode of communication is indicated by T for (underground) train, Ship for ship, Plane for plane; unmarked represents bus or car."
70744,1960,"Abstractly, a transformation is defined as a pair of dependency types, linking different grammatic types, but equivalent in meaning."
70424,1960,"If, in the source language, it is necessary to consider a group of words, rather than individual words in order to extract meaning, a human being can do this automatically, even if the words of the group are not contiguous."


In [75]:
save = data.groupby('decade').sample(5) [['decade', 'sentence' ]]
save

Unnamed: 0,decade,sentence
52444,1950,"Prof. Reifler has been investigating languages in order to discover such patterns of verbally conveying meaning, underlying the actual words and syntax of a language, as are common to all languages."
70936,1950,"The augmented word ""le"" has the unique meaning ""it"". ("
70975,1950,Many multiple-meaning problems will be solved at the same time.
70315,1950,"With these facilities in mind, we may now examine some of the procedures that can be mechanized to allow the machine to guess at a sequence of words which constitute its best estimate of the meaning of the sentence in the foreign language."
69708,1950,"If (M) [B] is 0, we consider successively meanings M-l, M-2,….., as above, and assign finally to all three words the highest numbered meaning which is non-blank for all."
27658,1960,"If we are not ready to understand the exact meaning of codes and rules and to prepare the right program for the representation chosen, the rules established on the basis of ad hoc definitions will result in a chaos."
70438,1960,"В might have been translated as ""(to) turn into"", and F as ""(to) point to""; in both cases, then, the basic meaning of the verb would have been retained, and in B, the preposition would have had its basic meaning ""into"" which holds in the presence of a verb of motion, whereas in F, the preposition would have to assume a secondary meaning, ""to"", rather than ""on""."
52622,1960,"But we, when we read the sentence, nevertheless represent to ourselves the garden together with the house, not together with the family, and so, contrary to the usual way of taking the meaning of the order of the words, we jump over ""for a large family"" and constitute the correlation ""a house with garden""."
52563,1960,"The strings ,  are defined on a vocabulary V = V N UV T , where V N is the non-terminal vocabulary which includes an initial symbol S meaning sentence."
51592,1960,An adaptation of the form-meaning method to the special circumstances of computer use is the 'cyclical' method with posteditors [11] .


## ACL Meaning

|decade | notes |
|---------|-------------------|
|1950 | patterns of verbally conveying meaning, "le" has the unique meaning "it", the meaning of the sentence (in the foreign language)
|1960 | the exact meaning of codes and rules ,  the basic meaning of the verb (translation), an initial symbol S meaning sentence.
|1970 | lexical meaning of the respective lexical unit (translation), conceptual graphs...serve as a representation of meaning in natural language, the precise meaning of the word.
|1980 | one knows the communicative meanlng of a language expression without knowing Its meaning proper, t translates the words, not the meaning, the constraint carries meaning,
|1990 | precision has less meaning the more the test corpus has undergone editing (meaning in a non metametalinguistic sense!), a morpheme having meaning in Persian, the meaning of rule definitions
|2000 | The first meaning might be intended when Mary... (ambiguity), the original meaning of the input sentence (translation), Neighborhoods of meaning points assigned to words
|2010 |meaning that GBS can be parallelized  (gloss), 
|2020 | derived meaning (vs base meaning), its [enclitics] meaning differs based on the context, the 'only cat' meaning (ambiguity), embeddings can capture the semantic meaning of words, system of forms and form-meaning mappings

### COCA Meaning

|sense | examples|
|--|--|
| providing a gloss | meaning he flip flops a lot, meaning they were unable to stop him, meaning that you have permission, meaning that students want to be in school |
| important or worthwhile quality | i derive meaning from many aspects of my life, It had little meaning in the scheme of life.
|implied or implicit significance | the meaning of poverty |
| what is means by a text action etc. | the more  a word is used the more it loses its meaning, the inner meaning of the words on the printed pages..

    

NLP strictly uses the what texts mean sense, unsurprisingly. but do we have further divisions there? 

The gloss sense is conspiciously missing, which makes sense because glossing is a metalinguistic speech act that constructs language and nlp does not do to o much of that. 