# Stuart Russell and Peter Norvig: _Artificial Intelligence: A Modern Approach_ (Third Edition)

**22.1** This exercise explores the quality of the n-gram model of language. Find or create a monolingual corpus of 100,000 words or more. Segment it into words, and compute the frequency of each word. How many distinct words are there? Also count frequencies of bigrams (two consecutive words) and trigrams (three consecutive words). Now use those frequencies to generate language: from the unigram, bigram, and trigram models, in turn, generate a 100-word text by making random choices according to the frequency counts. Compare the three generated texts with actual language.

## Učitavanje korpusa Gutenberg

In [283]:
import urllib
from urllib import request

url = "http://www.gutenberg.org/files/2554/2554-0.txt"
response = request.urlopen(url)
raw = response.read().decode('utf8')
print(type(raw),"\n Karaktera:",len(raw),"\n Prvih 75 karaktera:", raw[:75])

<class 'str'> 
 Karaktera: 1176967 
 Prvih 75 karaktera: ﻿The Project Gutenberg EBook of Crime and Punishment, by Fyodor Dostoevsky


## Rastavljanje na reči (tokenizacija)

In [298]:
import nltk
from nltk.tokenize import word_tokenize
tokens = word_tokenize(raw)
print(type(tokens),'\nBroj tokena:', len(tokens), '\nPrvih deset tokena:', tokens[:10])

<class 'list'> 
Broj tokena: 257727 
Prvih deset tokena: ['\ufeffThe', 'Project', 'Gutenberg', 'EBook', 'of', 'Crime', 'and', 'Punishment', ',', 'by']


## Frekvencije reči

In [299]:
from nltk import FreqDist
fdist1 = FreqDist(tokens)
print(fdist1)
filtered_word_freq = dict((word, freq) for word, freq in fdist1.items() if not word.isdigit())
print(filtered_word_freq)

<FreqDist with 11539 samples and 257727 outcomes>


## Jedinstvene reči

In [301]:
V = set(tokens)
print(len(V))

11539


## Frekvencije bigrama i trigrama

* Bigrami

In [306]:
bigrams = nltk.bigrams(tokens)
bigrams_freq = nltk.FreqDist(bigrams)

for a, b in bigrams_freq.items():
    print(a, b)

('\ufeffThe', 'Project') 1
('Project', 'Gutenberg') 27
('Gutenberg', 'EBook') 1
('EBook', 'of') 1
('of', 'Crime') 1
('Crime', 'and') 3
('and', 'Punishment') 3
('Punishment', ',') 2
(',', 'by') 44
('by', 'Fyodor') 2
('Fyodor', 'Dostoevsky') 4
('Dostoevsky', 'This') 1
('This', 'eBook') 2
('eBook', 'is') 2
('is', 'for') 5
('for', 'the') 238
('the', 'use') 12
('use', 'of') 20
('of', 'anyone') 5
('anyone', 'anywhere') 2
('anywhere', 'at') 2
('at', 'no') 5
('no', 'cost') 2
('cost', 'and') 2
('and', 'with') 48
('with', 'almost') 2
('almost', 'no') 2
('no', 'restrictions') 2
('restrictions', 'whatsoever') 2
('whatsoever', '.') 2
('.', 'You') 219
('You', 'may') 21
('may', 'copy') 2
('copy', 'it') 4
('it', ',') 305
(',', 'give') 6
('give', 'it') 11
('it', 'away') 5
('away', 'or') 2
('or', 're-use') 2
('re-use', 'it') 2
('it', 'under') 6
('under', 'the') 43
('the', 'terms') 13
('terms', 'of') 16
('of', 'the') 619
('the', 'Project') 26
('Gutenberg', 'License') 2
('License', 'included') 2
('include

(',', 'lived') 1
('lived', 'on') 2
('the', 'floor') 20
('floor', 'below') 2
('below', ',') 5
('and', 'every') 3
('every', 'time') 5
('time', 'he') 25
('he', 'went') 78
('went', 'out') 67
('out', 'he') 2
('was', 'obliged') 2
('obliged', 'to') 7
('to', 'pass') 15
('pass', 'her') 1
('her', 'kitchen') 2
('kitchen', ',') 6
(',', 'the') 316
('the', 'door') 171
('door', 'of') 10
('of', 'which') 23
('which', 'invariably') 1
('invariably', 'stood') 1
('stood', 'open') 4
('open', '.') 8
('.', 'And') 350
('And', 'each') 1
('each', 'time') 4
('he', 'passed') 11
('passed', ',') 3
('the', 'young') 27
('man', 'had') 3
('had', 'a') 76
('a', 'sick') 4
('sick', ',') 6
(',', 'frightened') 7
('frightened', 'feeling') 1
('feeling', ',') 5
(',', 'which') 87
('which', 'made') 6
('made', 'him') 19
('him', 'scowl') 1
('scowl', 'and') 1
('and', 'feel') 2
('feel', 'ashamed') 1
('ashamed', '.') 4
('was', 'hopelessly') 3
('hopelessly', 'in') 1
('in', 'debt') 3
('debt', 'to') 3
('landlady', ',') 16
('was', 'afraid'

('tremulously', 'at') 1
('at', 'his') 52
('his', 'hat') 16
('hat', '.') 5
('a', 'tall') 4
('tall', 'round') 1
('round', 'hat') 2
('hat', 'from') 1
('from', 'Zimmerman') 1
('Zimmerman', '’') 1
('s', ',') 32
('but', 'completely') 1
('completely', 'worn') 1
('worn', 'out') 7
('out', ',') 64
(',', 'rusty') 1
('rusty', 'with') 1
('with', 'age') 2
('age', ',') 3
('all', 'torn') 1
('torn', 'and') 3
('and', 'bespattered') 1
('bespattered', ',') 1
(',', 'brimless') 1
('brimless', 'and') 1
('and', 'bent') 1
('bent', 'on') 1
('on', 'one') 18
('one', 'side') 14
('side', 'in') 2
('a', 'most') 11
('most', 'unseemly') 2
('unseemly', 'fashion') 1
('fashion', '.') 2
('.', 'Not') 13
('Not', 'shame') 1
('shame', ',') 2
('but', 'quite') 2
('quite', 'another') 2
('another', 'feeling') 3
('feeling', 'akin') 1
('akin', 'to') 1
('to', 'terror') 1
('terror', 'had') 1
('had', 'overtaken') 1
('overtaken', 'him') 1
('I', 'knew') 28
('knew', 'it') 10
('he', 'muttered') 25
('muttered', 'in') 8
('in', 'confusion') 5

('spiteful', 'old') 1
('old', 'widows') 1
('widows', 'that') 1
('that', 'one') 15
('one', 'finds') 1
('finds', 'such') 1
('such', 'cleanliness') 1
('cleanliness', ',') 1
('Raskolnikov', 'thought') 11
('thought', 'again') 2
('again', ',') 79
('he', 'stole') 2
('stole', 'a') 5
('a', 'curious') 1
('curious', 'glance') 1
('glance', 'at') 8
('the', 'cotton') 1
('cotton', 'curtain') 1
('curtain', 'over') 1
('over', 'the') 27
('door', 'leading') 4
('leading', 'into') 1
('into', 'another') 5
('another', 'tiny') 1
('tiny', 'room') 2
('which', 'stood') 3
('stood', 'the') 2
('s', 'bed') 2
('bed', 'and') 8
('and', 'chest') 1
('chest', 'of') 9
('of', 'drawers') 9
('drawers', 'and') 1
('and', 'into') 1
('had', 'never') 28
('never', 'looked') 1
('looked', 'before') 1
('before', '.') 30
('.', 'These') 10
('These', 'two') 2
('rooms', 'made') 1
('made', 'up') 14
('“', 'What') 232
('What', 'do') 55
('do', 'you') 191
('you', 'want') 47
('want', '?') 21
('said', 'severely') 1
('severely', ',') 4
(',', 'com

('Walking', 'along') 1
('the', 'crowded') 1
('crowded', 'row') 1
('row', 'He') 1
('He', 'met') 4
('met', 'the') 3
('the', 'one') 8
('one', 'he') 2
('he', 'used') 7
('used', 'to') 59
('to', 'know.') 2
('know.', '”') 6
('no', 'one') 64
('one', 'shared') 1
('shared', 'his') 1
('his', 'enjoyment') 1
('enjoyment', ':') 1
(':', 'his') 7
('his', 'silent') 1
('silent', 'companion') 1
('companion', 'looked') 1
('looked', 'with') 8
('with', 'positive') 3
('positive', 'hostility') 1
('hostility', 'and') 1
('and', 'mistrust') 1
('mistrust', 'at') 1
('these', 'manifestations') 1
('manifestations', '.') 1
('was', 'another') 8
('another', 'man') 8
('man', 'in') 17
('room', 'who') 1
('who', 'looked') 4
('looked', 'somewhat') 1
('somewhat', 'like') 2
('a', 'retired') 2
('retired', 'government') 1
('government', 'clerk') 5
('clerk', '.') 8
('was', 'sitting') 19
('sitting', 'apart') 1
('apart', ',') 3
(',', 'now') 31
('then', 'sipping') 1
('sipping', 'from') 1
('his', 'pot') 1
('pot', 'and') 1
('looking'

('retain', 'your') 1
('your', 'innate') 1
('innate', 'nobility') 1
('nobility', 'of') 2
('of', 'soul') 1
('soul', ',') 9
('but', 'in') 18
('in', 'beggary') 2
('beggary', '--') 1
('--', 'never') 1
('never', '--') 2
('one', '.') 26
('For', 'beggary') 1
('beggary', 'a') 1
('man', 'is') 17
('not', 'chased') 1
('chased', 'out') 1
('of', 'human') 1
('human', 'society') 1
('society', 'with') 1
('a', 'stick') 2
('stick', ',') 2
('he', 'is') 130
('is', 'swept') 1
('swept', 'out') 1
('out', 'with') 10
('a', 'broom') 2
('broom', ',') 1
('to', 'make') 79
('make', 'it') 11
('as', 'humiliating') 1
('humiliating', 'as') 1
('possible', ';') 2
('and', 'quite') 10
('quite', 'right') 5
(',', 'forasmuch') 1
('forasmuch', 'as') 1
('as', 'in') 11
('beggary', 'I') 1
('am', 'ready') 11
('ready', 'to') 31
('first', 'to') 5
('to', 'humiliate') 1
('humiliate', 'myself') 1
('myself', '.') 11
('.', 'Hence') 2
('Hence', 'the') 1
('the', 'pot-house') 1
('pot-house', '!') 1
('!', 'Honoured') 2
('ago', 'Mr.') 1
('Mr.'

('more', 'or') 6
('or', 'less') 5
('less', 'in') 2
('the', 'order') 1
('order', 'of') 2
('of', 'things') 9
('things', ',') 19
('her', 'stockings') 3
('stockings', ',') 1
(',', 'her') 35
('stockings', 'I') 1
('sold', 'for') 2
('drink', '!') 4
('!', 'Her') 3
('Her', 'mohair') 1
('mohair', 'shawl') 1
('shawl', 'I') 1
('I', 'sold') 2
('drink', ',') 13
('a', 'present') 10
('present', 'to') 3
('her', 'long') 2
('long', 'ago') 21
('her', 'own') 20
('own', 'property') 1
('property', ',') 6
('not', 'mine') 1
('mine', ';') 2
('and', 'we') 23
('we', 'live') 4
('live', 'in') 9
('a', 'cold') 9
('cold', 'room') 1
('she', 'caught') 1
('caught', 'cold') 1
('cold', 'this') 1
('this', 'winter') 1
('winter', 'and') 1
('has', 'begun') 5
('begun', 'coughing') 1
('coughing', 'and') 1
('and', 'spitting') 1
('spitting', 'blood') 2
('blood', 'too') 1
('.', 'We') 68
('We', 'have') 11
('have', 'three') 1
('three', 'little') 2
('little', 'children') 7
('children', 'and') 8
('and', 'Katerina') 10
('Ivanovna', 'is'

('of', 'Persia') 1
('Persia', '.') 1
('Since', 'she') 2
('has', 'attained') 1
('attained', 'years') 1
('of', 'maturity') 1
('maturity', ',') 1
('has', 'read') 1
('read', 'other') 1
('other', 'books') 1
('of', 'romantic') 1
('romantic', 'tendency') 1
('tendency', 'and') 1
('late', 'she') 3
('had', 'read') 3
('read', 'with') 1
('with', 'great') 9
('great', 'interest') 3
('interest', 'a') 1
('a', 'book') 5
('book', 'she') 1
('she', 'got') 8
('got', 'through') 1
('through', 'Mr.') 2
('Lebeziatnikov', ',') 18
(',', 'Lewes') 1
('Lewes', '’') 1
('’', 'Physiology') 1
('Physiology', '--') 1
('--', 'do') 1
('know', 'it') 24
('?', '--') 5
('even', 'recounted') 1
('recounted', 'extracts') 1
('extracts', 'from') 1
('to', 'us') 12
('us', ':') 2
('whole', 'of') 8
('her', 'education') 1
('And', 'now') 13
('now', 'may') 1
('may', 'I') 6
('venture', 'to') 11
('to', 'address') 3
('address', 'you') 1
('on', 'my') 40
('own', 'account') 4
('account', 'with') 1
('a', 'private') 2
('private', 'question') 1
('

('maybe', 'when') 1
('when', 'no') 1
('can', 'see.') 1
('see.', '’') 1
('’', 'Do') 2
('hear', ',') 12
('hear', '?') 14
('?', 'I') 118
('a', 'nap') 1
('nap', 'after') 1
('after', 'dinner') 5
('dinner', 'and') 4
('you', 'think') 40
('think', ':') 1
(':', 'though') 1
('had', 'quarrelled') 1
('quarrelled', 'to') 1
('last', 'degree') 2
('degree', 'with') 1
('with', 'our') 1
('landlady', 'Amalia') 1
('Fyodorovna', 'only') 1
('a', 'week') 8
('week', 'before') 4
('she', 'could') 27
('not', 'resist') 6
('resist', 'then') 1
('then', 'asking') 1
('asking', 'her') 2
('in', 'to') 9
('to', 'coffee') 1
('coffee', '.') 3
('For', 'two') 1
('two', 'hours') 4
('hours', 'they') 1
('were', 'sitting') 6
('sitting', ',') 2
(',', 'whispering') 2
('whispering', 'together') 1
('together', '.') 9
('service', 'again') 1
('and', 'receiving') 1
('receiving', 'a') 2
('went', 'himself') 1
('excellency', 'and') 1
('excellency', 'himself') 1
('himself', 'came') 1
('made', 'all') 2
('others', 'wait') 1
('wait', 'and') 3

('plunged', 'in') 4
('in', 'deep') 2
('His', 'words') 1
('words', 'had') 3
('had', 'created') 1
('created', 'a') 1
('certain', 'impression') 1
('impression', ';') 1
('moment', 'of') 8
('of', 'silence') 1
('silence', ';') 1
('but', 'soon') 1
('soon', 'laughter') 1
('and', 'oaths') 1
('oaths', 'were') 1
('heard', 'again') 1
('s', 'his') 4
('his', 'notion') 1
('notion', '!') 1
('“', 'Talked') 2
('Talked', 'himself') 1
('himself', 'silly') 2
('silly', '!') 2
('A', 'fine') 3
('fine', 'clerk') 1
('clerk', 'he') 1
('And', 'so') 14
('so', 'on') 21
('on', '.') 36
('“', 'Let') 21
('Let', 'us') 13
('us', 'go') 14
('”', 'said') 135
('said', 'Marmeladov') 1
('Marmeladov', 'all') 1
('and', 'addressing') 1
('addressing', 'Raskolnikov') 7
('Raskolnikov', '--') 3
('“', 'come') 5
('come', 'along') 5
('along', 'with') 4
('with', 'me') 41
('...', 'Kozel') 1
('Kozel', '’') 3
('house', ',') 28
('looking', 'into') 5
('the', 'yard') 16
('yard', '.') 6
('m', 'going') 4
('--', 'time') 1
('time', 'I') 7
('I', 'd

('gowns', 'flung') 1
('flung', 'open') 3
('in', 'costumes') 1
('costumes', 'of') 1
('of', 'unseemly') 1
('unseemly', 'scantiness') 1
('scantiness', ',') 1
('some', 'of') 19
('them', 'with') 13
('with', 'cards') 1
('cards', 'in') 2
('hands', '.') 29
('were', 'particularly') 3
('particularly', 'diverted') 1
('diverted', ',') 1
('when', 'Marmeladov') 3
(',', 'dragged') 1
('dragged', 'about') 1
('about', 'by') 1
(',', 'shouted') 3
('shouted', 'that') 2
('that', 'it') 108
('They', 'even') 4
('even', 'began') 3
('room', ';') 10
(';', 'at') 7
('last', 'a') 4
('a', 'sinister') 2
('sinister', 'shrill') 1
('shrill', 'outcry') 1
('outcry', 'was') 1
('was', 'heard') 2
('heard', ':') 1
(':', 'this') 3
('this', 'came') 1
('from', 'Amalia') 3
('Lippevechsel', 'herself') 1
('herself', 'pushing') 1
('pushing', 'her') 1
('her', 'way') 7
('way', 'amongst') 1
('amongst', 'them') 2
('and', 'trying') 3
('to', 'restore') 4
('restore', 'order') 2
('order', 'after') 1
('after', 'her') 7
('own', 'fashion') 1
('

('are', 'our') 3
('our', 'all') 1
(',', 'our') 10
('our', 'one') 5
('one', 'hope') 2
('hope', ',') 6
('one', 'stay') 1
('stay', '.') 2
('a', 'grief') 1
('grief', 'it') 1
('was', 'to') 12
('me', 'when') 5
('heard', 'that') 11
('you', 'had') 28
('the', 'university') 16
('university', 'some') 1
('some', 'months') 1
('months', 'ago') 9
('for', 'want') 2
('of', 'means') 2
('keep', 'yourself') 1
('yourself', 'and') 4
('lost', 'your') 3
('your', 'lessons') 2
('lessons', 'and') 4
('your', 'other') 2
('other', 'work') 3
('work', '!') 1
('!', 'How') 22
('How', 'could') 18
('could', 'I') 22
('I', 'help') 4
('help', 'you') 2
('you', 'out') 6
('my', 'hundred') 1
('and', 'twenty') 4
('twenty', 'roubles') 6
('roubles', 'a') 2
('year', 'pension') 1
('pension', '?') 2
('?', 'The') 22
('The', 'fifteen') 1
('fifteen', 'roubles') 5
('I', 'sent') 4
('sent', 'you') 6
('you', 'four') 1
('four', 'months') 4
('ago', 'I') 2
('I', 'borrowed') 1
('borrowed', ',') 1
('on', 'security') 1
('security', 'of') 2
('my',

('the', 'nobility') 1
('feelings', 'and') 4
('her', 'behavior') 1
('behavior', '.') 1
('What', 'was') 11
('showed', 'and') 1
('and', 'read') 7
('read', 'to') 7
('to', 'everyone') 3
('everyone', 'the') 1
('letter', 'in') 3
('in', 'Dounia') 7
('s', 'own') 3
('own', 'handwriting') 1
('handwriting', 'to') 1
('to', 'Mr.') 2
('Svidrigaïlov', 'and') 5
('even', 'allowed') 1
('allowed', 'them') 1
('them', 'to') 21
('take', 'copies') 1
('copies', 'of') 8
('must', 'say') 2
('say', 'I') 7
('I', 'think') 36
('think', 'was') 1
('was', 'superfluous') 1
('superfluous', '.') 1
('In', 'this') 5
('this', 'way') 9
('way', 'she') 1
('was', 'busy') 4
('busy', 'for') 1
('for', 'several') 1
('several', 'days') 1
('days', 'in') 4
('in', 'driving') 2
('driving', 'about') 1
('about', 'the') 66
('whole', 'town') 1
('because', 'some') 1
('some', 'people') 3
('people', 'had') 3
('taken', 'offence') 1
('offence', 'through') 1
('through', 'precedence') 1
('precedence', 'having') 1
('having', 'been') 3
('been', 'given

('wasting', 'words') 1
('words', 'offer') 1
('offer', 'to') 1
('it', 'of') 9
(',', '(') 4
('could', 'refuse') 1
('refuse', 'Dounia') 1
('Dounia', 'that') 2
('that', ')') 3
(')', 'the') 1
('more', 'readily') 1
('readily', 'since') 1
('may', 'by') 1
('your', 'own') 22
('own', 'efforts') 1
('efforts', 'become') 1
('become', 'his') 3
('his', 'right') 11
('right', 'hand') 15
('hand', 'in') 6
('and', 'receive') 1
('receive', 'this') 1
('this', 'assistance') 1
('assistance', 'not') 1
('not', 'as') 3
('a', 'charity') 1
('charity', ',') 2
('salary', 'earned') 1
('earned', 'by') 1
('own', 'work') 2
('Dounia', 'wants') 1
('to', 'arrange') 2
('all', 'like') 3
('this', 'and') 7
('I', 'quite') 2
('quite', 'agree') 1
('agree', 'with') 10
('our', 'plans') 1
('plans', 'for') 2
('another', 'reason') 1
('I', 'particularly') 3
('particularly', 'wanted') 2
('wanted', 'you') 2
('feel', 'on') 1
('on', 'an') 6
('an', 'equal') 2
('equal', 'footing') 1
('footing', 'when') 1
('you', 'first') 2
('first', 'meet') 

('sacking', '(') 1
('(', 'I') 10
('been', 'driven') 1
('driven', 'in') 3
('it', ')') 6
('versts', 'and') 1
('they', 'can') 5
('can', '‘') 1
('‘', 'travel') 1
('travel', 'very') 1
('very', 'comfortably') 1
('comfortably', ',') 1
(',', 'third') 1
('class', ',') 3
('’', 'for') 3
('a', 'thousand') 14
('thousand', 'versts') 1
('versts', '!') 1
('!', 'Quite') 3
('Quite', 'right') 2
('One', 'must') 1
('must', 'cut') 1
('cut', 'one') 1
('one', '’') 26
('s', 'coat') 2
('coat', 'according') 1
('according', 'to') 8
('to', 'one') 8
('s', 'cloth') 1
('cloth', ',') 1
('what', 'about') 6
('?', 'She') 22
('is', 'your') 12
('your', 'bride') 1
('bride', '...') 1
('you', 'must') 39
('be', 'aware') 6
('aware', 'that') 4
('that', 'her') 19
('to', 'raise') 6
('raise', 'money') 1
('money', 'on') 5
('her', 'pension') 2
('pension', 'for') 1
('journey', '.') 3
('sure', 'it') 2
('a', 'matter') 11
('matter', 'of') 12
('a', 'partnership') 1
('partnership', 'for') 1
('for', 'mutual') 1
('mutual', 'benefit') 1
('ben

('suddenly', ',') 39
('frenzy', '--') 1
('“', 'accept') 1
('accept', 'one') 1
('s', 'lot') 1
('lot', 'humbly') 1
('humbly', 'as') 1
('once', 'for') 13
('and', 'stifle') 1
('stifle', 'everything') 1
('in', 'oneself') 1
(',', 'giving') 2
('giving', 'up') 1
('all', 'claim') 1
('claim', 'to') 2
('to', 'activity') 1
('activity', ',') 2
(',', 'life') 3
('and', 'love') 4
('love', '!') 1
('“', 'Do') 29
('Marmeladov', '’') 4
('s', 'question') 1
('question', 'came') 1
('came', 'suddenly') 1
('suddenly', 'into') 1
('turn', '...') 2
('He', 'gave') 3
('sudden', 'start') 1
('start', ';') 1
(';', 'another') 1
('another', 'thought') 3
('had', 'yesterday') 1
(',', 'slipped') 5
('slipped', 'back') 1
('not', 'start') 1
('start', 'at') 2
('the', 'thought') 19
('thought', 'recurring') 1
('recurring', 'to') 1
('knew', ',') 7
('had', '_felt') 1
('_felt', 'beforehand_') 1
('beforehand_', ',') 1
('must', 'come') 1
('was', 'expecting') 3
('expecting', 'it') 1
(';', 'besides') 4
('besides', 'it') 1
('only', 'yes

('Razumihin', '.') 38
('was', '...') 5
('remember', '.') 7
('What', 'for') 13
('though', '?') 2
('what', 'put') 1
('put', 'the') 18
('Razumihin', 'into') 1
('head', 'just') 2
('s', 'curious.') 1
('curious.', '”') 2
('He', 'wondered') 3
('wondered', 'at') 3
('at', 'himself') 8
('.', 'Razumihin') 49
('Razumihin', 'was') 17
('old', 'comrades') 1
('comrades', 'at') 1
('university', '.') 3
('was', 'remarkable') 1
('remarkable', 'that') 1
('that', 'Raskolnikov') 16
('had', 'hardly') 6
('hardly', 'any') 2
('any', 'friends') 1
('friends', 'at') 1
('university', ';') 1
('he', 'kept') 13
('kept', 'aloof') 1
('aloof', 'from') 2
('see', 'no') 1
('not', 'welcome') 1
('welcome', 'anyone') 1
('anyone', 'who') 3
('who', 'came') 6
('indeed', 'everyone') 1
('everyone', 'soon') 1
('soon', 'gave') 1
('gave', 'him') 19
('no', 'part') 2
('the', 'students') 1
('students', '’') 1
('’', 'gatherings') 1
('gatherings', ',') 1
(',', 'amusements') 1
('amusements', 'or') 1
('or', 'conversations') 1
('conversations'

('father', 'past') 1
('past', 'the') 1
('graveyard', ';') 1
('was', 'holding') 2
('holding', 'his') 4
('s', 'hand') 7
('with', 'dread') 3
('dread', 'at') 1
('A', 'peculiar') 1
('peculiar', 'circumstance') 1
('circumstance', 'attracted') 1
('attracted', 'his') 2
('attention', ':') 2
('there', 'seemed') 4
('some', 'kind') 1
('of', 'festivity') 2
('festivity', 'going') 1
('were', 'crowds') 2
('crowds', 'of') 3
('of', 'gaily') 1
('gaily', 'dressed') 1
('dressed', 'townspeople') 1
('townspeople', ',') 2
(',', 'peasant') 1
('peasant', 'women') 1
(',', 'their') 7
('their', 'husbands') 1
('husbands', ',') 1
('and', 'riff-raff') 1
('riff-raff', 'of') 1
('all', 'singing') 1
('all', 'more') 2
('less', 'drunk') 1
('entrance', 'of') 1
('tavern', 'stood') 1
('a', 'cart') 3
('but', 'a') 9
('strange', 'cart') 1
('cart', '.') 3
('those', 'big') 1
('big', 'carts') 1
('carts', 'usually') 1
('usually', 'drawn') 1
('drawn', 'by') 1
('by', 'heavy') 1
('heavy', 'cart-horses') 1
('cart-horses', 'and') 1
('and

('been', 'torturing') 2
('torturing', 'myself') 1
('myself', 'for') 4
('for', 'till') 1
('till', 'now') 7
('?', 'Yesterday') 2
('Yesterday', ',') 1
('make', 'that') 1
('...', '_experiment_') 1
('_experiment_', ',') 1
('yesterday', 'I') 3
('I', 'realised') 1
('realised', 'completely') 1
('completely', 'that') 1
('never', 'bear') 1
('bear', 'to') 2
('I', 'hesitating') 1
('hesitating', '?') 1
('?', 'As') 7
('As', 'I') 1
('stairs', 'yesterday') 1
('said', 'myself') 1
('myself', 'that') 6
('was', 'base') 2
('base', ',') 4
(',', 'vile') 4
('vile', ',') 3
('vile', '...') 1
('me', 'feel') 1
('feel', 'sick') 1
('sick', 'and') 5
('filled', 'me') 1
('horror', '.') 8
('I', 'couldn') 18
('couldn', '’') 41
('!', 'Granted') 1
('granted', 'that') 2
('no', 'flaw') 1
('flaw', 'in') 1
('that', 'reasoning') 1
('reasoning', ',') 2
('that', 'all') 28
('have', 'concluded') 1
('concluded', 'this') 1
('month', 'is') 1
('is', 'clear') 3
(',', 'true') 1
('true', 'as') 1
('as', 'arithmetic') 1
('arithmetic', '...

('mother', '.') 14
('was', 'thirty-five') 1
('thirty-five', '.') 1
('She', 'worked') 1
('worked', 'day') 1
('night', 'for') 3
('and', 'besides') 8
('besides', 'doing') 1
('doing', 'the') 2
('the', 'cooking') 1
('cooking', 'and') 1
('the', 'washing') 1
('washing', ',') 2
('did', 'sewing') 2
('sewing', 'and') 1
('and', 'worked') 2
('worked', 'as') 1
('a', 'charwoman') 1
('charwoman', 'and') 1
('and', 'gave') 12
('sister', 'all') 1
('she', 'earned') 1
('earned', '.') 1
('not', 'dare') 6
('accept', 'an') 1
('an', 'order') 2
('order', 'or') 2
('or', 'job') 1
('job', 'of') 1
('any', 'kind') 2
('kind', 'without') 1
('without', 'her') 4
('s', 'permission') 1
('permission', '.') 2
('woman', 'had') 4
('her', 'will') 5
('and', 'Lizaveta') 3
('Lizaveta', 'knew') 1
('by', 'this') 5
('this', 'will') 3
('farthing', ';') 1
(';', 'nothing') 1
('the', 'movables') 1
('movables', ',') 1
('on', ';') 1
(';', 'all') 5
('money', 'was') 4
('left', 'to') 4
('a', 'monastery') 4
('monastery', 'in') 1
('of', 'N') 

('left', 'corner') 1
('corner', 'and') 6
('and', 'drew') 5
('the', '_pledge_') 1
('_pledge_', ',') 1
('and', 'hidden') 2
('hidden', 'there') 1
('This', 'pledge') 1
('a', 'smoothly') 1
('smoothly', 'planed') 1
('planed', 'piece') 1
('wood', 'the') 1
('the', 'size') 2
('size', 'and') 1
('and', 'thickness') 1
('thickness', 'of') 1
('a', 'silver') 4
('silver', 'cigarette') 2
('cigarette', 'case') 3
('He', 'picked') 3
('up', 'this') 3
('this', 'piece') 1
('wood', 'in') 1
('his', 'wanderings') 1
('wanderings', 'in') 1
('a', 'courtyard') 2
('courtyard', 'where') 1
('was', 'some') 3
('a', 'workshop') 1
('workshop', '.') 1
('Afterwards', 'he') 2
('had', 'added') 1
('added', 'to') 3
('the', 'wood') 4
('wood', 'a') 1
('thin', 'smooth') 1
('smooth', 'piece') 1
('of', 'iron') 1
('iron', ',') 1
('had', 'also') 2
('also', 'picked') 1
('Putting', 'the') 1
('the', 'iron') 1
('iron', 'which') 1
('little', 'the') 1
('the', 'smaller') 1
('smaller', 'on') 1
('the', 'piece') 3
('he', 'fastened') 1
('fastene

('here', 'was') 7
('gate', '.') 3
('Suddenly', 'a') 2
('clock', 'somewhere') 1
('somewhere', 'struck') 1
('struck', 'once') 1
('!', 'can') 1
('be', 'half-past') 1
('half-past', 'seven') 1
('seven', '?') 1
('?', 'Impossible') 1
('Impossible', ',') 1
('be', 'fast') 1
('fast', '!') 1
('”', 'Luckily') 1
('Luckily', 'for') 1
(',', 'everything') 9
('everything', 'went') 1
('went', 'well') 1
('well', 'again') 2
('gates', '.') 1
('though', 'expressly') 1
('his', 'benefit') 2
('waggon', 'of') 1
('hay', 'had') 1
('just', 'driven') 1
('gate', ',') 2
(',', 'completely') 7
('completely', 'screening') 1
('screening', 'him') 1
('passed', 'under') 1
('the', 'waggon') 2
('waggon', 'had') 1
('scarcely', 'had') 1
('drive', 'through') 1
('through', 'into') 1
('had', 'slipped') 1
('slipped', 'in') 3
('flash', 'to') 1
('waggon', 'he') 1
('hear', 'shouting') 1
('shouting', 'and') 1
('and', 'quarrelling') 1
('quarrelling', ';') 1
('one', 'noticed') 1
('one', 'met') 1
('met', 'him') 7
('Many', 'windows') 1
('w

('sees', 'babies') 1
('babies', '’') 1
('’', 'mouths') 1
('mouths', ',') 1
('they', 'begin') 3
('begin', 'to') 7
(',', 'stare') 1
('stare', 'intently') 1
('at', 'what') 3
('what', 'frightens') 2
('frightens', 'them') 2
('are', 'on') 4
('of', 'screaming') 1
('screaming', '.') 2
('this', 'hapless') 1
('hapless', 'Lizaveta') 1
('so', 'simple') 1
('simple', 'and') 1
('so', 'thoroughly') 1
('thoroughly', 'crushed') 1
('and', 'scared') 1
('scared', 'that') 1
('even', 'raise') 2
('raise', 'a') 2
('to', 'guard') 1
('guard', 'her') 2
('though', 'that') 6
('most', 'necessary') 1
('necessary', 'and') 1
('and', 'natural') 1
('natural', 'action') 1
('action', 'at') 1
('axe', 'was') 1
('raised', 'over') 1
('only', 'put') 3
('her', 'empty') 1
('empty', 'left') 1
('left', 'hand') 4
('slowly', 'holding') 1
('out', 'before') 1
('though', 'motioning') 1
('motioning', 'him') 1
('The', 'axe') 1
('axe', 'fell') 1
('fell', 'with') 1
('the', 'sharp') 1
('sharp', 'edge') 1
('edge', 'just') 1
('skull', 'and') 1

('falling', 'from') 1
('from', 'fatigue') 2
('fatigue', ',') 4
('went', 'a') 1
('long', 'way') 1
('way', 'round') 1
('round', 'so') 1
('get', 'home') 1
('home', 'from') 1
('from', 'quite') 1
('different', 'direction') 1
('direction', '.') 1
('not', 'fully') 3
('fully', 'conscious') 3
('conscious', 'when') 1
('gateway', 'of') 2
('his', 'house') 2
('house', '!') 1
('staircase', 'before') 1
('he', 'recollected') 4
('recollected', 'the') 1
('very', 'grave') 2
('grave', 'problem') 1
('problem', 'before') 1
('escape', 'observation') 1
('observation', 'as') 1
('possible', 'in') 1
('in', 'doing') 3
('doing', 'so') 4
('course', 'incapable') 1
('of', 'reflecting') 1
('might', 'perhaps') 3
('perhaps', 'be') 2
('be', 'far') 1
('far', 'better') 2
('restore', 'the') 2
('drop', 'it') 2
('it', 'later') 3
('on', 'in') 8
('in', 'somebody') 1
('somebody', '’') 1
('s', 'yard') 1
('happened', 'fortunately') 1
('fortunately', ',') 1
('closed', 'but') 1
('seemed', 'most') 1
('porter', 'was') 3
('all', 'power

('writing', ',') 1
(',', 'dressed') 2
('dressed', 'hardly') 1
('hardly', 'better') 1
('a', 'queer-looking') 1
('queer-looking', 'set') 1
('set', '.') 1
('He', 'showed') 1
('showed', 'the') 2
('notice', 'he') 2
('received', '.') 2
('student', '?') 2
('man', 'asked') 1
(',', 'glancing') 7
('glancing', 'at') 9
('notice', '.') 1
(',', 'formerly') 3
('a', 'student.') 1
('student.', '”') 1
('The', 'clerk') 1
('but', 'without') 4
('the', 'slightest') 14
('slightest', 'interest') 1
('a', 'particularly') 2
('particularly', 'unkempt') 1
('unkempt', 'person') 1
('person', 'with') 2
('the', 'look') 4
('look', 'of') 19
('a', 'fixed') 1
('fixed', 'idea') 3
('idea', 'in') 4
('eye', '.') 3
('There', 'would') 1
('no', 'getting') 2
('getting', 'anything') 1
('anything', 'out') 1
('no', 'interest') 4
('in', 'anything') 4
('Go', 'in') 2
('there', 'to') 5
('head', 'clerk') 22
(',', 'pointing') 6
('pointing', 'towards') 1
('furthest', 'room') 1
('that', 'room') 3
('room', '--') 3
('fourth', 'in') 1
('order'

('regiment', 'it') 1
('cried', 'Ilya') 3
('much', 'gratified') 1
('gratified', 'at') 1
('this', 'agreeable') 1
('agreeable', 'banter') 1
('banter', ',') 1
('though', 'still') 4
('still', 'sulky') 1
('sulky', '.') 1
('something', 'exceptionally') 1
('exceptionally', 'pleasant') 1
('pleasant', 'to') 3
('“', 'Excuse') 12
(',', 'Captain') 1
('began', 'easily') 1
('suddenly', 'addressing') 3
('addressing', 'Nikodim') 2
('“', 'will') 1
('you', 'enter') 1
('my', 'position') 2
('position', '?') 2
('ask', 'pardon') 1
('pardon', ',') 1
('been', 'ill-mannered') 1
('ill-mannered', '.') 1
('a', 'poor') 9
('poor', 'student') 3
(',', 'sick') 1
('and', 'shattered') 1
('shattered', '(') 1
('(', 'shattered') 1
('shattered', 'was') 1
('the', 'word') 9
('word', 'he') 6
('used', ')') 1
(')', 'by') 1
('poverty', '.') 1
('not', 'studying') 1
('not', 'keep') 2
('keep', 'myself') 4
('myself', 'now') 2
('shall', 'get') 7
('and', 'sister') 25
('of', 'X') 1
('X', '.') 5
('will', 'send') 3
('send', 'it') 2
('will'

('Neva', '.') 3
('many', 'people') 5
('less', 'observed') 1
('observed', ',') 9
('convenient', 'in') 1
('all', 'it') 1
('was', 'further') 1
('further', 'off') 1
('wondered', 'how') 1
('been', 'wandering') 1
('wandering', 'for') 1
('good', 'half-hour') 1
(',', 'worried') 1
('worried', 'and') 2
('and', 'anxious') 3
('anxious', 'in') 1
('this', 'dangerous') 1
('dangerous', 'past') 1
('past', 'without') 1
('without', 'thinking') 4
('it', 'before') 7
('that', 'half-hour') 1
('half-hour', 'he') 1
('lost', 'over') 1
('over', 'an') 2
('an', 'irrational') 1
('irrational', 'plan') 1
('plan', ',') 4
('in', 'delirium') 6
('delirium', '!') 3
('become', 'extremely') 1
('extremely', 'absent') 1
('absent', 'and') 2
('and', 'forgetful') 1
('forgetful', 'and') 1
('was', 'aware') 2
('He', 'certainly') 1
('certainly', 'must') 4
('must', 'make') 3
('Neva', 'along') 1
('along', 'V') 1
('V', '--') 3
('--', 'Prospect') 3
('way', 'another') 1
('another', 'idea') 2
('idea', 'struck') 9
('Why', 'to') 1
('Would',

('incident', '.') 1
('A', 'coachman') 1
('coachman', ',') 2
('after', 'shouting') 1
('him', 'two') 2
(',', 'gave') 4
('violent', 'lash') 1
('lash', 'on') 1
('back', 'with') 3
('his', 'whip') 1
('for', 'having') 7
('having', 'almost') 1
('almost', 'fallen') 1
('fallen', 'under') 1
('his', 'horses') 1
('horses', '’') 1
('’', 'hoofs') 1
('hoofs', '.') 1
('The', 'lash') 1
('lash', 'so') 1
('so', 'infuriated') 1
('infuriated', 'him') 1
('he', 'dashed') 1
('dashed', 'away') 1
('away', 'to') 5
('the', 'railing') 5
('railing', '(') 1
('reason', 'he') 2
('been', 'walking') 2
('very', 'middle') 1
('bridge', 'in') 1
('the', 'traffic') 1
('traffic', ')') 1
('He', 'angrily') 1
('angrily', 'clenched') 1
('clenched', 'and') 1
('and', 'ground') 1
('ground', 'his') 3
('teeth', '.') 1
('He', 'heard') 4
('heard', 'laughter') 2
('laughter', ',') 4
('“', 'Serves') 1
('Serves', 'him') 1
('him', 'right') 1
('A', 'pickpocket') 1
('pickpocket', 'I') 1
('dare', 'say.') 1
('say.', '”') 3
('“', 'Pretending') 1
('

('should', 'we') 5
('we', 'trouble') 2
('trouble', 'you') 3
('judgment', '...') 1
('keep', 'your') 4
('your', 'visitor') 1
('see', 'he') 1
('is', 'waiting') 1
('waiting', ',') 2
('made', 'ready') 3
('hold', 'Raskolnikov') 1
('in', 'earnest') 6
('earnest', '.') 1
('it', 'alone') 3
('and', 'signing') 1
('signing', 'his') 1
('The', 'messenger') 1
('messenger', 'took') 1
('went', 'away') 10
('“', 'Bravo') 2
('Bravo', '!') 1
('you', 'hungry') 1
('hungry', '?') 1
('there', 'any') 2
('any', 'soup') 1
('soup', '?') 1
('Some', 'of') 4
('of', 'yesterday') 4
('answered', 'Nastasya') 2
('“', 'With') 4
('With', 'potatoes') 1
('potatoes', 'and') 1
('and', 'rice') 1
('rice', 'in') 1
('it', 'by') 7
('by', 'heart') 5
('.', 'Bring') 1
('Bring', 'soup') 1
('give', 'us') 4
('us', 'some') 3
('some', 'tea.') 1
('tea.', '”') 2
('Very', 'well.') 1
('well.', '”') 5
('this', 'with') 2
('with', 'profound') 1
('profound', 'astonishment') 1
('astonishment', 'and') 1
('a', 'dull') 1
('dull', ',') 1
(',', 'unreasoni

(';', 'forgotten') 1
('forgotten', 'it') 1
('I', 'remembered') 2
('minute', 'ago.') 1
('in', 'miserable') 1
('miserable', 'bewilderment') 1
('bewilderment', 'about') 1
('listened', ';') 1
('Suddenly', ',') 1
('though', 'recalling') 1
('recalling', 'something') 2
('a', 'hole') 3
('began', 'examining') 2
('examining', 'it') 1
('fumbled', '--') 1
('not', 'it') 9
('began', 'rummaging') 1
('rummaging', 'in') 1
('the', 'ashes') 1
('ashes', ';') 1
('frayed', 'edges') 1
('edges', 'of') 1
('rags', 'cut') 1
('pocket', 'were') 1
('were', 'lying') 4
('there', 'just') 2
('had', 'thrown') 2
('thrown', 'them') 1
('had', 'looked') 2
('sock', 'about') 1
('about', 'which') 1
('which', 'Razumihin') 4
('Razumihin', 'had') 7
('been', 'telling') 1
('sofa', 'under') 1
('the', 'quilt') 2
('quilt', ',') 1
('so', 'covered') 1
('with', 'dust') 1
('and', 'grime') 1
('grime', 'that') 1
('that', 'Zametov') 3
('Zametov', 'could') 1
('seen', 'anything') 2
('“', 'Bah') 6
('Bah', ',') 2
(',', 'Zametov') 3
('Zametov', '

('am', 'fond') 3
('.', 'Porfiry') 19
('Porfiry', 'Petrovitch') 81
('the', 'Investigation') 1
('Investigation', 'Department') 1
('Department', 'here') 1
('know', 'him.') 1
('he', 'a') 2
('relation', 'of') 3
('very', 'distant') 1
('distant', 'one') 1
('you', 'scowling') 1
('scowling', '?') 1
('?', 'Because') 5
('Because', 'you') 2
('you', 'quarrelled') 1
('quarrelled', 'once') 1
('come', 'then') 1
('care', 'a') 4
('a', 'damn') 1
('damn', 'for') 1
('So', 'much') 2
('much', 'the') 5
('some', 'students') 1
('a', 'teacher') 1
('teacher', ',') 1
('a', 'musician') 1
('musician', ',') 1
('officer', 'and') 2
('and', 'Zametov.') 1
('Zametov.', '”') 3
('Do', 'tell') 1
('you', 'or') 6
('or', 'he') 4
('he', '”') 1
('--', 'Zossimov') 1
('Zossimov', 'nodded') 1
('nodded', 'at') 4
('have', 'in') 3
('this', 'Zametov') 1
('you', 'particular') 1
('particular', 'gentleman') 1
('gentleman', '!') 2
('!', 'Principles') 1
('Principles', '!') 1
('are', 'worked') 1
('worked', 'by') 1
('by', 'principles') 1
('pri

('flat', 'open') 2
('there', 'at') 4
(',', 'flinging') 1
('flinging', 'away') 1
('away', 'their') 1
('their', 'booty') 1
('booty', ',') 1
('they', 'rolled') 1
('rolled', 'about') 1
('about', 'like') 1
('and', 'attracting') 1
('attracting', 'general') 1
('general', 'attention') 1
('dozen', 'witnesses') 1
('witnesses', 'to') 2
('swear', 'to') 3
('is', 'strange') 4
('strange', '!') 5
('no', '_buts_') 1
('_buts_', '.') 1
('ear-rings', 'being') 1
('being', 'found') 1
('found', 'in') 5
('in', 'Nikolay') 1
('hands', 'at') 1
('hour', 'of') 4
('murder', 'constitutes') 1
('constitutes', 'an') 1
('important', 'piece') 1
('of', 'circumstantial') 1
('circumstantial', 'evidence') 2
('--', 'although') 1
('although', 'the') 1
('explanation', 'given') 1
('him', 'accounts') 1
('accounts', 'for') 2
('and', 'therefore') 7
('therefore', 'it') 1
('not', 'tell') 5
('tell', 'seriously') 1
('seriously', 'against') 1
('take', 'into') 1
('into', 'consideration') 3
('consideration', 'the') 1
('facts', 'which') 2


('--', 'filthy') 1
('filthy', ',') 1
(',', 'stinking') 1
('stinking', 'and') 1
('of', 'doubtful') 1
('doubtful', 'character') 1
('.', 'Things') 1
('Things', 'have') 1
('happened', 'there') 1
('of', 'queer') 1
('queer', 'people') 1
('went', 'there') 4
('there', 'about') 1
('a', 'scandalous') 1
('scandalous', 'business') 1
('s', 'cheap') 1
('cheap', ',') 2
(',', 'find') 3
('much', 'about') 7
('Petersburg', 'myself') 1
('Petrovitch', 'replied') 3
('replied', 'huffily') 1
('huffily', '.') 1
('“', 'However') 2
('However', ',') 5
('rooms', 'are') 1
('are', 'exceedingly') 2
('exceedingly', 'clean') 1
('for', 'so') 4
('already', 'taken') 1
('a', 'permanent') 3
('permanent', ',') 2
('our', 'future') 1
('future', 'flat') 1
('am', 'having') 1
('having', 'it') 2
('it', 'done') 1
('done', 'up') 2
('meanwhile', 'I') 2
('am', 'myself') 1
('myself', 'cramped') 1
('cramped', 'for') 1
('for', 'room') 1
('room', 'in') 6
('lodging', 'with') 2
('friend', 'Andrey') 1
('Andrey', 'Semyonovitch') 21
('Semyonov

('not', 'fall') 1
('had', 'dressed') 1
('in', 'entirely') 1
('new', 'clothes') 4
('money', 'lying') 2
('and', 'after') 5
('thought', 'put') 1
('was', 'twenty-five') 1
('took', 'also') 1
('also', 'all') 1
('the', 'copper') 2
('copper', 'change') 1
('change', 'from') 1
('the', 'ten') 2
('roubles', 'spent') 1
('spent', 'by') 1
('Razumihin', 'on') 2
('softly', 'unlatched') 1
('slipped', 'downstairs') 1
('and', 'glanced') 2
('glanced', 'in') 1
('open', 'kitchen') 1
('kitchen', 'door') 1
('blowing', 'up') 1
('s', 'samovar') 1
('samovar', '.') 1
('She', 'heard') 3
('have', 'dreamed') 1
('indeed', '?') 2
('nearly', 'eight') 2
('sun', 'was') 3
('was', 'setting') 3
('setting', '.') 3
('as', 'stifling') 1
('stifling', 'as') 1
('he', 'eagerly') 1
('drank', 'in') 1
('the', 'stinking') 1
('stinking', ',') 1
(',', 'dusty') 1
('dusty', 'town') 1
('town', 'air') 1
('air', '.') 7
('head', 'felt') 1
('felt', 'rather') 1
('rather', 'dizzy') 1
('dizzy', ';') 1
('of', 'savage') 2
('savage', 'energy') 1
('en

('hang', 'oneself') 1
('oneself', 'at') 1
('notes', 'either') 1
('who', 'changed') 1
('changed', 'the') 1
('notes', 'took') 1
('hands', 'trembled') 3
('trembled', '.') 2
('He', 'counted') 1
('counted', 'the') 1
('first', 'four') 1
('four', 'thousand') 1
('thousand', ',') 4
('not', 'count') 1
('count', 'the') 3
('fifth', 'thousand') 1
('thousand', '--') 1
('get', 'the') 4
('and', 'run') 2
('course', 'he') 3
('he', 'roused') 2
('roused', 'suspicion') 1
('thing', 'came') 1
('a', 'crash') 1
('crash', 'through') 1
('through', 'one') 1
('one', 'fool') 1
('fool', '!') 5
('!', 'Is') 4
('possible', '?') 6
('That', 'his') 1
('trembled', '?') 1
('observed', 'Zametov') 1
('“', 'yes') 1
('quite', 'possible') 1
('That', ',') 3
('feel', 'quite') 2
('quite', 'sure') 1
('is', 'possible') 3
('Sometimes', 'one') 2
('t', 'stand') 6
('stand', 'things.') 1
('things.', '”') 1
('Can', '’') 1
('stand', 'that') 2
('you', 'stand') 2
('t', '.') 10
('face', 'such') 1
('terrible', 'experience') 1
('experience', '?'

('the', 'drowning') 2
('drowning', 'woman') 2
('woman', 'floated') 1
('floated', 'to') 1
('the', 'surface') 3
('surface', ',') 1
('moving', 'slowly') 1
('slowly', 'with') 1
('the', 'current') 1
('current', ',') 1
('legs', 'in') 1
('her', 'skirt') 1
('skirt', 'inflated') 1
('inflated', 'like') 1
('a', 'balloon') 2
('balloon', 'over') 1
('A', 'woman') 2
('woman', 'drowning') 2
('drowning', '!') 3
('shouted', 'dozens') 1
('of', 'voices') 1
('voices', ';') 2
('ran', 'up') 5
('both', 'banks') 1
('banks', 'were') 1
('were', 'thronged') 1
('thronged', 'with') 1
('with', 'spectators') 1
('spectators', ',') 1
('bridge', 'people') 1
('people', 'crowded') 1
('crowded', 'about') 1
('about', 'Raskolnikov') 3
('pressing', 'up') 1
('up', 'behind') 1
('!', 'it') 1
('s', 'our') 1
('our', 'Afrosinya') 1
('Afrosinya', '!') 1
('”', 'a') 6
('cried', 'tearfully') 1
('tearfully', 'close') 1
('Mercy', '!') 2
('!', 'save') 1
('!', 'kind') 1
('kind', 'people') 1
(',', 'pull') 1
('pull', 'her') 2
('A', 'boat') 1

('walking', 'to') 3
('fro', 'in') 3
('room', 'from') 3
('from', 'window') 1
('window', 'to') 1
('to', 'stove') 1
('stove', 'and') 2
('and', 'back') 1
('arms', 'folded') 3
('folded', 'across') 1
('across', 'her') 1
('coughing', '.') 3
('talk', 'more') 1
('her', 'eldest') 1
(',', 'Polenka') 13
('Polenka', ',') 13
('of', 'ten') 5
('ten', ',') 2
('mother', 'needed') 1
('needed', 'her') 1
('so', 'always') 1
('always', 'watched') 1
('watched', 'her') 2
('her', 'big') 3
('big', 'clever') 1
('clever', 'eyes') 1
('and', 'strove') 1
('strove', 'her') 1
('her', 'utmost') 3
('appear', 'to') 4
('time', 'Polenka') 1
('Polenka', 'was') 3
('was', 'undressing') 1
('undressing', 'her') 2
('been', 'unwell') 1
('unwell', 'all') 1
('to', 'bed') 10
('boy', 'was') 1
('was', 'waiting') 5
('his', 'shirt') 4
('shirt', ',') 5
('be', 'washed') 1
('washed', 'at') 1
('night', '.') 10
('sitting', 'straight') 1
('and', 'motionless') 2
('motionless', 'on') 1
('a', 'silent') 3
(',', 'serious') 1
('serious', 'face') 1
(

('doctor', 'changed') 1
('changed', 'places') 1
('places', 'with') 1
(',', 'exchanging') 2
('exchanging', 'glances') 1
('glances', 'with') 2
('Raskolnikov', 'begged') 1
('begged', 'the') 1
('doctor', 'to') 1
('remain', 'a') 2
('He', 'shrugged') 1
('and', 'remained') 3
('All', 'stepped') 1
('The', 'confession') 1
('confession', 'was') 3
('was', 'soon') 1
('soon', 'over') 1
('The', 'dying') 1
('man', 'probably') 1
('probably', 'understood') 1
('understood', 'little') 1
('only', 'utter') 1
('utter', 'indistinct') 1
('indistinct', 'broken') 1
('broken', 'sounds') 1
('sounds', '.') 1
('Ivanovna', 'took') 2
('took', 'little') 1
('lifted', 'the') 1
('boy', 'from') 1
('children', 'kneel') 1
('kneel', 'in') 1
('trembling', ';') 1
(',', 'kneeling') 1
('kneeling', 'on') 1
('little', 'bare') 1
('bare', 'knees') 1
('hand', 'rhythmically') 1
('rhythmically', ',') 1
('with', 'precision') 1
('precision', 'and') 1
('and', 'bowed') 3
('bowed', 'down') 8
(',', 'touching') 4
('to', 'afford') 1
('afford', 

('of', 'liquor') 1
('liquor', 'made') 1
('made', 'Razumihin') 1
('Razumihin', 'quite') 1
('quite', 'drunk') 1
('was', 'perceptibly') 1
('perceptibly', 'affected') 1
('affected', 'by') 2
('Raskolnikov', 'hastened') 2
('ve', 'only') 9
('ve', 'won') 1
('won', 'your') 1
('your', 'bet') 1
('bet', 'and') 1
('one', 'really') 2
('really', 'knows') 1
('not', 'happen') 2
('so', 'weak') 1
('weak', 'that') 1
('down', 'directly') 1
('so', 'good') 3
('good', 'evening') 1
('and', 'good-bye') 1
('good-bye', '!') 3
('Come', 'and') 1
('me', 'to-morrow.') 1
('to-morrow.', '”') 1
('ll', 'see') 6
('re', 'weak') 1
('weak', 'yourself') 1
('must', '...') 1
('your', 'visitors') 1
('visitors', '?') 1
('the', 'curly-headed') 1
('curly-headed', 'one') 1
('just', 'peeped') 1
('peeped', 'out') 2
('He', '?') 1
('?', 'Goodness') 2
('Goodness', 'only') 1
('only', 'knows') 1
('knows', '!') 1
('Some', 'friend') 1
('of', 'uncle') 1
('uncle', '’') 1
('I', 'expect') 5
('expect', ',') 2
('or', 'perhaps') 5
('come', 'without

('his', 'arguments') 1
('arguments', ',') 1
('he', 'squeezed') 3
('squeezed', 'their') 1
('hands', 'painfully') 1
('painfully', 'as') 2
('a', 'vise') 3
('vise', '.') 3
('He', 'stared') 3
('at', 'Avdotya') 4
('Romanovna', 'without') 1
('least', 'regard') 2
('regard', 'for') 3
('good', 'manners') 4
('manners', '.') 1
('They', 'sometimes') 1
('sometimes', 'pulled') 1
('pulled', 'their') 1
('hands', 'out') 1
('his', 'huge') 1
('huge', 'bony') 1
('bony', 'paws') 1
('paws', ',') 1
('from', 'noticing') 1
('drew', 'them') 2
('the', 'closer') 2
('closer', 'to') 1
('d', 'told') 1
('to', 'jump') 3
('jump', 'head') 1
('foremost', 'from') 1
('thought', 'or') 1
('or', 'hesitation') 1
('hesitation', 'in') 1
('their', 'service') 1
('service', '.') 6
('Though', 'Pulcheria') 2
('Alexandrovna', 'felt') 1
('really', 'too') 1
('too', 'eccentric') 1
('eccentric', 'and') 2
('and', 'pinched') 1
('pinched', 'her') 1
('hand', 'too') 1
('her', 'anxiety') 2
('anxiety', 'over') 1
('Rodya', 'she') 1
('presence', 'a

('exaggerated', ';') 1
('that', 'certainly') 1
('certainly', 'the') 1
('patient', 'had') 2
('some', 'fixed') 1
('something', 'approaching') 1
('approaching', 'a') 1
('a', 'monomania') 1
('monomania', '--') 1
('now', 'particularly') 1
('particularly', 'studying') 1
('studying', 'this') 1
('this', 'interesting') 2
('interesting', 'branch') 1
('branch', 'of') 1
('of', 'medicine') 2
('medicine', '--') 1
('be', 'recollected') 1
('recollected', 'that') 3
('that', 'until') 1
('until', 'to-day') 1
('to-day', 'the') 1
('delirium', 'and') 1
('doubt', 'the') 1
('family', 'would') 1
('favourable', 'effect') 1
('his', 'recovery') 1
('and', 'distract') 1
('only', 'all') 1
('all', 'fresh') 1
('fresh', 'shocks') 1
('shocks', 'can') 1
('be', 'avoided') 2
('avoided', ',') 2
('added', 'significantly') 1
('took', 'leave') 2
('leave', 'with') 1
('an', 'impressive') 1
('impressive', 'and') 1
('and', 'affable') 1
('affable', 'bow') 1
('while', 'blessings') 1
('blessings', ',') 1
('warm', 'gratitude') 1
('and

('my', '_bonjour_') 1
('_bonjour_', 'through') 1
('the', 'samovar') 2
('samovar', 'was') 2
('taken', 'into') 3
('not', 'vouchsafed') 1
('vouchsafed', 'a') 1
('a', 'personal') 1
('personal', 'interview') 1
('interview', '...') 1
('At', 'nine') 1
('precisely', 'Razumihin') 1
('Razumihin', 'reached') 1
('lodgings', 'at') 1
('at', 'Bakaleyev') 1
('Both', 'ladies') 1
('ladies', 'were') 3
('were', 'waiting') 1
('impatience', '.') 3
('had', 'risen') 2
('risen', 'at') 1
('clock', 'or') 1
('or', 'earlier') 1
('He', 'entered') 1
('entered', 'looking') 1
('looking', 'as') 3
('as', 'black') 1
('black', 'as') 1
('as', 'night') 1
(',', 'bowed') 4
('bowed', 'awkwardly') 1
('awkwardly', 'and') 1
('once', 'furious') 1
('furious', 'with') 1
('himself', 'for') 2
('had', 'reckoned') 2
('reckoned', 'without') 1
('his', 'host') 1
('host', ':') 1
(':', 'Pulcheria') 1
('Alexandrovna', 'fairly') 1
('fairly', 'rushed') 1
('almost', 'kissing') 1
('kissing', 'them') 3
('glanced', 'timidly') 1
('her', 'proud') 1
(

('and', 'sombre') 1
('sombre', '.') 1
('a', 'wounded') 1
('wounded', 'man') 1
('man', 'or') 5
('has', 'undergone') 1
('undergone', 'some') 1
('some', 'terrible') 3
('terrible', 'physical') 1
('physical', 'suffering') 1
('His', 'brows') 1
('brows', 'were') 1
('were', 'knitted') 1
('knitted', ',') 1
('eyes', 'feverish') 1
('feverish', '.') 2
('spoke', 'little') 1
('though', 'performing') 1
('performing', 'a') 2
('a', 'duty') 3
('a', 'restlessness') 1
('restlessness', 'in') 1
('his', 'movements') 1
('He', 'only') 3
('wanted', 'a') 1
('a', 'sling') 1
('sling', 'on') 1
('arm', 'or') 1
('a', 'bandage') 1
('bandage', 'on') 1
('finger', 'to') 2
('complete', 'the') 1
('impression', 'of') 2
('a', 'painful') 3
('painful', 'abscess') 1
('abscess', 'or') 1
('broken', 'arm') 1
('The', 'pale') 1
('sombre', 'face') 1
('face', 'lighted') 2
('sister', 'entered') 1
('entered', ',') 1
('this', 'only') 1
('only', 'gave') 1
('suffering', ',') 9
('its', 'listless') 1
('listless', 'dejection') 1
('dejection',

('lying', ',') 8
('nails', 'vindictively') 1
('vindictively', '.') 1
('“', 'Proud') 1
('Proud', 'creature') 1
('She', 'won') 2
('t', 'admit') 4
('admit', 'she') 1
('of', 'charity') 1
('charity', '!') 1
('!', 'Too') 1
('Too', 'haughty') 1
('haughty', '!') 1
(',', 'base') 4
('base', 'characters') 1
('characters', '!') 1
('even', 'love') 1
('love', 'as') 1
('they', 'hate') 1
('hate', '...') 1
('...', 'hate') 1
('hate', 'them') 4
('continued', 'Dounia') 1
('am', 'marrying') 1
('marrying', 'Pyotr') 1
('Petrovitch', 'because') 1
('because', 'of') 9
('of', 'two') 3
('two', 'evils') 1
('evils', 'I') 1
('choose', 'the') 1
('the', 'less') 2
('I', 'intend') 2
('intend', 'to') 6
('do', 'honestly') 1
('honestly', 'all') 1
('he', 'expects') 1
('expects', 'of') 1
('not', 'deceiving') 1
('deceiving', 'him') 1
('you', 'smile') 1
('smile', 'just') 1
(',', 'flushed') 3
('of', 'anger') 1
('anger', 'in') 2
('All', '?') 1
('malignant', 'grin') 1
('“', 'Within') 1
('Within', 'certain') 1
('certain', 'limits'

('about', 'some') 3
('business', 'or') 1
('gets', 'out') 1
('air', '...') 2
('fearfully', 'close') 1
('close', 'in') 1
('room', '...') 2
('very', 'streets') 1
('streets', 'here') 1
('here', 'feel') 1
('feel', 'like') 1
('like', 'shut-up') 1
('shut-up', 'rooms') 1
('a', 'town') 5
('town', '!') 1
('...', 'stay') 1
('stay', '...') 1
('this', 'side') 2
('will', 'crush') 1
('crush', 'you') 1
('--', 'carrying') 1
('carrying', 'something') 1
('piano', 'they') 1
('declare', '...') 1
('they', 'push') 1
('push', '!') 1
('that', 'young') 3
('What', 'young') 1
('that', 'Sofya') 5
('Semyonovna', ',') 23
('just', 'now.') 2
('presentiment', ',') 1
('may', 'believe') 3
('chief', 'cause') 1
('trouble', '...') 1
('Nothing', 'of') 3
('in', 'vexation') 1
('your', 'presentiments') 1
('presentiments', ',') 1
('her', 'acquaintance') 1
('acquaintance', 'the') 1
('evening', 'before') 3
('her', 'when') 1
('came', 'in.') 1
('She', 'worries') 1
('those', 'eyes') 4
('scarcely', 'sit') 1
('sit', 'still') 4
('my', '

('my', 'honour') 3
('honour', 'I') 1
('rage', 'with') 1
('we', 'came') 4
('came', 'along') 1
('along', 'that') 2
('like', 'Romeo') 1
('Romeo', '...') 1
('and', 'proved') 2
('proved', 'it') 2
('ejaculated', 'Razumihin') 1
('grave', 'grounds') 1
('furious', 'at') 1
('Porfiry', 'laughed') 1
('you', 'sharp') 1
('sharp', 'lawyer') 1
('lawyer', '!') 1
('...', 'Damn') 1
('Damn', 'you') 1
('”', 'snapped') 1
('snapped', 'Razumihin') 1
('suddenly', 'bursting') 1
('bursting', 'out') 1
('laughing', 'himself') 1
('Porfiry', 'with') 2
('cheerful', 'face') 1
('though', 'nothing') 1
('nothing', 'had') 1
('all', 'fools') 1
('To', 'come') 1
('friend', 'Rodion') 1
('and', 'wants') 1
('little', 'matter') 2
('.', 'Bah') 2
('what', 'brought') 1
('you', 'met') 1
('met', 'before') 1
('you', 'known') 1
('known', 'each') 1
('other', 'long') 1
('does', 'this') 1
('this', 'mean') 1
('Raskolnikov', 'uneasily') 1
('Zametov', 'seemed') 1
('seemed', 'taken') 1
('your', 'rooms') 1
('rooms', 'we') 1
('we', 'met') 2
('m

('phalanstery', 'is') 1
('but', 'your') 3
('your', 'human') 1
('not', 'ready') 2
('ready', 'for') 2
('the', 'phalanstery') 1
('phalanstery', '--') 1
('it', 'wants') 1
('wants', 'life') 1
('it', 'hasn') 2
('t', 'completed') 1
('completed', 'its') 1
('its', 'vital') 1
('vital', 'process') 1
('soon', 'for') 1
('graveyard', '!') 1
('t', 'skip') 1
('skip', 'over') 1
('over', 'nature') 1
('nature', 'by') 1
('by', 'logic') 1
('logic', '.') 1
('.', 'Logic') 1
('Logic', 'presupposes') 1
('presupposes', 'three') 1
('three', 'possibilities') 1
('possibilities', ',') 2
('are', 'millions') 1
('millions', '!') 1
('!', 'Cut') 1
('Cut', 'away') 1
('and', 'reduce') 1
('reduce', 'it') 1
('comfort', '!') 1
('the', 'easiest') 1
('easiest', 'solution') 1
('solution', 'of') 3
('the', 'problem') 2
('problem', '!') 1
('s', 'seductively') 1
('seductively', 'clear') 1
('clear', 'and') 2
('you', 'musn') 1
('musn', '’') 1
('think', 'about') 3
('whole', 'secret') 1
('two', 'pages') 1
('pages', 'of') 1
('of', 'prin

('to', 'kill') 7
('kill', 'others') 1
('these', 'extraordinary') 1
('admit', 'it') 6
('s', 'alarming') 1
('alarming', 'if') 1
('same', 'tone') 1
('People', 'with') 1
('with', 'new') 3
('new', 'ideas') 2
('people', 'with') 1
('faintest', 'capacity') 1
('capacity', 'for') 1
('something', '_new_') 2
('_new_', ',') 1
('are', 'extremely') 2
('extremely', 'few') 1
('few', 'in') 1
('in', 'number') 1
('number', ',') 2
(',', 'extraordinarily') 1
('extraordinarily', 'so') 1
('thing', 'only') 2
('only', 'is') 1
('these', 'grades') 1
('grades', 'and') 1
('and', 'sub-divisions') 1
('sub-divisions', 'of') 1
('men', 'must') 2
('must', 'follow') 1
('follow', 'with') 1
('with', 'unfailing') 2
('unfailing', 'regularity') 2
('regularity', 'some') 1
('some', 'law') 1
('nature', '.') 1
('That', 'law') 1
('is', 'unknown') 1
('unknown', 'at') 1
('it', 'exists') 1
('day', 'may') 1
('may', 'become') 4
('become', 'known') 1
('The', 'vast') 1
('vast', 'mass') 1
('mankind', 'is') 2
('is', 'mere') 1
('mere', 'mate

('less', 'he') 1
('he', 'suspects') 1
('suspects', 'that') 1
('be', 'caught') 2
('a', 'simple') 3
('simple', 'thing') 1
('the', 'simpler') 1
('simpler', 'the') 1
('trap', 'he') 1
('Porfiry', 'is') 1
('not', 'such') 2
('fool', 'as') 1
('a', 'knave') 1
('knave', 'then') 1
('the', 'strangeness') 1
('strangeness', 'of') 1
('own', 'frankness') 1
('frankness', ',') 1
('the', 'eagerness') 1
('eagerness', 'with') 1
('made', 'this') 2
('this', 'explanation') 1
('had', 'kept') 2
('kept', 'up') 2
('the', 'preceding') 3
('preceding', 'conversation') 1
('with', 'gloomy') 2
('gloomy', 'repulsion') 1
('obviously', 'with') 1
('a', 'motive') 3
('motive', ',') 4
('from', 'necessity') 1
('necessity', '.') 1
('am', 'getting') 1
('a', 'relish') 1
('relish', 'for') 1
('certain', 'aspects') 1
('aspects', '!') 1
('But', 'almost') 2
('suddenly', 'uneasy') 1
('and', 'alarming') 1
('alarming', 'idea') 1
('idea', 'had') 3
('had', 'occurred') 2
('His', 'uneasiness') 1
('uneasiness', 'kept') 1
('kept', 'on') 2
('on

('buzz', '.') 1
('corner', 'between') 2
('cupboard', 'something') 1
('a', 'cloak') 1
('cloak', 'hanging') 1
('hanging', 'on') 1
('that', 'cloak') 1
('cloak', 'here') 1
('there', 'before') 1
('it', 'quietly') 1
('someone', 'hiding') 1
('hiding', 'behind') 1
('behind', 'it') 2
('He', 'cautiously') 1
('cautiously', 'moved') 1
('moved', 'the') 1
('the', 'cloak') 1
('cloak', 'and') 2
('woman', 'bent') 1
('bent', 'double') 1
('double', 'so') 1
('see', 'her') 13
('was', 'she') 3
('is', 'afraid') 1
('He', 'stealthily') 1
('stealthily', 'took') 1
('her', 'one') 2
('another', 'on') 2
('But', 'strange') 2
('stir', ',') 1
('down', 'nearer') 1
('nearer', 'and') 3
('bent', 'her') 1
('head', 'lower') 1
('lower', '.') 1
('bent', 'right') 1
('right', 'down') 1
('ground', 'and') 2
('and', 'peeped') 2
('peeped', 'up') 1
('up', 'into') 2
('face', 'from') 1
('he', 'peeped') 1
('peeped', 'and') 1
('turned', 'cold') 1
('horror', ':') 1
('sitting', 'and') 3
('with', 'noiseless') 1
('noiseless', 'laughter') 1


('awake', '.') 1
('awake', 'every') 1
('comes', ',') 1
(',', 'speaks') 2
('speaks', 'to') 1
('goes', 'out') 1
('--', 'always') 1
('can', 'almost') 1
('almost', 'hear') 1
('hear', 'her.') 1
('sort', 'must') 1
('be', 'happening') 1
('much', 'excited') 1
('Svidrigaïlov', 'asked') 2
('?', 'Didn') 1
('common', 'between') 1
('never', 'said') 1
('cried', 'sharply') 1
('“', 'Didn') 2
('No', '!') 6
('did', '.') 4
('lying', 'with') 1
('your', 'eyes') 5
('eyes', 'shut') 1
('the', 'man.') 1
('man.', '’') 1
('by', '‘') 3
('Svidrigaïlov', 'muttered') 3
('muttered', 'ingenuously') 1
('ingenuously', ',') 1
('were', 'puzzled') 1
('minute', 'they') 2
('They', 'stared') 1
('s', 'faces') 1
('Raskolnikov', 'shouted') 2
('shouted', 'with') 1
('she', 'say') 1
('She', '!') 1
('she', 'talks') 1
('talks', 'of') 1
('the', 'silliest') 1
('silliest', 'trifles') 1
('trifles', 'and') 1
('--', 'man') 1
('strange', 'creature') 1
('creature', '--') 1
('angry', '.') 4
('in', '(') 1
('tired', 'you') 1
('know', ':') 1
('f

('say', 'to-morrow') 1
('till', 'that') 2
('never', 'occurred') 1
('to', 'wonder') 2
('wonder', 'what') 1
('Razumihin', 'would') 1
('think', 'when') 1
('knew', '.') 5
('Porfiry', 'had') 4
('had', 'very') 3
('little', 'interest') 1
('much', 'had') 1
('gone', 'since') 1
('corridor', 'they') 1
('upon', 'Luzhin') 1
('Luzhin', ';') 1
('had', 'arrived') 1
('arrived', 'punctually') 1
('punctually', 'at') 1
('three', 'went') 1
('together', 'without') 1
('without', 'greeting') 1
('greeting', 'or') 1
('or', 'looking') 2
('men', 'walked') 1
('in', 'first') 1
(',', 'lingered') 1
('lingered', 'a') 1
('Alexandrovna', 'came') 1
('came', 'forward') 1
('greet', 'him') 1
('was', 'welcoming') 1
('welcoming', 'her') 1
('Petrovitch', 'walked') 1
('quite', 'amiably') 1
('little', 'put') 1
('yet', 'recover') 1
('recover', 'himself') 1
('who', 'seemed') 2
('seemed', 'also') 1
('little', 'embarrassed') 1
('embarrassed', ',') 2
(',', 'hastened') 2
('make', 'them') 3
('all', 'sit') 1
('the', 'round') 1
('table',

('secret', 'proposals') 1
('of', 'Arkady') 1
('has', 'entrusted') 1
('entrusted', 'to') 2
('which', 'have') 2
('great', 'and') 1
('and', 'possibly') 1
('possibly', 'a') 1
('very', 'agreeable') 1
('agreeable', 'interest') 1
('Razumihin', 'could') 1
('you', 'ashamed') 1
('ashamed', 'now') 1
('white', 'with') 1
('with', 'anger') 3
('had', 'apparently') 1
('all', 'expected') 1
('expected', 'such') 2
('a', 'conclusion') 3
('conclusion', '.') 3
('had', 'too') 1
('much', 'confidence') 1
('confidence', 'in') 2
('power', 'and') 2
('the', 'helplessness') 1
('helplessness', 'of') 1
('his', 'victims') 1
('victims', '.') 1
('lips', 'quivered') 1
('quivered', '.') 1
('this', 'door') 3
('door', 'now') 1
('after', 'such') 1
('a', 'dismissal') 1
('dismissal', ',') 1
('may', 'reckon') 2
('never', 'come') 2
('.', 'Consider') 2
('Consider', 'what') 1
('are', 'doing') 2
('be', 'shaken.') 1
('shaken.', '”') 1
('What', 'insolence') 1
('insolence', '!') 1
(',', 'springing') 1
('springing', 'up') 1
('want', 'y

('one', 'book') 1
('book', 'myself') 1
('myself', 'which') 1
('go', 'well') 1
('manage', 'it') 1
('knows', 'the') 1
('talk', 'it') 3
('over', 'later') 1
('“', 'Hurrah') 1
('Hurrah', '!') 1
(',', 'stay') 4
('this', 'house') 1
(',', 'belonging') 1
('belonging', 'to') 2
('same', 'owner') 1
('owner', '.') 1
('special', 'flat') 1
('flat', 'apart') 1
('not', 'communicating') 1
('communicating', 'with') 1
('lodgings', '.') 5
('s', 'furnished') 1
('furnished', ',') 1
(',', 'rent') 1
('rent', 'moderate') 1
('moderate', ',') 1
('three', 'rooms') 1
('Suppose', 'you') 1
('ll', 'pawn') 1
('pawn', 'your') 1
('your', 'watch') 1
('watch', 'to-morrow') 1
('to-morrow', 'and') 4
('be', 'arranged') 1
('arranged', 'then') 1
('can', 'all') 2
('three', 'live') 1
('live', 'together') 1
('and', 'Rodya') 1
('Rodya', 'will') 1
('At', 'such') 1
('Dounia', 'looked') 1
('brother', 'with') 1
('with', 'incredulous') 1
('incredulous', 'wonder') 1
('wonder', '.') 1
('held', 'his') 2
('cap', 'in') 1
('was', 'preparing')

('God', 'at') 1
('of', 'malignance') 1
('malignance', ',') 1
('changed', ';') 1
('tremor', 'passed') 1
('with', 'unutterable') 1
('unutterable', 'reproach') 1
('reproach', ',') 1
('and', 'broke') 1
('into', 'bitter') 1
(',', 'bitter') 1
('bitter', 'sobs') 1
('sobs', ',') 2
(',', 'hiding') 2
('hiding', 'her') 2
('say', 'Katerina') 1
('unhinged', ';') 1
(';', 'your') 1
('own', 'mind') 1
('brief', 'silence') 1
('.', 'Five') 4
('Five', 'minutes') 5
('still', 'paced') 1
('paced', 'up') 2
('glittered', '.') 1
('his', 'two') 2
('two', 'hands') 2
('her', 'tearful') 1
('tearful', 'face') 1
('were', 'hard') 1
('feverish', 'and') 4
('and', 'piercing') 1
('were', 'twitching') 1
('twitching', '.') 2
('down', 'quickly') 1
('and', 'dropping') 2
('dropping', 'to') 1
('Sonia', 'drew') 1
('And', 'certainly') 1
('certainly', 'he') 1
('she', 'muttered') 2
('turning', 'pale') 2
('sudden', 'anguish') 2
('anguish', 'clutched') 1
('not', 'bow') 1
('I', 'bowed') 1
('the', 'suffering') 3
('said', 'wildly') 1
('

('She', 'jumped') 3
('wept', 'and') 3
('and', 'wrung') 2
('then', 'sank') 1
('sank', 'again') 1
('again', 'into') 1
('into', 'feverish') 1
('feverish', 'sleep') 1
('sleep', 'and') 1
('and', 'dreamt') 1
('dreamt', 'of') 1
('of', 'Polenka') 1
('the', 'gospel') 2
('gospel', 'and') 2
('...', 'him') 1
('with', 'pale') 1
('with', 'burning') 1
('burning', 'eyes') 1
('...', 'kissing') 1
('her', 'feet') 4
('which', 'divided') 1
('divided', 'Sonia') 1
('from', 'Madame') 1
('room', 'which') 3
('long', 'stood') 1
('stood', 'empty') 1
('A', 'card') 1
('card', 'was') 1
('was', 'fixed') 1
('notice', 'stuck') 1
('windows', 'over') 1
('canal', 'advertising') 1
('advertising', 'it') 1
('let', '.') 3
('been', 'accustomed') 1
('room', '’') 1
('being', 'uninhabited') 1
('uninhabited', '.') 1
('time', 'Mr.') 1
('empty', 'room') 1
('which', 'adjoined') 1
('adjoined', 'the') 1
('brought', 'a') 4
('noiselessly', 'carried') 1
('carried', 'it') 1
('door', 'that') 1
('that', 'led') 1
('The', 'conversation') 1
('c

('strangers', 'as') 1
('our', 'peasants') 1
('.', 'He-he') 1
('He-he', '!') 7
('surface', '.') 1
('merely', 'that') 1
('has', 'nowhere') 1
('is', '_psychologically_') 1
('_psychologically_', 'unable') 1
('expression', '!') 1
('!', 'Through') 1
('Through', 'a') 1
('nature', 'he') 1
('had', 'anywhere') 1
('anywhere', 'to') 1
('a', 'butterfly') 1
('butterfly', 'round') 1
('round', 'a') 1
('candle', '?') 2
('keep', 'circling') 2
('circling', 'and') 1
('and', 'circling') 1
('Freedom', 'will') 1
('will', 'lose') 2
('lose', 'its') 1
('its', 'attractions') 1
('attractions', '.') 1
('to', 'brood') 1
('brood', ',') 1
('ll', 'weave') 1
('weave', 'a') 1
('tangle', 'round') 1
('round', 'himself') 1
('ll', 'worry') 1
('worry', 'himself') 1
('will', 'provide') 1
('provide', 'me') 1
('a', 'mathematical') 1
('mathematical', 'proof') 1
('proof', '--') 1
('only', 'give') 1
('long', 'enough') 1
('enough', 'interval') 1
('interval', '...') 1
('ll', 'keep') 1
('getting', 'nearer') 1
('and', 'nearer') 1
('--

('and', 'alarm') 1
('alarm', 'in') 1
('was', 'peremptory') 1
('peremptory', ',') 1
(',', 'stern') 2
('stern', ',') 2
('once', 'laying') 1
('laying', 'aside') 1
('aside', 'all') 1
('all', 'mystification') 1
('mystification', '.') 1
(',', 'bewildered') 1
('suddenly', 'fell') 1
('into', 'actual') 1
('actual', 'frenzy') 1
('he', 'again') 1
('again', 'obeyed') 1
('obeyed', 'the') 1
('the', 'command') 2
('command', 'to') 1
('speak', 'quietly') 1
('perfect', 'paroxysm') 1
('paroxysm', 'of') 1
('allow', 'myself') 1
('be', 'tortured') 2
('tortured', ',') 1
('instantly', 'recognising') 1
('recognising', 'with') 1
('hatred', 'that') 1
('help', 'obeying') 1
('obeying', 'the') 1
('command', 'and') 1
('and', 'driven') 2
('to', 'even') 1
('even', 'greater') 1
('greater', 'fury') 1
('fury', 'by') 1
('“', 'Arrest') 1
('Arrest', 'me') 1
(',', 'search') 3
('search', 'me') 1
('kindly', 'act') 1
('form', 'and') 1
('Porfiry', 'interrupted') 1
('same', 'sly') 1
('gloating', 'with') 1
('with', 'enjoyment') 1


('of', 'wounded') 1
('wounded', 'vanity') 1
('vanity', 'had') 1
('been', 'gnawing') 1
('gnawing', 'at') 1
('heart', 'all') 1
('Petrovitch', 'immediately') 1
('immediately', 'looked') 1
('looking-glass', '.') 1
('had', 'jaundice') 1
('jaundice', '.') 1
('However', 'his') 1
('health', 'seemed') 1
('seemed', 'unimpaired') 1
('unimpaired', 'so') 1
('far', ',') 1
('his', 'noble') 1
('noble', ',') 2
(',', 'clear-skinned') 1
('clear-skinned', 'countenance') 1
('countenance', 'which') 1
('grown', 'fattish') 1
('fattish', 'of') 1
('instant', 'was') 1
('positively', 'comforted') 1
('comforted', 'in') 1
('find', 'another') 1
('another', 'bride') 1
('bride', 'and') 1
('a', 'better') 1
('better', 'one') 1
('But', 'coming') 2
('coming', 'back') 2
('the', 'sense') 1
('present', 'position') 1
('turned', 'aside') 1
('and', 'spat') 1
('spat', 'vigorously') 1
('vigorously', ',') 1
('which', 'excited') 1
('excited', 'a') 1
('a', 'sarcastic') 1
('sarcastic', 'smile') 1
('smile', 'in') 1
('in', 'Andrey') 1


('is', 'protest') 1
('.', 'Varents') 1
('Varents', 'had') 1
('been', 'married') 1
('married', 'seven') 1
('she', 'abandoned') 1
('abandoned', 'her') 2
('her', 'two') 1
('two', 'children') 2
('husband', 'straight') 1
('straight', 'out') 4
('have', 'realised') 2
('happy', 'with') 1
('deceived', 'me') 1
('by', 'concealing') 1
('concealing', 'from') 1
('another', 'organisation') 1
('organisation', 'of') 2
('society', 'by') 1
('the', 'communities') 1
('communities', '.') 1
('only', 'lately') 1
('lately', 'learned') 1
('learned', 'it') 1
('a', 'great-hearted') 1
('great-hearted', 'man') 1
('given', 'myself') 1
('am', 'establishing') 1
('establishing', 'a') 1
('a', 'community') 4
('community', '.') 3
('plainly', 'because') 1
('it', 'dishonest') 1
('dishonest', 'to') 1
('deceive', 'you') 2
('Do', 'as') 1
('Do', 'not') 4
('hope', 'you') 2
('be', 'happy.') 1
('happy.', '’') 1
('how', 'letters') 1
('letters', 'like') 1
('that', 'ought') 2
('be', 'written') 1
('written', '!') 1
('that', 'Terebyeva

('looking', 'carefully') 2
('carefully', 'at') 1
('not', 'nonsense') 1
('suffered', 'distress') 1
('distress', 'and') 1
('and', 'annoyance') 2
('annoyance', 'as') 1
('did', 'yesterday') 2
('who', 'yet') 1
('yet', 'can') 1
('can', 'sympathise') 1
('...', 'even') 1
('social', 'mistake') 1
('mistake', '--') 1
('still', 'deserving') 1
('respect', '!') 1
('it', 'indeed') 2
('as', 'according') 1
('your', 'ideas') 2
('a', 'drawback') 1
('drawback', 'your') 1
('How', 'distressed') 1
('are', 'for') 3
('instance', 'by') 1
('the', 'simple-hearted') 1
('simple-hearted', 'Lebeziatnikov') 1
('a', 'return') 1
('return', 'of') 1
('of', 'affection') 1
('affection', 'for') 2
('for', 'Pyotr') 1
('with', 'marriage') 1
('with', '_legal_') 1
('_legal_', 'marriage') 1
(',', 'noble') 1
('noble', 'Pyotr') 1
('you', 'cling') 1
('cling', 'to') 2
('this', '_legality_') 1
('_legality_', 'of') 1
('positively', 'glad') 2
('glad', 'it') 1
('are', 'free') 1
('free', ',') 1
('quite', 'lost') 1
('for', 'humanity') 1
('h

('cap', 'for') 1
(')', 'Have') 1
('wants', 'everyone') 1
('everyone', 'to') 3
('is', 'patronising') 1
('patronising', 'me') 1
('doing', 'me') 1
('an', 'honour') 3
('honour', 'by') 1
('by', 'being') 2
('sensible', 'woman') 4
('woman', 'to') 1
('invite', 'people') 1
('especially', 'those') 1
('who', 'knew') 1
('my', 'late') 4
('the', 'set') 1
('of', 'fools') 2
('fools', 'she') 1
('brought', '!') 1
('The', 'sweeps') 1
('sweeps', '!') 1
('one', 'with') 1
('the', 'spotty') 1
('And', 'those') 1
('those', 'wretched') 1
('wretched', 'Poles') 1
('Poles', ',') 1
(')', 'Not') 1
('Not', 'one') 1
('them', 'has') 1
('ever', 'poked') 1
('poked', 'his') 1
('nose', 'in') 1
('ve', 'never') 4
('never', 'set') 2
('set', 'eyes') 2
('There', 'they') 2
('a', 'row') 1
('row', '.') 1
('.', 'Hey') 2
(',', '_pan_') 1
('_pan_', '!') 1
('“', 'have') 1
('you', 'tasted') 1
('tasted', 'the') 1
('the', 'pancakes') 2
('pancakes', '?') 1
('?', 'Take') 1
('Take', 'some') 1
('!', 'Won') 1
('some', 'vodka') 1
('s', 'jumped

('father', '--') 1
('probably', 'some') 1
('some', 'Finnish') 1
('Finnish', 'milkman') 1
('milkman', ',') 1
('that', 'probably') 1
('probably', 'she') 1
('father', 'at') 1
('still', 'uncertain') 1
('whether', 'her') 1
('was', 'Amalia') 2
('Ivanovna', 'or') 1
('or', 'Amalia') 1
('Ludwigovna', '.') 1
(',', 'lashed') 1
('lashed', 'to') 1
('and', 'shrieked') 1
('shrieked', 'that') 1
('not', 'Ludwigovna') 1
('her', '_Vater_') 1
('_Vater_', 'was') 2
('was', 'named') 1
('named', 'Johann') 1
('Johann', 'and') 1
('a', 'burgomeister') 1
('burgomeister', ',') 1
('s', '_Vater_') 1
('quite', 'never') 1
('never', 'a') 1
('a', 'burgomeister.') 1
('burgomeister.', '”') 1
('Ivanovna', 'rose') 1
('a', 'stern') 1
('calm', 'voice') 1
('voice', '(') 1
('chest', 'was') 1
('was', 'heaving') 1
('heaving', ')') 1
(')', 'observed') 1
('she', 'dared') 1
('dared', 'for') 1
('her', 'contemptible') 1
('wretch', 'of') 4
('her', 'papa') 1
('papa', ',') 1
('would', 'tear') 1
('tear', 'her') 1
('her', 'cap') 1
('cap', 

KeyboardInterrupt: 

* Trigrami

In [308]:
trigrams = nltk.trigrams(tokens)
trigrams_freq = nltk.FreqDist(trigrams)

for c, d in trigrams_freq.items():
    print(c, d)

('\ufeffThe', 'Project', 'Gutenberg') 1
('Project', 'Gutenberg', 'EBook') 1
('Gutenberg', 'EBook', 'of') 1
('EBook', 'of', 'Crime') 1
('of', 'Crime', 'and') 1
('Crime', 'and', 'Punishment') 3
('and', 'Punishment', ',') 2
('Punishment', ',', 'by') 2
(',', 'by', 'Fyodor') 2
('by', 'Fyodor', 'Dostoevsky') 2
('Fyodor', 'Dostoevsky', 'This') 1
('Dostoevsky', 'This', 'eBook') 1
('This', 'eBook', 'is') 2
('eBook', 'is', 'for') 2
('is', 'for', 'the') 3
('for', 'the', 'use') 3
('the', 'use', 'of') 12
('use', 'of', 'anyone') 2
('of', 'anyone', 'anywhere') 2
('anyone', 'anywhere', 'at') 2
('anywhere', 'at', 'no') 2
('at', 'no', 'cost') 2
('no', 'cost', 'and') 2
('cost', 'and', 'with') 2
('and', 'with', 'almost') 2
('with', 'almost', 'no') 2
('almost', 'no', 'restrictions') 2
('no', 'restrictions', 'whatsoever') 2
('restrictions', 'whatsoever', '.') 2
('whatsoever', '.', 'You') 2
('.', 'You', 'may') 15
('You', 'may', 'copy') 2
('may', 'copy', 'it') 2
('copy', 'it', ',') 2
('it', ',', 'give') 2
(',

('and', 'through', 'it') 1
('through', 'it', 'he') 1
('it', 'he', 'became') 1
('he', 'became', 'great.') 1
('became', 'great.', '”') 1
('great.', '”', 'CRIME') 1
('”', 'CRIME', 'AND') 1
('AND', 'PUNISHMENT', 'PART') 1
('PUNISHMENT', 'PART', 'I') 1
('PART', 'I', 'CHAPTER') 1
('I', 'CHAPTER', 'I') 1
('CHAPTER', 'I', 'On') 1
('I', 'On', 'an') 1
('On', 'an', 'exceptionally') 1
('an', 'exceptionally', 'hot') 1
('exceptionally', 'hot', 'evening') 1
('hot', 'evening', 'early') 1
('evening', 'early', 'in') 1
('early', 'in', 'July') 1
('in', 'July', 'a') 1
('July', 'a', 'young') 1
('a', 'young', 'man') 6
('young', 'man', 'came') 1
('man', 'came', 'out') 1
('came', 'out', 'of') 2
('out', 'of', 'the') 69
('of', 'the', 'garret') 1
('the', 'garret', 'in') 1
('garret', 'in', 'which') 1
('in', 'which', 'he') 4
('which', 'he', 'lodged') 1
('he', 'lodged', 'in') 1
('lodged', 'in', 'S.') 1
('in', 'S.', 'Place') 1
('S.', 'Place', 'and') 1
('Place', 'and', 'walked') 1
('and', 'walked', 'slowly') 1
('walke

('a', 'tangle', 'and') 1
('tangle', 'and', 'that') 1
('and', 'that', 'he') 15
('that', 'he', 'was') 66
('he', 'was', 'very') 8
('was', 'very', 'weak') 1
('very', 'weak', ';') 1
('weak', ';', 'for') 1
(';', 'for', 'two') 1
('for', 'two', 'days') 2
('two', 'days', 'he') 1
('days', 'he', 'had') 1
('he', 'had', 'scarcely') 1
('had', 'scarcely', 'tasted') 1
('scarcely', 'tasted', 'food') 1
('tasted', 'food', '.') 1
('food', '.', 'He') 2
('He', 'was', 'so') 3
('was', 'so', 'badly') 1
('so', 'badly', 'dressed') 1
('badly', 'dressed', 'that') 1
('dressed', 'that', 'even') 1
('that', 'even', 'a') 2
('even', 'a', 'man') 1
('a', 'man', 'accustomed') 1
('man', 'accustomed', 'to') 1
('accustomed', 'to', 'shabbiness') 1
('to', 'shabbiness', 'would') 1
('shabbiness', 'would', 'have') 1
('would', 'have', 'been') 28
('have', 'been', 'ashamed') 1
('been', 'ashamed', 'to') 1
('ashamed', 'to', 'be') 2
('to', 'be', 'seen') 5
('be', 'seen', 'in') 3
('seen', 'in', 'the') 4
('the', 'street', 'in') 3
('street'

('on', 'the', 'landing') 3
('the', 'landing', ',') 2
('landing', ',', 'she') 1
(',', 'she', 'grew') 1
('she', 'grew', 'bolder') 1
('grew', 'bolder', ',') 1
('bolder', ',', 'and') 1
(',', 'and', 'opened') 1
('and', 'opened', 'the') 1
('opened', 'the', 'door') 14
('the', 'door', 'wide') 2
('door', 'wide', '.') 1
('wide', '.', 'The') 1
('young', 'man', 'stepped') 1
('man', 'stepped', 'into') 1
('stepped', 'into', 'the') 3
('into', 'the', 'dark') 1
('the', 'dark', 'entry') 1
('dark', 'entry', ',') 1
('entry', ',', 'which') 1
(',', 'which', 'was') 11
('which', 'was', 'partitioned') 1
('was', 'partitioned', 'off') 1
('partitioned', 'off', 'from') 1
('off', 'from', 'the') 1
('from', 'the', 'tiny') 1
('the', 'tiny', 'kitchen') 1
('tiny', 'kitchen', '.') 1
('kitchen', '.', 'The') 1
('.', 'The', 'old') 11
('The', 'old', 'woman') 16
('old', 'woman', 'stood') 1
('woman', 'stood', 'facing') 1
('stood', 'facing', 'him') 4
('facing', 'him', 'in') 2
('him', 'in', 'silence') 4
('in', 'silence', 'and') 

('...', 'but', 'how') 3
('but', 'how', 'degrading') 1
('how', 'degrading', 'it') 1
('degrading', 'it', 'all') 1
('it', 'all', 'is.') 1
('all', 'is.', '”') 1
('is.', '”', 'The') 1
('”', 'The', 'old') 3
('old', 'woman', 'came') 1
('woman', 'came', 'back') 1
('came', 'back', '.') 2
('back', '.', '“') 9
('.', '“', 'Here') 15
('“', 'Here', ',') 4
('Here', ',', 'sir') 1
(',', 'sir', ':') 1
('sir', ':', 'as') 1
(':', 'as', 'we') 1
('as', 'we', 'say') 1
('we', 'say', 'ten') 1
('say', 'ten', 'copecks') 1
('ten', 'copecks', 'the') 1
('copecks', 'the', 'rouble') 1
('the', 'rouble', 'a') 1
('rouble', 'a', 'month') 1
('a', 'month', ',') 2
('month', ',', 'so') 2
(',', 'so', 'I') 11
('so', 'I', 'must') 2
('I', 'must', 'take') 1
('must', 'take', 'fifteen') 1
('take', 'fifteen', 'copecks') 1
('fifteen', 'copecks', 'from') 1
('copecks', 'from', 'a') 1
('from', 'a', 'rouble') 1
('a', 'half', 'for') 3
('half', 'for', 'the') 2
('for', 'the', 'month') 1
('the', 'month', 'in') 1
('month', 'in', 'advance') 1


('avoided', 'society', 'of') 1
('society', 'of', 'every') 1
('of', 'every', 'sort') 1
('every', 'sort', ',') 1
('sort', ',', 'more') 1
(',', 'more', 'especially') 1
('more', 'especially', 'of') 1
('especially', 'of', 'late') 1
('of', 'late', '.') 8
('late', '.', 'But') 2
('.', 'But', 'now') 10
('But', 'now', 'all') 1
('now', 'all', 'at') 1
('all', 'at', 'once') 19
('at', 'once', 'he') 12
('he', 'felt', 'a') 6
('felt', 'a', 'desire') 1
('a', 'desire', 'to') 2
('desire', 'to', 'be') 1
('to', 'be', 'with') 4
('be', 'with', 'other') 1
('with', 'other', 'people') 1
('other', 'people', '.') 1
('people', '.', 'Something') 1
('.', 'Something', 'new') 1
('Something', 'new', 'seemed') 1
('new', 'seemed', 'to') 1
('seemed', 'to', 'be') 29
('to', 'be', 'taking') 1
('be', 'taking', 'place') 1
('taking', 'place', 'within') 1
('place', 'within', 'him') 1
('within', 'him', ',') 2
('and', 'with', 'it') 1
('with', 'it', 'he') 1
('it', 'he', 'felt') 1
('felt', 'a', 'sort') 1
('a', 'sort', 'of') 37
('sort

('and', 'my', 'wife') 1
('my', 'wife', 'is') 1
('wife', 'is', 'a') 1
('is', 'a', 'very') 6
('a', 'very', 'different') 4
('very', 'different', 'matter') 2
('different', 'matter', 'from') 1
('matter', 'from', 'me') 1
('from', 'me', '!') 2
('me', '!', 'Do') 1
('!', 'Do', 'you') 19
('Do', 'you', 'understand') 15
('you', 'understand', '?') 15
('understand', '?', 'Allow') 1
('?', 'Allow', 'me') 2
('Allow', 'me', 'to') 7
('me', 'to', 'ask') 4
('to', 'ask', 'you') 11
('ask', 'you', 'another') 1
('you', 'another', 'question') 1
('another', 'question', 'out') 1
('question', 'out', 'of') 1
('out', 'of', 'simple') 1
('of', 'simple', 'curiosity') 1
('simple', 'curiosity', ':') 1
('curiosity', ':', 'have') 1
(':', 'have', 'you') 1
('have', 'you', 'ever') 2
('you', 'ever', 'spent') 1
('ever', 'spent', 'a') 1
('spent', 'a', 'night') 1
('a', 'night', 'on') 1
('night', 'on', 'a') 1
('on', 'a', 'hay') 1
('a', 'hay', 'barge') 1
('hay', 'barge', ',') 2
('barge', ',', 'on') 1
('on', 'the', 'Neva') 1
('the',

('you', 'go', '?') 4
('go', '?', '”') 5
('?', '”', 'put') 1
('”', 'put', 'in') 9
('put', 'in', 'Raskolnikov') 1
('in', 'Raskolnikov', '.') 1
('Well', ',', 'when') 1
(',', 'when', 'one') 1
('when', 'one', 'has') 2
('one', 'has', 'no') 1
('has', 'no', 'one') 1
('no', 'one', ',') 6
('one', ',', 'nowhere') 1
(',', 'nowhere', 'else') 1
('nowhere', 'else', 'one') 1
('else', 'one', 'can') 1
('one', 'can', 'go') 1
('can', 'go', '!') 1
('go', '!', 'For') 1
('!', 'For', 'every') 1
('For', 'every', 'man') 1
('every', 'man', 'must') 2
('must', 'have', 'somewhere') 2
('have', 'somewhere', 'to') 2
('somewhere', 'to', 'go') 1
('to', 'go', '.') 8
('go', '.', 'Since') 1
('.', 'Since', 'there') 1
('Since', 'there', 'are') 1
('there', 'are', 'times') 1
('are', 'times', 'when') 1
('times', 'when', 'one') 1
('when', 'one', 'absolutely') 1
('one', 'absolutely', 'must') 1
('absolutely', 'must', 'go') 1
('must', 'go', 'somewhere') 1
('go', 'somewhere', '!') 1
('somewhere', '!', 'When') 1
('!', 'When', 'my') 1

('ran', 'away', 'with') 1
('away', 'with', 'him') 1
('with', 'him', 'from') 1
('him', 'from', 'her') 1
('from', 'her', 'father') 1
('her', 'father', '’') 3
('’', 's', 'house') 17
('s', 'house', '.') 3
('house', '.', 'She') 1
('She', 'was', 'exceedingly') 1
('was', 'exceedingly', 'fond') 1
('exceedingly', 'fond', 'of') 1
('fond', 'of', 'her') 1
('of', 'her', 'husband') 2
('her', 'husband', ';') 2
('husband', ';', 'but') 1
(';', 'but', 'he') 12
('but', 'he', 'gave') 1
('he', 'gave', 'way') 3
('gave', 'way', 'to') 2
('way', 'to', 'cards') 1
('to', 'cards', ',') 1
('cards', ',', 'got') 1
(',', 'got', 'into') 1
('got', 'into', 'trouble') 1
('into', 'trouble', 'and') 1
('trouble', 'and', 'with') 1
('and', 'with', 'that') 1
('with', 'that', 'he') 1
('that', 'he', 'died') 1
('he', 'died', '.') 2
('.', 'He', 'used') 3
('He', 'used', 'to') 3
('used', 'to', 'beat') 3
('to', 'beat', 'her') 3
('beat', 'her', 'at') 1
('her', 'at', 'the') 5
('at', 'the', 'end') 8
('the', 'end', ':') 1
('end', ':', 'a

('children', ';', 'and') 1
(';', 'and', 'it') 3
('and', 'it', 'was') 14
('it', 'was', 'said') 2
('was', 'said', 'more') 1
('said', 'more', 'to') 1
('more', 'to', 'wound') 1
('to', 'wound', 'her') 1
('wound', 'her', 'than') 1
('her', 'than', 'anything') 1
('than', 'anything', 'else') 1
('anything', 'else', '...') 1
('else', '...', '.') 2
('...', '.', 'For') 3
('.', 'For', 'that') 2
('For', 'that', '’') 1
('’', 's', 'Katerina') 1
('s', 'Katerina', 'Ivanovna') 1
('Katerina', 'Ivanovna', '’') 28
('Ivanovna', '’', 's') 35
('’', 's', 'character') 6
('s', 'character', ',') 2
('character', ',', 'and') 1
(',', 'and', 'when') 11
('and', 'when', 'children') 1
('when', 'children', 'cry') 1
('children', 'cry', ',') 1
('cry', ',', 'even') 1
(',', 'even', 'from') 1
('even', 'from', 'hunger') 1
('from', 'hunger', ',') 2
('hunger', ',', 'she') 1
(',', 'she', 'falls') 1
('she', 'falls', 'to') 1
('falls', 'to', 'beating') 1
('to', 'beating', 'them') 1
('beating', 'them', 'at') 1
('them', 'at', 'once') 2


('heard', 'in', 'the') 2
('in', 'the', 'entry') 3
('the', 'entry', '.') 3
('entry', '.', 'The') 1
('.', 'The', 'room') 6
('The', 'room', 'was') 5
('room', 'was', 'filled') 1
('was', 'filled', 'with') 2
('filled', 'with', 'noise') 1
('with', 'noise', '.') 1
('noise', '.', 'The') 1
('.', 'The', 'tavern-keeper') 1
('The', 'tavern-keeper', 'and') 1
('tavern-keeper', 'and', 'the') 1
('and', 'the', 'boys') 1
('the', 'boys', 'were') 1
('boys', 'were', 'busy') 1
('were', 'busy', 'with') 1
('busy', 'with', 'the') 2
('with', 'the', 'new-comers') 1
('the', 'new-comers', '.') 1
('new-comers', '.', 'Marmeladov') 1
('.', 'Marmeladov', 'paying') 1
('Marmeladov', 'paying', 'no') 1
('paying', 'no', 'attention') 1
('no', 'attention', 'to') 1
('attention', 'to', 'the') 1
('to', 'the', 'new') 1
('the', 'new', 'arrivals') 1
('new', 'arrivals', 'continued') 1
('arrivals', 'continued', 'his') 1
('continued', 'his', 'story') 1
('his', 'story', '.') 1
('story', '.', 'He') 1
('.', 'He', 'appeared') 2
('He', 'ap

('on', 'the', 'Egyptian') 1
('the', 'Egyptian', 'bridge') 1
('Egyptian', 'bridge', '.') 1
('bridge', '.', 'I') 1
('.', 'I', 'exchanged') 1
('I', 'exchanged', 'it') 1
('exchanged', 'it', 'for') 1
('it', 'for', 'the') 1
('for', 'the', 'garments') 1
('the', 'garments', 'I') 1
('garments', 'I', 'have') 1
('I', 'have', 'on') 1
('have', 'on', '...') 1
('on', '...', 'and') 1
('...', 'and', 'it') 3
('end', 'of', 'everything') 2
('of', 'everything', '!') 2
('everything', '!', '”') 6
('!', '”', 'Marmeladov') 1
('”', 'Marmeladov', 'struck') 1
('struck', 'his', 'forehead') 1
('with', 'his', 'fist') 2
('his', 'fist', ',') 2
('fist', ',', 'clenched') 1
(',', 'clenched', 'his') 1
('clenched', 'his', 'teeth') 1
('his', 'teeth', ',') 3
('teeth', ',', 'closed') 1
(',', 'closed', 'his') 1
('closed', 'his', 'eyes') 2
('his', 'eyes', 'and') 12
('eyes', 'and', 'leaned') 1
('and', 'leaned', 'heavily') 2
('leaned', 'heavily', 'with') 1
('heavily', 'with', 'his') 1
('with', 'his', 'elbow') 2
('his', 'elbow', '

('go', 'and', 'he') 1
('he', 'had', 'meant') 1
('had', 'meant', 'to') 1
('meant', 'to', 'help') 1
('to', 'help', 'him') 1
('help', 'him', '.') 1
('him', '.', 'Marmeladov') 3
('.', 'Marmeladov', 'was') 2
('Marmeladov', 'was', 'much') 1
('was', 'much', 'unsteadier') 1
('much', 'unsteadier', 'on') 1
('unsteadier', 'on', 'his') 1
('on', 'his', 'legs') 2
('his', 'legs', 'than') 1
('legs', 'than', 'in') 1
('than', 'in', 'his') 1
('in', 'his', 'speech') 1
('his', 'speech', 'and') 1
('speech', 'and', 'leaned') 1
('leaned', 'heavily', 'on') 1
('heavily', 'on', 'the') 2
('on', 'the', 'young') 1
('man', '.', 'They') 2
('.', 'They', 'had') 10
('They', 'had', 'two') 1
('or', 'three', 'hundred') 1
('three', 'hundred', 'paces') 1
('hundred', 'paces', 'to') 1
('paces', 'to', 'go') 1
('go', '.', 'The') 1
('.', 'The', 'drunken') 1
('The', 'drunken', 'man') 1
('drunken', 'man', 'was') 1
('man', 'was', 'more') 1
('was', 'more', 'and') 1
('and', 'more', 'overcome') 1
('more', 'overcome', 'by') 1
('overcome

('and', 'in', 'a') 5
('in', 'a', 'fury') 3
('a', 'fury', 'she') 1
('fury', 'she', 'seized') 1
('she', 'seized', 'him') 1
('seized', 'him', 'by') 5
('him', 'by', 'the') 11
('by', 'the', 'hair') 2
('the', 'hair', 'and') 2
('hair', 'and', 'dragged') 1
('and', 'dragged', 'him') 2
('dragged', 'him', 'into') 1
('him', 'into', 'the') 6
('room', '.', 'Marmeladov') 1
('.', 'Marmeladov', 'seconded') 1
('Marmeladov', 'seconded', 'her') 1
('seconded', 'her', 'efforts') 1
('her', 'efforts', 'by') 1
('efforts', 'by', 'meekly') 1
('by', 'meekly', 'crawling') 1
('meekly', 'crawling', 'along') 1
('crawling', 'along', 'on') 1
('along', 'on', 'his') 1
('his', 'knees', '.') 2
('knees', '.', '“') 1
('“', 'And', 'this') 2
('And', 'this', 'is') 2
('this', 'is', 'a') 2
('is', 'a', 'consolation') 1
('a', 'consolation', 'to') 2
('consolation', 'to', 'me') 1
('me', '!', 'This') 1
('!', 'This', 'does') 1
('This', 'does', 'not') 1
('does', 'not', 'hurt') 1
('not', 'hurt', 'me') 1
('hurt', 'me', ',') 1
(',', 'but',

('s', 'past', 'nine') 1
('past', 'nine', ',') 1
('nine', ',', 'I') 1
('I', 'have', 'brought') 2
('have', 'brought', 'you') 1
('brought', 'you', 'some') 1
('you', 'some', 'tea') 2
('some', 'tea', ';') 1
('tea', ';', 'will') 1
(';', 'will', 'you') 1
('will', 'you', 'have') 2
('you', 'have', 'a') 3
('have', 'a', 'cup') 2
('a', 'cup', '?') 1
('cup', '?', 'I') 1
('?', 'I', 'should') 3
('I', 'should', 'think') 9
('should', 'think', 'you') 1
('think', 'you', '’') 1
('you', '’', 're') 34
('’', 're', 'fairly') 1
('re', 'fairly', 'starving') 1
('fairly', 'starving', '?') 1
('starving', '?', '”') 1
('?', '”', 'Raskolnikov') 64
('”', 'Raskolnikov', 'opened') 1
('Raskolnikov', 'opened', 'his') 2
('opened', 'his', 'eyes') 5
('his', 'eyes', ',') 9
('eyes', ',', 'started') 1
(',', 'started', 'and') 1
('started', 'and', 'recognised') 1
('and', 'recognised', 'Nastasya') 1
('recognised', 'Nastasya', '.') 1
('Nastasya', '.', '“') 6
('.', '“', 'From') 7
('“', 'From', 'the') 3
('From', 'the', 'landlady') 2


('send', 'you', 'anything') 1
('you', 'anything', 'all') 1
('anything', 'all', 'this') 1
('all', 'this', 'time') 8
('this', 'time', '.') 3
('time', '.', 'But') 2
('But', 'now', ',') 3
('now', ',', 'thank') 1
(',', 'thank', 'God') 5
('thank', 'God', ',') 3
('God', ',', 'I') 1
(',', 'I', 'believe') 7
('I', 'believe', 'I') 9
('believe', 'I', 'shall') 1
('shall', 'be', 'able') 2
('be', 'able', 'to') 15
('able', 'to', 'send') 3
('send', 'you', 'something') 1
('you', 'something', 'more') 1
('something', 'more', 'and') 1
('more', 'and', 'in') 1
('and', 'in', 'fact') 1
('in', 'fact', 'we') 1
('fact', 'we', 'may') 1
('we', 'may', 'congratulate') 1
('may', 'congratulate', 'ourselves') 1
('congratulate', 'ourselves', 'on') 1
('ourselves', 'on', 'our') 1
('on', 'our', 'good') 1
('our', 'good', 'fortune') 1
('good', 'fortune', 'now') 1
('fortune', 'now', ',') 1
('now', ',', 'of') 2
('which', 'I', 'hasten') 1
('I', 'hasten', 'to') 2
('hasten', 'to', 'inform') 1
('to', 'inform', 'you') 4
('inform', '

('looks', ',', 'whispers') 1
(',', 'whispers', ',') 1
('whispers', ',', 'and') 1
('and', 'even', 'remarks') 1
('even', 'remarks', 'made') 1
('remarks', 'made', 'aloud') 1
('made', 'aloud', 'about') 1
('aloud', 'about', 'us') 1
('about', 'us', '.') 4
('us', '.', 'All') 1
('.', 'All', 'our') 1
('All', 'our', 'acquaintances') 1
('our', 'acquaintances', 'avoided') 1
('acquaintances', 'avoided', 'us') 1
('avoided', 'us', ',') 1
('us', ',', 'nobody') 1
(',', 'nobody', 'even') 1
('nobody', 'even', 'bowed') 1
('even', 'bowed', 'to') 1
('bowed', 'to', 'us') 1
('to', 'us', 'in') 1
('us', 'in', 'the') 2
('and', 'I', 'learnt') 1
('I', 'learnt', 'that') 3
('learnt', 'that', 'some') 1
('that', 'some', 'shopmen') 1
('some', 'shopmen', 'and') 1
('shopmen', 'and', 'clerks') 1
('and', 'clerks', 'were') 1
('clerks', 'were', 'intending') 1
('were', 'intending', 'to') 1
('intending', 'to', 'insult') 1
('to', 'insult', 'us') 1
('insult', 'us', 'in') 1
('us', 'in', 'a') 1
('in', 'a', 'shameful') 1
('a', 'sha

(',', 'he', 'has') 12
('he', 'has', 'two') 1
('has', 'two', 'posts') 1
('two', 'posts', 'in') 1
('posts', 'in', 'the') 1
('in', 'the', 'government') 2
('the', 'government', 'and') 1
('government', 'and', 'has') 1
('and', 'has', 'already') 3
('has', 'already', 'made') 2
('already', 'made', 'his') 1
('made', 'his', 'fortune') 2
('his', 'fortune', '.') 1
('fortune', '.', 'It') 1
('.', 'It', 'is') 19
('It', 'is', 'true') 4
('is', 'true', 'that') 3
('true', 'that', 'he') 1
('that', 'he', 'is') 8
('he', 'is', 'forty-five') 1
('is', 'forty-five', 'years') 1
('forty-five', 'years', 'old') 1
('old', ',', 'but') 1
('but', 'he', 'is') 3
('he', 'is', 'of') 2
('is', 'of', 'a') 1
('of', 'a', 'fairly') 1
('a', 'fairly', 'prepossessing') 1
('fairly', 'prepossessing', 'appearance') 1
('prepossessing', 'appearance', 'and') 1
('appearance', 'and', 'might') 1
('and', 'might', 'still') 1
('might', 'still', 'be') 1
('still', 'be', 'thought') 1
('be', 'thought', 'attractive') 1
('thought', 'attractive', 'by'

('are', 'a', 'student') 2
('a', 'student', 'of') 1
('student', 'of', 'law') 1
('of', 'law', '.') 1
('law', '.', 'I') 1
('.', 'I', 'am') 56
('I', 'am', 'in') 10
('am', 'in', 'complete') 1
('in', 'complete', 'agreement') 1
('complete', 'agreement', 'with') 1
('agreement', 'with', 'her') 1
('her', ',', 'Rodya') 1
(',', 'Rodya', ',') 45
('Rodya', ',', 'and') 4
(',', 'and', 'share') 1
('and', 'share', 'all') 1
('share', 'all', 'her') 1
('all', 'her', 'plans') 2
('her', 'plans', 'and') 1
('plans', 'and', 'hopes') 1
('and', 'hopes', ',') 1
('hopes', ',', 'and') 1
(',', 'and', 'think') 1
('and', 'think', 'there') 1
('think', 'there', 'is') 2
('there', 'is', 'every') 1
('is', 'every', 'probability') 1
('every', 'probability', 'of') 1
('probability', 'of', 'realising') 1
('of', 'realising', 'them') 1
('realising', 'them', '.') 1
('them', '.', 'And') 8
('.', 'And', 'in') 10
('And', 'in', 'spite') 1
('spite', 'of', 'Pyotr') 1
('of', 'Pyotr', 'Petrovitch') 9
('’', 's', 'evasiveness') 1
('s', 'evasi

KeyboardInterrupt: 

#### Generisanje teksta od 100 reči na osnovu frekvencija, tj. verovatnoća pojavljivanja unigrama, bigrama i trigrama

Za generisanje tekstova primenjen je model _skrivenih Markovljevih lanaca_ (Hidden Markov Models), na osnovu frekvencija:

* Unigrama

In [319]:
import random
import string

try:
    maketrans = ''.maketrans
except AttributeError:
    # Alternativa za Python 2 (da ne bi prijavljivao grešku "ImportError: cannot import name 'maketrans'")
    from string import maketrans

def tekst_u_listu(datoteka):
    """Pretvara čist tekst datoteke u listu reči, 
    pri čemu su uklonjeni interpunkcijski znaci
    i sve reči svedene na mala slova.
    
    Argument:
        datoteka - Niska koja sadrži putanju datoteke.
    Vraća:
        Vraća listu reči iz datoteke.
    """
    ucitavanje = open(datoteka, 'r')
    text = ucitavanje.read().lower()
    text = text.translate(
        maketrans(string.punctuation,
                         " " * len(string.punctuation)))
    return text.split()

def nasumicni_odabir(raspodela):
    """
    Bira element iz raspodele verovatnoća predstavljene datim rečnikom,
    npr. >>> nasumicni_odabir({'a':.9, 'b':.1})
    'a'
    """
    
    # Vrednost zbira raspodele verovatnoća je blizu 1. 
    assert abs(sum(raspodela.values()) - 1.0) < .000001, \
        "Vrednost zbira raspodele verovatnoća nije blizu 1."

    r = random.random()
    ukupno = 0
    for element in raspodela:
        ukupno += raspodela[element]
        if r < ukupno:
            return element

    assert False, "Greška u odabiru!"
    
def frekvencije_u_verovatnoce(frekvencije):
    """ Pretvara rečnik frekvencija u verovatnoće.

    Argument:
       frekvencije - rečnik mapira elemente na cele brojeve 

    Vraća:
       Novi rečnik gde se svaka frekvencija deli sa zbirom svih elemenata u argumentu frekvencije.

    Npr:

    >>> frekvencije_u_verovatnoce({'a':9, 'b':1})
    {'a': 0.9, 'b': 0.1}

    """
    verovatnoce = {}
    ukupno = 0
    for element in frekvencije:
        ukupno += frekvencije[element]
    for element in frekvencije:
        verovatnoce[element] = frekvencije[element] / float(ukupno)
    return verovatnoce

def racunanje_unigrama(word_list):
    """ Računa raspodelu verovatnoća nad pojedinačnim rečima.

    Argumenti:
       lista_reci - lista reči koja odgovara rečima u dokumentu.
                   Reči su svedene na mala slova, i iz teksta su uklonjeni
                   interpunkcijski znaci.
                   
    Vraća:
       Rečnik mapira reči na verovatnoće.

    Npr:

    >>> u = racunanje_unigrama(['i', 'think', 'therefore', 'i', 'am'])
    >>> print(u)
    {'i': 0.4, 'am': 0.2, 'think': 0.2, 'therefore': 0.2}

    """
    unigrami = {}
    for rec in lista_reci:
        if rec in unigrami:
            unigrami[rec] += 1
        else:
            unigrami[rec] = 1
    return frekvencije_u_verovatnoce(unigrami)

def nasumicni_tekst_unigrama(unigrami, broj_reci):
    """Generiše nasumični niz prema datim verovatnoćama. 

    Argumenti:
       unigrami -   Raspodela verovatnoća nad rečima koje je vratila 
                    funkcija racunanje_unigrama funkcija.
       broj_reci -  Broj reči od kojih će biti generisan tekst. 

    Vraća:
       Nasumičnu nisku reči, pri čemu je svaka naredna reč razdvojena razmakom. 

    Npr:

    >>> u = racunanje_unigrama(['i', 'think', 'therefore', 'i', 'am'])
    >>> nasumicni_tekst_unigrama(u, 5)
    'think i therefore i i'

    """
    rezultat = ""
    for i in range(broj_reci):
        sledeca_rec = nasumicni_odabir(unigrami)
        rezultat += sledeca_rec + " "
    return rezultat.rstrip()

def unigrami_main():
    """ Generiše tekst na osnovu Gutenbergovog korpusa unigrama."""
    reci = tekst_u_listu('gutenberg.txt')
    unigrami = racunanje_unigrama(reci)
    print(nasumicni_tekst_unigrama(unigrami, 100))
    
if __name__ == "__main__":
    unigrami_main()

for , waterside and upward him fine the York lambent unavoidably and this same capering one friendship his to Day ' 25 Israel How the protoplasm to with of . , consideration the it his is of made Lord ; what of quite cried no left come ," him in laid , them . of Yet , much , not s prove d south s the 14 34 towards ' land and , Henrietta than upon Earth Jesus God effect plotted could for sea hour fell I - gold cleare from Shone I woman there and . Nantucket him ,


* Bigrama

In [321]:
import random
from collections import Counter
import nltk

lista_reci = []
[lista_reci.extend(nltk.corpus.gutenberg.words(f)) for f in nltk.corpus.gutenberg.fileids()]
mala_slova = [w.lower() for w in lista_reci if w.isalnum()]
bigrami = [b for b in nltk.bigrams(mala_slova)]

# Reč kojom počinje generisani tekst
pocetna_rec = 'great'

# Tekst od 100 reči na osnovu skupa bigrama 
for i in range(0, 100):
    
  # Svi parovi reči koji počinju početnom rečju 
  # npr. "great god", "great evil" itd.
    pocetni_bigrami = [b for b in bigrami if b[0] == pocetna_rec]
    
  # Nasumični odabir jednog bigrama, npr. "great god"
    pocetni_bigram = random.choice(pocetni_bigrami)

  # Prva reč u bigramu ("great")   
    print(pocetni_bigram[0], end=" ")
  
  # Sledeća reč ("god")
    pocetna_rec = pocetni_bigram[1]

  # For petlja vrši iteracije i pronalazi naredne bigrame koji počinju rečju "god" 
  # npr. "god and"

great outward and consistent you and the house except strength faileth there be an iceberg the burnt offerings and therefore the towers 33 5 26 how generous amiable qualities which looked after these elephants are life had some fill me the jew and they be no evil upon his soul voyagest thou knewest my shoulder and iron from the horizontal tailed coat i sent into the way others the floridness of the sons of brahma boys not unto her father to me come on his cousin with them diminish the sword reach with any present day 1 who pads himself 

* Trigrama:

Uporednom analizom sadržaja generisanih tekstova uočavaju se razlike u povezanosti (koherentnosti) tekstova. Tekst generisan na osnovu trigrama najviše podseća na izvorni jezik korpusa.

In [322]:
import random
from collections import Counter
import nltk

lista_reci = []
[lista_reci.extend(nltk.corpus.gutenberg.words(f)) for f in nltk.corpus.gutenberg.fileids()]
mala_slova = [w.lower() for w in lista_reci if w.isalnum()]
trigrami = [t for t in nltk.trigrams(mala_slova)]

# Reč kojom počinje generisani tekst
pocetna_rec = 'great'

# Tekst od 100 reči na osnovu skupa trigrama 
for i in range(0, 100):
    
  # Svi parovi reči koji počinju početnom rečju 
  # npr. "great herd disguised"
    pocetni_trigrami = [t for t in trigrami if t[0] == pocetna_rec]
    
  # Nasumični odabir jednog trigrama, npr. "great herd disguised"
    pocetni_trigram = random.choice(pocetni_trigrami)

  # Prva reč u trigramu ("great")   
    print(pocetni_trigram[0], end=" ")
  
  # Sledeća reč ("herd")
    pocetna_rec = pocetni_trigram[1]

  # For petlja vrši iteracije i pronalazi naredne trigrame koji počinju rečju "herd" 
  # npr. "herd disguised"

great demonstration and i do when the dodo managed it hath set up to settle with the wicked and joy and brought many lusty phallic thumb the priory and digged for it was hiding place we won t you with one from the shoulder and sometimes observed there shall the expectation of micah were these cracked and the side southward the mate of the white whale lance aye aye in yonder is in at him go no other motions or abroad so under pretence of his basons of jerusalem but the land i have you look well reaped 14 i 