Skip to content

Commit 0118961

Browse files
Vectorizing
1 parent f19e4b1 commit 0118961

File tree

5 files changed

+7411
-0
lines changed

5 files changed

+7411
-0
lines changed
Lines changed: 57 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,57 @@
1+
{
2+
"cells": [
3+
{
4+
"cell_type": "markdown",
5+
"metadata": {},
6+
"source": [
7+
"# Vectorizing\n",
8+
"<ul>\n",
9+
" <li>\n",
10+
" The process that we use to convert text to a form that Python and a machine learning model can understand is called vectorizing\n",
11+
" </li>\n",
12+
" <li>\n",
13+
" Vectorization is used to speed up the Python code without using loop\n",
14+
" </li>\n",
15+
"</ul>\n",
16+
"___\n",
17+
"\n",
18+
"- ### Count Vectorization\n",
19+
"Creates a document-term matrix where the entry of each cell will be a count of the number of times that word occurred in that document."
20+
]
21+
},
22+
{
23+
"cell_type": "code",
24+
"execution_count": null,
25+
"metadata": {},
26+
"outputs": [],
27+
"source": [
28+
"import nltk\n",
29+
"import re\n",
30+
"import string\n",
31+
"import pandas as pd\n",
32+
"pd.set_options('.max_')"
33+
]
34+
}
35+
],
36+
"metadata": {
37+
"kernelspec": {
38+
"display_name": "Python 3",
39+
"language": "python",
40+
"name": "python3"
41+
},
42+
"language_info": {
43+
"codemirror_mode": {
44+
"name": "ipython",
45+
"version": 3
46+
},
47+
"file_extension": ".py",
48+
"mimetype": "text/x-python",
49+
"name": "python",
50+
"nbconvert_exporter": "python",
51+
"pygments_lexer": "ipython3",
52+
"version": "3.7.5"
53+
}
54+
},
55+
"nbformat": 4,
56+
"nbformat_minor": 2
57+
}
Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
{
2+
"cells": [],
3+
"metadata": {},
4+
"nbformat": 4,
5+
"nbformat_minor": 2
6+
}

0 commit comments

Comments
 (0)