<a href="https://colab.research.google.com/github/ridvanyigit/NLP/blob/main/Text_Files.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>

##String Formatter

In [137]:
name = 'Ridvan'

In [138]:
print("My name is {}".format(name))

My name is Ridvan


In [139]:
print(f"My name is {name}")

My name is Ridvan


In [140]:
data_science_tuts = [('Python for Beginners', 19),
                     ('Feature selection for ML', 11),
                     ('Machine Learning Tutorials', 11),
                     ('Deep Learning Tutorials', 19)]
data_science_tuts

[('Python for Beginners', 19),
 ('Feature selection for ML', 11),
 ('Machine Learning Tutorials', 11),
 ('Deep Learning Tutorials', 19)]

In [141]:
for info in data_science_tuts:
  print(info)

('Python for Beginners', 19)
('Feature selection for ML', 11)
('Machine Learning Tutorials', 11)
('Deep Learning Tutorials', 19)


* **<** `-->` Left  
* **>** `-->` Right

In [142]:
print(f"{'Course':<50} {'Count':>10}")
print("-" * 60)

for info in data_science_tuts:
    print(f"{info[0]:<50} {info[1]:>10}")

Course                                                  Count
------------------------------------------------------------
Python for Beginners                                       19
Feature selection for ML                                   11
Machine Learning Tutorials                                 11
Deep Learning Tutorials                                    19


**^** `-->`Center

In [143]:
print(f"{'Course':^50} {'Count':^10}")
print("-" * 60)

for info in data_science_tuts:
    print(f"{info[0]:^50} {info[1]:^10}")

                      Course                         Count   
------------------------------------------------------------
               Python for Beginners                    19    
             Feature selection for ML                  11    
            Machine Learning Tutorials                 11    
             Deep Learning Tutorials                   19    


* **.>**  `-->` ......
* ->  `-->` ------

In [144]:
print(f"{'Course':<50} {'Count':>10}")
print("-" * 60)

for info in data_science_tuts:
    print(f"{info[0]:-<50} {info[1]:.>10}")

Course                                                  Count
------------------------------------------------------------
Python for Beginners------------------------------ ........19
Feature selection for ML-------------------------- ........11
Machine Learning Tutorials------------------------ ........11
Deep Learning Tutorials--------------------------- ........19


##   Working with `.CSV` & `.TSV` files to read and write

In [145]:
import numpy as np
import pandas as pd

In [146]:
data = pd.read_csv('https://raw.githubusercontent.com/purviljain/Semantic-and-Sentiment-analysis/refs/heads/master/moviereviews.tsv', sep='\t')
data.head()

Unnamed: 0,label,review
0,neg,how do films like mouse hunt get into theatres...
1,neg,some talented actresses are blessed with a dem...
2,pos,this has been an extraordinary year for austra...
3,pos,according to hollywood movies made in last few...
4,neg,my first press screening of 1998 and already i...


In [147]:
data.shape

(2000, 2)

In [148]:
data['label'].value_counts()

Unnamed: 0_level_0,count
label,Unnamed: 1_level_1
neg,1000
pos,1000


In [149]:
pos = data[data['label']=='pos']
pos.head()

Unnamed: 0,label,review
2,pos,this has been an extraordinary year for austra...
3,pos,according to hollywood movies made in last few...
11,pos,"with stars like sigourney weaver ( "" alien "" t..."
16,pos,i remember hearing about this film when it fir...
18,pos,garry shandling makes his long overdue starrin...


In [150]:
pos.to_csv('pos.tsv', sep = '\t', index=False) # Saves the 'pos' DataFrame to a TSV file named 'pos.tsv', using tab as the separator.

In [151]:
pd.read_csv('pos.tsv', sep='\t').head()

Unnamed: 0,label,review
0,pos,this has been an extraordinary year for austra...
1,pos,according to hollywood movies made in last few...
2,pos,"with stars like sigourney weaver ( "" alien "" t..."
3,pos,i remember hearing about this film when it fir...
4,pos,garry shandling makes his long overdue starrin...


## Built in magic command in Jupyter/CoLab `%%writefile`

In [152]:
# Creates a file named 'TextFile.tex' and writes two lines of text to it.
%%writefile TextFile.txt
Hello, this is first test file.
This is the second line of the file.

Overwriting TextFile.txt


 open TextFile.txt with `Append Mode`

In [153]:
%%writefile -a TextFile.txt
Thank you for adding

Appending to TextFile.txt


## Use Python's inbuilt commant to read and write text file

In [154]:
file = open('TextFile.txt', 'r')
file

<_io.TextIOWrapper name='TextFile.txt' mode='r' encoding='UTF-8'>

In [155]:
file.read()

'Hello, this is first test file.\nThis is the second line of the file.\nThank you for adding\n'

In [156]:
file.seek(0)  # Moves the file pointer to the beginning of the file (position 0).

0

In [157]:
file.readline()  # Reads a single line from the file and returns it as a string.

'Hello, this is first test file.\n'

In [158]:
file.seek(0)

0

In [159]:
file.readlines()

['Hello, this is first test file.\n',
 'This is the second line of the file.\n',
 'Thank you for adding\n']

In [160]:
file.close()

In [161]:
with open('TextFile.txt', 'r') as file:
  text_data = file.readlines()
  print(text_data)

['Hello, this is first test file.\n', 'This is the second line of the file.\n', 'Thank you for adding\n']


In [162]:
for temp in text_data:
  print(temp.strip())

Hello, this is first test file.
This is the second line of the file.
Thank you for adding


In [163]:
for i, temp in enumerate(text_data):
  print(str(i) + "--->  " + temp.strip())

0--->  Hello, this is first test file.
1--->  This is the second line of the file.
2--->  Thank you for adding


##File Writing

In [164]:
file = open('TextFile2.txt', 'w')
file

<_io.TextIOWrapper name='TextFile2.txt' mode='w' encoding='UTF-8'>

In [165]:
file.write('This is second file')

19

In [166]:
file.close()

In [167]:
with open('TextFile3.txt', 'w') as file:
  file.write('This is third file\n')

In [168]:
text_data

['Hello, this is first test file.\n',
 'This is the second line of the file.\n',
 'Thank you for adding\n']

In [169]:
with open('TextFile3.txt', 'a') as file:
  for temp in text_data:
    file.write(temp)