# Code Documentation 
This code performs keyword extraction from text data stored in a CSV file using spaCy and Pandas libraries.
## Import Library

- ```pandas```: for reading CSV files and data manipulation.
- ```spacy```: for natural language processing tasks such as tokenization and stop word removal.
- ```collections.Counter```: for counting occurrences of keywords.

In [None]:
import pandas as pd
import spacy
from collections import Counter

## Mounting Google Drive in Google Colab
This code mounts Google Drive in Google Colab, enabling access to files stored in Google Drive.

Libraries used:
- google.colab: for mounting Google Drive in Google Colab.

In [3]:
from google.colab import drive
drive.mount('/content/drive')

Mounted at /content/drive


## Loading spaCy Language Model for English
This code loads the spaCy language model for English, which is used for various natural language processing tasks.

In [2]:
nlp = spacy.load("en_core_web_sm")

## Keyword Extraction Function using spaCy

This code defines a function to extract keywords from a given text using the spaCy library.


Function to extract keywords from a given text using spaCy.

    Parameters:
    - text (str): The input text from which keywords are to be extracted.

    Returns:
    - list of tuples: A list of tuples containing (keyword, frequency) pairs sorted in descending order of frequency.
                      Returns an empty list if the input is not a string.

In [16]:
def extract_keywords(text):
    if isinstance(text, str):
        doc = nlp(text)

        keyword_freq = Counter()

        for token in doc:
            if not token.is_stop and not token.is_punct:
                if token.text.isalpha():
                    keyword_freq[token.text.lower()] += 1

        sorted_keywords = sorted(keyword_freq.items(), key=lambda x: x[1], reverse=True)

        return sorted_keywords
    else:
        return [] 

## Reading CSV File into DataFrame in Google Colab

This code snippet reads a CSV file into a DataFrame in Google Colab environment from Google Drive.

In [12]:
df = pd.read_csv('/content/drive/MyDrive/For Capstone/Collecting data/placedata.csv')

## Performing Keyword Extraction for Each Review Text in a DataFrame

This code snippet performs keyword extraction for each review text in a DataFrame and adds the extracted keywords as a new column.

Function used:
- extract_keywords: A function defined to extract keywords from text using spaCy.


In [None]:
df['Keywords'] = df['Review Text'].apply(extract_keywords)

## Print the Data from Dataframe df column Review Text and Keywords

In [None]:
df['Review Text','Keywords'].sample(20)

## Saving DataFrame to a New CSV File in Google Colab

This code snippet saves the updated DataFrame to a new CSV file in Google Colab environment, specifically in Google Drive.


In [None]:
file_path = '/content/drive/MyDrive/For Capstone/Collecting data/Hasil_Ekstraksi(One Keyword).csv'

# Save the updated data to a new CSV file
df.to_csv(file_path, index=False)