## **Goose**

In the context of **Natural Language Processing (NLP)**, **Goose** generally refers to **GOOSE (Generating Open Ontology using Semantics and Structure)**, which is a technique or tool used to create or generate ontologies by extracting structured knowledge from unstructured data. Ontologies are formal representations of knowledge within a specific domain, often used to map relationships between concepts, entities, and categories.

To break it down:

1. **Ontology in NLP**: Ontologies are important for tasks like information retrieval, semantic search, and knowledge graph generation. They represent relationships between words, phrases, or concepts in a structured way, which helps machines better understand natural language.

2. **GOOSE (Generating Open Ontology)**: GOOSE helps in the automatic generation of ontologies using NLP techniques. It analyzes the structure and semantics of text data to extract important concepts, relationships, and hierarchies, which can then be represented in an ontology.

This approach is useful for tasks like:
- Building knowledge bases
- Enhancing search engines with semantic understanding
- Assisting in reasoning and decision-making systems

However, **Goose** could also refer to various other tools or techniques within specific NLP frameworks or domains. If you have a more specific context in mind, feel free to share, and I can provide more details.

In [1]:
from goose3 import Goose

In [2]:
my_goose=Goose()
url="https://en.wikipedia.org/wiki/Sundar_Pichai"
wiki=my_goose.extract(url)

In [4]:
display(wiki.domain)
display(wiki.links)

'en.wikipedia.org'

['#cite_note-britannica-3',
 '#cite_note-:1-4',
 '#cite_note-5',
 '/wiki/Indian-American',
 '#cite_note-6',
 '#cite_note-7',
 '/wiki/Chief_executive_officer',
 '/wiki/Alphabet_Inc.',
 '/wiki/Google',
 '#cite_note-8',
 '/wiki/Materials_engineer',
 '/wiki/McKinsey_%26_Co.',
 '#cite_note-britt-9',
 '/wiki/Google_Chrome',
 '/wiki/ChromeOS',
 '/wiki/Google_Drive',
 '/wiki/Gmail',
 '/wiki/Google_Maps',
 '/wiki/VP8',
 '/wiki/WebM',
 '/wiki/Chromebook',
 '/wiki/Android_(operating_system)',
 '/wiki/Google',
 '/wiki/Chief_Product_Officer',
 '/wiki/Larry_Page',
 '/wiki/Alphabet_Inc',
 '/wiki/Holding_company',
 '/wiki/Google',
 '#cite_note-10',
 '/wiki/Time_(magazine)',
 '/wiki/Time_100',
 '#cite_note-11',
 '#cite_note-12',
 '#cite_note-13',
 '/wiki/Madurai',
 '/wiki/Tamil_Nadu',
 '/wiki/India',
 '#cite_note-School_days-14',
 '#cite_note-britt-9',
 '#cite_note-Charlie-15',
 '/wiki/Tamil_Brahmin',
 '#cite_note-16',
 '#cite_note-17',
 '/wiki/Stenographer',
 '/wiki/Electrical_engineer',
 '/wiki/Gener

In [7]:
print(wiki.cleaned_text)

Pichai Sundararajan (born June 10, 1972[3][4][5]), better known as Sundar Pichai ( ), is an Indian-born American business executive.[6][7] He is the chief executive officer (CEO) of Alphabet Inc. and its subsidiary Google.[8]

Pichai began his career as a materials engineer. Following a short stint at the management consulting firm McKinsey & Co., Pichai joined Google in 2004,[9] where he led the product management and innovation efforts for a suite of Google's client software products, including Google Chrome and ChromeOS, as well as being largely responsible for Google Drive. In addition, he went on to oversee the development of other applications such as Gmail and Google Maps. In 2010, Pichai also announced the open-sourcing of the new video codec VP8 by Google and introduced the new video format, WebM. The Chromebook was released in 2012. In 2013, Pichai added Android to the list of Google products that he oversaw.

Pichai was selected to become the next CEO of Google on August 10,

In [8]:
wiki.infos

{'meta': {'description': '',
  'lang': 'en',
  'keywords': '',
  'favicon': '/static/apple-touch/wikipedia.png',
  'canonical': 'https://en.wikipedia.org/wiki/Sundar_Pichai',
  'encoding': 'UTF-8'},
 'image': None,
 'domain': 'en.wikipedia.org',
 'title': 'Sundar Pichai - Wikipedia',
 'cleaned_text': 'Pichai Sundararajan (born June 10, 1972[3][4][5]), better known as Sundar Pichai ( ), is an Indian-born American business executive.[6][7] He is the chief executive officer (CEO) of Alphabet Inc. and its subsidiary Google.[8]\n\nPichai began his career as a materials engineer. Following a short stint at the management consulting firm McKinsey & Co., Pichai joined Google in 2004,[9] where he led the product management and innovation efforts for a suite of Google\'s client software products, including Google Chrome and ChromeOS, as well as being largely responsible for Google Drive. In addition, he went on to oversee the development of other applications such as Gmail and Google Maps. In 20