Skip to content

A Neo4j project designed to store and manage relationships between URLs

Notifications You must be signed in to change notification settings

KingAkeem/url-graph

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 

Repository files navigation

URL-Graph

Neo4j Logo

URL-Graph is a Neo4j project designed to store and manage relationships between URLs. This graph database allows you to model and query the connections between different web addresses, providing valuable insights into the structure of your web data.

Introduction

In the world of web data, understanding relationships between URLs is crucial. The URL-Graph project leverages the power of Neo4j to create a graph representation of these relationships, enabling easy navigation and analysis.

Features

  • Graph Database: Utilize Neo4j's powerful graph database to model and store URL relationships.
  • Cypher Queries: Leverage the expressive Cypher query language to extract valuable insights from the graph.

Getting Started

Prerequisites

Before you begin, ensure you have the following prerequisites installed:

Installation

  1. Clone the repository:
git clone https://github.com/KingAkeem/url-graph.git
  1. Install dependencies:
cd url-graph
pip install -r requirements.txt

Configuration (TODO)

Update the configuration file with your Neo4j connection details (config.yml):

neo4j:
  uri: bolt://localhost:7687
  username: your-username
  password: your-password

Running the application

  1. Start the Neo4j database, this will be based on the OS that you're using. Check Neo4j instructions for further explanation.

  2. Execute the application

python main.py -u https://www.example.com -d 3 # -u/--url to specify URI and -d/--depth to specify depth of graph

Will dockerize project at some point.

Access the Neo4j Browser or use Cypher queries to interact with the URL graph.

Browser URL: http://localhost:7474/browser/

// Example Cypher Query to find relationships for a specific URL
MATCH (n:Node {url: 'https://example.com'})
-[relationship:parent]-()
RETURN n, relationship;

Examples

With a depth of 1

graph

With a depth of 3

graph (1)

About

A Neo4j project designed to store and manage relationships between URLs

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages