Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Updated Sidebar for Pebblo Docs #217

Merged
merged 8 commits into from
Feb 29, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 3 additions & 0 deletions docs/gh_pages/docs/development.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,8 @@
# Setting up development environment

> **Note**
> Please note that Pebblo requires Python version 3.9 or above to function optimally.

Pebblo is currently supported in MacOS and Linux.

The following instructions are **tested on Mac OSX and Linux (Debian).**
Expand Down
55 changes: 27 additions & 28 deletions docs/gh_pages/docs/entityclassifier.md
Original file line number Diff line number Diff line change
@@ -1,31 +1,30 @@
# Pebblo Entity Classifier
`Pebblo entity classifier` is designed to automatically scan your loader source files and pinpoint sensitive entities within the files. By highlighting these entities, it assists in ensuring compliance, data security, and privacy protection within your data processing pipeline.
Integrating it enhances risk mitigation and regulatory adherence while streamlining sensitive data handling.
# Pebblo Entity Classifier

`Pebblo entity classifier` is designed to automatically scan your loader source files and pinpoint sensitive entities within the files. By highlighting these entities, it assists in ensuring compliance, data security, and privacy protection within your data processing pipeline.
Integrating it enhances risk mitigation and regulatory adherence while streamlining sensitive data handling.

Pebblo Entity Classifier harnesses the power of the `Presidio Analyzer` python library for accurate entity classification.
Leveraging Presidio's robust features and capabilities, we ensure precise identification of entities within textual data.
Additionally, our solution welcomes contributions from the open-source community, encouraging collaborative efforts to improve its functionality and reliability.

# Entities Supported By Pebblo Entity Classifier

Below is the list of `entities` supported by Pebblo -

1. US Social Security Number
2. US Passport Number
3. US Driver's License
4. US Credit Card Number
5. US Bank Account Number
6. IBAN Code
7. US ITIN
8. Github Access Token
9. Slack Access Token
10. AWS Access Key
11. AWS Secret Key
12. Azure Key ID


Additionally, our solution welcomes contributions from the open-source community, encouraging collaborative efforts to improve its functionality and reliability.

# Entities Supported By Pebblo Entity Classifier

Below is the list of `entities` supported by Pebblo -

1. US Social Security Number
1. US Passport Number
1. US Driver's License
1. US Credit Card Number
1. US Bank Account Number
1. IBAN Code
1. US ITIN
1. Github Access Token
1. Slack Access Token
1. AWS Access Key
1. AWS Secret Key
1. Azure Key ID

User can get details of classified entities for their loader source files in Pebblo report.
Different sections of Pebblo report such as , `Top Files with Most Findings`, `Data Source Findings Table` and `Snippets` helps to get overview of pebblo entity classifier output for user's Rag application.
For more details refer - [Reports](reports.md)
Different sections of Pebblo report such as , `Top Files with Most Findings`, `Data Source Findings Table` and `Snippets` helps to get overview of pebblo entity classifier output for user's Rag application.

For more details refer - [Reports](reports.md)
5 changes: 4 additions & 1 deletion docs/gh_pages/docs/installation.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,8 @@
# Installation

> **Note**
> Please note that Pebblo requires Python version 3.9 or above to function optimally.

### Pre-requisites

#### Mac OSX
Expand Down Expand Up @@ -36,4 +39,4 @@ see [troubeshooting](troubleshooting.md) if you face any issues.

```bash
pebblo --config [PATH TO CONFIG FILE]
````
```
6 changes: 3 additions & 3 deletions docs/gh_pages/docs/introduction.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@ slug: /

Pebblo enables developers to safely load data and promote their Gen AI app to deployment without worrying about the organization’s compliance and security requirements. The project identifies semantic topics and entities found in the loaded data and summarizes them on the UI or a PDF report.

![Pebblo Overview](../static/img/pebblo-overview.jpg)
![Pebblo Overview](../static/img/pebblo-overview.webp)

# Benefits

Expand All @@ -30,7 +30,7 @@ Pebblo has two components.
- [Installation](installation.md)
- [Development Environment](development.md)
- [Pebblo Server](daemon.md)
- [Pebblo Safe DataLoader for Langchain](rag.md)
- [Pebblo Configuration File ](config.md)
- [Safe DataLoader for Langchain](rag.md)
- [Configuration](config.md)
- [Reports](reports.md)
- [Troubleshooting](troubleshooting.md)
56 changes: 27 additions & 29 deletions docs/gh_pages/docs/topicclassifier.md
Original file line number Diff line number Diff line change
@@ -1,32 +1,30 @@
# Pebblo Topic Classifier

# Pebblo Topic Classifier

`Pebblo topic classifier` is designed to analyze loader source files and accurately identify the underlying
topics they contain. It uses machine learning model meticulously trained to identify and categorize topics within textual data. this model is open for contribution from the open-source community, allowing for collaborative enhancements
and improvements to enrich its accuracy and effectiveness.
`Pebblo topic classifier` is designed to analyze loader source files and accurately identify the underlying
topics they contain. It uses machine learning model meticulously trained to identify and categorize topics within textual data. this model is open for contribution from the open-source community, allowing for collaborative enhancements
and improvements to enrich its accuracy and effectiveness.

# Topics Supported By Pebblo Topic Classifier

Below is the list of `topics` supported by Pebblo -

1. Board Meeting
1. Enterprise Agreement
1. Patent Application Filling
1. Financial Report
1. Loan and Security Agreement
1. Consulting Agreement
1. Sexual Harassment
1. Settlement Agreement
1. Price List
1. Distribution/Partner Agreement
1. Customer List
1. Executive Severance Agreement
1. Employee Agreement
1. Merger Agreement
1. Non-Disclosure Agreement

# Topics Supported By Pebblo Topic Classifier

Below is the list of `topics` supported by Pebblo -

1. Board Meeting
2. Enterprise Agreement
3. Patent Application Filling
4. Financial Report
5. Loan and Security Agreement
6. Consulting Agreement
7. Sexual Harassment
8. Settlement Agreement
9. Price List
10. Distribution/Partner Agreement
11. Customer List
12. Executive Severance Agreement
13. Employee Agreement
14. Merger Agreement
15. Non-Disclosure Agreement


User can get details of classified topics for their loader source files in Pebblo report.
Different sections of Pebblo report such as , `Top Files With Most Findings`, `Data Source Findings Table` and `Snippets` helps to get overview of pebblo topic classifier output for user's rag application.
For more details refer - [Reports](reports.md)
Different sections of Pebblo report such as , `Top Files With Most Findings`, `Data Source Findings Table` and `Snippets` helps to get overview of pebblo topic classifier output for user's rag application.

For more details refer - [Reports](reports.md)
42 changes: 24 additions & 18 deletions docs/gh_pages/sidebars.ts
Original file line number Diff line number Diff line change
Expand Up @@ -19,39 +19,45 @@ const sidebars: SidebarsConfig = {
},
{
type: "doc",
id: "daemon", // document ID
label: "Pebblo Server", // sidebar label
id: "config", // document ID
label: "Configuration", // sidebar label
},
{
type: "doc",
id: "rag", // document ID
label: "Pebblo Safe DataLoader for Langchain", // sidebar label
type: "category",
label: "Pebblo", // sidebar label
items: [
{
type: "doc",
label: "Server",
id: "daemon",
},
{
type: "doc",
label: "Entity Classifier",
id: "entityclassifier",
},
{
type: "doc",
label: "Topic Classifier",
id: "topicclassifier",
},
],
},
{
type: "doc",
id: "config", // document ID
label: "Pebblo Configuration File", // sidebar label
id: "rag", // document ID
label: "Safe DataLoader for Langchain", // sidebar label
},
{
type: "doc",
id: "reports", // document ID
label: "Reports", // sidebar label
},
{
{
type: "doc",
id: "troubleshooting", // document ID
label: "Troubleshooting Guide", // sidebar label
},
{
type: "doc",
id: "entityclassifier", // document ID
label: "Pebblo Entity Classifier", // sidebar label
},
{
type: "doc",
id: "topicclassifier", // document ID
label: "Pebblo Topic Classifier", // sidebar label
},
],
};

Expand Down
Binary file removed docs/gh_pages/static/img/pebblo-overview.jpg
Binary file not shown.
Binary file added docs/gh_pages/static/img/pebblo-overview.webp
Binary file not shown.