Pensieve (previously named Memos)

I changed the name to Pensieve because Memos was already taken.

Pensieve (previously named Memos)

Pensieve is a privacy-focused passive recording project. It can automatically record screen content, build intelligent indices, and provide a convenient web interface to retrieve historical records.

This project draws heavily from two other projects: one called Rewind and another called Windows Recall. However, unlike both of them, Pensieve allows you to have complete control over your data, avoiding the transfer of data to untrusted data centers.

Features

🚀 Simple installation: just install dependencies via pip to get started
🔒 Complete data control: all data is stored locally, allowing for full local operation and self-managed data processing
🔍 Full-text and vector search support
🤖 Integrates with Ollama, using it as the machine learning engine for Pensieve
🌐 Compatible with any OpenAI API models (e.g., OpenAI, Azure OpenAI, vLLM, etc.)
💻 Supports Mac and Windows (Linux support is in development)
🔌 Extensible functionality through plugins

Quick Start

Important

It seems that not all versions of Python's sqlite3 library support enable_load_extension. However, I'm not sure which environments or Python versions might encounter this issue. I use conda to manage Python, and Python installed via conda works fine on macOS, Windows x86, and Ubuntu 22.04.

Please ensure the following command works in your Python environment:

import sqlite3

# Check sqlite version
print(f"SQLite version: {sqlite3.sqlite_version}")

# Test if enable_load_extension is supported
try:
    conn = sqlite3.connect(':memory:')
    conn.enable_load_extension(True)
    print("enable_load_extension is supported")
except AttributeError:
    print("enable_load_extension is not supported")
finally:
    conn.close()

If you find that this does not work properly, you can install miniconda to manage your Python environment. Alternatively, check the current issue list to see if others have encountered the same problem.

1. Install Pensieve

pip install memos

2. Initialize

Initialize the pensieve configuration file and sqlite database:

memos init

Data will be stored in the ~/.memos directory.

3. Start the Service

memos enable
memos start

This command will:

Begin recording all screens
Start the Web service
Set the service to start on boot

4. Access the Web Interface

Open your browser and visit http://localhost:8839

Mac Permission Issues

On Mac, Pensieve needs screen recording permission. When the program starts, Mac will prompt for screen recording permission - please allow it to proceed.

User Guide

Using the Appropriate Embedding Model

1. Model Selection

Pensieve uses embedding models to extract semantic information and build vector indices. Therefore, choosing an appropriate embedding model is crucial. Depending on the user's primary language, different embedding models should be selected.

For Chinese scenarios, you can use the jinaai/jina-embeddings-v2-base-zh model.
For English scenarios, you can use the jinaai/jina-embeddings-v2-base-en model.

2. Adjust Memos Configuration

Open the ~/.memos/config.yaml file with your preferred text editor and modify the embedding configuration:

embedding:
  use_local: true
  model: jinaai/jina-embeddings-v2-base-en   # Model name used
  num_dim: 768                               # Model dimensions
  use_modelscope: false                      # Whether to use ModelScope's model

3. Restart Memos Service

memos stop
memos start

The first time you use the embedding model, Pensieve will automatically download and load the model.

4. Rebuild Index

If you switch the embedding model during use, meaning you have already indexed screenshots before, you need to rebuild the index:

memos reindex --force

The --force parameter indicates rebuilding the index table and deleting previously indexed screenshot data.

Using Ollama for Visual Search

By default, Pensieve only enables the OCR plugin to extract text from screenshots and build indices. However, this method significantly limits search effectiveness for images without text.

To achieve more comprehensive visual search capabilities, we need a multimodal image understanding service compatible with the OpenAI API. Ollama perfectly fits this role.

Important Notes Before Use

Before deciding to enable the VLM feature, please note the following:

Hardware Requirements
- Recommended configuration: NVIDIA graphics card with at least 8GB VRAM or Mac with M series chip
- The minicpm-v model will occupy about 5.5GB of storage space
- CPU mode is not recommended as it will cause severe system lag
Performance and Power Consumption Impact
- Enabling VLM will significantly increase system power consumption
- Consider using other devices to provide OpenAI API compatible model services

1. Install Ollama

Visit the Ollama official documentation for detailed installation and configuration instructions.

2. Prepare the Multimodal Model

Download and run the multimodal model minicpm-v using the following command:

ollama run minicpm-v "Describe what this service is"

This command will download and run the minicpm-v model. If the running speed is too slow, it is not recommended to use this feature.

3. Configure Pensieve to Use Ollama

Open the ~/.memos/config.yaml file with your preferred text editor and modify the vlm configuration:

vlm:
  endpoint: http://localhost:11434  # Ollama service address
  modelname: minicpm-v              # Model name to use
  force_jpeg: true                  # Convert images to JPEG format to ensure compatibility
  prompt: Please describe the content of this image, including the layout and visual elements  # Prompt sent to the model

Use the above configuration to overwrite the vlm configuration in the ~/.memos/config.yaml file.

Also, modify the default_plugins configuration in the ~/.memos/plugins/vlm/config.yaml file:

default_plugins:
- builtin_ocr
- builtin_vlm

This adds the builtin_vlm plugin to the default plugin list.

4. Restart Pensieve Service

memos stop
memos start

After restarting the Pensieve service, wait a moment to see the data extracted by VLM in the latest screenshots on the Pensieve web interface:

If you do not see the VLM results, you can:

Use the command memos ps to check if the Pensieve process is running normally
Check for error messages in ~/.memos/logs/memos.log
Confirm whether the Ollama model is loaded correctly (ollama ps)

Full Indexing

Pensieve is a compute-intensive application. The indexing process requires the collaboration of OCR, VLM, and embedding models. To minimize the impact on the user's computer, Pensieve calculates the average processing time for each screenshot and adjusts the indexing frequency accordingly. Therefore, not all screenshots are indexed immediately by default.

If you want to index all screenshots, you can use the following command for full indexing:

memos scan

This command will scan and index all recorded screenshots. Note that depending on the number of screenshots and system configuration, this process may take some time and consume significant system resources. The index construction is idempotent, and running this command multiple times will not re-index already indexed data.

Sampling Strategy

Pensieve dynamically adjusts the image processing interval based on the speed of screenshot generation and the speed of processing individual images. In environments without NVIDIA GPUs, it may be challenging to ensure that image processing keeps up with the rate of screenshot generation. To address this, Pensieve processes images on a sampled basis.

To prevent excessive system load, Pensieve’s default sampling strategy is intentionally conservative. However, this conservative approach might limit the performance of devices with higher computational capacity. To provide more flexibility, additional control options have been introduced in ~/.memos/config.yaml, allowing users to configure the system for either more conservative or more aggressive processing strategies.

watch:
  # number of recent events to consider when calculating processing rates
  rate_window_size: 10
  # sparsity factor for file processing
  # a higher value means less frequent processing
  # 1.0 means process every file, can not be less than 1.0
  sparsity_factor: 3.0
  # initial processing interval for file processing, means process one file 
  # with plugins for every N files
  # but will be adjusted automatically based on the processing rate
  # 12 means processing one file every 12 screenshots generated
  processing_interval: 12

If you want every screenshot file to be processed, you can configure the settings as follows:

# A watch config like this means process every file with plugins at the beginning
# but if the processing rate is slower than file generated, the processing interval 
# will be increased automatically
watch:
  rate_window_size: 10
  sparsity_factor: 1.0
  processing_interval: 1

Remember to do memos stop && memos start to make the new config work.

Privacy and Security

During the development of Pensieve, I closely followed the progress of similar products, especially Rewind and Windows Recall. I greatly appreciate their product philosophy, but they do not do enough in terms of privacy protection, which is a concern for many users (or potential users). Recording the screen of a personal computer may expose extremely sensitive private data, such as bank accounts, passwords, chat records, etc. Therefore, ensuring that data storage and processing are completely controlled by the user to prevent data leakage is particularly important.

The advantages of Pensieve are:

The code is completely open-source and easy-to-understand Python code, allowing anyone to review the code to ensure there are no backdoors.
Data is completely localized, all data is stored locally, and data processing is entirely controlled by the user. Data will be stored in the user's ~/.memos directory.
Easy to uninstall. If you no longer use Pensieve, you can close the program with memos stop && memos disable, then uninstall it with pip uninstall memos, and finally delete the ~/.memos directory to clean up all databases and screenshot data.
Data processing is entirely controlled by the user. Pensieve is an independent project, and the machine learning models used (including VLM and embedding models) are chosen by the user. Due to Pensieve' operating mode, using smaller models can also achieve good results.

Of course, there is still room for improvement in terms of privacy, and contributions are welcome to make Pensieve better.

Development Guide

Peeling the First Layer of the Onion

In fact, after Pensieve starts, it runs three programs:

memos serve starts the web service
memos record starts the screenshot recording program
memos watch listens to the image events generated by memos record and dynamically submits indexing requests to the server based on actual processing speed

Therefore, if you are a developer or want to see the logs of the entire project running more clearly, you can use these three commands to run each part in the foreground instead of the memos enable && memos start command.

Name		Name	Last commit message	Last commit date
Latest commit History 536 Commits
.github		.github
docs		docs
memos		memos
memos_ml_backends		memos_ml_backends
migrations		migrations
screen_recorder		screen_recorder
web		web
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README.pypi.md		README.pypi.md
README_JP.md		README_JP.md
README_ZH.md		README_ZH.md
build_executable.py		build_executable.py
memos_app.py		memos_app.py
pypi_md_generator.py		pypi_md_generator.py
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pensieve (previously named Memos)

Features

Quick Start

1. Install Pensieve

2. Initialize

3. Start the Service

4. Access the Web Interface

Mac Permission Issues

User Guide

Using the Appropriate Embedding Model

1. Model Selection

2. Adjust Memos Configuration

3. Restart Memos Service

4. Rebuild Index

Using Ollama for Visual Search

Important Notes Before Use

1. Install Ollama

2. Prepare the Multimodal Model

3. Configure Pensieve to Use Ollama

4. Restart Pensieve Service

Full Indexing

Sampling Strategy

Privacy and Security

Other Noteworthy Content

About Storage Space

About Power Consumption

Resource Usage

Performance Optimization Strategy

Development Guide

Peeling the First Layer of the Onion

About

Releases 8

Packages

Contributors 4

Languages

License

arkohut/pensieve

Folders and files

Latest commit

History

Repository files navigation

Pensieve (previously named Memos)

Features

Quick Start

1. Install Pensieve

2. Initialize

3. Start the Service

4. Access the Web Interface

Mac Permission Issues

User Guide

Using the Appropriate Embedding Model

1. Model Selection

2. Adjust Memos Configuration

3. Restart Memos Service

4. Rebuild Index

Using Ollama for Visual Search

Important Notes Before Use

1. Install Ollama

2. Prepare the Multimodal Model

3. Configure Pensieve to Use Ollama

4. Restart Pensieve Service

Full Indexing

Sampling Strategy

Privacy and Security

Other Noteworthy Content

About Storage Space

About Power Consumption

Resource Usage

Performance Optimization Strategy

Development Guide

Peeling the First Layer of the Onion

About

Resources

License

Stars

Watchers

Forks

Releases 8

Packages 0

Contributors 4

Languages

Packages