[16.0][ADD] fs_attachment: Add new addon #260

lmignon · 2023-04-13T15:36:47Z

This PR is part of the RFC #252 and depends on it.

… implementations

Using the base_attachment_object_storage module, the same way attachment_swift is done. Fixed a few issues along the way in attachment_swift.

When moving attachments from the filestore to an object storage, the filesystem files will be deleted only after the commit, so if the transaction is rollbacked, we still have the local files for another try.

Assume the following situation: * We have installed addons base, sale and attachment_s3 (hence base_attachment_object_storage as dependency) * All attachments are in S3 already * We run an upgrade of the 'base' addon, 'sale' is upgraded before attachment_s3 in the order of loading. * Sale updates the icon of the Sale menu * As attachment_s3 is not loaded yet, the attachment is created in the filestore Now if we don't persist the filestore or use different servers, we'll lose the images of the menus (or any attachment loaded by the install/upgrade of an addon). The implemented solution is to move the attachments from the filestore to the object storage at the loading of the module. However, this operation can take time and it shouldn't be run by 2 processes at the same time, so we want to detect if the module is loaded during a normal odoo startup or when some addons have been upgraded. There is nothing anymore at this point which allow us to know that modules just have been upgraded except... in the caller frame (load_modules). We have to rely on the inpect module and get the caller frame, which is not recommended, but seems the only way, besides, it's not called often and if _register_hook was called from another place, it would have no effect (unless the other place has a variable 'update_module' too).

The reason being: https://github.com/odoo/odoo/blob/9032617120138848c63b3cfa5d1913c5e5ad76db/odoo/addons/base/ir/ir_attachment.py#L344-L347 I nearly deleted this domain but it was too weird to be there for no reason. A comment explaining the issue was really missing.

Some attachments (e.g. image_small, image_medium) are stored in DB instead of the object storage for faster access. In some situations, we may have pushed all these files on the Object Storage (migration from a filesystem to object storage) and want to bring back these attachments from the object storage to the database. This method is not called anywhere but can be called by RPC or scripts.

The initial issue that triggered this rework is that the forced storage in database was working only on writes, and was never applied on attachment creations. This feature is used to store small files that need to be read in a fast way in database rather than in the object storage. Reading a file from the object storage can take 150-200ms, which is fine for downloading a PDF file or a single image, but not if you need 40 thumbnails. Down the path to make a correction, I found that: * the logic to force storage was called in `_inverse_datas`, which is not called during a create * odoo implemented a new method `_get_datas_related_values`, which is a model method that receive only the data and the mimetype, and return the attachment values and write the file to the correct place The `_get_datas_related_values` is where we want to plug this special storage, as it is called for create and write, and already handle the values and conditional write. But using this method, we have less information than before about the attachment, so let's review the different criterias we had before: * res_model: we were using it to always store attachments related to 'ir.ui.view' in db, because assets are related to this model. However, we don't really need to check this: we should store any javascript and css documents in database. * exclude res_model: we could have an exclusion list, to tell that for instance, for mail.message, we should never store any image in db. We don't have this information anymore, but I think it was never used and added "in case of". Because the default configuration is "mail.mail" and "mail.message" and I couldn't find any attachment with such res_model in any of our biggest databases. So this is removed. * mimetype and data (size) are the last criteria and we still have them The new system is only based on mimetype and data size and I think it's actually more versatile. Previously, we could set a global size and include mimetypes, but we couldn't say "I want to store all images below 50KB and all files of type X below 10KB". Now, we have a single system parameter with a dict configuration (`ir_attachment.storage.force.database`) defaulting to: {"image/": 51200, "application/javascript": 0, "text/css": 0} Assets have a limit of zero, which means they will all be stored in the database whatever their size is. Overall, this is a great simplification of the module too, as the method `_get_datas_related_values` integrates it better in the base calls of IrAttachment. Note for upgrade: I doubt we customized the previous system parameters which are now obsolete, but if yes, the configuration may need to be moved to `ir_attachment.storage.force.database`. For the record, the params were: * mimetypes.list.storedb (default: image) * file.maxsize.storedb (default: 51200) * excluded.models.storedb (mail.message,mail.mail), no equivalent now The method IrAttachment.force_storage_to_db_for_special_fields() should be called through a migration script on existing databases to move the attachments back into the database.

The main goal is to be able to easily do grep and sed when we do mass update on them

* fix: azure reading in stream monkey patch documents

…r attachment on the fs.storage itself

Also fix typo into summary

yvaucher

Only a question about the dependency. Otherwise LGTM

fs_attachment/__manifest__.py

requirements.txt

Avoid recompute of new columns when installed on an existing database

Allows to provide configuration parameters through server environement files.

Remove code used to try to read file from the root filesystem and write into the specialized filesystem. This code was used to try to provide a way to manage staging environments by reusing the same filesystem storage but with a different directory_path depending of the environement. A simpler method is to configure use a different filesystem storage by environement. If a production database is restored in pre production env, you can declare a new filesystem storage with a different code to store the attachements by default and configure the filesystem storage from the production with information allowing to read documents stored in it but not to modify or delete existing documents. This make the implementation far more simple.

To create a new cursor, just ask to the current registry.... Loading a registry is very time consuming and could lead to deadlocks...

lmignon · 2023-08-24T09:20:31Z

superseded by #269

TDu and others added 30 commits April 7, 2023 14:24

Create base_attachment_object_storage to extract common code to store…

950bd68

… implementations

Abstract object storage in attachment_s3

2031da2

Using the base_attachment_object_storage module, the same way attachment_swift is done. Fixed a few issues along the way in attachment_swift.

Set addons uninstallable

914eed7

Set addons installable

36fddd0

Replace value.decode('base64') by base64.b64decode (py3)

3ebda00

Ensure that migration of files is commited before deleting files

9fd0c3f

When moving attachments from the filestore to an object storage, the filesystem files will be deleted only after the commit, so if the transaction is rollbacked, we still have the local files for another try.

base_attachment_object_storage: bump 1.1.0

b93f3d1

Set all modules to uninstallable

72feffe

Migration to 12.0

70a1229

fixup! Migration to 12.0

d5a5f11

[IMP]: Allow to pass storage as a context key

0e8ca9a

[IMP]: Allow to use context Key as storage key

2de76dd

BSRD-286: Set the addons to uninstallable

37c3b69

[MIG] base_attachment_object_storage: Migration to 13.0

4123fbe

[IMP] route file to db base on size and mimetype

bb86bd8

Set module for 14.0 uninstallable

ab9d835

[MIG] base_attachment_object_storage: Migration to 14.0

4b2b37d

remove base64 from base_attachment

5d28509

15.0 Modules migration

e81636c

Update manifest files to be consistent inbetween them

d3cdfd4

The main goal is to be able to easily do grep and sed when we do mass update on them

Object Storage - inactive mode

cb6eb96

Object storage inactivation: changes INACTIVE concept for DISABLE

f160256

feat: v16.0 : all modules uninstallable

660827f

fix: modifition setup (OCA#386)

b675970

fix: dependencies and deprecated code (OCA#390)

46d3141

feat: remove after method (#393)

76e81dc

* fix: azure reading in stream monkey patch documents

lmignon added 2 commits April 7, 2023 18:20

[ADD] fs_attachment: Store attachment through fsspec

8ca9295

fixup! [ADD] fs_attachment: Store attachment through fsspec

e7036bd

This was referenced Apr 13, 2023

[16.0][IMP] storage_backend: New implementation #250

Closed

[16.0][ADD] fs_storage: Storage Addons Refactoring RFC #252

Merged

lmignon added 4 commits April 14, 2023 17:53

add tests and fixes for urls, allows to define the default storage fo…

c43e0e0

…r attachment on the fs.storage itself

add tests and fixesé

e424ee9

add doc and fix deletion

528fd2c

add missing files

9f0bbdf

lmignon mentioned this pull request Apr 17, 2023

base_attachment_object_storage refactored into OCA/storage camptocamp/odoo-cloud-platform#418

Closed

lmignon added 5 commits April 17, 2023 17:00

removed unused file

d01a083

[IMP] fs_attachement: implements x-access

fb539a6

[IMP] fs_attachement: implements filename obfuscation

edf1125

[IMP] fs_attachment; Declares maintainer

9dba9c6

Also fix typo into summary

[FIX] fs_attachment: Do nothing in write if nothing to write

4c8ea41

lmignon mentioned this pull request Apr 27, 2023

[16.0][ADD] fs_file: New field to store files #261

Merged

6 tasks

lmignon added 2 commits May 24, 2023 11:19

[IMP] fs_attachment: Set development status to 'Beta'

9330aeb

[IMP] fs_attachment: Add full support for file like open method

42be0b9

lmignon marked this pull request as ready for review June 4, 2023 21:12

yvaucher approved these changes Jun 6, 2023

View reviewed changes

fs_attachment/__manifest__.py Show resolved Hide resolved

requirements.txt Show resolved Hide resolved

lmignon added 4 commits July 10, 2023 09:30

[IMP] fs_attachment: Speedup install

f3923a0

Avoid recompute of new columns when installed on an existing database

[IMP] fs_attachment: Server Environement support

732a3a4

Allows to provide configuration parameters through server environement files.

[FIX] fs_attachment: No new registry creation

bb18b7f

To create a new cursor, just ask to the current registry.... Loading a registry is very time consuming and could lead to deadlocks...

marielejeune mentioned this pull request Aug 1, 2023

[16.0] [IMP] fs_attachment: store attachments linked to different model/fields to different FS storages #269

Merged

lmignon mentioned this pull request Aug 21, 2023

[16.0][MIG] storage_backend_s3: Migration to 16.0 #270

Closed

lmignon closed this Aug 24, 2023

lmignon deleted the 16.0-refactoring-rfc-fs_attachment-lmi branch August 24, 2023 09:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[16.0][ADD] fs_attachment: Add new addon #260

[16.0][ADD] fs_attachment: Add new addon #260

lmignon commented Apr 13, 2023 •

edited

Loading

yvaucher left a comment

lmignon commented Aug 24, 2023 •

edited

Loading

[16.0][ADD] fs_attachment: Add new addon #260

[16.0][ADD] fs_attachment: Add new addon #260

Conversation

lmignon commented Apr 13, 2023 • edited Loading

yvaucher left a comment

Choose a reason for hiding this comment

lmignon commented Aug 24, 2023 • edited Loading

lmignon commented Apr 13, 2023 •

edited

Loading

lmignon commented Aug 24, 2023 •

edited

Loading