Different from #849. Not audio upload, but attaching file to the current note. Eventually we can do OCR or feed image directly as input tokens to the model.
https://tiptap.dev/docs/editor/extensions/functionality/filehandler
https://www.codemzy.com/blog/tiptap-drag-drop-image
ueberdosis/tiptap#1600