New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] Importer fails with Umlaut #898
Comments
I was able export and import again a file named |
Sure. What exactly would you need? |
Just any output from when you run the command |
I use: docker-compose exec webserver document_importer ../export This results into: CommandError: The manifest file refers to "2019-12-31 Antrag_auf_Kostenübernahme_einer_Individualbegleitung_für_den_Besuch_e.pdf" which does not appear to be in the source directory. The file is in the export folder: |
I'm still unable to reproduce this so far. Again using the exact same name, I'm able to import and export without issue. One this I notice is the question marks in your |
LANG=en_US.UTF-8 |
That seems about correct. I think you can determine how to get just Bash displaying the special characters instead of |
Hi! I just wanted to report that I also have this issue. Exported the documents via exporter with paperless-ng 1.5, tried importing with paperless-ngx 1.7 installed. I receive the same CommandError.
as well as in the folder
|
You're not going to be able to import from 1.5 anyway. An in place upgrade should work though. Unfortunately, as I can't reproduce this at all, it's going to be hard to fix.
|
Ah OK, I did the steps as you pointed out and it worked without errors. The deleted file with Umlaut was imported again without errors. How would I then transfer from 1.5? 1.5 is running in docker on Mac, I want to use 1.7.1 on RaspberryPi. I think I need to move maybe the databases but did not find anything in the documentation. Would it work to update first to 1.7.1 on Mac, then use the exporter. And importing on RaspberryPi would then be without issues? Thanks! |
Yes, that should work just fine. How were you running the exporter before? There must be some difference |
Hi stumpylog,
As you recommended, I upgraded to 1.7.1 on Mac which worked fine. Then I exported the documents again using the same terminal command. I then transferred those files to Raspberry Pi, also running 1.7.1. When I started the importer using
again, I get the CommandError:
Could it be a problem because the files were generated on Mac but imported on Raspbian? Thank you! |
Hm, that could be, I've never tried between systems. The easy test would be trying to import again on the same system (such as a new setup to test with). |
I will set up my Ubuntu server maybe this week and also try some things. |
You'd need to delete the exported documents from paperlesss or import to a fresh instance. The importer won't overwrite an existing file. The good news is, this is after the check which originally failed, so it appears to be getting further along. |
i have the same situation coming form the same hardware, I created on m1 MacBook and exported there. now I want to import it on unraid / linux system and I also think its not working because of the Umlaut problem |
I did that as a test with 51 documents on my MacBook m1 where I created the files. the importer imports flawlessly after exporting and deleting by webui so there is a bug between macOS language and linux language handling with umlaut somewhere |
Hi Have the same (?) problem: 15:10:47 [Q] INFO Process-1:11 stopped doing work Umlaute can't import My locale: |
@HorstyS did you get it working now? I have the same problem, migrating from a M1 Mac (Ventura 13.2) to Ubuntu (22.04.1 LTS) |
This issue has been automatically locked since there has not been any recent activity after it was closed. Please open a new discussion or issue for related concerns. |
Description
Hi,
I had to reinstall my document server (Ubuntu with docker containers). So i first exported everything with the exporter into a dedicated directory, which went fine.
After the reinstall of the server and the docker images via the provided script I started the importer.
I reveive now a lot of errors with e.g.: "The manifest file refers to "Kostenübernahme.pdf" which does not appear to be in the source directory". But the file is there as mentioned in the error message.
What I can see is, that this only happens for files with Umlaute in the filename. All others seem to be okay.
I have to say, I might have had version 1.6 for the export, and now I have 1.7 which was obviously released some days ago. But I also installed 1.6 and tried the import with the same result.
Any idea what might be the issue?
Thanks,
Richard
Steps to reproduce
Webserver logs
No response
Paperless-ngx version
1.7.0
Host OS
Ubuntu 22.04 LTS
Installation method
Docker
Browser
Chrome
Configuration changes
No response
Other
No response
The text was updated successfully, but these errors were encountered: