Replies: 5 comments 14 replies
-
这个是测试过程
|
Beta Was this translation helpful? Give feedback.
-
|
这个bug很影响使用,希望官方可以给出解决方法 谢谢 |
Beta Was this translation helpful? Give feedback.
-
|
终于解决这个问题了:ocr-det ch阶段不使用gpu加速。原因: |
Beta Was this translation helpful? Give feedback.
-
|
终于解决这个问题了:ocr-det ch阶段不使用gpu加速。原因: |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.





Uh oh!
There was an error while loading. Please reload this page.
-
我按照之前issue#3285的回复,盘查问题。我现在没有用fastapi+uvicorn,我使用pipeline解析策略,并且显现配置了环境变量
os.environ['MINERU_MODEL_SOURCE'] = "local" # 配置文件位置/root/mineru.json
os.environ['MINERU_DEVICE_MODE'] = "cuda",
以下是我的代码:
import os
import tempfile
from pathlib import Path
import uuid
from mineru.cli.common import prepare_env, read_fn
from mineru.data.data_reader_writer import FileBasedDataWriter
from mineru.utils.enum_class import MakeMode
from mineru.backend.pipeline.pipeline_analyze import doc_analyze as pipeline_doc_analyze
from mineru.backend.pipeline.model_json_to_middle_json import result_to_middle_json as pipeline_result_to_middle_json
from mineru.backend.pipeline.pipeline_middle_json_mkcontent import union_make as pipeline_union_make
pdf_extensions = [".pdf"]
image_extensions = [".png", ".jpg", ".jpeg"]
文档解析方法
def process_file(
file_bytes: bytes,
file_name: str,
) -> list:
"""
Process PDF file content
def file_parse(
filepath: str
):
"""
Execute the process of converting PDF to JSON and MD, outputting MD and JSON files
to the specified directory.
if name == "main":
file_path = "/app/3333.pdf"
content_list, middle_json_content = file_parse(file_path)
print(f"content_list:{content_list}")
print(f"middle_json_content:{middle_json_content}")
并且在测试的时候,打印出来的环境变量如下:$debian_chroot)}\u@\h:\w$
SHELL=/bin/bash
COLORTERM=truecolor
PYTHON_SHA256=ae665bc678abd9ab6a6e1573d2481625a53719bc517e9a634ed2b9fefae3817f
PYTHONUNBUFFERED=1
TERM_PROGRAM_VERSION=1.85.2
HOSTNAME=bea671b3fe0c
PYTHON_VERSION=3.10.18
SSH_AUTH_SOCK=/tmp/vscode-ssh-auth-d0c10770-f0fe-42d6-b784-64298613cf28.sock
REMOTE_CONTAINERS_IPC=/tmp/vscode-remote-containers-ipc-d0c10770-f0fe-42d6-b784-64298613cf28.sock
PWD=/app
HOME=/root
LANG=C.UTF-8
VIRTUAL_ENV=/app/venv
REMOTE_CONTAINERS=true
GPG_KEY=A035C8C19219BA821ECEA86B64E628F8D684696D
TERM=xterm-256color
REMOTE_CONTAINERS_SOCKETS=["/tmp/vscode-ssh-auth-d0c10770-f0fe-42d6-b784-64298613cf28.sock"]
PIP_DISABLE_PIP_VERSION_CHECK=1
SHLVL=2
VIRTUAL_ENV_PROMPT=(venv)
PYTHONDONTWRITEBYTECODE=1
PS1=(venv) ${debian_chroot:+(
BROWSER=/root/.vscode-server/bin/8b3775030ed1a69b13e4f4c628c612102e30a681/bin/helpers/browser.sh
PATH=/app/venv/bin:/root/.vscode-server/bin/8b3775030ed1a69b13e4f4c628c612102e30a681/bin/remote-cli:/usr/local/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin
PIP_NO_CACHE_DIR=1
DEBIAN_FRONTEND=noninteractive
TERM_PROGRAM=vscode
VSCODE_IPC_HOOK_CLI=/tmp/vscode-ipc-e6ad6fee-d6a1-4105-ab62-bb7e5511b7a4.sock
_=/usr/bin/env
OLDPWD=/app
PYTHONIOENCODING=UTF-8
PYDEVD_USE_FRAME_EVAL=NO
QT_QPA_PLATFORM_PLUGIN_PATH=/app/venv/lib/python3.10/site-packages/cv2/qt/plugins
QT_QPA_FONTDIR=/app/venv/lib/python3.10/site-packages/cv2/qt/fonts
LD_LIBRARY_PATH=/app/venv/lib/python3.10/site-packages/cv2/../../lib64:
NUMEXPR_MAX_THREADS=8
CUBLAS_WORKSPACE_CONFIG=:4096:8
TF_CPP_MIN_LOG_LEVEL=3
OMP_NUM_THREADS=1
TORCH_CPP_LOG_LEVEL=ERROR
KINETO_LOG_LEVEL=5
PYTORCH_ENABLE_MPS_FALLBACK=1
NO_ALBUMENTATIONS_UPDATE=1
FTLANG_CACHE=/app/venv/lib/python3.10/site-packages/mineru/resources/fasttext-langdetect
MINERU_MODEL_SOURCE=local
MINERU_DEVICE_MODE=cuda
Beta Was this translation helpful? Give feedback.
All reactions