From 61c8755283617fbda8f11da3679ef6413bc88aa5 Mon Sep 17 00:00:00 2001 From: Vildan <64216738+Meryem1425@users.noreply.github.com> Date: Wed, 10 May 2023 17:44:50 -0400 Subject: [PATCH] Update licensed_install.md (#203) * Update licensed_install.md * Update licensed_install.md --- docs/en/licensed_install.md | 90 ++++++++++++++++++++++++++++++++++++- 1 file changed, 89 insertions(+), 1 deletion(-) diff --git a/docs/en/licensed_install.md b/docs/en/licensed_install.md index 8eaac88132..3207a71d46 100644 --- a/docs/en/licensed_install.md +++ b/docs/en/licensed_install.md @@ -247,6 +247,94 @@ Either create a conda env for python 3.6, install *pyspark==3.1.2 spark-nlp nump
+### Windows Server + +1. Install JAVA 18 - https://adoptium.net/temurin/releases/?version=18 + + i) Once installed , we can check java is installed or not using following command Open Cmd and type java --version + +![image](https://github.com/JohnSnowLabs/johnsnowlabs/assets/64216738/56176cf8-0184-4147-9923-bb00452bcf20) + +2. Install Hadoop tools + + i) Download the pre-compiled Hadoop binaries winutils.exe, hadoop.dll from winutils/hadoop-3.2.0/bin at master ยท cdarlint/winutils + + ii) Copy files into a folder called C:\\hadoop\\bin + + Note: The version above is for Spark 3.1.2, which was built for Hadoop 3.2.0. You might have to change the hadoop version in the link, depending on which Spark version you are using. + +3. Install Apache Spark and Microsoft Visual C++ 2010 + + i) Download Apache Spark - 3.2 from https://archive.apache.org/dist/spark/spark-3.2.3/spark-3.2.3-bin-hadoop3.2.tgz + ii) Extract the archive file to location C:\\spark. (C:\\spark folder should have folder bin - so it should belike C\\spark\\bin ) + iii) Intsall Microsoft Visual C++ 2010 from Microsoft Visual C++ 2010 Service Pack 1 Redistributable Package MFC Security Update + +4. Configure Hadoop and Spark ENV variables + +Windows Explorer โ†’ This PC -> Right Click select properties -> Click on Advanced system settings -> Click on Environment Variables Under system variables -> Click on new + + +Add HADOOP_HOME as below +VARIABLE Name: HADOOP_HOME VARIABLE Value: C:\\hadoop + +![image](https://github.com/JohnSnowLabs/johnsnowlabs/assets/64216738/7c8eaa03-1e55-4054-9851-c30c0b8fab46) + +Add SPARK_HOME as below + +VARIABLE Name: SPARK_HOME VARIABLE Value: C:\\spark + +![image](https://github.com/JohnSnowLabs/johnsnowlabs/assets/64216738/b1e8d0bf-e6ba-4062-9f77-704bc87c7226) + +Include HADOOP_HOME and SPARK_HOME in Path environment variable + +Select the Path (from variable) -> Click on edit โ†’ Click on New โ†’ add %HADOOP_HOME%\bin and %SPARK_HOME%\bin + +![image](https://github.com/JohnSnowLabs/johnsnowlabs/assets/64216738/60221c6b-cbc5-4c8e-9abd-25b4563950e4) + +![image](https://github.com/JohnSnowLabs/johnsnowlabs/assets/64216738/0e290cef-bd57-41ce-90a9-fc354b616540) + +![image](https://github.com/JohnSnowLabs/johnsnowlabs/assets/64216738/48a809f2-2b26-4b82-bc75-64aac758a33b) + +5. Configure Conda env + + i) Download Miniconda for python 3.8 from https://repo.anaconda.com/miniconda/Miniconda3-py38_4.11.0-Windows-x86_64.exe + ii) Install miniconda exe file. + iii) Under system variables -> Select the Path (from variable) -> Click on edit -> (ADD the miniconda install location)/bin (Same steps as above) + +![image](https://github.com/JohnSnowLabs/johnsnowlabs/assets/64216738/331b1089-7eed-46e5-83e4-48bff30995e0) + +6. Configure conda env + + i) Open cmd and execute the following commands + + ``` +- conda --version +- java --version +- conda create -n sparknlp python=3.8 -y +- conda activate sparknlp +- pip install spark-nlp==4.4.1 pyspark==3.3.1 +- pip install jupyter +- conda env config vars set PYSPARK_PYTHON=python +- conda activate sparknlp +- conda env config vars set PYSPARK_DRIVER_PYTHON=jupyter +- conda activate sparknlp +- conda env config vars set PYSPARK_DRIVER_python_OPTS=notebook +- conda activate sparknlp +``` + + ii) Start the spark nlp + +``` +- cd C:\spark\bin +- pyspark --packages com.johnsnowlabs.nlp:spark-nlp_2.12:4.4.1 +``` + +![image](https://github.com/JohnSnowLabs/johnsnowlabs/assets/64216738/b4213eef-af82-4ebd-a968-4a0669807331) + + iii) Jupyter notebook will be opened in default browser and you can install johnsnowlabs using following screenshot. + +![image](https://github.com/JohnSnowLabs/johnsnowlabs/assets/64216738/b4383fec-2785-4a0b-b18e-88d916c80565) + ## Non-johnsnowlabs Clinical NLP on Ubuntu > These instructions use non-johnsnowlabs installation syntax. For simplified installation with `johnsnowlabs` library, check first section. @@ -834,4 +922,4 @@ The license json file contains: If you have asked for a trial license, but you cannot access your account on [my.JohnSnowLabs.com](https://my.johnsnowlabs.com/) and you did not receive the license information via email, please contact us at support@johnsnowlabs.com. -
\ No newline at end of file +