Skip to content

Commit

Permalink
Update licensed_install.md (#203)
Browse files Browse the repository at this point in the history
* Update licensed_install.md

* Update licensed_install.md
  • Loading branch information
Meryem1425 committed May 10, 2023
1 parent 77c9343 commit 61c8755
Showing 1 changed file with 89 additions and 1 deletion.
90 changes: 89 additions & 1 deletion docs/en/licensed_install.md
Original file line number Diff line number Diff line change
Expand Up @@ -247,6 +247,94 @@ Either create a conda env for python 3.6, install *pyspark==3.1.2 spark-nlp nump

</div><div class="h3-box" markdown="1">

### Windows Server

1. Install JAVA 18 - https://adoptium.net/temurin/releases/?version=18

i) Once installed , we can check java is installed or not using following command Open Cmd and type java --version

![image](https://github.com/JohnSnowLabs/johnsnowlabs/assets/64216738/56176cf8-0184-4147-9923-bb00452bcf20)

2. Install Hadoop tools

i) Download the pre-compiled Hadoop binaries winutils.exe, hadoop.dll from winutils/hadoop-3.2.0/bin at master · cdarlint/winutils

ii) Copy files into a folder called C:\\hadoop\\bin

Note: The version above is for Spark 3.1.2, which was built for Hadoop 3.2.0. You might have to change the hadoop version in the link, depending on which Spark version you are using.

3. Install Apache Spark and Microsoft Visual C++ 2010

i) Download Apache Spark - 3.2 from https://archive.apache.org/dist/spark/spark-3.2.3/spark-3.2.3-bin-hadoop3.2.tgz
ii) Extract the archive file to location C:\\spark. (C:\\spark folder should have folder bin - so it should belike C\\spark\\bin )
iii) Intsall Microsoft Visual C++ 2010 from Microsoft Visual C++ 2010 Service Pack 1 Redistributable Package MFC Security Update

4. Configure Hadoop and Spark ENV variables

Windows Explorer → This PC -> Right Click select properties -> Click on Advanced system settings -> Click on Environment Variables Under system variables -> Click on new


Add HADOOP_HOME as below
VARIABLE Name: HADOOP_HOME VARIABLE Value: C:\\hadoop

![image](https://github.com/JohnSnowLabs/johnsnowlabs/assets/64216738/7c8eaa03-1e55-4054-9851-c30c0b8fab46)

Add SPARK_HOME as below

VARIABLE Name: SPARK_HOME VARIABLE Value: C:\\spark

![image](https://github.com/JohnSnowLabs/johnsnowlabs/assets/64216738/b1e8d0bf-e6ba-4062-9f77-704bc87c7226)

Include HADOOP_HOME and SPARK_HOME in Path environment variable

Select the Path (from variable) -> Click on edit → Click on New → add %HADOOP_HOME%\bin and %SPARK_HOME%\bin

![image](https://github.com/JohnSnowLabs/johnsnowlabs/assets/64216738/60221c6b-cbc5-4c8e-9abd-25b4563950e4)

![image](https://github.com/JohnSnowLabs/johnsnowlabs/assets/64216738/0e290cef-bd57-41ce-90a9-fc354b616540)

![image](https://github.com/JohnSnowLabs/johnsnowlabs/assets/64216738/48a809f2-2b26-4b82-bc75-64aac758a33b)

5. Configure Conda env

i) Download Miniconda for python 3.8 from https://repo.anaconda.com/miniconda/Miniconda3-py38_4.11.0-Windows-x86_64.exe
ii) Install miniconda exe file.
iii) Under system variables -> Select the Path (from variable) -> Click on edit -> (ADD the miniconda install location)/bin (Same steps as above)

![image](https://github.com/JohnSnowLabs/johnsnowlabs/assets/64216738/331b1089-7eed-46e5-83e4-48bff30995e0)

6. Configure conda env

i) Open cmd and execute the following commands

```
- conda --version
- java --version
- conda create -n sparknlp python=3.8 -y
- conda activate sparknlp
- pip install spark-nlp==4.4.1 pyspark==3.3.1
- pip install jupyter
- conda env config vars set PYSPARK_PYTHON=python
- conda activate sparknlp
- conda env config vars set PYSPARK_DRIVER_PYTHON=jupyter
- conda activate sparknlp
- conda env config vars set PYSPARK_DRIVER_python_OPTS=notebook
- conda activate sparknlp
```

ii) Start the spark nlp

```
- cd C:\spark\bin
- pyspark --packages com.johnsnowlabs.nlp:spark-nlp_2.12:4.4.1
```

![image](https://github.com/JohnSnowLabs/johnsnowlabs/assets/64216738/b4213eef-af82-4ebd-a968-4a0669807331)

iii) Jupyter notebook will be opened in default browser and you can install johnsnowlabs using following screenshot.

![image](https://github.com/JohnSnowLabs/johnsnowlabs/assets/64216738/b4383fec-2785-4a0b-b18e-88d916c80565)

## Non-johnsnowlabs Clinical NLP on Ubuntu
> These instructions use non-johnsnowlabs installation syntax. For simplified installation with `johnsnowlabs` library, check first section.
Expand Down Expand Up @@ -834,4 +922,4 @@ The license json file contains:
If you have asked for a trial license, but you cannot access your account on [my.JohnSnowLabs.com](https://my.johnsnowlabs.com/) and you did not receive the license information via email, please contact us at <a href="mailto:support@johnsnowlabs.com">support@johnsnowlabs.com</a>.
</div>
</div>

0 comments on commit 61c8755

Please sign in to comment.