Skip to content

Commit

Permalink
better comments in config
Browse files Browse the repository at this point in the history
  • Loading branch information
kitchWWW committed Apr 12, 2018
1 parent e016209 commit e7ddf3d
Show file tree
Hide file tree
Showing 4 changed files with 18 additions and 16 deletions.
10 changes: 7 additions & 3 deletions database/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -11,11 +11,11 @@ The important files:

To produce your database:

1. `git clone https://github.com/Bobay-Ochman/ConSpeciFix.git`
1. Download the latest stable release from [our github](https://github.com/Bobay-Ochman/ConSpeciFix/releases).
2. update the `genomes_proks.txt` with the newest list from [NCBI](https://www.ncbi.nlm.nih.gov/genome/browse#!/prokaryotes/)
3. Edit path variables in `ConSpeciFix/database/config`
4. `cd ConSpeciFix/database`
5. `python species.py`
5. Run `python species.py` to produce the list of all possible species to build into the database.
6. Remove any unwanted species from `species.txt`
7. `python runner.py`

Expand Down Expand Up @@ -83,4 +83,8 @@ The following scripts allow the Exclusion Criterion to run in order to identify
- `kmeans.py` creates the Rscript that identfies two modes in the distribution with a kmeans test and produces graphs of the distribution file, as well as `vector.txt` and `key.txt`.
- `split_kmeans.py` analyses the results of kmeans.py and produces `kmeans.txt`
- `kmeansGraph.py` generates Rscript that graphs the result of `kmeans.txt` and identifies outliers, outputing `kmeans.pdf` and `for_removal.txt`
- `criterion.py` writes the final results into `criterion.txt`
- `criterion.py` writes the final results into `criterion.txt`

### Additional Scripts

Some additional scripts for mass database editing are included in `database/auxScripts`. These scripts are not a part of the pipeline, but can be used to aid in analysis, progress reporting, aggregating results, or cleaning up the data produced. Code in this section is provided as-is, and is not intended to be stable or complete.
4 changes: 2 additions & 2 deletions database/config.py
Original file line number Diff line number Diff line change
Expand Up @@ -35,8 +35,8 @@
# RAXML where the binaries themselves are optomized for multithreading
MAX_THREADS = multiprocessing.cpu_count()

#Used to ignore species that would take computationally very long times
MAX_SPECIES_SIZE = 1000000000
# Used to ignore species that would take computationally very long times
MAX_SPECIES_SIZE = 500


############### Things users will not need to change ###############
Expand Down
6 changes: 3 additions & 3 deletions personalCompare/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -10,9 +10,9 @@ The important files:

To run your folder comparison:

1. `git clone https://github.com/Bobay-Ochman/ConSpeciFix.git`
2. Edit path variables in `ConSpeciFix/personalCompare/config`
3. `cd ConSpeciFix/personalCompare`
1. Download the latest stable release from [our github](https://github.com/Bobay-Ochman/ConSpeciFix/releases).
2. Edit path variables in `ConSpeciFix-#.#.#/personalCompare/config`
3. `cd ConSpeciFix-#.#.#/personalCompare`
4. `python runner.py`

## The Hard Way
Expand Down
14 changes: 6 additions & 8 deletions personalCompare/config.py
Original file line number Diff line number Diff line change
Expand Up @@ -8,13 +8,11 @@
############### VARIABLES USERS WILL NEED TO CHANGE ###############


#Path to output should be followed by a tailing slash
#This is where the full database will be built,
# and if all the species are being processed,
# this can be upwards of 500GB
PATH_TO_FOLDER = '/stor/home/be4833/testSpec/genes'
# This is where your folder of strains should be located
# Path should be followed by a tailing slash
PATH_TO_FOLDER = '/user/documents/myCoolProject/myGenomes/'

#your local instalations for the following programs,
# Your local instalations for the following programs,
# or simply the name of the program if it can be accessed from the command line
# for example
# RAXML_PATH = '/Users/Admin/programs/standard-RAxML-master/raxmlHPC-SSE3'
Expand All @@ -26,9 +24,9 @@

# command line character to pipe output to a file.
# On most shells, use ' &> '.
PIPE_CHAR = ' > '
PIPE_CHAR = ' &> '
# if the & is interpreted as launching a new subprocess instead of the pipe,
# a simple ' > ' will work instead.
# a simple ' > ' will work as well, though STD.ERR will still be printed.


############### VARIABLES USERS MIGHT NEED TO CHANGE ###############
Expand Down

0 comments on commit e7ddf3d

Please sign in to comment.