Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hotfix SlutmPlatform memory space issue #2006

Merged

Conversation

shchen-idmod and others added 30 commits January 12, 2023 14:02
…/1.7.3

Add changelog for 1.7.3 release to dev branch
…/1.7.3.nightly

bump dev to 1.7.3.nightly
…DiseaseModeling/release-1.7.x

Merge 1.7.4 to master
# Conflicts:
#	idmtools_platform_slurm/idmtools_platform_slurm/platform_operations/simulation_operations.py
@ZDu-IDM ZDu-IDM changed the base branch from dev to release-1.7.x January 27, 2023 17:44
@shchen-idmod
Copy link
Collaborator

Tested, everything looks good.
Memory dramatically reduced. Tested with 2500 serialized simulations from Anne's script. memory used 2.5G in login node(max memory size is 6G).
Before fix. script failed around 400 sims with over 4G max heapsize

Summary data:
Initializing and commssion took 26mins
Execution: 1:23:24
...
Initializing objects for creation: 99%|██████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████ | 2544/2560 [26:43<00:09, 1.76simulation/s]WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
Commissioning Simulations: 0%| | 0/2560 [00:00<?, ?simulation/s]WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
Commissioning Simulations: 100%|██████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 2560/2560 [00:01<00:00, 1741.96simulation/s]
job_id: 1554587
job_directory: /home/scj6369/example_emodpy_malaria/burnin_create_and_use_sweep_larval_habitat1
suite: a5964d87-8b14-4fcc-926d-70c0ccd101cb
experiment: c858fe53-f548-4dea-a164-d78a90d527ec
Waiting on Experiment 4EIR_pickup_PvAd to Finish running: 100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 2560/2560 [1:23:24<00:00, 1.95s/simulation]
Experiment c858fe53-f548-4dea-a164-d78a90d527ec succeeded.

memory2

@shchen-idmod
Copy link
Collaborator

With 640 sims, took 6 mins init/commission and 18mins run:
perf_new_640.txt
Initializing objects for creation: 98%|████████████████████████████████████████████████████████████████████████████████▉ | 624/640 [06:34<00:10, 1.57simulation/s]WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
Commissioning Simulations: 0%| | 0/640 [00:00<?, ?simulation/s]WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
WARNING: During schema-based param purge, Disable_IP_Whitelist not in schema.
Commissioning Simulations: 100%|██████████████████████████████████████████████████████████████████████████████████████████| 640/640 [00:01<00:00, 453.71simulation/s]job_id: 1579154
job_directory: /home/scj6369/example_emodpy_malaria/burnin_create_and_use_sweep_larval_habitat1
suite: d8517fb6-42a1-4b89-949d-974e0fce1fc1
experiment: aaa37eaf-90f8-4a66-a342-052ae750f08a
Commissioning Simulations: 100%|██████████████████████████████████████████████████████████████████████████████████████████| 640/640 [00:15<00:00, 453.71simulation/sExperiment aaa37eaf-90f8-4a66-a342-052ae750f08a succeeded. 100%|████████████████████████████████████████████████████████████| 640/640 [18:43<00:00, 3.28s/simulation]

aaa37eaf90f84a66a342052ae750f08a
Commissioning Simulations: 100%|███████████████████████████████████████████████████████████████████████████████████████████| 640/640 [18:45<00:00, 1.76s/simulation]
Waiting on Experiment 4EIR_pickup_PvAd to Finish running: 100%|████████████████████████████████████████████████████████████| 640/640 [18:44<00:00, 1.76s/simulation]

@shchen-idmod shchen-idmod merged commit 15b54ef into InstituteforDiseaseModeling:release-1.7.x Jan 27, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants