dpgen init_bulk machine parameters
- init_bulk_mdata:
- type:
dictargument path:init_bulk_mdatamachine.json file
- api_version:
- type:
strargument path:init_bulk_mdata/api_versionPlease set to 1.0
- deepmd_version:
- type:
str, optional, default:2argument path:init_bulk_mdata/deepmd_versionDeePMD-kit version, e.g. 2.1.3
- fp:
- type:
dictargument path:init_bulk_mdata/fpParameters of command, machine, and resources for fp
- command:
- type:
strargument path:init_bulk_mdata/fp/commandCommand of a program.
- machine:
- type:
dictargument path:init_bulk_mdata/fp/machine- batch_type:
- type:
strargument path:init_bulk_mdata/fp/machine/batch_typeThe batch job system type. Option: SlurmJobArray, PBS, Lebesgue, LSF, DpCloudServer, Slurm, DistributedShell, Shell, Torque
- local_root:
- type:
NoneType|strargument path:init_bulk_mdata/fp/machine/local_rootThe dir where the tasks and relating files locate. Typically the project dir.
- remote_root:
- type:
NoneType|str, optionalargument path:init_bulk_mdata/fp/machine/remote_rootThe dir where the tasks are executed on the remote machine. Only needed when context is not lazy-local.
- clean_asynchronously:
- type:
bool, optional, default:Falseargument path:init_bulk_mdata/fp/machine/clean_asynchronouslyClean the remote directory asynchronously after the job finishes.
Depending on the value of context_type, different sub args are accepted.
- context_type:
- type:
str(flag key)argument path:init_bulk_mdata/fp/machine/context_typepossible choices:LocalContext,LebesgueContext,HDFSContext,DpCloudServerContext,LazyLocalContext,SSHContextThe connection used to remote machine. Option: SSHContext, LocalContext, LebesgueContext, DpCloudServerContext, HDFSContext, LazyLocalContext
When context_type is set to
LocalContext(or its aliaseslocalcontext,Local,local):- remote_profile:
- type:
dict, optionalargument path:init_bulk_mdata/fp/machine[LocalContext]/remote_profileThe information used to maintain the connection with remote machine. This field is empty for this context.
When context_type is set to
LebesgueContext(or its aliaseslebesguecontext,Lebesgue,lebesgue):- remote_profile:
- type:
dictargument path:init_bulk_mdata/fp/machine[LebesgueContext]/remote_profileThe information used to maintain the connection with remote machine.
- email:
- type:
strargument path:init_bulk_mdata/fp/machine[LebesgueContext]/remote_profile/emailEmail
- password:
- type:
strargument path:init_bulk_mdata/fp/machine[LebesgueContext]/remote_profile/passwordPassword
- program_id:
- type:
int, alias: project_idargument path:init_bulk_mdata/fp/machine[LebesgueContext]/remote_profile/program_idProgram ID
- keep_backup:
- type:
bool, optionalargument path:init_bulk_mdata/fp/machine[LebesgueContext]/remote_profile/keep_backupkeep download and upload zip
- input_data:
- type:
dictargument path:init_bulk_mdata/fp/machine[LebesgueContext]/remote_profile/input_dataConfiguration of job
When context_type is set to
HDFSContext(or its aliaseshdfscontext,HDFS,hdfs):- remote_profile:
- type:
dict, optionalargument path:init_bulk_mdata/fp/machine[HDFSContext]/remote_profileThe information used to maintain the connection with remote machine. This field is empty for this context.
When context_type is set to
DpCloudServerContext(or its aliasesdpcloudservercontext,DpCloudServer,dpcloudserver):- remote_profile:
- type:
dictargument path:init_bulk_mdata/fp/machine[DpCloudServerContext]/remote_profileThe information used to maintain the connection with remote machine.
- email:
- type:
strargument path:init_bulk_mdata/fp/machine[DpCloudServerContext]/remote_profile/emailEmail
- password:
- type:
strargument path:init_bulk_mdata/fp/machine[DpCloudServerContext]/remote_profile/passwordPassword
- program_id:
- type:
int, alias: project_idargument path:init_bulk_mdata/fp/machine[DpCloudServerContext]/remote_profile/program_idProgram ID
- keep_backup:
- type:
bool, optionalargument path:init_bulk_mdata/fp/machine[DpCloudServerContext]/remote_profile/keep_backupkeep download and upload zip
- input_data:
- type:
dictargument path:init_bulk_mdata/fp/machine[DpCloudServerContext]/remote_profile/input_dataConfiguration of job
When context_type is set to
LazyLocalContext(or its aliaseslazylocalcontext,LazyLocal,lazylocal):- remote_profile:
- type:
dict, optionalargument path:init_bulk_mdata/fp/machine[LazyLocalContext]/remote_profileThe information used to maintain the connection with remote machine. This field is empty for this context.
When context_type is set to
SSHContext(or its aliasessshcontext,SSH,ssh):- remote_profile:
- type:
dictargument path:init_bulk_mdata/fp/machine[SSHContext]/remote_profileThe information used to maintain the connection with remote machine.
- hostname:
- type:
strargument path:init_bulk_mdata/fp/machine[SSHContext]/remote_profile/hostnamehostname or ip of ssh connection.
- username:
- type:
strargument path:init_bulk_mdata/fp/machine[SSHContext]/remote_profile/usernameusername of target linux system
- password:
- type:
str, optionalargument path:init_bulk_mdata/fp/machine[SSHContext]/remote_profile/password(deprecated) password of linux system. Please use SSH keys instead to improve security.
- port:
- type:
int, optional, default:22argument path:init_bulk_mdata/fp/machine[SSHContext]/remote_profile/portssh connection port.
- key_filename:
- type:
NoneType|str, optional, default:Noneargument path:init_bulk_mdata/fp/machine[SSHContext]/remote_profile/key_filenamekey filename used by ssh connection. If left None, find key in ~/.ssh or use password for login
- passphrase:
- type:
NoneType|str, optional, default:Noneargument path:init_bulk_mdata/fp/machine[SSHContext]/remote_profile/passphrasepassphrase of key used by ssh connection
- timeout:
- type:
int, optional, default:10argument path:init_bulk_mdata/fp/machine[SSHContext]/remote_profile/timeouttimeout of ssh connection
- totp_secret:
- type:
NoneType|str, optional, default:Noneargument path:init_bulk_mdata/fp/machine[SSHContext]/remote_profile/totp_secretTime-based one time password secret. It should be a base32-encoded string extracted from the 2D code.
- tar_compress:
- type:
bool, optional, default:Trueargument path:init_bulk_mdata/fp/machine[SSHContext]/remote_profile/tar_compressThe archive will be compressed in upload and download if it is True. If not, compression will be skipped.
- resources:
- type:
dictargument path:init_bulk_mdata/fp/resources- number_node:
- type:
int, optional, default:1argument path:init_bulk_mdata/fp/resources/number_nodeThe number of node need for each job
- cpu_per_node:
- type:
int, optional, default:1argument path:init_bulk_mdata/fp/resources/cpu_per_nodecpu numbers of each node assigned to each job.
- gpu_per_node:
- type:
int, optional, default:0argument path:init_bulk_mdata/fp/resources/gpu_per_nodegpu numbers of each node assigned to each job.
- queue_name:
- group_size:
- type:
intargument path:init_bulk_mdata/fp/resources/group_sizeThe number of tasks in a job. 0 means infinity.
- custom_flags:
- type:
list, optionalargument path:init_bulk_mdata/fp/resources/custom_flagsThe extra lines pass to job submitting script header
- strategy:
- type:
dict, optionalargument path:init_bulk_mdata/fp/resources/strategystrategies we use to generation job submitting scripts.
- if_cuda_multi_devices:
- type:
bool, optional, default:Falseargument path:init_bulk_mdata/fp/resources/strategy/if_cuda_multi_devicesIf there are multiple nvidia GPUS on the node, and we want to assign the tasks to different GPUS.If true, dpdispatcher will manually export environment variable CUDA_VISIBLE_DEVICES to different task.Usually, this option will be used with Task.task_need_resources variable simultaneously.
- ratio_unfinished:
- type:
float, optional, default:0.0argument path:init_bulk_mdata/fp/resources/strategy/ratio_unfinishedThe ratio of jobs that can be unfinished.
- para_deg:
- type:
int, optional, default:1argument path:init_bulk_mdata/fp/resources/para_degDecide how many tasks will be run in parallel.
- source_list:
- type:
list, optional, default:[]argument path:init_bulk_mdata/fp/resources/source_listThe env file to be sourced before the command execution.
- module_purge:
- type:
bool, optional, default:Falseargument path:init_bulk_mdata/fp/resources/module_purgeRemove all modules on HPC system before module load (module_list)
- module_unload_list:
- type:
list, optional, default:[]argument path:init_bulk_mdata/fp/resources/module_unload_listThe modules to be unloaded on HPC system before submitting jobs
- module_list:
- type:
list, optional, default:[]argument path:init_bulk_mdata/fp/resources/module_listThe modules to be loaded on HPC system before submitting jobs
- envs:
- type:
dict, optional, default:{}argument path:init_bulk_mdata/fp/resources/envsThe environment variables to be exported on before submitting jobs
- wait_time:
- type:
int|float, optional, default:0argument path:init_bulk_mdata/fp/resources/wait_timeThe waitting time in second after a single task submitted
Depending on the value of batch_type, different sub args are accepted.
- batch_type:
- type:
str(flag key)argument path:init_bulk_mdata/fp/resources/batch_typepossible choices:LSF,DpCloudServer,DistributedShell,Slurm,PBS,Shell,Lebesgue,Torque,SlurmJobArrayThe batch job system type loaded from machine/batch_type.
When batch_type is set to
LSF(or its aliaslsf):- kwargs:
- type:
dictargument path:init_bulk_mdata/fp/resources[LSF]/kwargsExtra arguments.
- gpu_usage:
- type:
bool, optional, default:Falseargument path:init_bulk_mdata/fp/resources[LSF]/kwargs/gpu_usageChoosing if GPU is used in the calculation step.
- gpu_new_syntax:
- type:
bool, optional, default:Falseargument path:init_bulk_mdata/fp/resources[LSF]/kwargs/gpu_new_syntaxFor LFS >= 10.1.0.3, new option -gpu for #BSUB could be used. If False, and old syntax would be used.
- gpu_exclusive:
- type:
bool, optional, default:Trueargument path:init_bulk_mdata/fp/resources[LSF]/kwargs/gpu_exclusiveOnly take effect when new syntax enabled. Control whether submit tasks in exclusive way for GPU.
- custom_gpu_line:
- type:
NoneType|str, optional, default:Noneargument path:init_bulk_mdata/fp/resources[LSF]/kwargs/custom_gpu_lineCustom GPU configuration, starting with #BSUB
When batch_type is set to
DpCloudServer(or its aliasdpcloudserver):- kwargs:
- type:
dict, optionalargument path:init_bulk_mdata/fp/resources[DpCloudServer]/kwargsThis field is empty for this batch.
When batch_type is set to
DistributedShell(or its aliasdistributedshell):- kwargs:
- type:
dict, optionalargument path:init_bulk_mdata/fp/resources[DistributedShell]/kwargsThis field is empty for this batch.
When batch_type is set to
Slurm(or its aliasslurm):- kwargs:
- type:
dict, optionalargument path:init_bulk_mdata/fp/resources[Slurm]/kwargsExtra arguments.
- custom_gpu_line:
- type:
NoneType|str, optional, default:Noneargument path:init_bulk_mdata/fp/resources[Slurm]/kwargs/custom_gpu_lineCustom GPU configuration, starting with #SBATCH
When batch_type is set to
PBS(or its aliaspbs):- kwargs:
- type:
dict, optionalargument path:init_bulk_mdata/fp/resources[PBS]/kwargsThis field is empty for this batch.
When batch_type is set to
Shell(or its aliasshell):- kwargs:
- type:
dict, optionalargument path:init_bulk_mdata/fp/resources[Shell]/kwargsThis field is empty for this batch.
When batch_type is set to
Lebesgue(or its aliaslebesgue):- kwargs:
- type:
dict, optionalargument path:init_bulk_mdata/fp/resources[Lebesgue]/kwargsThis field is empty for this batch.
When batch_type is set to
Torque(or its aliastorque):- kwargs:
- type:
dict, optionalargument path:init_bulk_mdata/fp/resources[Torque]/kwargsThis field is empty for this batch.
When batch_type is set to
SlurmJobArray(or its aliasslurmjobarray):- kwargs:
- type:
dict, optionalargument path:init_bulk_mdata/fp/resources[SlurmJobArray]/kwargsExtra arguments.
- custom_gpu_line:
- type:
NoneType|str, optional, default:Noneargument path:init_bulk_mdata/fp/resources[SlurmJobArray]/kwargs/custom_gpu_lineCustom GPU configuration, starting with #SBATCH
- user_forward_files:
- type:
list, optionalargument path:init_bulk_mdata/fp/user_forward_filesFiles to be forwarded to the remote machine.
- user_backward_files:
- type:
list, optionalargument path:init_bulk_mdata/fp/user_backward_filesFiles to be backwarded from the remote machine.