TJ Langford, Wright Lab & YCRC
September 28, 2021
Scientific Computing is a rapidly growing multidisciplinary using advanced computing capabilities to understand and solve complex problems
Ranges from laptop-based data acquisition to highly parallel super-computers
How we define it: nearly everything we do here
wright, rubin, mgm, strickland, etc.Docker and Singularity enable software to be “containerized”
To maintain security, logins require ssh-keys
This produces a key-pair: two files id_rsa and id_rsa.pub
The private key (id_rsa) is kept secret on your computer, and the public key (id_rsa.pub) is placed on the server
ssh-rsa AAAAB3NzaC1yc2EAAAADAQABAAABAQCmffiWouCCBhfyL7tG5rZUIS
QbPbjm3T8HDm4okH54pDwb4r0zTRQ8BviOvGnqW7BoloBRcCk4FjIuG3L3bmidIpT
izq3GLIV/u3S6fjMnTt8iMr5/I4lKLTTwyarOqblZncIYfvEhlMG/
iF/Bcu+IL0oZ0QvRbdeD6IrRtQCwgSAqzVC0tBTRh1GuOayYQfuq5
64B8Zm2mWqLfhgvcu4y0Z2Ifs47xZKRjGoz8Cipc95EJtJXtUI+IGJ
GQcYMIz1REdyEADagqXYMCzoC1Hjt4DxCmnM1aws7S4T8dT4NQHMta/LntwXAspv
9oJqHH0+ovTti+obZ7+xEUFQiwwt langford@yellowtail-----BEGIN OPENSSH PRIVATE KEY-----
b3BlbnNzaC1rZXktdjEAAAAABG5vbmUAAAAEbm9uZQAAAAAAAAABAAACFwAAAAdzc2gtcn
NhAAAAAwEAAQAAAgEAu7+TPw3JMC/+QKcV2JBQDqLJtne4x6pXk1KAeowAdN3hiT75dNE/
guEkNgBqwSl+CIhZGnOKNB2lSO1CHKKM1CHiSE7+3L1lpN/2wV2S2AzBhD6XneoapUKhHL
qwGDBTXPw51ywa5S6aqpr7thIJnZZbVM8jvo770hB6aFpEBmVo3fi1Emx06gJvqcf3PrBA
ZOLaIK4EECjg2+qZ8HEB5jfs166qGADaj/yXfse0dFlWcjZ9Y11uGtUIEY36pxhvXeBPSH
umBKKkVYtJ7k9eso6de29HQpPcX/ceZ/oHQklOxvqRjHH0u4HF2IzCdNTIqixBGw9VziOT
u/3jLpPV630iVcfxme8M+1WmX389IBznXs0IhUTXyRjtSwL1l/u0g/CzHy8mFJ/DQd2czd
Vu8qkJ4aDV5MDKayDCUvZlKQaeEIjaQ7O+OzcyLEDioGIBU0ULHYSItt4+VWYHYKEP3R03
GEX+KJI+dH3f12gYP54Tn5HHsp1NQEMXlPbH6xD0enI5WP1gKgAmF0YbPJ9cGnmCKAoydn
T+TVmCLnP0Wf3NIyGwdT5e1FrBj+ejFrLZeHqtRphLcwtk7wa4/EaVA0p6btWv5klmMX4A
fL8jKxuXV170I9bdIRK75BBE/QGzUNWWIJbWjY0CCzaEl20X5q73sRg5UVI775wqh4qliT
cAAAdIFh8T6xYfE+sAAAAHc3NoLXJzYQAAAgEAu7+TPw3JMC/+QKcV2JBQDqLJtne4x6pX
k1KAeowAdN3hiT75dNE/guEkNgBqwSl+CIhZGnOKNB2lSO1CHKKM1CHiSE7+3L1lpN/2wV
2S2AzBhD6XneoapUKhHLqwGDBTXPw51ywa5S6aqpr7thIJnZZbVM8jvo770hB6aFpEBmVo
3fi1Emx06gJvqcf3PrBAZOLaIK4EECjg2+qZ8HEB5jfs166qGADaj/yXfse0dFlWcjZ9Y1
1uGtUIEY36pxhvXeBPSHumBKKkVYtJ7k9eso6de29HQpPcX/ceZ/oHQklOxvqRjHH0u4HF
2IzCdNTIqixBGw9VziOTu/3jLpPV630iVcfxme8M+1WmX389IBznXs0IhUTXyRjtSwL1l/
u0g/CzHy8mFJ/DQd2czdVu8qkJ4aDV5MDKayDCUvZlKQaeEIjaQ7O+OzcyLEDioGIBU0UL
HYSItt4+VWYHYKEP3R03GEX+KJI+dH3f12gYP54Tn5HHsp1NQEMXlPbH6xD0enI5WP1gKg
AmF0YbPJ9cGnmCKAoydnT+TVmCLnP0Wf3NIyGwdT5e1FrBj+ejFrLZeHqtRphLcwtk7wa4
/EaVA0p6btWv5klmMX4AfL8jKxuXV170I9bdIRK75BBE/QGzUNWWIJbWjY0CCzaEl20X5q
73sRg5UVI775wqh4qliTcAAAADAQABAAACAHCgIaKHkJL5l1oNYUuCdqPw/3QYKZ6NDu/v
Y+cfqP5yQ+NjBZ4QEDtg96n1YhTx4QsZT+pQOS2+QvKWcTxgPn7avLWHvdeJPjpDp/CjQ3
2bWVMNgUJXtxg/+goT66L3CmsTW1c6u/+Tj3CcfDbiZyZDlhIwGE8t0t5WyDdlPr1fhCL0
GRsuOIxQXc/JhwXyEQ70Dsnf2cwf2ZPTDflwsST7k3Zm7t5rVLFfYznIbvYEyZjGGz3KRE
yQUsDFHQGz5Mq5zpW6pwLM1iwC3Jy6vCH873emb2b+8AoZYgIZuNUIyNsQYNORYLBCzv5y
ogwXwTBkT8H91wlLyknRqcC9KmuMfsO8tZtMCkE2p0qQMRjBPJpAwz7RE00KbSaqO7ddct
hlc9n2F23VhYtKvsUhgGDAVMR02TUoOMeJ1hdyNav1wvmtSV2wHZrUfRRYyz/ty4dDLLOQ
/x5eCnCJt/gGIKteCEvOAuGS6NDeYFCZh7ehtLYAytJ+5GV0CTz/j0nyyDhyFFMtGjy5p+
eoZGKN24Xz6xu+iUdd+6MGXm/gBvtVlpjGFjNR/wIfF6dTIlrO1O5fBPrU3jACCmik7d+0
0MBhaTj9va3rSqSnxdeZ9SAG9h19X0T/eiGcGyusq0zZvWatV6t/o0pLaIOjrnP//XmWza
wp1X1NVth8/XnMLBTRAAABAF9cxGFdTvsEH9FcrLKmrlFy+bmOYivfu3oUjtLgGErxEZkK
Xqc0iOA++CJT6B0SEPEXi7zgcvpH6oFVlJKeJe8lb37fxSorRZGoA3GVFJ14qyOSQst7o5
73uso6TYtqM5zff2nmbcKASQx+RUYKtJCb6oxJCCXD5jOPiYf5R0Ngw2auhfNcQFPvHFCM
UIiwJT+ZEjxDAmkYmbQpzMQm7ERB39+0r/7lHVz3IrW+f1O4UDWYndc2vNKoubEF+Yqs+H
ueQ8rGVK3pE8q0aBbIpyIIK/XY5Kjkt87ZGYep8B3YFl+7vjRUOWDt9gFZbJs2GaTeUENA
D5aLT+qQEhvLT9EAAAEBAOKckzHfsHvwx8tob77pIcFCquVmELVZwaVRsB1ZCfrd7iFXMR
U6L7yQF/8GDjiKD5jwrnz9B6vilGoiGKOiWI3gHxQnajLebOEKJ/qtcS5HYjxQ/fvtuDq3
n2gqMRsmwJWKRIzpGubCo0je5MLKxHsBEVhznTbFunAHiQoGlJcXUMUZLLtt0SbCI8/+KR
wCvFEhxgk8ZPN+M8duBH2K8/H+8/tXimbL9i5xmshVKoXUKG/UPf2wOc/XaQ996d9v0Nle
8+0RzzAkQcesNnu2aeoGBkNmKSe8yGYLUzocn8q5epy6AKJfeoNitGsQkSMwoc561shwbP
A7UUG7ydU+xIUAAAEBANQYwM4xHaZIJaAKLH7cp4rKxfjret/v0Dips8aJDF/Q7x9II26n
xPleoVSvJ7GXZXlei0IuQfbZH7fg6A2Fhn1b8jWh/bYySizLdE1Lim/VJ0joBpHfpRh/J1
Hf1L/s0446YP+PZb+GKMFZQLjB6kuIpzgPT9nlvpPRGRYTkggGlQNFJqlKcCljhpWQj6Zr
pAtug+6QyVJ+Gcq/Mq9nH6oKINiAG8UAEWFYSiwzGEc5tfajyDesoUz9BldaTDjbeiJiPx
K3uO766ElEtzQDcqzfLjVZUsuTNIvW+RpAG9tXjNSn6/hhSWfQptJrjGZ3jretZcTXQzNQ
D+eodJW0EYsAAAATbGFuZ2ZvcmRAeWVsbG93dGFpbA==
-----END OPENSSH PRIVATE KEY-----WLab Computing Contact:
help-wlab@yale.edu
Slack #computing channel
Thomas Langford: WLC254, YCRC225 thomas.langford@yale.edu


Grace clustersh ssh <netID>@grace.hpc.yale.eduThe cluster is divided into separate groups of nodes based on the types of jobs we want to run on them
day: limited to 24hrs of wall-time, users limited to 1000CPUsweek: longer run time, but fewer cores per usermpi: specific group of identical nodes for highly-parallel computation (like galaxy simulations)scavenge: special partition for high-throughput independent jobs, 10000CPUs/usermodule avail on the cluster:[tl397@grace2 ~]$ module avail
---------------------- /gpfs/loomis/apps/avx/modules/base ----------------------
miniconda/4.5.12 miniconda/4.7.10 (D)
---------------------- /gpfs/loomis/apps/avx/modules/bio -----------------------
AFNI/2019.0.24
BamTools/2.5.1-foss-2018a
Bowtie2/2.3.4.1-foss-2018a
Chimera/1.12-linux_x86_64
FSL/5.0.10-centos7_64
FSL/6.0.0-centos7_64 (D)
FastQC/0.11.7-Java-1.8.0_92
FreeSurfer/5.3.0-HCP
GROMACS/5.1.4-foss-2016b-hybrid
GROMACS/2016.5-intel-2018a
GROMACS/2019.3-foss-2018a (D)
IGV/2.5.0-Java-1.8.0_92
MRtrix3/3.0_RC3-foss-2018a
MrBayes/3.2.6-foss-2016b
PLINK/1.90-beta6.9
Rosetta/3.10
SAMtools/1.7-foss-2018aproject directory is perfect for this:
./configure --prefix=$HOME/project/cmake -DCMAKE_INSTALL_PREFIX=$HOME/project/Python users should make use of the miniconda module
Additional packages can be installed into an existing environment:
Best for prototyping code where immediate feedback is needed
srun command:
This requests 4 CPUs, 20GB of RAM, sets up X11 forwarding and requests that the jobs run on the interactive partition
Good for long code compilation, request up to 4 CPUs and make -j4
stl397@grace2:~$ srun --pty --x11 -p interactive -t 1:00:00 bsh
[tl397@c01n01 ~]$ module load miniconda
[tl397@c01n01 ~]$ source activate my_env
(my_env) [tl397@c01n01 ~]$ python my_script.pyJob is “submitted” from the login node (grace2) and then “run” on a compute node (c01n01)
squeuesqueue: lists jobs that are currently running or waiting to run
sacctsacct: details of past jobs, useful for debugging why a job failed to complete
[tl397@c01n01 ~]$ sacct -j 29432699
JobID JobName User Partition NodeList Elapsed State ExitCode MaxRSS AllocTRES
-------------------- ---------- --------- ---------- --------------- ---------- ---------- -------- ---------- --------------------------------
29432699 bash tl397 interacti+ c01n01 00:21:31 COMPLETED 0:0 billing=4,cpu=4,mem=20G,node=1
29432699.extern extern c01n01 00:21:31 COMPLETED 0:0 940K billing=4,cpu=4,mem=20G,node=1
29432699.0 bash c01n01 00:21:31 COMPLETED 0:0 30524K cpu=4,mem=20G,node=1