-
Updated
Feb 4, 2022 - Groovy
#
slurm
Here are 405 public repositories matching this topic...
A DSL for data-driven computational pipelines
docker
groovy
hello
aws
cloud
bioinformatics
pipeline
nextflow
hpc
reproducible-research
workflow-engine
slurm
pipeline-framework
sge
singularity
reproducible-science
dataflow
singularity-containers
Simplify HPC and Batch workloads on Azure
docker
serverless
hpc
azure
containers
gpu
mpi
slurm
nfs
singularity
azure-batch
azure-functions
glusterfs
rdma
infiniband
batch-processing
windows-containers
-
Updated
Oct 18, 2021 - Python
A Slurm cluster using docker-compose
-
Updated
Feb 4, 2022 - Dockerfile
Tools for computation on batch systems
cran
r
hpc
docker-swarm
parallel-computing
slurm
sge
torque
lsf
openlava
high-performance-computing
reproducibility
hpc-clusters
batchjobs
batchexperiments
-
Updated
Dec 14, 2021 - R
HenrikBengtsson
commented
Jan 21, 2019
Just like parallel::makePSOCKCluster()
sets up a SOCKcluster
object of SOCKnode
workers, I think it would not be too complicated(*) to provide CMQcluster
and CMQnode
alternatives for clustermq.
For instance,
cl <- clustermq::makeClusterMCQ("sge")
y <- parallel::parLapply(1:10, FUN = sqrt)
parallel::stopCluster(cl)
(*) Roughly, S3 methods for generic functions such a
3
Open
Condor as scheduler
4
A toolset for black-box hyperparameter optimisation.
-
Updated
Jan 26, 2020 - Python
Prometheus exporter for performance metrics from Slurm.
-
Updated
Feb 2, 2022 - Go
An open-source toolkit for deploying and managing high performance clusters for HPC, AI, and data analytics workloads.
kubernetes
ansible
hpc
slurm
ansible-playbooks
hpc-clusters
dell-emc
k8s-cluster
slurm-cluster
dellemc
-
Updated
Feb 4, 2022 - Python
TorchX is a library containing standard DSLs for authoring and running PyTorch related components for an E2E production ML pipeline
python
kubernetes
components
machine-learning
deep-learning
slurm
pipelines
pytorch
distributed-training
-
Updated
Feb 4, 2022 - Python
Funnel is a toolkit for distributed task execution via a simple, standard API.
-
Updated
Dec 19, 2021 - Go
SEML: Slurm Experiment Management Library
-
Updated
Jan 25, 2022 - Python
Singularity implementation of k8s operator for interacting with SLURM.
-
Updated
Dec 29, 2020 - Go
-
Updated
Oct 11, 2021 - R
-
Updated
Feb 2, 2022 - Shell
EnsEMBL Hive - a system for creating and running pipelines on a distributed compute resource
mysql
python
java
docker
pipeline
sqlite
perl
docker-swarm
postgresql
slurm
sge
htcondor
lsf
ehive
ensembl
high-performance-computing
pbs-pro
workflow-management-system
pbspro
-
Updated
Jan 13, 2022 - Perl
A scheduler for GPU/CPU tasks
-
Updated
Jan 6, 2022 - C
Slurm-Mail is a drop in replacement for Slurm's e-mails to give users much more information about their jobs compared to the standard Slurm e-mails.
-
Updated
Jan 28, 2022 - Python
Walkthroughs for DSL, AirSim, the Vector Institute, and more
ubuntu
anaconda
tensorflow
slurm
tutorials
torch
nvidia
ray
unreal-engine-4
airsim
mujoco
rllib
robomaster-sdk
brax
robomaster-s1
dji-tello-talent
-
Updated
Jan 11, 2022 - C++
Scripts to facilitate parallel InSAR processing and analysis of Sentinel-1 time series on HPC clusters based on GMTSAR and Slurm.
-
Updated
Jan 21, 2020 - Shell
slurmR: A Lightweight Wrapper for Slurm
-
Updated
Sep 21, 2021 - R
Improve this page
Add a description, image, and links to the slurm topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the slurm topic, visit your repo's landing page and select "manage topics."
As requested in https://gitter.im/bd2k-genomics-toil/Lobby?at=617126297db1e3753e527ff2 it would be useful for debugging to be able to stop the whole workflow as soon as a single job fails, even if other jobs still exist to be run or are currently running.
┆Issue is synchronized with this Jira Task
┆Issue Number: TOIL-1062