-
Updated
Oct 13, 2021 - Python
#
mpi
Here are 1,383 public repositories matching this topic...
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
machine-learning
spark
deep-learning
uber
mxnet
tensorflow
mpi
keras
pytorch
machinelearning
baidu
deeplearning
ray
devreal
commented
Nov 6, 2019
The documentation for the function shmem_wait
seems inconsistent with the function signatures as it describes parameters target
, pe
, and value
that are actually not part of the interface. The parameter ivar
is described but at least in the C interface the variable is named var
(without the prefix i
).
See https://github.com/open-mpi/ompi/blob/master/oshmem/shmem/man/man3/shmem_wai
Kratos Multiphysics (A.K.A Kratos) is a framework for building parallel multi-disciplinary simulation software. Modularity, extensibility and HPC are the main objectives. Kratos has BSD license and is written in C++ with extensive Python interface.
python
c-plus-plus
multi-platform
openmp
mpi
parallel-computing
fem
bsd-license
numerical-methods
multiphysics
dem
kratos
kratos-multiphysics
-
Updated
Oct 13, 2021 - C++
Official git repository of Elmer FEM software
mpi
parallel-computing
fem
finite-elements
multiphysics
electromagnetics
acoustics
finite-element-methods
fluid-mechanics
structural-mechanics
elmergui
elmersolver
elmergrid
glaciology
elmerice
-
Updated
Oct 12, 2021 - Fortran
OpenHPC Integration, Packaging, and Test Repo
-
Updated
Oct 12, 2021 - C
Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)
c
c-plus-plus
networking
hpc
mpi
gemini
pgas
drivers
rdma
infiniband
iwarp
roce
cray
verbs
shared-memory
tcp-ip
hacktoberfest
shmem
openshmem
aries
-
Updated
Oct 13, 2021 - C
High-performance stateful serverless runtime based on WebAssembly
-
Updated
Oct 13, 2021 - C++
distributed dataflows with functional list operations for data processing with C++14
-
Updated
Nov 16, 2019 - C++
LibTomMath is a free open source portable number theoretic multiple-precision integer library written entirely in C.
-
Updated
Sep 21, 2021 - C
Python bindings for MPI
-
Updated
Oct 8, 2021 - Python
C++ library for solving large sparse linear systems with algebraic multigrid method
c-plus-plus
cpp
opencl
openmp
mpi
cuda
gpgpu
scientific-computing
amg
sparse-linear-systems
multigrid
linear-solvers
-
Updated
Oct 4, 2021 - C++
This is a set of simple programs that can be used to explore the features of a parallel platform.
c
c-plus-plus
travis-ci
julia
opencl
boost
openmp
mpi
parallel-computing
python3
pgas
coarray-fortran
threading
tbb
kokkos
shmem
charmplusplus
sycl
parallel-programming
fortran2008
-
Updated
Sep 12, 2021 - C
A fast simulator and a library dedicated to the channel coding.
library
multi-threading
cpp
simulation
mpi
decoding
multithreading
turbo
reed-solomon
fec
polar
error-correcting-codes
polar-codes
ldpc-codes
ldpc
turbo-codes
channel-coding
forward-error-correction
bch
turbo-product-codes
-
Updated
Oct 6, 2021 - C++
DMTCP: Distributed MultiThreaded CheckPointing
-
Updated
Oct 13, 2021 - C++
Modular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template API is essentially a framework to creating tools: it is designed to provide a unifying interface for recording various performance measurements alongside data logging and interfaces to other tools.
python
c
performance
cplusplus
cross-platform
cpp
analysis
mpi
cuda
hardware-counters
performance-measurement
gotcha
modular-design
cross-language
papi
roofline
cupti
memory-measurements
instrumentation-api
-
Updated
Oct 12, 2021 - C++
Simplify HPC and Batch workloads on Azure
docker
serverless
hpc
azure
containers
gpu
mpi
slurm
nfs
singularity
azure-batch
azure-functions
glusterfs
rdma
infiniband
batch-processing
windows-containers
-
Updated
Oct 4, 2021 - Python
-
Updated
Oct 13, 2021 - C++
Kubernetes Operator for Allreduce-style Distributed Training
-
Updated
Oct 8, 2021 - Go
RAxML Next Generation: faster, easier-to-use and more flexible
-
Updated
Sep 21, 2021 - C++
-
Updated
Sep 24, 2021 - C++
An adaptive and distributed-memory parallel implementation of the immersed boundary (IB) method
c-plus-plus
fortran
mpi
finite-elements
finite-difference
multiphase-flow
adaptive-mesh-refinement
immersed-boundary-method
volume-penalization-method
-
Updated
Oct 13, 2021 - C++
Distributed and decentralized training framework for PyTorch over graph
machine-learning
asynchronous
decentralized
mpi
distributed-computing
pytorch
deeplearning
one-sided
nccl
-
Updated
Jul 23, 2021 - Python
Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.
-
Updated
Oct 14, 2021 - C++
Information about many aspects of high-performance computing. Wiki content moved to ~/docs.
-
Updated
Oct 12, 2021 - C++
Improve this page
Add a description, image, and links to the mpi topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the mpi topic, visit your repo's landing page and select "manage topics."
Our users are often confused by the output from programs such as zip2john sometimes being very large (multi-gigabyte). Maybe we should identify and enhance these programs to output a message to stderr to explain to users that it's normal for the output to be very large - maybe always or maybe only when the output size is above a threshold (e.g., 1 million bytes?)