Dolt – Git for Data
-
Updated
Jul 27, 2023 - Go
Dolt – Git for Data
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
lakeFS - Data version control for your data lake | Git for data
Quilt is a data mesh for connecting people with actionable data
sgr (command line client for Splitgraph) and the splitgraph Python library
Data version control for reproducible analysis pipelines in R with {targets}.
Meta data server & client tools for game development
A curated list to help you manage temporal data across many modalities
Create, visualize, run & benchmark DVC pipelines in Python & Jupyter notebooks.
Git-like data versioning.
Python framework for artificial text detection: NLP approaches to compare natural text against generated by neural networks.
SageMaker Experiments and DVC
Deprecated. See https://github.com/datopian/ckanext-versions.
A CKAN extension for data versioning.
An abstraction layer for data storage systems
Metadata management in Go
A JSON-based format for working with machine learning data, with a focus on data interoperability.
Lesson 2 tutorial: Versioning Data and Model for the ML REPA School course: Machine Learning experiments reproducibility and engineering with DVC
Deploying a Machine Learning Model on Heroku with FastAPI using CI/CD tools as GitHub Actions and Heroku Automatic Deployment.
Add a description, image, and links to the data-version-control topic page so that developers can more easily learn about it.
To associate your repository with the data-version-control topic, visit your repo's landing page and select "manage topics."