Use Textile’s Basin to prove data availability in both the immediate term and in the long-term.
For many ML/AI and compute networks, there is a lot of data that comes from compute jobs in the form of proofs that need to be globally available for a short period of time (hours or days). Proof data may include model inference proofs, model checkpoint data (params and hyperparams), etc. This data needs to be available for a short period of time following job completion to handle disputes, and then longer-term to allow anyone on the network to verify or resolve a dispute by downloading the complete proof data.
Basin provides object storage with verifiable data pipelines and decentralized architecture, allowing compute networks to guarantee data is available for compute, dispute resolution and more:
Textile Vaults enable various data sources to combine into an open, verifiable and collaborative data environment.
Users often submit training jobs that require large, open data such as WikiText and Flickr data, and smaller ones such as mnist. Or, they include their own data that needs to be made available to compute nodes. Ideally, all this data can be made available over a common protocol to facilitate collaboration and data liquidity.