API#

Loaders#

Loader(*[, batch_sampler, chunk_size, ...])

A loader for on-disk data anndata stores.

Loader.__iter__()

Iterate over the on-disk datasets.

Samplers#

RandomSampler(chunk_size, preload_nchunks, ...)

Shuffled chunk-based sampler for batched data access.

SequentialSampler(chunk_size, ...[, ...])

Ordered chunk-based sampler for batched data access.

DistributedSampler(sampler, *, dist_info[, ...])

Distributed chunk-based sampler that shards data across distributed processes.

ChunkSampler(chunk_size, preload_nchunks, ...)

Chunk-based sampler for batched data access.

io helpers#

write_sharded(group, adata, *[, ...])

Write a sharded zarr store from a single AnnData object.

DatasetCollection(group, *[, mode, ...])

A preshuffled collection object including functionality for creating, adding to, and loading collections shuffled by annbatch.

abc#

abc.Sampler()

Base sampler class.

types#

types.LoaderOutput

The output of the loader, the "data matrix" with its obs, optional, var, optional, and index, also optional.

types.LoadRequest

Load request from sampler.