retriever¶

FileRetriever classes

class retriever.MetaRetriever[source]¶

Base class for retrieving metadata from a source

async add(files: list, dids: list | None = None) → dict[source]¶

Add the metadata for a list of files to the set.

Parameters:

Returns:

dict of MergeFile objects that were added

abstract async input_batches() → AsyncGenerator[dict, None][source]¶

Asynchronously retrieve metadata for the next batch of files.

output_chunks() → Generator[MergeChunk, None, None][source]¶

Yield chunks of files for merging.

class retriever.PathFinder(meta: MetaRetriever)[source]¶

Base class for finding paths to files

async input_batches() → AsyncGenerator[dict, None][source]¶

Asynchronously retrieve paths for the next batch of files.

abstract async process(files: dict) → None[source]¶

Process a batch of files to find their physical locations.