Dataset list#
- class igua.dataset.list.DatasetList(datasets=())#
A dataset consisting in a list of other datasets.
- extract_clusters(progress)#
Extract the clusters from the dataset.
- Parameters:
progress (
rich.progress.Progress) – AProgressinstance that can be used for tracking progress.- Yields:
Cluster– A cluster object for each gene cluster to be processed in the dataset.
- extract_proteins(progress, representatives)#
Extracts protein sequences from GenBank files.
- Parameters:
progress (
rich.progress.Progress) – AProgressinstance that can be used for tracking progress.cluster_ids (
collections.abc.Collectionofstr) – A collection of cluster IDs from which to extract proteins.
- Yields:
Protein– A protein object for each protein of the gene clusters to be processed in the dataset.