scembed.methods.scANVIMethod#

class scembed.methods.scANVIMethod(adata, scvi_params=None, max_epochs=None, early_stopping=None, accelerator=None, linear_classifier=None, batch_size=None, lr=None, n_samples_per_label=None, **kwargs)#

scANVI integration method.

Wrapper around the scANVI method [XLM+21] for probabilistic harmonization and annotation of single-cell transcriptomics data with deep generative models. Part of the scvi-tools framework [GLX+22].

Methods table#

fit()

Fit scANVI model.

fit_transform()

Fit the method and transform the data.

get_model_info()

Get information about the fitted model.

load_artifact(source[, artifact_type, ...])

Load a pre-trained model or embedding from various sources.

save_embedding([format_type, filename, ...])

Save embedding to file with preserved cell names as index.

save_model(path)

Save scANVI model and pretrained scVI model.

setup([force_recompute])

Setup scANVI-specific preprocessing.

transform()

Get scANVI latent representation.

validate_adata(adata)

Validate the AnnData object has required keys and structure.

validate_spatial_adata(adata)

Validate spatial-specific data requirements.

Methods#

scANVIMethod.fit()#

Fit scANVI model.

scANVIMethod.fit_transform()#

Fit the method and transform the data.

Modifies self.adata in place.

Return type:

None

scANVIMethod.get_model_info()#

Get information about the fitted model.

Return type:

dict[str, Any]

Returns:

Dict[str, Any] Dictionary with model information.

scANVIMethod.load_artifact(source, artifact_type='model', embedding_key=None, **kwargs)#

Load a pre-trained model or embedding from various sources.

Parameters:
  • source (str | Path | dict) – Source of the artifact. Can be: - str/Path: Local path to model directory or embedding file - dict: WandB parameters with keys ‘run_id’, ‘entity’, ‘project’

  • artifact_type (Literal['model', 'embedding'] (default: 'model')) – Type of artifact to load: ‘model’ or ‘embedding’.

  • embedding_key (str | None (default: None)) – Key to store embedding in adata.obsm. If None, uses self.embedding_key. Only used when artifact_type=’embedding’.

  • **kwargs – Additional arguments passed to loading functions.

Return type:

None

scANVIMethod.save_embedding(format_type='parquet', filename=None, compression=True)#

Save embedding to file with preserved cell names as index.

Parameters:
  • format_type (Literal['parquet', 'pickle', 'h5'] (default: 'parquet')) – Format to save embedding in. Options: ‘parquet’, ‘pickle’, or ‘h5’.

  • filename (str | None (default: None)) – Custom filename (without extension). If None, uses “embedding”.

  • compression (bool (default: True)) – Whether to use compression (gzip for all formats).

Return type:

Path

Returns:

Path Path to the saved embedding file.

Raises:

ValueError – If method is not fitted or embedding key not found in adata.obsm.

scANVIMethod.save_model(path)#

Save scANVI model and pretrained scVI model.

Return type:

Path | None

scANVIMethod.setup(force_recompute=False)#

Setup scANVI-specific preprocessing.

Return type:

None

scANVIMethod.transform()#

Get scANVI latent representation.

scANVIMethod.validate_adata(adata)#

Validate the AnnData object has required keys and structure.

Parameters:

adata (AnnData) – Annotated data object to validate.

Raises:

ValueError – If required keys are missing or data is malformed.

Return type:

AnnData

scANVIMethod.validate_spatial_adata(adata)#

Validate spatial-specific data requirements.

Parameters:

adata (AnnData) – Annotated data object to validate.

Raises:

ValueError – If required spatial keys are missing or data is malformed.

Return type:

None