scembed.methods.ScanoramaMethod#

class scembed.methods.ScanoramaMethod(adata, sigma=None, alpha=None, knn=None, approx=None, batch_size=None, verbose=None, dimred=None, return_dense=None, union=None, seed=None, sketch=None, sketch_method=None, sketch_max=None, **kwargs)#

Scanorama integration method.

Wrapper around the Scanorama method [HBB19] for efficient integration of heterogeneous single-cell transcriptomes.

Methods table#

fit()

Fit Scanorama - no explicit fitting needed.

fit_transform()

Fit the method and transform the data.

get_model_info()

Get information about the fitted model.

load_artifact(source[, artifact_type, ...])

Load a pre-trained model or embedding from various sources.

save_embedding([format_type, filename, ...])

Save embedding to file with preserved cell names as index.

save_model(path)

Save the trained model (for deep learning methods).

setup([force_recompute])

Setup Scanorama-specific preprocessing.

transform()

Apply Scanorama integration.

validate_adata(adata)

Validate the AnnData object has required keys and structure.

validate_spatial_adata(adata)

Validate spatial-specific data requirements.

Methods#

ScanoramaMethod.fit()#

Fit Scanorama - no explicit fitting needed.

ScanoramaMethod.fit_transform()#

Fit the method and transform the data.

Modifies self.adata in place.

Return type:

None

ScanoramaMethod.get_model_info()#

Get information about the fitted model.

Return type:

dict[str, Any]

Returns:

Dict[str, Any] Dictionary with model information.

ScanoramaMethod.load_artifact(source, artifact_type='model', embedding_key=None, **kwargs)#

Load a pre-trained model or embedding from various sources.

Parameters:
  • source (str | Path | dict) – Source of the artifact. Can be: - str/Path: Local path to model directory or embedding file - dict: WandB parameters with keys ‘run_id’, ‘entity’, ‘project’

  • artifact_type (Literal['model', 'embedding'] (default: 'model')) – Type of artifact to load: ‘model’ or ‘embedding’.

  • embedding_key (str | None (default: None)) – Key to store embedding in adata.obsm. If None, uses self.embedding_key. Only used when artifact_type=’embedding’.

  • **kwargs – Additional arguments passed to loading functions.

Return type:

None

ScanoramaMethod.save_embedding(format_type='parquet', filename=None, compression=True)#

Save embedding to file with preserved cell names as index.

Parameters:
  • format_type (Literal['parquet', 'pickle', 'h5'] (default: 'parquet')) – Format to save embedding in. Options: ‘parquet’, ‘pickle’, or ‘h5’.

  • filename (str | None (default: None)) – Custom filename (without extension). If None, uses “embedding”.

  • compression (bool (default: True)) – Whether to use compression (gzip for all formats).

Return type:

Path

Returns:

Path Path to the saved embedding file.

Raises:

ValueError – If method is not fitted or embedding key not found in adata.obsm.

ScanoramaMethod.save_model(path)#

Save the trained model (for deep learning methods).

Parameters:

path (Path) – Directory to save the model.

Return type:

Path | None

Returns:

Optional[Path] Path to saved model file, None if method doesn’t support saving.

ScanoramaMethod.setup(force_recompute=False)#

Setup Scanorama-specific preprocessing.

Return type:

None

ScanoramaMethod.transform()#

Apply Scanorama integration.

ScanoramaMethod.validate_adata(adata)#

Validate the AnnData object has required keys and structure.

Parameters:

adata (AnnData) – Annotated data object to validate.

Raises:

ValueError – If required keys are missing or data is malformed.

Return type:

AnnData

ScanoramaMethod.validate_spatial_adata(adata)#

Validate spatial-specific data requirements.

Parameters:

adata (AnnData) – Annotated data object to validate.

Raises:

ValueError – If required spatial keys are missing or data is malformed.

Return type:

None