coords_to_sample_dict
coords_to_sample_dict(raster, coordinates, individual_ids=None, vcf_path=None)
Convert sample coordinates to sample dictionaries for simulation and analysis. Can optionally include empirical data, which is accepted as a path to a VCF file.
This function takes a raster, a list of coordinates, and optional individual IDs and VCF path. It masks the raster with the given coordinates, retrieves the cell IDs for each individual’s locality, and returns two dictionaries: a sample dictionary containing the number of individuals to sample from the simulation, and a sample dictionary containing the range of individual indices for each cell ID. The first dictionary is used to sample individuals from the simulation, and the second dictionary is used to calculate genetic summary statistics from the sampled individuals.
Parameters
Name | Type | Description | Default |
---|---|---|---|
raster |
Union[np.ndarray, rasterio.DatasetReader] | The raster data as a numpy array or rasterio DatasetReader object. | required |
coordinates |
Union[List[Tuple[float, float]], gpd.GeoDataFrame] | A list of (x, y) coordinates or a geopandas GeoDataFrame. | required |
individual_ids |
Optional[List[str]] | A list of individual IDs corresponding to those in the VCF file, by default None. | None |
vcf_path |
Optional[str] | The path to the VCF file, by default None. | None |
Returns
Type | Description |
---|---|
Tuple[Dict[int, int], Dict[int, np.ndarray], Optional[Dict[int, np.ndarray]]] | A tuple containing two or three dictionaries. The first dictionary contains the number of individuals to sample from the simulation for each cell ID. The second dictionary contains the indices of individuals for each cell ID. The third, optional dictionary contains the indices of individuals in the VCF file for each cell ID. |