Contributors | Affiliation | Role |
---|---|---|
White, Crow | Cal Poly San Luis Obispo | Principal Investigator, Contact |
Christie, Mark | Purdue University | Co-Principal Investigator |
Davidson, Jean | Cal Poly San Luis Obispo | Co-Principal Investigator |
Toonen, Robert J. | University of Hawaiʻi at Mānoa (HIMB) | Co-Principal Investigator |
Anderson, Paul | Cal Poly San Luis Obispo | Scientist |
Daniels, Benjamin | Cal Poly San Luis Obispo | Scientist |
Lee, Andy | Purdue University | Scientist |
López, Cataixa | University of Hawaiʻi at Mānoa (HIMB) | Scientist |
Soenen, Karen | Woods Hole Oceanographic Institution (WHOI BCO-DMO) | BCO-DMO Data Manager |
Description of linked resources for this dataset, all links can be found in the related dataset section.
gDNA was extracted using HMW Circulomics Standard TissueRuptor Protocol. DNA was further cleaned and concentrated using the DNeasy PowerClean Pro Cleanup Kit. Libraries were prepared using the PacBio HiFi SMARTbell from Ultra-low DNA input procedures. Libraries were prepared for sequencing using the Binding kit 2.2.
RNA was extracted using Trizol and RNA phase separation. After fragmentation, the first strand cDNA was synthesized using random hexamer primers, followed by the second strand cDNA synthesis using dTTP for non-directional library preparation. Messenger RNA was purified from total RNA using poly-T oligo-attached magnetic beads. Libraries were prepared using end repair, A-tailing, adapter ligation, size selection, amplification, and purification.
Detailed methods in Daniels et al., (2023), see related publications
* Merged RNA and DNA sequence files
* Split lat_lon column into latitude and longitude column
File |
---|
945292_v1_kelletia.csv (Comma Separated Values (.csv), 46.23 KB) MD5:7d0b036e1424a609a44b6bc904eedda5 Primary data file for dataset ID 945292, version 1 |
Parameter | Description | Units |
Run | NCBI SRA run accession ID | unitless |
Assay_Type | Type of sequencing assay performed: WGS or RNA-Seq | unitless |
AvgSpotLen | The average length of the spots (reads) in the run | unitless |
Bases | The total number of bases sequenced | unitless |
BioProject | NCBI Bioproject accession ID | unitless |
BioSample | NCBI Biosample accession ID | unitless |
BioSampleModel | NCBI BioSample package representing type of biosample and required attributes | unitless |
Bytes | Filesize in bytes for datastore file | unitless |
Center_Name | The name of the sequencing center | unitless |
Collection_Date | Collection date of organism | unitless |
Consent | Information on the consent for data usage | unitless |
DATASTORE_filetype | The file type stored in the NCBI DataStore (e.g., FASTQ, BAM) | unitless |
DATASTORE_provider | The provider of the DataStore where files are kept | unitless |
DATASTORE_region | The geographical region of the DataStore | unitless |
Ecotype | A population within a given species displaying genetically based, phenotypic traits that reflect adaptation to a local habitat, e.g., Columbia | unitless |
env_broad_scale | Broad-scale environmental context | unitless |
env_local_scale | Local-scale environmental context | unitless |
env_medium | Material displaced by the entity at time of sampling | unitless |
Experiment | The NCBI identifier for the sequencing experiment | unitless |
geo_loc_name_country | Geographic location of the origin of the sample: country = USA | unitless |
geo_loc_name_country_continent | Geographic location of the origin of the sample: continent = North America | unitless |
geo_loc_name | Geographic location of the origin of the sample | unitless |
Instrument | The sequencing instrument used (e.g., Illumina MiSeq) | unitless |
isol_growth_condt | Isolation and growth condition specifications | unitless |
latitude | Latitude of sampling location, south is negative | decimal degrees |
longitude | Longitude of sampling location, west is negative | decimal degrees |
Library_Name | Unique identifier for the sequencing library (can be the sample name repeated) | unitless |
LibraryLayout | Single or paired end sequencing reads | unitless |
LibrarySelection | Selection used for sequencing library | unitless |
LibrarySource | Source material of sequencing library | unitless |
Organism | Name of organism from which the sample was taken: Kelletia kelletii | unitless |
Platform | Sequencing platform manufacturer | unitless |
ReleaseDate | Date of release of NCBI data to the public | unitless |
create_date | Date of creation of NCBI submission | unitless |
version | The version of the NCBI submission: 1 | unitless |
Sample_Name | Unique sample name | unitless |
source_material_id | Unique identifier assigned to the sequenced material | unitless |
SRA_Study | NCBI SRA study accession ID | unitless |
Dataset-specific Instrument Name | ILLUMINA |
Generic Instrument Name | Automated DNA Sequencer |
Generic Instrument Description | A DNA sequencer is an instrument that determines the order of deoxynucleotides in deoxyribonucleic acid sequences. |
Dataset-specific Instrument Name | Nanopore MinION |
Generic Instrument Name | Automated DNA Sequencer |
Generic Instrument Description | A DNA sequencer is an instrument that determines the order of deoxynucleotides in deoxyribonucleic acid sequences. |
Dataset-specific Instrument Name | PacBio sequel 2 |
Generic Instrument Name | Automated DNA Sequencer |
Generic Instrument Description | A DNA sequencer is an instrument that determines the order of deoxynucleotides in deoxyribonucleic acid sequences. |
NSF Award Abstract:
Where do young marine fish and shellfish come from? This project aims to improve our understanding of how coastal marine populations are connected in space and time. Coastal populations are replenished through the arrival of minuscule larvae that have been dispersed for weeks to months in the open ocean after spawning at remote sites. The combination of the long dispersal period of marine fish and shellfish larvae and the varying ocean currents results in complex patterns of "connectivity" among populations near and far. Identifying these patterns of connectivity is fundamental to marine science and critical for effective fisheries management and conservation, yet it remains an unresolved component of marine ecology. The study species is currently expanding its biogeographic range up the U.S. west coast. By genetically analyzing individuals from across the species' range, including offspring spawned in the laboratory by experimentally-crossed individuals collected in the field from throughout the species historical and expanded range, certain genes can serve to differentiate populations along the coast. The team leverages the statistical power of these geographically-informative genes to assign thousands of young collected in the field to the source populations that spawned them (across the species' range and over multiple years). The team then quantifies patterns of connectivity over multiple years, and tests fundamental hypotheses on the spatial scale, temporal variability, biogeographic patterns, and biophysical drivers of population connectivity. The project trains approximately two dozen U.S. university students in molecular ecology and marine science, as well as creating intellectual linkages among Ph.D.-granting and non-Ph.D.-granting universities. The project also supports further development of a K-12 education program that uses SCUBA diving and videography to teach elementary school students Next Generation Science Standards and train them for careers in science, technology, engineering and mathematics.
Using a kelp forest gastropod and fisheries species (Kellet's whelk, Kelletia kelletii), this project combines genome-wide Restriction site Associated DNA (RAD) loci with transcriptomic loci identified from common-garden laboratory crosses of individuals from the species' historical and expanded range to identify geographically-informative loci that maximize power for individual assignment testing. Leveraging the combined power of these loci, genetic assignment of approximately three thousand recruit samples to 20 putative source populations allows the team to construct three independent years of connectivity matrices and test some of the most fundamental questions in marine ecology, including: 1) Are marine populations open or closed and at what scales? 2) To what degree is the evolutionary pattern of gene flow represented by single versus multiple generations of connectivity events? And, 3) How spatially heterogeneous and temporally variable is population connectivity? Can one year of connectivity data predict anything about the next? Additionally, by focusing on a range-expanding species with common life history traits, the team addresses a number of questions with broad applicability and significant ecological and societal implications: 4) How much is population connectivity influenced by post-recruitment demographic and evolutionary processes? 5) How well-connected are historic- and expanded-range populations? And, of particular relevance to climate change, 6) Are El Nino oceanographic conditions, which are predicted to increase in frequency and intensity this century, driving the poleward range expansion of this coastal marine species? By coupling common-garden experimental crosses to identify maximally-informative transcriptomic loci with genomic RAD analysis of field samples, this project aims to accurately and precisely quantify marine population connectivity in high gene flow species with large population sizes.
This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
Funding Source | Award |
---|---|
NSF Division of Ocean Sciences (NSF OCE) |