Annotation data from: Utilizing a comparative approach to assess genome evolution during diploidization in Artemisia tridentata (Asteraceae), a keystone species of western North America

Submission information

Submission Number: 197

Created: Thu, 11/30/2023 - 10:59

Dataset General Information

Title Annotation data from: Utilizing a comparative approach to assess genome evolution during diploidization in Artemisia tridentata (Asteraceae), a keystone species of western North America

Discipline/Subject Natural sciences, Biological sciences

Description/Abstract

Supporting dataset for genome assembly data found within NCBI BioProjects PRJNA1032953 (UTT2), PRJNA722258 (IDT2) and PRJNA795150 (IDT3-Reference Genome). Fasta assembly data used in the EDTA analysis to generate subsequent output files are available from the NCBI Genome database and raw sequence data are available from the NCBI SRA database. Each input fasta contains the nine pseudo-chromosomes described in Melton et al. 2022. Reads from each sample were mapped to the nine pseudo-chromosomes and used to call a consensus sequence. The EDTA analysis provides several outputs, listed below. The primary file of interest is the "SAMPLE_consensus.fasta.mod.EDTA.TEanno.gff3" file. This file was used as inputs for comparisons of TE content across the three genomes.

Please visit https://github.com/oushujun/EDTA for more information about the EDTA pipeline.

FILES:
SAMPLE_consensus.fasta.mod.EDTA.TEanno.gff3 == Whole-genome TE annotation
SAMPLE_consensus.fasta.mod.EDTA.TEanno.sum == Summary of whole-genome TE annotation
SAMPLE_consensus.fasta.mod.EDTA.TElib.fa == A non-redundant TE library
SAMPLE_consensus.fasta.mod.MAKER.masked == Low-threshold TE masking

Dataset Key Terms

GEM3 Project Affiliation(s)

Internal Affiliation(s)

GEM3 Research Universities and Colleges Boise State University

GEM3 Project Component(s) Mechanisms

External Affiliation(s)

External Affiliated Organization(s) {Empty}

Project Keywords

GEM3 Keywords Artemisia tridentata (sagebrush), genomics

Other Discipline Specific Keywords

annotations
GFF
sequence assemblies

Dataset Authors & Contact Information

Data Authors/Creators

Authors/Creators

GEM3 Authors

Other Author(s) Info {Empty}

Contact Information

Dataset Contact Information

Contact Person Sven Buerki

Contact Email svenbuerki@boisestate.edu

Geographic and Temporal Information

Geographic Research Space

Geographic Research Components This project involves a research component conducted at field site(s) or collecting data from a target population within a specific geographic area

Geographic Information

Spatial / Geographical Coverage Location Utah and Idaho USA

Spatial Extent (Lat/Lon) {Empty}

Upload Spatial Extent KML/KMZ file(s) {Empty}

GEM3 Research Sites

Research Site(s)

Temporal Information

Temporal Extent (start date) {Empty}

Temporal Extent (end date) {Empty}

Dataset Files & Links

Metadata Files

Upload Auxiliary Metadata File(s) {Empty}

Upload Field/Lab Protocol(s) {Empty}

Select a Previously Submitted Protocol {Empty}

Data Files

Data Repository Link(s)

https://doi.org/10.7923/hysd-x388

Dataset Resources

Link to Data Resource(s)

Administrative Submission Details

Language(s)

English

DOI Details

DOI 10.7923/hysd-x388

Data Licensing & Availability

Data Licensing CC-BY | Attribution

Date Data is Available 2023-11-29

Funding Information

GEM3 Funding Sources NSF Idaho EPSCoR Program: Award OIA‐1757324

Other Funding Sources {Empty}