18S non-redundant database of Blastocystis and its 28 subtypes formatted for DADA2

  1. Cabodevilla, Xabier 1
  2. Caro, Amaia 2
  1. 1 Centre Tecnologic Forestal de Catalunya
    info

    Centre Tecnologic Forestal de Catalunya

    Solsona, España

  2. 2 Universidad del País Vasco/Euskal Herriko Unibertsitatea
    info

    Universidad del País Vasco/Euskal Herriko Unibertsitatea

    Lejona, España

    ROR https://ror.org/000xsnr85

Editor: Zenodo

Año de publicación: 2024

Tipo: Dataset

CC BY 4.0

Resumen

DADA2_Blastocystis.fa This is an updated 18S Blastocystis database containing 636 sequences, 606 Blastocystis sequences with verified identification to subtype level and 30 Blastocystis sequences from non-mammalian and non-bird organisms. This DB includes data from two origins: 1) The sequences identified as Blastocystis and identified to subtype level which include the region amplified by the MiniB18S_81 primers (Cabodevilla et al., 2023) present in the GenBank nucleotide database. 2) The sequences from the reference databases “Blastocystis subtype reference sequences - ST1-ST17, ST21 and ST23-ST32” and “Blastocystis sequences from non-human/other mammal/bird sources” proposed by Stensvold & Clark (2020) that were not obtained from the GenBank search. For more information on data curation, see Caro et al. (2024). This database is provided in FASTA format (FASTA_Blastocystis.fasta) and also formatted for direct use with DADA2 (DADA2_Blastocystis.fa).    DADA2_Silva_v132_18S_&_Blastocystis.fa In this database the Blastocystis sequences are combined with those eukaryotic sequences of Silva v132 database provided by Cabodevilla (2024). This 18S database is formatted for DADA2 and includes 26990 sequences, 26324 non-redundant sequences from eukaryotic organisms other than Blastocystis, 20 from bacteria, 10 from archaea, and the 636 Blastocystis sequences (DADA2_Silva_v132_18S_&_Blastocystis.fa). –––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––   –> If you use the Blastocystis database in your work, please cite the article in which this database is described (Caro et al., 2024) and this Zenodo repository. On the other hand, if you use the combined database (Silva v132 + Blastocystis), in addition to the above, please also cite the reference to the Silva database (Quast et al., 2013) and the article describing that database (del Portillo et al., 2024).     The SILVA v132 database is release under a Creative Commons Attribution 4.0 (CC-BY 4.0) license. Thus documents are freely available for academic and commercial use as long as SILVA is credited as original author and a link to the full license is provided. https://creativecommons.org/licenses/by/4.0/