<?xml version='1.0' encoding='us-ascii'?>
<eml:eml scope="system" system="https://doi.org" packageId="https://doi.org/10.12761/sgn.2014.2" xsi:schemaLocation="https://eml.ecoinformatics.org/eml-2.2.0" xmlns:eml="https://eml.ecoinformatics.org/eml-2.2.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"><dataset><title>Illumina metabarcoding pipeline for fungal ITS</title><creator><individualName><givenName>Mikl&#243;s</givenName><surName>B&#225;lint</surName></individualName><organizationName>Senckenberg Gesellschaft f&#252;r Naturforschung</organizationName><address /></creator><associatedParty><individualName><givenName>Mikl&#243;s</givenName><surName>B&#225;lint</surName></individualName><role>associatedParty</role></associatedParty><pubDate>2014-04-11</pubDate><abstract><para>High-throughput metabarcoding studies on fungi and other eukaryotic microorganisms are rapidly becoming more frequent and more complex, requiring researchers to handle ever increasing amounts of raw sequence data. Here we provide a flexible pipeline for pruning and analyzing fungal barcode (ITS rDNA) data generated as paired-end reads on Illumina MiSeq sequencers. The pipeline presented includes specific steps fine-tuned for ITS, that are mostly missing from pipelines developed for prokaryotes. It (i) employs state of the art programs and follows best practices in fungal high-throughput metabarcoding, (ii) consists of modules and scripts easily modifiable by the user to ensure maximum flexibility with regard to specific needs of a project or future methodological developments, (iii) and is straightforward to use, also in classroom settings. We provide detailed descriptions and revision techniques for each step, thus avoiding a black-box approach and giving the user maximum control over data treatment. Employing this pipeline will improve and speed up the tedious and error-prone process of cleaning fungal Illumina metabarcoding data.</para></abstract><intellectualRights><para>no restrictions</para></intellectualRights><coverage><geographicCoverage><geographicDescription>Fl&#246;rsheim, Germany</geographicDescription><boundingCoordinates><westBoundingCoordinate>8.3992</westBoundingCoordinate><eastBoundingCoordinate>8.3992</eastBoundingCoordinate><northBoundingCoordinate>50.0069</northBoundingCoordinate><southBoundingCoordinate>50.0069</southBoundingCoordinate></boundingCoordinates></geographicCoverage><temporalCoverage><rangeOfDates><beginDate><calendarDate>2013</calendarDate></beginDate><endDate><calendarDate>2013</calendarDate></endDate></rangeOfDates></temporalCoverage><taxonomicCoverage><generalTaxonomicCoverage>Example dataset covering some soil fungal operational taxonomic units (OTUs). The number of OTUs is dependent on the method of delimitation.</generalTaxonomicCoverage><taxonomicClassification><taxonRankName>Kingdom</taxonRankName><taxonRankValue>Fungi</taxonRankValue></taxonomicClassification></taxonomicCoverage></coverage><contact><individualName><givenName>Imke</givenName><surName>Schmitt</surName></individualName><organizationName> Senckenberg Gesellschaft f&#252;r Naturforschung</organizationName><address /><electronicMailAddress>imke.schmitt@senckenberg.de</electronicMailAddress></contact><methods><methodStep><description><section>
<title>Illumina sequencing</title>
<para>Illumina sequencing of ITS rDNA fragments for the undentification of fungal communities (metabarcoding)</para>
</section></description></methodStep>
</methods><otherEntity id="85de7956-1f72-46cc-b416-a171a61b608e"><entityName>Illumina_metabarcoding_pipeline_scripts-examples.zip</entityName><physical><objectName>Illumina_metabarcoding_pipeline_scripts-examples.zip</objectName><size unit="byte">15527127</size><dataFormat><externallyDefinedFormat><formatName>ZIP</formatName></externallyDefinedFormat></dataFormat><distribution><online><url>https://dataportal.senckenberg.de/dataset/3f8de147-6017-408e-9a04-8542f2e20d66/resource/85de7956-1f72-46cc-b416-a171a61b608e/download/illumina_metabarcoding_pipeline_scripts-examples.zip</url></online></distribution></physical><entityType>Other</entityType></otherEntity></dataset><additionalMetadata>
<metadata>
<temporalDataType>timeslices</temporalDataType>
</metadata>
</additionalMetadata><additionalMetadata>
<metadata>
<temporalDataInfo>This is an example dataset for demonstrating the use of a bioinformatic pipeline.</temporalDataInfo>
</metadata>
</additionalMetadata><additionalMetadata>
<metadata>
<temporalResolution>other</temporalResolution>
</metadata>
</additionalMetadata><additionalMetadata>
<metadata>
<temporalResolutionInfo>No temporal resolution</temporalResolutionInfo>
</metadata>
</additionalMetadata><additionalMetadata>
<metadata>
<climaticNiche />
</metadata>
</additionalMetadata><additionalMetadata>
<metadata>
<ecologicalTraits />
</metadata>
</additionalMetadata><additionalMetadata>
<metadata>
<environmentalForcingData />
</metadata>
</additionalMetadata></eml:eml>