<?xml version='1.0' encoding='us-ascii'?>
<eml:eml scope="system" system="https://doi.org" packageId="https://doi.org/10.12761/sgn.2014.3" xsi:schemaLocation="https://eml.ecoinformatics.org/eml-2.2.0" xmlns:eml="https://eml.ecoinformatics.org/eml-2.2.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"><dataset><title>Gene Loss Rather Than Gene Gain Is Associated with a Host Jump from Monocots to Dicots in the Smut Fungus Melanopsichium pennsylvanicum.</title><creator><individualName><surName>Thines</surName></individualName><organizationName>Biodiversity and Climate Research Center (BiK-F)</organizationName><address /></creator><creator><individualName><surName>Sharma</surName></individualName><organizationName>Biodiversity and Climate Research Center (Bik-F)</organizationName><address /></creator><abstract><para>Smut fungi are well-suited to investigate the ecology and evolution of plant pathogens, as they are strictly biotrophic, yet cultivable on media. Here we report the genome sequence of Melanopsichium pennsylvanicum, closely related to Ustilago maydis and other Poaceae-infecting smuts, but parasitic to a dicot plant. To explore the evolutionary patterns resulting from host adaptation after this huge host jump, the genome of Me. pennsylvanicum was sequenced and compared with the genomes of U. maydis, Sporisorium reilianum, and U. hordei. Although all four genomes had a similar completeness in CEGMA (Core Eukaryotic Genes Mapping Approach) analysis, gene absence was highest in Me. pennsylvanicum, and most pronounced in putative secreted proteins, which are often considered as effector candidates. In contrast, the amount of private genes was similar among the species, highlighting that gene loss rather than gene gain is the hallmark of adaptation after the host jump to the dicot host. Our analyses revealed a trend of putative effectors to be next to another putative effector, but the majority of these are not in clusters and thus the focus on pathogenicity clusters might not be appropriate for all smut genomes. Positive selection studies revealed that Me. pennsylvanicum has the highest number and proportion of genes under positive selection. In general, putative effectors showed a higher proportion of positively selected genes than noneffector candidates. The 248 putative secreted effectors found in all four smut genomes might constitute a core set needed for pathogenicity, whereas those 92 that are found in all grass-parasitic smuts but have no ortholog in Me. pennsylvanicum might constitute a set of effectors important for successful colonization of grass hosts.</para></abstract><keywordSet><keyword> effector genes</keyword><keyword> evolutionary biology</keyword><keyword> genome assembly</keyword><keyword> host jump</keyword><keyword> positive selection</keyword><keyword> smut fungi</keyword><keyword>comparative genomics</keyword></keywordSet><coverage><taxonomicCoverage><generalTaxonomicCoverage /><taxonomicClassification><taxonRankName>Species</taxonRankName><taxonRankValue>Melanopsichium pennsylvanicum</taxonRankValue></taxonomicClassification></taxonomicCoverage></coverage><contact><individualName><surName>Sharma</surName></individualName><organizationName>Biodiversity and Climate Research Center (Bik-F)</organizationName><address /></contact><contact><individualName><surName>Thines</surName></individualName><organizationName>Biodiversity and Climate Research Center (BiK-F)</organizationName><address /></contact><dataTable id="1380113584714" scope="document"> <entityName>Repeat element masked genomic sequences of Melanopsichium pennsylvanicum</entityName><physical scope="document"> <objectName>Mp4_V1_masked_genome.fa</objectName><dataFormat> <textFormat> <attributeOrientation>column</attributeOrientation><simpleDelimited> <fieldDelimiter>#x20</fieldDelimiter></simpleDelimited></textFormat></dataFormat><distribution> <online> <url>ecogrid://knb/rsharma.3.1</url></online> <access authSystem="knb" order="allowFirst"><allow><principal>public</principal>
<permission>read</permission>
</allow>
</access>
</distribution></physical><attributeList> <attribute id="1380113584715" scope="document"> <attributeName>Scaffolds</attributeName><attributeDefinition>Repeat masked Mp4 scaffolds</attributeDefinition><measurementScale> <nominal> <nonNumericDomain> <textDomain> <definition>Text values</definition></textDomain></nonNumericDomain></nominal></measurementScale></attribute></attributeList></dataTable><otherEntity id="c990c7e2-4b50-4df3-a793-9160991a9147"><entityName>Mp4 gff file</entityName><physical><objectName>Mp4 gff file</objectName><size unit="byte">1727071</size><dataFormat><externallyDefinedFormat><formatName>gff</formatName></externallyDefinedFormat></dataFormat><distribution><online><url>https://dataportal.senckenberg.de/dataset/7f944577-8198-46a3-901e-1c9b345860df/resource/c990c7e2-4b50-4df3-a793-9160991a9147/download/mp4-gff-file</url></online></distribution></physical><entityType>Other</entityType></otherEntity><otherEntity id="c2b320a0-b00d-4fb4-af5b-25e3396b10a3"><entityName>Mp4 protein sequences</entityName><physical><objectName>Mp4 protein sequences</objectName><size unit="byte">3750742</size><dataFormat><externallyDefinedFormat><formatName>aa</formatName></externallyDefinedFormat></dataFormat><distribution><online><url>https://dataportal.senckenberg.de/dataset/7f944577-8198-46a3-901e-1c9b345860df/resource/c2b320a0-b00d-4fb4-af5b-25e3396b10a3/download/mp4-protein-sequences</url></online></distribution></physical><entityType>Other</entityType></otherEntity><otherEntity id="99d5326b-e804-473e-a19f-7a18f790af20"><entityName>Mp4_Unmasked_genome.fasta</entityName><entityDescription>Unmasked genomic sequences of Melanopsichium pennsylvanicum</entityDescription><physical><objectName>Mp4_Unmasked_genome.fasta</objectName><size unit="byte">19237552</size><dataFormat><externallyDefinedFormat><formatName>fasta</formatName></externallyDefinedFormat></dataFormat><distribution><online><url>https://dataportal.senckenberg.de/dataset/7f944577-8198-46a3-901e-1c9b345860df/resource/99d5326b-e804-473e-a19f-7a18f790af20/download/mp4_unmasked_genome.fasta</url></online></distribution></physical><entityType>Other</entityType></otherEntity><otherEntity id="b6670913-5fec-468b-bddc-75b0e4b06fcd"><entityName>Mp4_CDS.fasta</entityName><entityDescription>Melanopsichium pennsylvanicumcoding sequences</entityDescription><physical><objectName>Mp4_CDS.fasta</objectName><size unit="byte">11091514</size><dataFormat><externallyDefinedFormat><formatName>fa</formatName></externallyDefinedFormat></dataFormat><distribution><online><url>https://dataportal.senckenberg.de/dataset/7f944577-8198-46a3-901e-1c9b345860df/resource/b6670913-5fec-468b-bddc-75b0e4b06fcd/download/mp4_cds.fasta</url></online></distribution></physical><entityType>Other</entityType></otherEntity><otherEntity id="c32dd404-ed04-40ee-9966-3e6be18c529b"><entityName>Repeat element masked genomic sequences of Melanopsichium pennsylvanicum</entityName><physical><objectName>Repeat element masked genomic sequences of Melanopsichium pennsylvanicum</objectName><size unit="byte">19621963</size><dataFormat><externallyDefinedFormat><formatName>fa</formatName></externallyDefinedFormat></dataFormat><distribution><online><url>https://dataportal.senckenberg.de/dataset/7f944577-8198-46a3-901e-1c9b345860df/resource/c32dd404-ed04-40ee-9966-3e6be18c529b/download/repeat-element-masked-genomic-sequences-of-melanopsichium-pennsylvanicum</url></online></distribution></physical><entityType>Other</entityType></otherEntity></dataset></eml:eml>