Premium accounts now available! Sign up and create a premium account. Read more Close

Advertisement

Image

Predicted Effector Gene Aggregation, Standards and Unified Schema (PEGASUS): A Community Framework for Effector Gene Reporting

Preprint Created on 17 Jun 2026 bioRxiv

Genome-wide association studies (GWAS) increasingly report predicted effector genes (PEGs) - genes hypothesised to mediate the biological effects of associated variants. These function as key outputs for advancing variant-to-function research, mechanistic understanding, and therapeutic discovery. However, the rapid growth of PEG lists has not been matched by standards for organising, annotating, and reporting these predictions. As shown by recent landscape analyses, PEG lists vary widely in methodology, evidence definition, nomenclature, provenance tracking, and data structure, limiting interoperability, benchmarking, reuse, and adherence to FAIR principles. To address this gap, we convened an international multi-stakeholder community comprising method developers, data generators, resource maintainers, curators, funders, journal editors, and downstream users. Through a 2024 workshop and a 2025 working group series, we developed the Predicted Effector Gene Aggregation, Standards and Unified Schema (PEGASUS) framework. PEGASUS specifies (i) a metadata standard to capture provenance, trait and GWAS descriptors, evidence sources, and integration methods; (ii) a structured evidence matrix reporting all genes and all evidence underpinning prioritisation at each locus; and (iii) a concise PEG list that summarises author-prioritised genes linked transparently to underlying evidence. The framework balances transparency, machine readability, burden on submitters, and alignment with existing community standards. PEGASUS provides the first community-developed schema for reporting predicted effector genes and their supporting evidence. Adoption of this framework by authors will improve the comparability, reproducibility, and reusability of PEG outputs across studies, facilitating more robust biological inference, enabling cross-resource comparison of gene-prioritisation methods to support community benchmarking, and integration into downstream resources and analytical pipelines. PEGASUS-compliant data can be shared via the PEG Data Registry platform (https://kpndataregistry.org/peg), promoting re-use and establishing the basis for future integration with publicly shared GWAS data.

McMahon, A., Ji, Y., Costanzo, M., Butterworth, A. S., Pahl, M., Szyszkowski, S., Heilbron, K., Shiyanbola, A., Tsepilov, Y. A., Spracklen, C. N., Hite, D., Shilin, A., PEG Working Group,, Parkinson, H. E., Burtt, N. P., Harris, L. W.

Advertisement

Stats

  • Recommendations n/a n/a positive of 0 vote(s)
  • Views 0
  • Comments 0

Recommended by

  • No recommendations yet.

Post a comment

You need to be signed in to post comments. You can sign in here.

Comments

There are no comments yet.

Advertisement