Team Argon FAIR Research

FAIR Research

Best practices, tools and tips for integrating FAIR data principles into your daily work.

Automated Generation of GTEx Metadata

Goal: Produce a GTEx data asset manifest in a reusable format.

Method: Develop scripts to convert the metadata released by GTEx into the common CFDE format.

We have created a scalable process to transform DCC data asset metadata into the shared C2M2 model and store these manifests for use in the CFDE and by external entities. This the process on the left side of the following diagram.

CFDE metadata process

To support per-DCC extensions and heterogeneity, separate scripts have been written to work on each DCC’s metadata. These scripts create BDBags which are stored in a Globus endpoint for access by other CFDE team members and for input to create the cross-DCC metadata catalog.

C2M2 creation

The GTEx conversion scripts have been released as part of the CFDE GitHub organization.

back