Python code for building and training a generative adversarial network for demographic inferences from genomic data
[This application is embargoed until January 1, 2025]. A collection of python and bash scripts to implement, train and deploy a generative adversarial network for population genetic inferences. The networks have been tuned to be deployed to genomic data from Anopheles mosquitoes. However, the general framework can be applied to other species. It requires the input data to be in Variant Call Format (VCF) format and the simulations need to be in msprime format. Full details about this application can be found at
https://doi.org/10.5285/3ae572f6-4862-47ae-b4a0-4b9c496b5b54
Simple
- Date (Publication)
- 2023-12-08
- Identifier
- doi: / 10.5285/3ae572f6-4862-47ae-b4a0-4b9c496b5b54
- Other citation details
- Pui, C.S., Fumagalli, M., Mathieson, S. (2023). Python code for building and training a generative adversarial network for demographic inferences from genomic data. NERC EDS Environmental Information Data Centre 10.5285/3ae572f6-4862-47ae-b4a0-4b9c496b5b54
Point of contact
Queen Mary University of London
-
Fumagalli, M.
https://orcid.org/0000-0002-4084-2953
- Access constraints
- otherRestrictions Other restrictions
- Other constraints
- Embargoed
- Use constraints
- otherRestrictions Other restrictions
- Use constraints
- otherRestrictions Other restrictions
- Other constraints
- If you reuse this data, you should cite: Pui, C.S., Fumagalli, M., Mathieson, S. (2023). Python code for building and training a generative adversarial network for demographic inferences from genomic data. NERC EDS Environmental Information Data Centre https://doi.org/10.5285/3ae572f6-4862-47ae-b4a0-4b9c496b5b54
- Metadata language
- EnglishEnglish
- Character set
- utf8 UTF8
- Topic category
-
- Environment
- Distribution format
-
-
Python scripts
()
-
Bash scripts
()
-
R script
()
-
Python scripts
()
- Hierarchy level
- application application
- Other
- application
Conformance result
- Date (Publication)
- 2010-12-08
- Statement
- The methodology can be applied to up to two populations at the same time. The code uses the keras python package and the scripts have been tested via simulations. Inferences were tested using simulations with known output, and the power to infer the ground truth was recovered. Simulations can be performed using msprime or SLiM.
- File identifier
- 3ae572f6-4862-47ae-b4a0-4b9c496b5b54 XML
- Metadata language
- EnglishEnglish
- Character set
- ISO/IEC 8859-1 (also known as Latin 1) 8859 Part 1
- Hierarchy level
- application application
- Hierarchy level name
- application
- Date stamp
- 2024-02-08T17:25:18
- Metadata standard name
- UK GEMINI
- Metadata standard version
- 2.3
Point of contact
NERC EDS Environmental Information Data Centre
Lancaster Environment Centre, Library Avenue, Bailrigg
,
Lancaster
,
LA1 4AP
,
UK
https://eidc.ac.uk/