dnadna: A DEEP LEARNING FRAMEWORK FOR POPULATION GENETIC INFERENCE - Laboratoire Interdisciplinaire des Sciences du Numérique Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2022

dnadna: A DEEP LEARNING FRAMEWORK FOR POPULATION GENETIC INFERENCE

Résumé

We present dnadna, a flexible python-based software for deep learning inference in population genetics. It is task-agnostic and aims at facilitating the development, reproducibility, dissemination, and reusability of neural networks designed for population genetic data. dnadna defines multiple user-friendly workflows. First, users can implement new architectures and tasks, while benefiting from dnadna utility functions, training procedure and test environment, which saves time and decreases the likelihood of bugs. Second, the implemented networks can be re-optimized based on user-specified training sets and/or tasks. Newly implemented architectures and pretrained networks are easily shareable with the community for further benchmarking or other applications. Finally, users can apply pretrained networks in order to predict evolutionary history from alternative real or simulated genetic datasets, without requiring extensive knowledge in deep learning or coding in general. dnadna comes with a peer-reviewed, exchangeable neural network, allowing demographic inference from SNP data, that can be used directly or retrained to solve other tasks. Toy networks are also available to ease the exploration of the software, and we expect that the range of available architectures will keep expanding thanks to community contributions. Availability: dnadna repository is available at gitlab.com/mlgenetics/dnadna and its associated documentation at mlgenetics.gitlab.io/dnadna/.
Fichier principal
Vignette du fichier
DNADNA_software_FINAL_long_arxiv_template (1).pdf (467.75 Ko) Télécharger le fichier

Dates et versions

hal-03352910 , version 1 (23-09-2021)
hal-03352910 , version 2 (17-11-2021)
hal-03352910 , version 3 (04-11-2022)
hal-03352910 , version 4 (05-12-2022)

Identifiants

  • HAL Id : hal-03352910 , version 3

Citer

Théophile Sanchez, Erik Madison Bray, Pierre Jobic, Jérémy Guez, Anne-Catherine Letournel, et al.. dnadna: A DEEP LEARNING FRAMEWORK FOR POPULATION GENETIC INFERENCE: dnadna: Deep Neural Architectures for DNA. 2022. ⟨hal-03352910v3⟩
876 Consultations
524 Téléchargements

Partager

Gmail Facebook X LinkedIn More