EDIR: Exome Database of Interspersed Repeats

Research output: Contribution to journalArticlepeer-review

43 Downloads (Pure)

Abstract

MOTIVATION: Intragenic exonic deletions are known to contribute to genetic diseases and are often flanked by regions of homology. RESULTS: In order to get a more clear view of these interspersed repeats encompassing a coding sequence, we have developed EDIR (Exome Database of Interspersed Repeats) which contains the positions of these structures within the human exome. EDIR has been calculated by an inductive strategy, rather than by a brute force approach and can be queried through an R/Bioconductor package or a web interface allowing the per-gene rapid extraction of homology-flanked sequences throughout the exome. AVAILABILITY AND IMPLEMENTATION: The code used to compile EDIR can be found at https://github.com/lauravongoc/EDIR. The full dataset of EDIR can be queried via an Rshiny application at http://193.70.34.71:3857/edir/. The R package for querying EDIR is called 'EDIRquery' and is available on Bioconductor. The full EDIR dataset can be downloaded from https://osf.io/m3gvx/ or http://193.70.34.71/EDIR.tar.gz. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Original languageEnglish
Article numberbtac771
Number of pages3
JournalBioinformatics
Volume39
Issue number1
Early online date1 Dec 2022
DOIs
Publication statusPublished - 1 Jan 2023

Bibliographical note

Publisher Copyright:
© The Author(s) 2022. Published by Oxford University Press.

Copyright:
This record is sourced from MEDLINE/PubMed, a database of the U.S. National Library of Medicine

Keywords

  • EDIR
  • Database
  • interspersed repeats

Fingerprint

Dive into the research topics of 'EDIR: Exome Database of Interspersed Repeats'. Together they form a unique fingerprint.

Cite this