1 Citation (Scopus)
4 Downloads (Pure)

Abstract

SIMSApiper is a Nextflow pipeline that creates reliable, structure-informed MSAs of thousands of protein sequences in time-frames faster than standard structure-based alignment methods. Structural information can be provided by the user or collected by the pipeline from online resources. Parallelization with sequence identity based subsets can be activated to significantly speed up the alignment process. Finally, the number of gaps in the final alignment can be reduced by leveraging the position of conserved secondary structure elements.
Original languageEnglish
Article numberbtae276
Number of pages5
JournalBioinformatics
Volume40
Issue number5
DOIs
Publication statusPublished - 22 Apr 2024

Bibliographical note

Funding Information:
This work was supported by Research Foundation Flanders (FWO) SB PhD fellowship [1SE5923N] to C.C.; the FWO large infrastructure grant [I000323N] to A.D.; and the Fonds de la Recherche Scientifique (FNRS) Aspirant fellowship to S.L.H. The resources used in this work were provided in part by the VSC (Flemish Supercomputer Center), funded by the Research Foundation\u2014Flanders (FWO) and the Flemish Government.

Publisher Copyright:
© 2024 The Author(s). Published by Oxford University Press.

Fingerprint

Dive into the research topics of 'Large-scale Structure-Informed multiple sequence alignment of proteins with SIMSApiper'. Together they form a unique fingerprint.

Cite this