Julie Thompson

University of Strasbourg France

1chapters authored

Chapters authored

By Salma Aouled El Haj Mohamed, Mourad Elloumi and Julie D. Thompson

Biology has become a data‐intensive research field. Coping with the flood of data from the new genome sequencing technologies is a major area of research. The exponential increase in the size of the datasets produced by “next‐generation sequencing” (NGS) poses unique computational challenges. In this context, motif discovery tools are widely used to identify important patterns in the sequences produced. Biological sequence motifs are defined as short, usually fixed length, sequence patterns that may represent important structural or functional features in nucleic acid and protein sequences such as transcription binding sites, splice junctions, active sites, or interaction interfaces. They can occur in an exact or approximate form within a family or a subfamily of sequences. Motif discovery is therefore an important field in bioinformatics, and numerous methods have been developed for the identification of motifs shared by a set of functionally related sequences. This chapter will review the existing motif discovery methods for protein sequences and their ability to discover biologically important features as well as their limitations for the discovery of new motifs. Finally, we will propose new horizons for motif discovery in order to address the short comings of the existent methods.

Part of the book: Pattern Recognition

Julie Thompson

Chapters authored

Related collaborators

Francesc Pozo

Jin-Tae Kim

Nadia Abd-Alsabour

Christopher Ohagwu

Pavel Yezhov

Alexander Kuzmenko

Alexander Ostroukh

Kenneth K. Agwu

Salma Aouled El Haj Mohamed

Mourad Elloumi