Bioinformatics and Systems Biology
Today, massively parallel DNA sequencing or hybridization approaches allow the identification of not only the gene repertoire but also the gene regulatory networks of an organism. The huge amounts of data acquired from such experiments can only be handled with intensive bioinformatics support that has to provide an adequate infrastructure for storing and analyzing these data. Thus, bioinformatics has to deliver efficient data analysis algorithms, user-friendly tools and software applications, as well as extensive hardware infrastructure for answering such questions.
The group is focused on data management for genome and post-genome research projects that require new software solutions for systematic data acquisition, secure data storage of structured information, and high-throughput data analysis. All these features enable the users to browse through a hierarchy of data ranging from raw experimental data to highly structured complexes such as EST analyses, complete genome annotations or results of microarray analyses. The web-based genome annotation system called GenDB was for example successfully used for the automatic and manual annotation of a dozen microbial genomes in the past five years. Data access across our different components is mediated via the BRIDGE architecture, a domain spanning query software, allowing users for example to be able to view gene expression data projected onto a KEGG metabolic pathway which is cross-linked with all available sequence annotations for the corresponding enzyme.
- Recent publications
Rapid protein alignment in the cloud: HAMOND combines fast DIAMOND alignments with Hadoop parallelismThe introduction of next generation sequencing has caused a steady increase in the amounts of data that have to be processed in modern life science. Sequence alignment plays a key role in the analysis of sequencing data e.g. within whole genome sequencing or metagenome projects. BLAST is a commonly used alignment tool that was the standard approach for more than two decades, but in the last years faster alternatives have been proposed including RapSearch, GHOSTX, and DIAMOND.
The rapidly increasing availability of microbial genome sequences has led to a growing demand for bioinformatics software tools that support the functional analysis based on the comparison of closely related genomes. By utilizing comparative approaches on gene level it is possible to gain insights into the core genes which represent the set of shared features for a set of organisms under study.
DistAMo is a versatile web-based tool for analyzing motif distributions in bacteria, archaea and viruses. It allows for an analysis of motif over/underrepresentation from the level of single genes to the level of whole replicons.