Cactus: A user-friendly and reproducible ATAC-Seq and mRNA-Seq analysis pipeline for data preprocessing, differential analysis, and enrichment analysis

tools and software 1 session
tuesday
Authors
Affiliations

Jérôme Salignon

Department of Bioscience and Nutrition, Karolinska Institute, Sweden

Lluís Millan-Ariño

Department of Bioscience and Nutrition, Karolinska Institute, Sweden

Maxime U. Garcia

National Genomics Infrastructure, Science for Life Laboratory, Sweden

Department of Oncology-Pathology, Karolinska Institute, Sweden

Christian G. Riedel

Department of Bioscience and Nutrition, Karolinska Institute, Sweden

Time

Nov 05, 14:30

Abstract
The ever decreasing cost of Next-Generation Sequencing coupled with the emergence of efficient and reproducible analysis pipelines has rendered genomic methods more accessible. However, downstream analyses are basic or missing in most workflows, creating a significant barrier for non-bioinformaticians. To help close this gap, we developed Cactus, an end-to-end pipeline for analyzing ATAC-Seq and mRNA-Seq data, either separately or jointly. Its Nextflow-, container-, and virtual environment-based architecture ensures efficient and reproducible analyses. Cactus preprocesses raw reads, conducts differential analyses between conditions, and performs enrichment analyses in various databases, including DNA-binding motifs, ChIP-Seq binding sites, chromatin states, and ontologies. We demonstrate the utility of Cactus in a multi-modal and multi-species case study as well as by showcasing its unique capabilities as compared to other ATAC-Seq pipelines. In conclusion, Cactus can assist researchers in gaining comprehensive insights from chromatin accessibility and gene expression data in a quick, user-friendly, and reproducible manner.