Big Data Analytics in Genomics
- Length: 428 pages
- Edition: 1st ed. 2016
- Language: English
- Publisher: Springer
- Publication Date: 2016-10-25
- ISBN-10: 3319412787
- ISBN-13: 9783319412788
- Sales Rank: #4542348 (See Top 100 Books)
This contributed volume explores the emerging intersection between big data analytics and genomics. Recent sequencing technologies have enabled high-throughput sequencing data generation for genomics resulting in several international projects which have led to massive genomic data accumulation at an unprecedented pace. To reveal novel genomic insights from this data within a reasonable time frame, traditional data analysis methods may not be sufficient or scalable, forcing the need for big data analytics to be developed for genomics. The computational methods addressed in the book are intended to tackle crucial biological questions using big data, and are appropriate for either newcomers or veterans in the field.
This volume offers thirteen peer-reviewed contributions, written by international leading experts from different regions, representing Argentina, Brazil, China, France, Germany, Hong Kong, India, Japan, Spain, and the USA. In particular, the book surveys three main areas: statistical analytics, computational analytics, and cancer genome analytics. Sample topics covered include: statistical methods for integrative analysis of genomic data, computation methods for protein function prediction, and perspectives on machine learning techniques in big data mining of cancer. Self-contained and suitable for graduate students, this book is also designed for bioinformaticians, computational biologists, and researchers in communities ranging from genomics, big data, molecular genetics, data mining, biostatistics, biomedical science, cancer research, medical research, and biology to machine learning and computer science. Readers will find this volume to be an essential read for appreciating the role of big data in genomics, making this an invaluable resource for stimulating further research on the topic.
Table of Contents
Part I Statistical Analytics
Chapter 1 Introduction to Statistical Methods for Integrative Data Analysis in Genome-Wide Association Studies
Chapter 2 Robust Methods for Expression Quantitative Trait Loci Mapping
Chapter 3 Causal Inference and Structure Learning of Genotype–Phenotype Networks Using Genetic Variation
Chapter 4 Genomic Applications of the Neyman–Pearson Classification Paradigm
Part II Computational Analytics
Chapter 5 Improving Re-annotation of Annotated Eukaryotic Genomes
Chapter 6 State-of-the-Art in Smith–Waterman Protein Database Search on HPC Platforms
Chapter 7 A Survey of Computational Methods for Protein Function Prediction
Chapter 8 Genome-Wide Mapping of Nucleosome Position and Histone Code Polymorphisms in Yeast
Part III Cancer Analytics
Chapter 9 Perspectives of Machine Learning Techniques in Big Data Mining of Cancer
Chapter 10 Mining Massive Genomic Data for Therapeutic Biomarker Discovery in Cancer: Resources, Tools, and Algorithms
Chapter 11 NGS Analysis of Somatic Mutations in Cancer Genomes
Chapter 12 OncoMiner: A Pipeline for Bioinformatics Analysis of Exonic Sequence Variants in Cancer
Chapter 13 A Bioinformatics Approach for Understanding Genotype–Phenotype Correlation in Breast Cancer