The Data Bonanza
- Length: 576 pages
- Edition: 1
- Language: English
- Publisher: Wiley-IEEE Computer Society Pr
- Publication Date: 2013-04-15
- ISBN-10: 1118398645
- ISBN-13: 9781118398647
- Sales Rank: #7526352 (See Top 100 Books)
The Data Bonanza: Improving Knowledge Discovery in Science, Engineering, and Business (Wiley Series on Parallel and Distributed Computing)
Complete guidance for mastering the tools and techniques of the digital revolution
With the digital revolution opening up tremendous opportunities in many fields, there is a growing need for skilled professionals who can develop data-intensive systems and extract information and knowledge from them. This book frames for the first time a new systematic approach for tackling the challenges of data-intensive computing, providing decision makers and technical experts alike with practical tools for dealing with our exploding data collections.
Emphasizing data-intensive thinking and interdisciplinary collaboration, The Data Bonanza: Improving Knowledge Discovery in Science, Engineering, and Business examines the essential components of knowledge discovery, surveys many of the current research efforts worldwide, and points to new areas for innovation. Complete with a wealth of examples and DISPEL-based methods demonstrating how to gain more from data in real-world systems, the book:
- Outlines the concepts and rationale for implementing data-intensive computing in organizations
- Covers from the ground up problem-solving strategies for data analysis in a data-rich world
- Introduces techniques for data-intensive engineering using the Data-Intensive Systems Process Engineering Language DISPEL
- Features in-depth case studies in customer relations, environmental hazards, seismology, and more
- Showcases successful applications in areas ranging from astronomy and the humanities to transport engineering
- Includes sample program snippets throughout the text as well as additional materials on a companion website
The Data Bonanza is a must-have guide for information strategists, data analysts, and engineers in business, research, and government, and for anyone wishing to be on the cutting edge of data mining, machine learning, databases, distributed systems, or large-scale computing.
Table of Contents
PART I STRATEGIES FOR SUCCESS IN THE DIGITAL-DATA REVOLUTION 1
1. The Digital-Data Challenge 5
2. The Digital-Data Revolution 15
3. The Data-Intensive Survival Guide 37
4. Data-Intensive Thinking with DISPEL 61
PART II DATA-INTENSIVE KNOWLEDGE DISCOVERY 123
5. Data-Intensive Analysis 127
6. Problem Solving in Data-Intensive Knowledge Discovery 147
7. Data-Intensive Components and Usage Patterns 165
8. Sharing and Reuse in Knowledge Discovery 181
PART III DATA-INTENSIVE ENGINEERING 193
9. Platforms for Data-Intensive Analysis 197
10. Definition of the DISPEL Language 203
11. DISPEL Development 237
12. DISPEL Enactment 251
PART IV DATA-INTENSIVE APPLICATION EXPERIENCE 275
13. The Application Foundations of DISPEL 277
14. Analytical Platform for Customer Relationship Management 287
15. Environmental Risk Management 301
16. Analyzing Gene Expression Imaging Data in Developmental Biology 327
17. Data-Intensive Seismology: Research Horizons 353
PART V DATA-INTENSIVE BEACONS OF SUCCESS 377
18. Data-Intensive Methods in Astronomy 381
19. The World at One’s Fingertips: Interactive Interpretation of Environmental Data 395
20. Data-Driven Research in the Humanities—the DARIAH Research Infrastructure 417
21. Analysis of Large and Complex Engineering and Transport Data 431
22. Estimating Species Distributions—Across Space, Through Time, and with Features of the Environment 441
PART VI THE DATA-INTENSIVE FUTURE 459
23. Data-Intensive Trends 461
Appendix A: Glossary 499
Appendix B: DISPEL Reference Manual 507
Appendix C: Component Definitions 531