Books

Integrating Hadoop

McKnight/Dolezal, 2016, Technics Publications

Integrating Hadoop leverages the discipline of data integration and applies it to the Hadoop open-source software framework for storing data on clusters of commodity hardware. It is packed with the need-to-know for managers, architects, designers, and developers responsible for populating Hadoop in the enterprise, allowing you to harness big data and do it in such a way that the solution:

  • Complies with (and even extends) enterprise standards
  • Integrates seamlessly with the existing information infrastructure
  • Fills a critical role within enterprise architecture.

Integrating Hadoop covers the gamut of the setup, architecture and possibilities for Hadoop in the organization, including:

  • Supporting an enterprise information strategy
  • Organizing for a successful Hadoop rollout
  • Loading and extracting of data in Hadoop
  • Managing Hadoop data once it’s in the cluster
  • Utilizing Spark, streaming data, and master data in Hadoop processes – examples are provided to reinforce concepts

Chapter List:

  1. Hadoop in Support of an Information Strategy
  2. Preparing for Integration
  3. ETL vs. ELT
  4. Loading Data into Hadoop
  5. Managing Big Data
  6. Unloading/Distributing Data from Hadoop
  7. Apache Spark Cluster Computing with Hadoop
  8. Streaming Data
  9. Master Data Management and Big Data
  10. Top 10 Mistakes Integrating Hadoop Data
  11. Case Studies and Trends

 

Information Management: Strategies for Gaining a Competitive Advantage with Data

McKnight, 2014, Morgan-Kaufmann
Press release.   Podcast from OCDQ Radio.

Press coverage in Huffington Post: Handling Big Data: An Interview with Author William McKnight

Get the big picture on managing data so that your team can make smart decisions by understanding how everything from workload allocation to data stores fit together.

Information Management is about making smart decisions to make the most of company information. Expert author William McKnight develops the value proposition for information in the enterprise, and succinctly outlines the numerous forms of data storage. Information Management will enlighten you, challenge your preconceived notions, and help activate information in the enterprise.

The practical, hands-on guidance in this book includes:

Part 1: The importance of information management and analytics to business, and how data warehouses are used
Part 2: The technologies and data that advance an organization, and extend data warehouses and related functionality
Part 3: Big Data and NoSQL, and how technologies like Hadoop enable management of new forms of data
Part 4: Pulls it all together, while addressing topics of agile development, modern business intelligence, and organizational change management

Read the book cover-to-cover, or keep it within reach for a quick and useful resource. Either way, this book will enable you to master all of the possibilities for data or the broadest view across the enterprise.

• Balances business and technology, with non-product-specific technical detail
• Shows how to leverage data to deliver ROI for a business
• Engaging and approachable writing, with practical advice on the pros and cons of each domain, so that you learn how information fits together into a complete architecture
• A path for the data warehouse professional into the new normal of heterogeneity, including big data solutions

Chapter List:

  1. You’re in the Business of Information
  2. Relational Theory in Practice
  3. You’re in the Business of Analytics
  4. Data Quality: Passing the Standard
  5. Columnar Databases
  6. Data Warehouses and Appliances
  7. Master Data Management: One Chapter Here, but Ramifications Everywhere
  8. Data Stream Processing: When Storing the Data Happens Later
  9. Data Virtualization: The Perpetual Short-Term Solution
  10. Key-Value, Document and Column Stores:  Hash Tables Reborn
  11. Hadoop: Analytics at Scale
  12. Graph Databases: When Relationships are the Data
  13. Cloud Computing: On-Demand Elasticity
  14. An Elegant Architecture Where Information Flows
  15. Modern Business Intelligence: Collaboration, Mobile and Self-Service: Organizing the Discussion and Tethering the User to Information
  16. Agile Practices for Information Management
  17. Organizational Change Management: The Soft Stuff is the Hard Stuff

This book is perfect for vendors, consultants and technology organizations to issue to all employees and members. For bulk sales, contact us.