Book description
Data Simplification: Taming Information With Open Source Tools addresses the simple fact that modern data is too big and complex to analyze in its native form. Data simplification is the process whereby large and complex data is rendered usable. Complex data must be simplified before it can be analyzed, but the process of data simplification is anything but simple, requiring a specialized set of skills and tools.
This book provides data scientists from every scientific discipline with the methods and tools to simplify their data for immediate analysis or long-term storage in a form that can be readily repurposed or integrated with other data.
Drawing upon years of practical experience, and using numerous examples and use cases, Jules Berman discusses the principles, methods, and tools that must be studied and mastered to achieve data simplification, open source tools, free utilities and snippets of code that can be reused and repurposed to simplify data, natural language processing and machine translation as a tool to simplify data, and data summarization and visualization and the role they play in making data useful for the end user.
- Discusses data simplification principles, methods, and tools that must be studied and mastered
- Provides open source tools, free utilities, and snippets of code that can be reused and repurposed to simplify data
- Explains how to best utilize indexes to search, retrieve, and analyze textual data
- Shows the data scientist how to apply ontologies, classifications, classes, properties, and instances to data using tried and true methods
Table of contents
- Cover image
- Title page
- Table of Contents
- Copyright
- Dedication
- Foreword
- Preface
- Author Biography
- Chapter 1: The Simple Life
- Chapter 2: Structuring Text
- Chapter 3: Indexing Text
- Chapter 4: Understanding Your Data
- Chapter 5: Identifying and Deidentifying Data
- Chapter 6: Giving Meaning to Data
- Chapter 7: Object-Oriented Data
- Chapter 8: Problem Simplification
- Index
Product information
- Title: Data Simplification
- Author(s):
- Release date: March 2016
- Publisher(s): Morgan Kaufmann
- ISBN: 9780128038543
You might also like
book
Cleaning Data for Effective Data Science
Think about your data intelligently and ask the right questions Key Features Master data cleaning techniques …
book
Data Analysis: What Can Be Learned From the Past 50 Years
This book explores the many provocative questions concerning the fundamentals of data analysis. It is based …
book
UNIX° TEXT PROCESSING
This book shows how UNIX can be used effectively in the preparation of written documents, especially …
book
Text Mining of Web-Based Medical Content
• Includes Text Mining and Natural Language Processing Methods for extracting information from electronic health records …