Free Trial

Safari Books Online is a digital library providing on-demand subscription access to thousands of learning resources.

Help

Databases


1. 

Data Clean-Up and Management

Data Clean-Up and Management

By: Margaret Hogarth; Kenneth Furuta

Publisher: Chandos Publishing

Publication Date: 22-OCT-2012

Insert Date: 24-APR-2014

Slots: 1.0

Table of Contents • Start Reading

Data use in the library has specific characteristics and common problems. Data Clean-up and Management addresses these, and provides methods to clean up frequently-occurring data problems using readily-available applications. The authors highlight the importance and methods of data analysis and presentation, and offer guidelines and recommendations for a data quality policy. The book gives step-by-step how-to directions for common dirty data issues. Focused towards libraries and practicing librarians Deals with practical, real-life issues and addresses common problems that all...

2. 

Beyond Knowledge Management: What Every Leader Should Know

Beyond Knowledge Management: What Every Leader Should Know

By: Jay Liebowitz

Publisher: Auerbach Publications

Publication Date: 11-NOV-2011

Insert Date: 19-APR-2014

Slots: 1.0

Table of Contents • Start Reading

This concise and easy-to-read book examines 10 areas where Knowledge Management can help an organization gain a competitive advantage. Each chapter opens with an introduction to one of these promising areas, followed by case studies from industry, government, and not-for-profits. The case studies demonstrate how leaders at organizations such as The Coca-Cola Company, e-Bay, PricewaterhouseCoopers, University of Maryland University College, Northrop Grumman, and the U.S. Department of Health and Human Services have used the concepts discussed to improve decision making. ...

3. 

Spectral Feature Selection for Data Mining

Spectral Feature Selection for Data Mining

By: Zheng Zhao; Huan Liu

Publisher: Chapman and Hall/CRC

Publication Date: 14-DEC-2011

Insert Date: 19-APR-2014

Slots: 1.0

Table of Contents • Start Reading

This timely introduction to spectral feature selection illustrates the potential of this powerful dimensionality reduction technique in high-dimensional data processing. It presents the theoretical foundations of spectral feature selection, its connections to other algorithms, and its use in handling both large-scale data sets and small sample problems. Readers learn how to use spectral feature selection to solve challenging problems in real-life applications and discover how general feature selection and extraction are connected to spectral feature selection. Source code for the...

4. 

Pig Design Patterns

Pig Design Patterns

By: Pradeep Pasupuleti

Publisher: Packt Publishing

Publication Date: 17-APR-2014

Insert Date: 19-APR-2014

Slots: 1.0

Table of Contents • Start Reading

Simplify Hadoop programming to create complex end-to-end Enterprise Big Data solutions with Pig Quickly understand how to use Pig to design end-to-end Big Data systems Implement a hands-on programming approach using design patterns to solve commonly occurring enterprise Big Data challenges Enhances users’ capabilities to utilize Pig and create their own design patterns wherever applicable In Detail Pig makes Hadoop programming simple, intuitive, and fun to work with. It removes the complexity from Map Reduce programming by giving the programmer immense power through its...

5. 

Statistical Learning and Data Science

Statistical Learning and Data Science

By: Mireille Summa; Leon Bottou; Bernard Goldfarb; Fionn Murtagh; Catherine Pardoux; Myriam Touati

Publisher: Chapman and Hall/CRC

Publication Date: 19-DEC-2011

Insert Date: 19-APR-2014

Slots: 1.0

Table of Contents • Start Reading

Driven by a vast range of applications, data analysis and learning from data are vibrant areas of research. Various methodologies, including unsupervised data analysis, supervised machine learning, and semi-supervised techniques, have continued to develop to cope with the increasing amount of data collected through modern technology. With a focus on applications, this volume presents contributions from some of the leading researchers in the different fields of data analysis. Synthesizing the methodologies into a coherent framework, the book covers a range of topics, from large-scale...

6. 

Statistical and Machine-Learning Data Mining: Techniques for Better Predictive Modeling and Analysis of Big Data, Second Edition

Statistical and Machine-Learning Data Mining: Techniques for Better Predictive Modeling and Analysis of Big Data, Second Edition

By: Bruce Ratner

Publisher: CRC Press

Publication Date: 19-DEC-2011

Insert Date: 19-APR-2014

Slots: 1.0

Table of Contents • Start Reading

Focusing on uniquely large-scale statistical models that effectively consider big data identifying structures (variables) with the appropriate predictive power in order to yield reliable, robust, relevant large scale analyses, this revised edition incorporates 13 new chapters, as well as expanded explanations of the author's own popular machine-learning GenIQ Model. The book highlights the needs of data analysts, regression modelers, non-regression modelers, and data miners across all industry sectors, delivering practical yet powerful, simple yet insightful quantitative techniques that...

7. 

Getting Started with SOQL

Getting Started with SOQL

By: Magulan D

Publisher: Packt Publishing

Publication Date: 16-APR-2014

Insert Date: 18-APR-2014

Slots: 1.0

Table of Contents • Start Reading

Revolutionize the use of simple query strings to make them more efficient using SOQL Write optimized SOQL statements Discover the standards to follow while writing SOQL statements Learn how to write SOQL statements without hitting the limits set by Salesforce.com In Detail Salesforce Object Querying Language(SOQL) is used by Salesforce to search database applications. Although only one object can be queried at a time, SOQL allows for greater flexibility over the queries for more objects. This in turn allows for greater accuracy in searches, though the query does...

8. 

Outlier Detection for Temporal Data

Outlier Detection for Temporal Data

By: Manish Gupta; Jing Gao; Charu Aggarwal; Jiawei Han

Publisher: Morgan & Claypool Publishers

Publication Date: 01-MAR-2014

Insert Date: 18-APR-2014

Slots: 1.0

Table of Contents • Start Reading

Outlier (or anomaly) detection is a very broad field which has been studied in the context of a large number of research areas like statistics, data mining, sensor networks, environmental science, distributed systems, spatio-temporal mining, etc. Initial research in outlier detection focused on time series-based outliers (in statistics). Since then, outlier detection has been studied on a large variety of data types including high-dimensional data, uncertain data, stream data, network data, time series data, spatial data, and spatio-temporal data. While there have been many tutorials and...

9. 

Mining User Generated Content

Mining User Generated Content

By: Marie-Francine Moens; Juanzi Li; Tat-Seng Chua

Publisher: Chapman and Hall/CRC

Publication Date: 28-JAN-2014

Insert Date: 18-APR-2014

Slots: 1.0

Table of Contents • Start Reading

This volume is the first focused effort to compile state-of-the-art research and address future directions of UGC. It explains how to collect, index, and analyze UGC to uncover social trends and user habits. The book describes how to mine various media, including social annotation, music information retrieval, and networks, and discusses the mining and searching of different types of UGC, such as Wikis and blogs. It also presents many applications of UGC, including the use of UGC to answer questions and summarize information. ...

10. 

A First Course in Machine Learning

A First Course in Machine Learning

By: Simon Rogers; Mark Girolami

Publisher: Chapman and Hall/CRC

Publication Date: 25-OCT-2011

Insert Date: 18-APR-2014

Slots: 1.0

Table of Contents • Start Reading

Requiring minimal mathematical prerequisites, this classroom-tested text covers the core mathematical and statistical techniques needed to understand some of the most popular machine learning algorithms, including classification, clustering, and projection algorithms. The MATLAB<SUP>®</SUP>/Octave scripts available online enable readers to recreate plots that appear in the book and investigate changing model specifications and parameter values. By experimenting with the various algorithms and concepts, readers see how an abstract set of equations can be used to solve real problems. A...

11. 

Understanding Information Retrieval Systems: Management, Types, and Standards

Understanding Information Retrieval Systems: Management, Types, and Standards

By: Marcia Bates

Publisher: Auerbach Publications

Publication Date: 20-DEC-2011

Insert Date: 18-APR-2014

Slots: 1.0

Table of Contents • Start Reading

Information retrieval (IR) is concerned with searching for documents, information within documents, metadata about documents, relational databases, and the web. This book covers the management, types, and technical standards of these increasingly important systems, including those used in libraries and museums, medicine, geographic information, music, computer-supported collaborative work, web mining, social mining, and the Semantic Web. Leading contributors in the field address digital asset management, piracy in digital media, records compliance, information storage technologies, and...

12. 

Data Mining in Biomedical Imaging, Signaling, and Systems

Data Mining in Biomedical Imaging, Signaling, and Systems

By: Sumeet Dua; U, Acharya

Publisher: Auerbach Publications

Publication Date: 16-MAY-2011

Insert Date: 18-APR-2014

Slots: 1.0

Table of Contents • Start Reading

This comprehensive volume demonstrates the broad scope of uses for data mining and includes detailed strategies and methodologies for analyzing data from biomedical images, signals, and systems. Written by experts in the field, it presents data mining techniques in the context of various important clinical issues, including diagnosis and grading of depression, identification and classification of arrhythmia and ischemia, and description of classification paradigms for mammograms. The book provides ample information and techniques to benefit researchers, practitioners, and educators of...

13. 

Multi-Label Dimensionality Reduction

Multi-Label Dimensionality Reduction

By: Liang Sun; Shuiwang Ji; Jieping Ye

Publisher: Chapman and Hall/CRC

Publication Date: 04-NOV-2013

Insert Date: 18-APR-2014

Slots: 1.0

Table of Contents • Start Reading

The data mining and machine learning literature currently lacks a unified treatment of multi-label dimensionality reduction that incorporates both algorithmic developments and applications. Addressing this shortfall, this book covers the methodological developments, theoretical properties, computational aspects, and applications of many multi-label dimensionality reduction algorithms, including existing dimensionality reduction algorithms and new developments of traditional algorithms. It illustrates how to apply the algorithms to solve real-world problems. A supplementary website provides...

14. 

Transforming Business: Big Data, Mobility, and Globalization

Transforming Business: Big Data, Mobility, and Globalization

By: 

Publisher: John Wiley & Sons

Publication Date: 17-DEC-2012

Insert Date: 18-APR-2014

Slots: 1.0

Table of Contents • Start Reading

A unique perspective of an evolved role for company leadership Based on the findings of an extensive research project that surveyed more than 5,500 enterprise employees and functional decision makers across the United States and China, Transforming Business: Big Data, Mobility and Globalization explores the influence of technology in the workplace and the implications to company culture, functional responsibilities and competitive advantage. This in-depth analysis illuminates emerging technological trends, the changing workforce, and the shifting face of business and industry while...

15. 

IBM System Storage N series Hardware Guide

IBM System Storage N series Hardware Guide

By: Roland Tretau; Jeff Lin; Dirk Peitzmann; Steven Pemberton; Tom Provost; Marco Schwarz

Publisher: IBM Redbooks

Publication Date: 23-SEP-2012

Insert Date: 17-APR-2014

Slots: 1.0

Table of Contents • Start Reading

This IBM® Redbooks® publication provides a detailed look at the features, benefits, and capabilities of the IBM System Storage® N series hardware offerings. The IBM System Storage N series systems can help you tackle the challenge of effective data management by using virtualization technology and a unified storage architecture. The N series delivers low- to high-end enterprise storage and data management capabilities with midrange affordability. Built-in serviceability and manageability features help support your efforts to increase reliability; simplify and unify storage infrastructure...

16. 

Gitolite Essentials

Gitolite Essentials

By: Sitaram Chamarty

Publisher: Packt Publishing

Publication Date: 11-APR-2014

Insert Date: 15-APR-2014

Slots: 1.0

Table of Contents • Start Reading

Leverage powerful branch and user access control with Git for your own private collaborative repositories Learn to manage the many repositories and the users accessing these repositories in the Git server Walks you through the most important ideas and concepts in Gitolite supported by examples and use cases Master the most powerful tool for fine-grained access control of Git repositories In Detail If you're responsible for securing a Git server where lots of developers work with lots of repositories, you have a problem on your hands. You probably want to implement...

17. 

MySQL High Availability, 2nd Edition

MySQL High Availability, 2nd Edition

By: ; ;

Publisher: O'Reilly Media, Inc.

Publication Date: 25-APR-2014

Insert Date: 12-APR-2014

Slots: 1.0

Table of Contents • Start Reading

Server bottlenecks and failures are a fact of life in any database deployment, but they don’t have to bring everything to a halt. This practical book explains replication, cluster, and monitoring features that can help protect your MySQL system from outages, whether it’s running on hardware, virtual machines, or in the cloud....

18. 

Knowledge Discovery, Transfer, and Management in the Information Age

Knowledge Discovery, Transfer, and Management in the Information Age

By: Murray Jennex

Publisher: IGI Global

Publication Date: 30-NOV-2013

Insert Date: 05-APR-2014

Slots: 1.0

Table of Contents • Start Reading

With the advent of electronic databases, information technologies, and the Internet, organizations now more than ever have easy access to all the knowledge they need to conduct their affairs. Identifying the useful information in all that data, however, can pose a challenge. Knowledge Discovery, Transfer, and Management in the Information Age brings together the latest empirical research in knowledge management practices and information retrieval strategies to assist organizations in effectively and efficiently utilizing the data at their disposal. Academics, managers, researchers, and...

19. 

Foundations of Data Exchange

Foundations of Data Exchange

By: Marcelo Arenas; Pablo Barceló; Leonid Libkin; Filip Murlak

Publisher: Cambridge University Press

Publication Date: 31-MAR-2014

Insert Date: 05-APR-2014

Slots: 1.0

Table of Contents • Start Reading

The problem of exchanging data between different databases with different schemas is an area of immense importance. Consequently data exchange has been one of the most active research topics in databases over the past decade. Foundational questions related to data exchange largely revolve around three key problems: how to build target solutions; how to answer queries over target solutions; and how to manipulate schema mappings themselves? The last question is also known under the name 'metadata management', since mappings represent metadata, rather than data in the database. In this book...

20. 

Innovative Techniques and Applications of Entity Resolution

Innovative Techniques and Applications of Entity Resolution

By: Hongzhi Wang

Publisher: IGI Global

Publication Date: 28-FEB-2014

Insert Date: 04-APR-2014

Slots: 1.0

Table of Contents • Start Reading

Entity resolution is an essential tool in processing and analyzing data in order to draw precise conclusions from the information being presented. Further research in entity resolution is necessary to help promote information quality and improved data reporting in multidisciplinary fields requiring accurate data representation. Innovative Techniques and Applications of Entity Resolution draws upon interdisciplinary research on tools, techniques, and applications of entity resolution. This research work provides a detailed analysis of entity resolution applied to various types of data as well...