Free Trial

Safari Books Online is a digital library providing on-demand subscription access to thousands of learning resources.

  • Create BookmarkCreate Bookmark
  • Create Note or TagCreate Note or Tag
  • PrintPrint

DATA PROFILING

Ultimately, data warehousing and BI is about reporting and analytics, and the first step to reaching that objective is understanding the source data, because that has immeasurable impact on how you design the structures and build the ETL.

Data profiling is the process of analyzing the source data to better understand what condition it is in, in terms of cleanliness, patterns, number of nulls, and so on. In fact, you probably have done data profiling before with scripts and spreadsheets, but perhaps you didn’t realize that it was called data profiling.

SSIS 2012 includes a Control Flow task called the Data Profiling Task. This task is reviewed in Chapter 3, but let’s drill into some more details about how to leverage it for data warehouse ETL.


  

You are currently reading a PREVIEW of this book.

                                                                                                                    

Get instant access to over $1 million worth of books and videos.

  

Start a Free Trial


  
  • Safari Books Online
  • Create BookmarkCreate Bookmark
  • Create Note or TagCreate Note or Tag
  • PrintPrint