Wednesday, February 24, 2010

The Methods & Architecture Behind Data Warehousing

The method data warehousing vendors use to provide their service is a key issue. Vendors who let you run your own server's license or those who host software tools on their servers are common options.

There are multiple options related to the architecture of the data warehousing system. The most popular is the hub-and-spoke architecture, which is a centralized data warehouse with dependent data marts. It is a sort of corporate information factory.

One other choice in data warehousing architecture is the data-mart bus architecture linking dimensional data marts, or a centralized data warehouse with no dependent marts and last but not least, a federated architecture.

Some organizations develop their own data marts, which are independent from one another. However, they have inconsistent data definitions and different dimensions, which make it difficult to analyze data across marts.

Data mart bus architecture implies building a first mart that uses dimensions and measures that will be used with the other marts that are developed, in order to obtain logically integrated marts. The data is organized in a star schema and this provides a dimensional view of the data.

The hub-and-spoke architecture focuses on building a scalable and maintainable infrastructure. It is developed in iterative manner. Dependent data marts obtain data from the warehouse and they can be developed for different departments or special purposes.

A similar data warehousing architecture is used for centralized data warehouses but without dependent data marts. Queries and applications access data from relational data and dimensional views.

Federated architecture allows data to be accessed from sources such as operational systems, data marts and data warehouses. It is logically or physically integrated by use of shared keys, global metadata, and distributed queries. It is an adequate solution for companies that have a complex decision support environment.

When deciding on a data warehousing solution, you may need to assess the following advantages of different architectures:

The level of information interdependence and the ability to share and integrate information
Senior management needs regarding information from lower levels

Urgency of the need for capabilities of data warehousing architecture

Non-routine tasks, the need to analyze data in novel ways

The availability of IT personnel, the technical skills of IT internal staff, successful experiences with similar projects and the level of confidence
Constraints on monetary resources, cost and time benefits derived from implementing a solution compatible with existing systems

The need for a point solution for a particular business unit, or a decision support infrastructure which can support a range of applications

The ability to integrate metadata, scalability in terms of number of users, volume of data and query performance, ability to maintain historical data and adapt to changes related to source systems

Depending on your organization, these factors may be considered to have a lower or higher priority. A lack of resources may prevent some organizations to implement a better architectural data warehousing solution. The centralized data warehouse architecture is faster and easier to implement since it does not require dependent data marts.

The design of user interface has an important impact on the decision to choose a certain vendor. A tool that is uncluttered, intuitive will be better used by intended users. Data warehousing solutions assume that users are trained in data management and analysis techniques.

The support offered by data warehouses comes in two different forms: technical and functional. While technical support focuses on hardware, application of software patches and updates, functional support includes user manuals, training sessions, and client services.

Security of the data warehousing services refers to the physical and electronic aspects. Physical security involves restricted access to the servers which store data and off-site emergency backups. Electronic security involves limiting access to data and use of software to identify unauthorized attempts to access the data.

The legislation regarding data ownerships is not keeping the pace with the advance in technology and business practices. You have to take this into consideration and spend time on analyzing any issues that can arise from this incompatibility.

The reputation and history of vendors of data warehousing services should be investigated thouroughly. References and people in the industry can be a valuable source to assess the quality of customer service offered and the integrity of the vendor. A visit to the physical space is also helpful in assessing the quality of the organization.

Once the selection is made, the exact services to be provided, expectations related to customer service, contact information and payment will take priority.
You will benefit from a better return on investment and higher quality of service once you reach an informed decision.

No comments:

Post a Comment