Data Warehouse

HP’s Dabble in the Data Warehousing Business {2}

by Kevin Q

When Mark Hurd became Hewlett-Packard’s President in 2005, he noticed that as a technology company, they were failing in one area internally. They had no central system that collected all their companies data together into what Mr. Hurd liked to call “a single version of the truth.”(Vance 2008) Mark Hurd used to head the teradata division at NCR, where he helped start data warehousing, however now he notice that HP needed some kind of similar system to help itself. After explaining some benefits of data warehousing like noticing trends during certain times of the year and other analysis that can be noticed once data is all collectively pulled together, HP created NeoView internally. NeoView is a data warehouse and business intelligence computer server that would solve and help with HP and Mark Hurds needs. It became available for purchse to the public, now a competitor in the data warehousing market which was dominated by much larger companies like Teradata, IBM, Oracle and Microsoft. HPs sales weren’t impressive, which may have been results of building its systems on expensive older technology, reather than cheaper and newer technology, according to expertes in the field. The NeoView can cost more than 10 million for the whole setup, which is kind of pricey, especially when competitors are using cheaper setups and therefore reducing price to customers. Their entrance into the data warehousing came at a time where companies were beginning to see the importance and advantage of data warehouses, but their approach seems to be a little off. read more...

Virtual Data Warehouse {1}

by Mike Y

            The article goes into suggesting using a virtual data warehouse to manage the data in database management systems. The reason for creating virtual data warehouse is because creating and managing a real physical data warehouse is much more expensive compared to a virtual one. The paper describes uses for a virtual data warehouse in supply chain management companies and consultancy companies. The downside is that it is best suited for solutions to temporary problems. read more...

Data Staging Area of Data Warehouses {Comments Off on Data Staging Area of Data Warehouses}

by Jintien C
A data staging area is in part of data warehouse. In the data staging area, it extracts data from a source, transforms data, and loads data into the data warehouse. There are many reasons and advantages to have a staging area in the data warehouse. It can load tables directly so that it can prevent semantic disintegrity or any structure changes. In addition, by having data staging area, the source will not be affected while data warehouse is performing process. Moreover, it makes easier to do aggregation and summation; especially for complex transformation process. read more...

Global Warehouse {Comments Off on Global Warehouse}

by James C

Data warehouses of today’s companies are increasing in users and data at an alarming rate. Used for primarily business intelligence the data that is entered into these warehouses are causing some issues with quality in query transactions. Current businesses utilize tools that analyze data and exhaust the resources that support the databases. The reports of usage of past data of these warehouses were expected and predictable, as opposed to today’s required analysis of real-time reporting and faster query response times. The issues that currently reside with today’s database systems are the concurrent complex queries of rather large data sets. These issues are creating performance problems to database systems that are already strained for resources. This article covers the good and bad of alternative methods to solving poor performance in database queries. read more...

Data warehouse management {Comments Off on Data warehouse management}

by Chris S
Data warehouses are databases that are used for reporting and analysis of data. Therefore, for businesses it is important to maintain a data warehouse that is reliable. Managing a data warehouse is up to the database administrators. Top priority is recovery of data that may be temporarily lost. Data availability is second, as businesses need to be able to access their data whenever necessary. Next, is security because many data warehouses hold crucial information on customers and employees as well as business partners. Making this data secure is big part of administration. Last, performance is important to make sure that the data warehouse provides the data in a timely manner. Almost always data warehouses are the most important part of a business and need to be working efficiently and be secure. Managing data warehouses is an important job and many companies can’t afford to have any issues with their data warehouses. Enterprise data warehouses are harder to manage since they are a lot larger than a typical data warehouse and contain historical data for a business. read more...

Social Network Face Book and Its Data Warehouse {Comments Off on Social Network Face Book and Its Data Warehouse}

by Tuyen H

         This article talks about MapReduce (MR) platform with parallel data access. MapReduce is introduced by Google in 2004 for supporting multiple accesses of data. It has been written in many languages such as C++, C#, Java, and even PHP. Famous Social Network Face Book built its massive data warehouse by using the MapReduce technology. The authors also compare the MapReduce technology with database management system (DBMS) “MapReduce complements DBMSs since databases are not designed for extract transform- load tasks, a MapReduce specialty.” Indeed, the combination of MR and DBMS create a powerful of database system. This combination allows big companies to build their database warehouse where they store a lot of information. For example, Face Book store more than 800 million users’ data and information. Google’s data warehouse stores more than 850 TB for supporting user searching. Finally, the authors mention that those technologies are not competition but complementary. 
        This article mentions data warehouse and database technologies such as MapReduce (MR) and database management system (DBMS). Those technologies are really helpful because they allow people to build massive data warehouses. For instance, Google’s data warehouse like a world library; it support business and study around the world. It also likes the world knowledge. In my work place, every time we have questions we “ask” Google. In school, when I research I also “ask’ Google.
        As you know the Jasmine Revolution happened with support of social network like Face Book and Twister. If there are not any database technologies such as MR and DBMS, there is not Face Book or Twister. Therefore, the Jasmine Revolution will not happen.  Data warehouse also use to store much important information that supports government, organization, and education. Imagine that, if the government does not have the database, how can it work? The IRS cannot collect tax. The DMV cannot manage vehicles and so on. I think data warehouse is really helpful, and both RM and DBMS are revolution of the database technology. read more...