Data modeling

New Big Data Underwriting Models introduced by Zestcash {3}

by Hongde H
The article I read for this week is about a new big data underwriting models that is introduced by ZestCash. The model helps analyze credit risk in an better accuracy that would allow the company to extend credit to 25 percent and increase repayment from customers by 20 percent.

According to what I read, ZestCash underwrites by combining Google-style machine learning techniques and data analysis, and traditional credit scoring. As a result, the company can offer credit to people who would be mistakenly turned away. read more...

Big Data and its Effect on Businesses {3}

by Andrew S
This article was a very interesting read for me, it mainly talked about big data and how it is growing in data analysis.  With the infinite amount of data on the Web, data consultants are in high demand to make sense of all the data.  It also talks about how data is the new class of economic assets, as it can serve to be a valuable tool for big companies.  Facebook and Google are just some of the online conglomerates that harness the data of the Web with their own advertising.  It talks about what exactly is Big Data, which is a marketing term for the advancing trends in technology, and how it affects the digital age today.  There are many ins and outs of the influence of data in today’s digital age and this article just touches on the surface of it. read more...

Current data models do not meet today’s needs {3}

by Brian B
The article I chose this week is named “Toward a Next Generation Data Modeling Facility: Neither the Entity-Relationship Model nor UML Meet the Need” by David Kroenke and Cary Donald from the Journal of Information Systems Education. It starts off by talking about the importance and the purpose of data modeling. The article states that “the main use for data models is to describe and document the user’s view of the world. (Kroenke & Gray, 2006)” The authors then put forth a list of criteria that they believe are required to create a new and better suited data model for today’s world. The first criterion that they go into depth about is that a model needs to be “sufficiently robust.” This means that the model needs to be to represent the user’s idea of how objects are defined.  They then talk about their next requirement, that a model be as simple as possible. The requirement states that the models need to be easy to understand to someone with even the most basic training in data modeling. The next require is that a data model must be independent from anything else. They say that “A data model should be a representation of the users’ semantics, and nothing more. (Kroenke & Gray, 2006)” The next requirement that they list is that a data model must “utilize domains with inheritable properties.” This would allow for a data model to be easily changed later down the road without having to go back and manually change every attribute that is incorrect.  The last requirement is that a data model support data migration. This would allow for data models to be more easily changed as new requirements are added to the model. The article then goes into a brief history of the E-R diagram and talks about the Erwin software and its ability to convert between IDEFIX & IE notations. After that the author starts to test the IE notation of E-R diagrams. He finds that while it is robust, it does not meet any of the other criteria that they have laid out in the article. They conclude that “Based on years of data modeling experience, we are convinced that there is a fundamental conceptual problem with the E-R model: namely, it represents entity relationships! Most users do not think of relationships as things.” Because of this many users have problems learning E-R modeling. They also have a small section on UML which finishes with them saying that it has roughly the same problems that E-R diagrams do. The article ends with a conclusion that there is a need for a new data model and that it should follow the criteria that had been laid out earlier in the article. read more...

FBI Moves from Paper to an Electronic Database {5}

by Eric C
The FBI has always been known for their high-tech equipment they use from computer forensic equipment to the specialized tech gear that agents use. However, it wasn’t until recently when the FBI finally completed a new database system that eliminates paper files and moves to a more modern approach of digitizing paperwork. This new electronic file management system is called Sentinel and it was originally scheduled for completion in 2009 with an estimated budget of $425 million. Due to delays and poor planning and organization, it was about three years late and about $26 million over budget. This project of transferring the FBI to an electronic system was contracted to Lockhead Martin Corporation, who managed the project, but was taken over by the FBI due to delays. read more...

CA Technologies-The Future of Data Modeling in the Cloud {1}

by Alexander H
John Heywood once said “two heads are better than one”. In our ever growing world of technology, this quote is quickly becoming the normal standard for the creation of products. From simple brain storms to application development, thinking in groups allows a broader range of ideas to flourish. This concept is being implemented by CA Technologies, the developer of our ERwin Data Modeler, with the help of the cloud. The ERwin software allows a user friendly approach to data modeling and is consistently utilized for data structure management. However, CA Technologies wishes to take this tool one step further in the form of three various releases that aim to take data modeling into the cloud. The first update has enabled “integration and management in Microsoft’s cloud environment”. This allows the data to be in a single area when share in the cloud. The second release in the trio is the CA ERwin Web Portal which is an “interface that allows for data architects to build their database models” in such a way that business and IT users can access the material. Finally, the ERwin Data Modeler r8.2 was altered in ways that accommodated the sharing premise, such as the availability of concurrent licensing agreements.
Being one of the programs we utilize in class, it is important to note that this is a professional program that successful businesses utilize in order to model their databases. I believe this article highlights the importance of cloud computing and how it can revolutionize how a company models its data. A group of database architects may be able to build a great database. However, if the development of the database was distributed and shared throughout various companies, whether they are sister companies or simply have a similar focus, the ideas from all areas can benefit the company immensely. This also leads to consistent data modeling throughout various companies and break free from the confines of the company for new and fresh ideas.
The main concern I have, as noted by senior director Donna Burbank, is the security of such information in the cloud. The models created could be potent in the hands of competitors if they ever got a hold of the information. I believe that overtime, the issue of security with cloud computing will be fortified and made more secure. This key element will open the lines for worldwide collaboration and I think that CA Technologies is taking a step in the right direction as technology progresses. read more...

Significance of Data Modeling in Today’s Business {2}

by Michael V
In today’s modern day business, the day to day operations of a company are so complex that it would be impossible to process by normal means. Raw data is no use to any human being until it is processed to a point of becoming “information” that can be readily absorbed by people to gain the intelligence over business affairs that they need. This is where data modeling comes into play. Data modeling is essentially a process in which basic data in the form of unintelligible words and numbers is converted into a picturesque format that allows information to be absorbed with a mere glance. read more...

The Data Modeling Approach {2}

by Anthony T
The article focuses on data modeling as a way to improve the effectiveness of businesses. It outlines strategies in which businesses can benefit from reducing cost of marketing, higher revenue, better profitability, and competitive advantages. Predictive modeling tools, according to the article, can be especially useful in analyzing factors which are directly related to product profitability. The author also mentions the benefits of reliability and accuracy of supplier relationship management when building data models. The accessibility of modeling tools is also discussed in the article and how it is no longer only at the dispose of statisticians or experts. Modeling tools and data analysis is becoming more and more available for marketers to use. read more...

CA Technologies supports ERwin Modeling Tool using Cloud Database {1}

by Davina V
CA Technologies, in an effort to make data integration easier for businesses resting in “on- premise and cloud systems”, has added support for Mricosoft’s SQL Azure cloud database to its CA ERwin data modeling tools. Sadly the author lowers the quality of the article by advertising the new ERwin 8.2 slightly in the article. This article also mentions the potential growth of the use of cloud databases in businesses by quoting Donna Burbank. The quote said that “about 10 to  15 percent of businesses are using cloud databases to day, about 75 percent are interesting in their potential use” (Whiting, 1012). read more...

Tips on how to choose the correct data types {4}

by Willen L
In this article the author focuses on the importance of proper data types in order to maintain data quality. He gives us some general tips and rules to follow to ensure the correct type is chosen. First, if the data is numeric, favor SMALLINT, INTEGER, OR DECIMAL Data types. Second, if the data is a character, use CHAR or VARCHAR data types. Third, if the data is date and time, use DATE, TIME, or TIMESTAMP data types. Lastly, if the data is a multimedia, use GRAPHIC, VARGRAPHIC, BLOB, or DBCLOB data types. These rules seem simple enough but he states that it is a widespread problem that people are using improper data types. The most common ones he encountered were using CHAR type on date data and using CHAR type on numerical data. Choosing the wrong data type may slow down the system and in general it is best to assign correct data type that best matches the values in the domain to improve overall data quality. read more...

Physical Data Warehouse Advantages with UML {Comments Off on Physical Data Warehouse Advantages with UML}

by Ermie C
This article is about how there are several approaches on how to create a data warehouse.  The specifics on which parts of a data warehouse is approached are the logical and conceptual modeling, the use  of something they call ETL(Extraction, Transformation, Loading), and unique customization of the schema of the database.  They conclude that when creating a DW, it needs many decisions to create a more efficient database.  So they started to used the design and implementation tools from UML.  UML stands for Unified Modeling Language and with this language it would “reduce the overall development time of a DW, such as replicating dimension tables, vertical and horizontal partitioning of a fact table, and the use of particular servers for certain ETL processes. (Mora)” read more...