Database

Footprint Database used catch criminals! {2}

by Stephen O
Summary:
What comes to mind when you think of databases? You would normally think about a collection of your personal information that you may or may not have given out freely. Combine that with the crime solving and what comes to mind now? Shows like NCSI and CSI and its many spinoff should be first in your thoughts. Crime solving units around the world use databases on offenders to keep track of finger prints, blood type, and DNA. However has it ever crossed your mind…that your foot print could also end up in some national database? If you are like the rest of us this has not yet crossed your mind. The United Kingdom stores the imprints of thousands of shoe types and imprints from “suspects” left at crime scenes. The database was created by The Forensic Science Service and locates matches between shoe prints and crimes, much like DNA databases work here in America. While using footprints to solve crimes is not exactly new, a database allows Scotland yard to solve those tough to solve cases faster. read more...

Google and Its Books {1}

by Joey L
Summary:

This article is relates how digitization of print media is becoming one of the largest infringements in history.  One of the most notable digital infringements is Google’s Books project.  Google Books has become an easy access for anyone to find reference to millions of books and magazines.   Many of these books are claimed to be illegally scanned by Google, which then hands their database of books to HathiTrust, a partnership of universities and libraries.   A few groups of writer unions have filed a lawsuit with Google asserting that Google Books is publishing copyrighted works without the author’s authorization.   Currently, about 7 million books from Google are claimed to be illegally scanned.   Because these scans are unauthorized, the writers are seeking to withdrawn all illegally scanned books from Google’s database.  Google is currently offering the authors to either sign a contract and giving them their share or remove their book from their database.  If the authors do not contact Google, the books will become available to the public.   The lawsuit is still active as of September 15, 2011 and hopefully, Google and the writer unions will reach an agreement soon. read more...

Social Network Face Book and Its Data Warehouse {Comments Off on Social Network Face Book and Its Data Warehouse}

by Tuyen H

         This article talks about MapReduce (MR) platform with parallel data access. MapReduce is introduced by Google in 2004 for supporting multiple accesses of data. It has been written in many languages such as C++, C#, Java, and even PHP. Famous Social Network Face Book built its massive data warehouse by using the MapReduce technology. The authors also compare the MapReduce technology with database management system (DBMS) “MapReduce complements DBMSs since databases are not designed for extract transform- load tasks, a MapReduce specialty.” Indeed, the combination of MR and DBMS create a powerful of database system. This combination allows big companies to build their database warehouse where they store a lot of information. For example, Face Book store more than 800 million users’ data and information. Google’s data warehouse stores more than 850 TB for supporting user searching. Finally, the authors mention that those technologies are not competition but complementary. 
        This article mentions data warehouse and database technologies such as MapReduce (MR) and database management system (DBMS). Those technologies are really helpful because they allow people to build massive data warehouses. For instance, Google’s data warehouse like a world library; it support business and study around the world. It also likes the world knowledge. In my work place, every time we have questions we “ask” Google. In school, when I research I also “ask’ Google.
        As you know the Jasmine Revolution happened with support of social network like Face Book and Twister. If there are not any database technologies such as MR and DBMS, there is not Face Book or Twister. Therefore, the Jasmine Revolution will not happen.  Data warehouse also use to store much important information that supports government, organization, and education. Imagine that, if the government does not have the database, how can it work? The IRS cannot collect tax. The DMV cannot manage vehicles and so on. I think data warehouse is really helpful, and both RM and DBMS are revolution of the database technology. read more...

The “One and Only” Disease-Causing Gene Database {1}

by Monica G
The “One and Only” Disease-Causing Gene Database

We live in a society where information is everything. Everyday we expose ourselves to large amounts of data. Therefore it is easy to say that the people that have all this information have power.The power to change the world. But what if this information could potentially save lives? Should that factor make it free? The human genome is exactly this. It was started over decades ago to map out every single gene, then to identify the disease-causing mutations. The process is still underway, but many of sequences that cause Alzheimer’s or Parkinson’s have been identified. So then the question becomes, why hasn’t this information been made available to the public? One reason being is that the data is not all in one place. Because different clinicians collect data around the world, it seems more logical for it to be stored somewhere accessible to them. But this makes it nearly impossible for other scientists to share all their research. Another reason being that data is big business, databases like Human Gene Mutation Database, require subscribers to pay a fee to access all the information. Other databases like Online Mendelian Inheritance in Man, were started so long ago that much of the information is still in writing, therefore the online version lacks a lot of the substance required. For all these reasons, it would be much easier to have all the information stored in one place. Luckily, some countries like China have taken the initiative to do just this. read more...

Open Source Database for Small Company {1}

by Tuyen H

 

While Eclipse is very successful for Java Development, MySQL and PostgreSQL are very populating open source Databases.We cannot compare with brand name like Oracle, Microsoft, or IBM, but MySQL and PostgreSQL are very reasonable for small business.  If your data is less than a terabyte, both MySQL and PostgreSQL definitely can handle it. However, if your data is really big, you better think of Oracle, DB2, and Microsoft enterprise RDBMSes. Open source database is less secure than brand name database, so you have to keep in mind that “how safe is my data”  (Gravelle, 2011). read more...

Data Mining in Social Media {1}

by Joe C
Summary:

This article talks about data mining in social media, in which companies such as Facebook or Twitter are able to gather data from their users through the information, posts, and other activities the users provide. This type of information gathering is emerging to become a great source of information on upcoming trends and opportunities, however the data has to be harvested and analyzed correctly. While there is a great deal of information, the amount of useful/relevant information compared to irrelevant noise is as little as 20%. Being this way, these companies have to utilize highly advanced artificial intelligence to sift through the noise and retrieve only the useful information. As technology advances, these AI systems are able to perform tasks such as match newly created profiles to old preexisting users in the database to form a more complete profile of a possible customer. This is very useful as it acts like a real-time automatically updating CRM system. read more...