Data

Google Has Done it Again {3}

by Katheryn T
Does it really surprise anyone?  Google has always been a leader in innovations and thinking outside the box.  This time, they have discovered a way to super connect all their databases in real time.  The article I read spoke about how Google was able to create an accurate connection between world-wide servers. ” Today, there are many databases designed to store data across thousands of servers” (Metz, 2012).  What Google has done is come up with a way to have multiple data centers and millions of servers instead of just a few data centers with thousands of servers.  Essentially, they have installed atomic clocks, which keep real time on the atomic level, and GPS antennas on the actual servers.  The combinations of these two devices helps keep the servers in perfect timing. This new system is called Spanner and the two devices are part of their TrueTime API. This new method will help users in different parts of the world have a consistent view.  From the article, this isn’t a very expensive idea, but not everyone can implement. read more...

Big Data joined by Nodeable {1}

by Rudy P
I chose to read the article, “Twitter’s Open Source Big Data Tool Comes to the Cloud Courtesy of Nodeable” by Klint Finley. This article gives details of a new Big Data analysis software by Nodeable called StreamReduce. This software is based on a software created by a Twitter owned company called storm. Finley goes on to state, “StreamReduce is essentially Storm hosted in the cloud, with a few extras such as connectors to Apache Hadoop” (Finley, 2012). Nodeable began as a company which developed software for both data management and analysis, however due to market demand they have shifted their attention to analysis. Nodeable is joining an increasingly competitive market where companies such as HStreaming already exists, and possible companies such as Amazon may join in the near future. read more...

Data could shape your future education {4}

by Brian B
The article I chose is “Big Data on Campus” by Marc Parry. The author talks about data being used by schools to track how students are doing in class and shaping the classes they are taking or changing their major to better suit the student. With the increasing amounts of data that colleges are collecting they are able to recommend classes to students based on past performance in other classes or from their transfer records. They are doing this by using software that is similar to what Netflix uses. They track how well the students have done in past classes and give them recommendations based on that data (Parry, 2012). These programs are coupled with new styles of classes. The classes are held in a classroom but the classes themselves are taught on computers. As a student progresses through the class they are given specific areas to focus on. The class becomes focused on the areas that a student needs help in rather than just having a static curriculum. The data that these campuses are collecting is also being used to get students more involved in campus life. Campuses can use this data to get students involved in clubs or activities “that might tie them more deeply to the institution.” (Parry, 2012). read more...

Facebook Effectively Using its Big Data {5}

by Kathy S
According to the article, Facebook processes about 2.5 billion pieces of content and 500 plus terabytes of data every day. They receive 2.7 billion “Like” actions and 300 million photo uploads per day. Facebook’s Vice President of Engineering says that Big data gives them major insights and helps them to make an impact to their business. He says if they’re not taking advantage of their collection of data, then it’s just a pile of useless data. When Facebook’s data is processed into useful information they are able to make out “new products, understand user reactions, and modify designs in near real-time.” The beneficial data that Facebook possesses is passed on to its advertisers. Facebook tracks how ads perform across different dimensions of users based on gender, age, interests, so they can see which ads are being more effective. Then those specific ads are shown more to make the ad successful. Lastly, this article talks about their “Project Prism”, which is Facebook’s plan to have their live data-set hosted across their data centers in different states across the country. The article also mentions that users might be uncomfortable with the idea that Facebook employees have access to their information and activity, but they assure that it has numerous protections against abuse. If data is being accessed then it is logged so Facebook can track which employees are looking at what. The VP assures that if there is any employee prying where they’re not supposed to, they’re fired. They have a “zero-tolerance policy.” read more...

Tackling Data {5}

by Garcello D
This article spreads some knowledge about accumulation of data in businesses. There is so much digital data that if it was broken down into bits there would be more data particles then stars in the universe. The problem is that there is no real way for businesses to stop this data agglomeration so they are faced with the hideous task of having to analyze the data. The reason they would want to analyze data in their business is so that they can accurately calculate their profit and losses, but also the demographic information on their worst and most loyal customers. All this data can be complicated to manage, but most importantly it can cause performance issues on a network that is crucial to the business. Furthermore the article gives some ways of how IT managers tend to deal with this issue, yet all of them come with a negative side effect for the business. In conclusion it goes over that the best way to solve this issue is not to add more and more hardware, but instead to approach the way we store the data in a different database formats. read more...

BioGRID Updates Database {2}

by Alexander H
The Biological General Repository for Interaction Datasets (BioGRID) offers a public database of information that breaks down genetic and protein findings from humans and organisms.  After the recent update, a total of over 500,000 genetic and protein interactions complete the database.  The recent update has brought enhancements to their web interface and allows for rapid queries across multiple data types and sources.  The database is designed to forward engineer advancements in any applicable area of science and share information across the platform.  In addition, the entire data collection of the database can be downloaded in multiple file formats to be used for any purpose. read more...

Database for Works of Art {2}

by Alexander H
Technology has a trend of being one step ahead of its consumers. Databases housing tons of information have continued to utilize their resources in order to predict and maximize the manner in which users will interact with the database. Carter Cleveland, a Computer Science Engineer from Princeton University, and Sebastian Cwilich, a former executive at Christie’s Auction House and Haunch of Venison Gallery, are teaming up to bring together a new way to appreciate and find works of art. Art.sy seeks to build on the principles set forth by major companies, like Google, by implementing a database of artistic pieces in order to connect and recommend works of art for its users. This is completed by evaluating artworks using 800+ characteristics called “genes” across various pieces of art. These “genes” are essentially categories to identify artwork, such as historical movements, subject matter, and formal elements. The concept is designed to create an experience where the user can stumble upon works of art based on the “genes” that are appealing to the user. The database is still in an infant state, however, gaining the force of global leaders in the arts can move this project forward and create a central database for all works of art. read more...

Massive data a problem… not really {2}

by Robert Q
This article titled “Storing Massive Data: Distributed Data and the no SQL Movement”, talks about the general programs that are available to handle the problem of massive data. The author points out that “big data” is just huge amounts of information that are collected from all parts of the internet. The author explains some of the available software that are there to improve the storage of these data. Some of the software that the author mentions are Hadoop, Cloudera and Hortonworks, there are many other software that the author mentions. The author noticed that businesses are trying to find better and better software to handle their demand of information. Companies are starting to as some vendors such as Amazon and Oracle to develop better software that can handle real-time analysis database. As information continues to grow the future for this industry would appear to continue in its positive growth. read more...

Hacker Busted by Smartphone {5}

by Joshua L
The article I read is about a hacker by the name of Higinio O. Ochoa III, a Linux administrator, who was charged by the FBI with hacking into U.S. law enforcement agencies and posting online the home addresses for police officers, including those of more than 100 Los Angeles police officers. He was caught because on the site he posted the information he used a picture that he took with his cell phone that was traced back to his phone. He didn’t know about the fact that EXIF data (location, camera type, and other image information) is included in every photo taken with a smartphone. read more...

Cisco Says Mobile Data Traffic Is Rising {Comments Off on Cisco Says Mobile Data Traffic Is Rising}

by Han C
According to a recent forecast, Cisco’s visual networking index has listed global mobile data traffic to increase by 18 fold within the next five years. The VNI forecast is an ongoing initiative to track the impact of visual networking applications on a global scale. By examining the trends of the last four years, the article is able to show that average smartphone usage nearly tripled in 2011 and that mobile video traffic exceeded 50% for the first time in 2011. In addition, Android data usage is actually higher than iPhone users. The crucial factor here is to promote the increase in mobile speeds such as 4G, which includes: WiMAX, and Long-Term Evolution. 4G only holds 0.2% of mobile connections but is already accounting for 6% of all mobile data traffic. As mobile data traffic continues to rise along with user demand, the article estimates that the number of mobile connected devices will exceed the world’s population sometime this year. read more...