big data

Data could shape your future education {4}

by Brian B
The article I chose is “Big Data on Campus” by Marc Parry. The author talks about data being used by schools to track how students are doing in class and shaping the classes they are taking or changing their major to better suit the student. With the increasing amounts of data that colleges are collecting they are able to recommend classes to students based on past performance in other classes or from their transfer records. They are doing this by using software that is similar to what Netflix uses. They track how well the students have done in past classes and give them recommendations based on that data (Parry, 2012). These programs are coupled with new styles of classes. The classes are held in a classroom but the classes themselves are taught on computers. As a student progresses through the class they are given specific areas to focus on. The class becomes focused on the areas that a student needs help in rather than just having a static curriculum. The data that these campuses are collecting is also being used to get students more involved in campus life. Campuses can use this data to get students involved in clubs or activities “that might tie them more deeply to the institution.” (Parry, 2012). read more...

Facebook Effectively Using its Big Data {5}

by Kathy S
According to the article, Facebook processes about 2.5 billion pieces of content and 500 plus terabytes of data every day. They receive 2.7 billion “Like” actions and 300 million photo uploads per day. Facebook’s Vice President of Engineering says that Big data gives them major insights and helps them to make an impact to their business. He says if they’re not taking advantage of their collection of data, then it’s just a pile of useless data. When Facebook’s data is processed into useful information they are able to make out “new products, understand user reactions, and modify designs in near real-time.” The beneficial data that Facebook possesses is passed on to its advertisers. Facebook tracks how ads perform across different dimensions of users based on gender, age, interests, so they can see which ads are being more effective. Then those specific ads are shown more to make the ad successful. Lastly, this article talks about their “Project Prism”, which is Facebook’s plan to have their live data-set hosted across their data centers in different states across the country. The article also mentions that users might be uncomfortable with the idea that Facebook employees have access to their information and activity, but they assure that it has numerous protections against abuse. If data is being accessed then it is logged so Facebook can track which employees are looking at what. The VP assures that if there is any employee prying where they’re not supposed to, they’re fired. They have a “zero-tolerance policy.” read more...

Problem in The Process of Getting Data {4}

by Phuong H
The article I picked for this week is “The Big Problem with Big Data” written by Jill Duffy. In the article, the author mentions several problems with having big data. It is good to have a lot of information but most of the time people don’t know what to do with the data. One of the biggest problem with big data is data generation. Almost everyone have an account on either Facebook, Google+, or Twitter. Users are asked to provide or share their information in an exchange of some kind of service. In the end, the information that they collected is what the companies want not the information they want to collect. In another word, user generate data to get what they want. The information might not represent the user because they do it for a reason not voluntary. For example, you are asked to fill out a form and some survey questions before you can download the software. How many people would fill in their “real” information? Most of the time, people just fill in the blank to get pass by. According to the survey in the article written by Richard Karpinski, “The survey found that more than 50% of buyers said they provide a valid name, email address, industry, job title and company name when they register; although less than 40% provide accurate phone numbers” (Karpinski, 2007). read more...

Big Data and Analytics {Comments Off on Big Data and Analytics}

by Jennifer R
The author talks about how the business intelligence tools used to interpret data are changing as people continually work on adjusting databases process massive amounts of data. Traditionally the data to be analyzed was relational and stored in cubes, with retrieved information “delivered as standard reports”. There is a demand for tools with ‘data discovery’ properties, where they work with near-real time data “to create adhoc reports and graphs”. The location for data storage for databases is changing in conjunction with the analytic tools. Databases used to be stored on disks due to the size being impractical for storage on RAM. Improvements have made database storage on memory feasible; such databases are described by the author as in-memory databases. There’s also been talk about ‘predictive analytics’ tools where they “try to anticipate what will happen based on trends they spot in the data”. read more...

New Software for Big Data Analytics {1}

by Renee L
In his article “New GigaSpaces Release Promises ‘Big Data’ Analytics in Real Time,” Mark Brunelli talks about the latest software from GigaSpaces Technologies, that gives users the ability to build their own real-time big data analytics platforms.  GigaSpaces Technologies is the leading provider of a new generation of application platforms for Java and .net environments, and provides scaling solutions and cloud enabling technologies. Big data analytics is the process of examining large amounts of structured and semi-structured data (big data), and its goal is to help organizations make better decisions while analyzing huge amounts of data. GigaSpaces new software called XAP 9.0 allows users to create and launch high-performance real-time analytics systems for applications that are processing large amounts of data sets. According to GigaSpaces, XAP 9.0 was created to help reliability and scalability issues when dealing with other big data technologies in real time, such as Hadoop. Some of the great features that XAP 9.0 offers are real-time streaming data processing, parallel processing, fine grained data compression, reduced memory footprint and the ability to integrate with databases, such as Hbase, Cassandra, and MongoDB. XAP 9.0 also supports cloud environments by allowing users to run applications on public or private clouds, and offers scaling, which reduces the cost of running big data applications. read more...

Microsoft SQL Update Caters to Big Data Needs! {2}

by Joshua L
The article I read for this week talks about Microsoft’s most recent update to SQL server. With the release of SQL server 2012 Microsoft has turned SQL Server into a tool that can help organizations analyze large amounts of unstructured data, an increasingly popular practice called big data. The new software is helpful for social networking analysis services. The new software is also very adaptable for use with cloud computing services. The software has already been proven to process about 350GB of social networking data each day by a company called Klout. The article says that SQL Server 2012 is a modern data platform that embraces the world of traditional structured data and brings in the world of unstructured big data. read more...

What is Big Data and How Can We Manage It? {2}

by Renee L
In his article “‘Big Data’ Technologies Emerge to Battle Large, Complex Data Sets,” Mark Whitehorn talks about the new database features that help organizations easily store, manage, and analyze huge amounts of data. The term “big data” is defined as large volumes of structured data (eg. text) and semi-structured data (eg. images). Organizations find it difficult to deal with semi-structured data, for the databases they use are usually relational databases, which store data in tables comprising rows and columns. These columnar databases work very well with well-structured data because it makes data faster to read; however, it doesn’t do much good for semi-structured data. That is why experts have came up with new technologies that can easily store big data. Such technologies are NoSQL, Hadoop, and MapReduce. NoSQL is a non-relational, distributed, and open source database and it can handle large amounts of data across many servers. Hadoop is an open source shared storage system and provide redundant storage and MapReduce is a software framework and programming model that can combine and organize huge amounts of data from many different sources. With these new technologies, organizations are able to cope with large volumes of both structured and semi-structured data and can add more value to their service with customers. read more...

Our Information’s Importance to Corporations {Comments Off on Our Information’s Importance to Corporations}

by Tyler K
 

In the Guardian article chosen, the author describes how Wal-Mart has recently purchased Facebook’s popular “Social Calendar” application, and the implications of the purchase – the corporation now has access to the millions of users, and all of the friends of the users that they may refer to using the application. The author goes on to describe the future role of information, and how information on people, as consumers and for other roles, is quickly becoming a massive component of the modern world. The article details how important information really is, “Facebook’s projected $100bn value is based on the data it offers people who want to exploit its social graph,” (Krotoski, 2012) – essentially, the success of the social media site relies heavily upon offering all of the information the user believes is just to be shared with friends. This leads to a discussion on the future of individual privacy and what having information online might lead to – it even refers to my previous blog about Target analysts discovering a woman is pregnant before even her father can! The author chooses the phrase “aggregated data,” or “Big Data” to describe what is happening to all of the information on us online; powerful organizations are collecting various facets of data about individuals, whether it be to market to them better, determine whether to hire individuals (several recent cases involve companies asking for Facebook passwords, or simply collecting the information on their own), and one example even details using social media to gather information on a man that attempted murder! read more...

‘Big Data’ concern and confusion {Comments Off on ‘Big Data’ concern and confusion}

by Quoc L
Companies are not prepare for the influx of data coming into their systems. These ‘big data’ is hammering these companies systems and yet most employee still don’t know how to deal with it. Those that understand the important of ‘big data’ do not have the require tool to efficiency manage these data. A survey conduct by Echelon One, finds that most companies does not put, managing big data in the fore front of their operations or does not know what it is. The survey was collected from a multitude of industries, such as manufacturing, educations, governments, healthcare and many others. Responses was taken from a director level or higher. Those that do manage big data are not using the right tool to monitor and manage it. They would use separated spreadsheets and/or another databases for it. I find it interesting how important these data is to companies but they do not seem to care about monitor or managing it. These data can improve security or provide insights on improving the companies. Only 49% of all the companies that are surveyed, are concern about big data. However they all know how important data is for their companies. It seem like these director level employees are only look at the short term management of the companies only. read more...

Big Data, Big Problems {8}

by Jongwoo Y
Zappos and 6PM, two of the largest online clothing retailers in the United States, have recently experienced a security breach in their big data. This cyber attack has effected the information of 24 million customers, the largest amount since the attacks that Sony had received last year. Barbara Scott, a director of a technology services business and victim to these recent attacks, believes that this type of attack should not be viable for companies with so much financial backing. “You would think companies like eBay or Amazon have the financial backing wherewithal to take the proper security measures.(Perlroth, 2012)” The attacks have been able to retrieve the customer names, encrypted passwords, phone numbers, e-mail addresses, and the last four digits of customer credit cards. What was even more outrageous than the attacks was the email that millions of Zappos customers received from the CEO, Tony Hsieh (who is usually held in high regard). In the email, Hseih apologizes, but instructs customers to send emails for questions rather than calling the support lines because they “simply aren’t capable” of taking the expected number of calls(Perlroth, 2012). This incident has left a sour taste in many Zappos customers, not only did they feel less secure with the breach, but also as if they have been not taken care of after the incident had occurred. read more...