Showing category results for Big Data

Jul 21, 2015
Post likes count0

Extracting XML Data from HDFS Sequence Files

Beat Schwegler
Beat Schwegler

In this post, you will learn how to implement a MapReduce job to efficiently extract XML data from HDFS sequence files using a custom RecordReader.

Big Data
Jul 21, 2015
Post likes count0

Parallelized Bulk Copy of Data from AWS to Azure

Anthony Turner
Anthony Turner

For various reasons, many customers want the ability to easily and efficiently move data from Amazon Web Services’ Simple Storage Service (S3) to Microsoft Azure storage. In this Case Study, you’ll see how to take the simple notion of cloud file-copy and make it fast and efficient using parallel techniques.

Big Data
Jul 21, 2015
Post likes count0

Building a ReactJS Spreadsheet Component

Anthony Turner
Anthony Turner

During the Microsoft Ventures hackathon in May 2015 it became obvious that one of the startups required a standalone Excel-like spreadsheet component for the web. This post describes the resulting React component, how it was built, and how it can be used today.

Big Data
Jul 21, 2015
Post likes count0

Prediction of diabetes hypoglycemic events

Beat Schwegler
Beat Schwegler

Creating a Microsoft Azure Machine Learning (MAML) model which predicts diabetics' hypoglycemic events based on blood glucose measurements alone.

Big Data
Jul 21, 2015
Post likes count0

Processing Time Series Data with HDInsight Storm

Tim Park
Tim Park

At Microsoft, we are working with a number of car manufacturers on processing telemetry streams. One of the key business scenarios for the incoming real-time car telemetry is to offer real-time driver risk scoring for end user driving guidance applications, fleet administration, and usage-based insurance scenarios.

Big Data