Showing category results for Big Data

May 13, 2021

Observability for Event Stream Processing with Azure Functions, Event Hubs, and Application Insights

Shervyna Ruan

With distributed tracing handled by Azure Functions, Azure Event Hubs, and Azure Application Insights behind the scenes, Azure Application Insights provides useful monitoring visualizations of the system that helped us easily understand system performances and troubleshoot failures.

Feb 5, 2019

Assessing the Severity of Acne via Cell Phone Selfie Images Using A Deep Learning Model

Hang Zhang

Nestlé Skin Health partnered with Microsoft to develop a deep learning model powered mobile app able to assess acne severity using only uploaded selfie images as a source.

Jan 18, 2019

Running Parallel Apache Spark Notebook Workloads On Azure Databricks

Clemens Wolff

This article walks through the development of a technique for running Spark jobs in parallel on Azure Databricks. The technique enabled us to reduce the processing times for JetBlue's reporting threefold while keeping the business logic implementation straight forward. The technique can be re-used for any notebooks-based Spark workload on Azure Dat...

Jan 2, 2019

Real-Time Time Series Analysis at Scale for Trending Topics Detection

Omri Mendels

This code story describes a collaboration with ZenCity around detecting trending topics at scale. We discuss the datasets, data preparation, models used and the deployment story for this scenario.

Dec 12, 2018

Social Stream Pipeline on Databricks with auto-scaling and CI/CD using Travis

Mor Shemesh

This code story describes CSE's work with ZenCity to create a data pipeline on Azure Databricks supported by a CI/CD pipeline on TravisCI. The aim of the collaboration was to create a pipeline capable of processing a stream of social posts, analyzing them, and identifying trends.

Jul 30, 2018

Unsupervised driver safety estimation at scale, a collaboration with Pointer Telocation

Omri Mendels

A scalable unsupervised approach for driver safety estimation on Pointer Telocation's dataset

May 1, 2018

Runtime Configuration of Spark Streaming Jobs

Kevin Hartman

We achieved zero-downtime reconfiguration and management of the Spark Streaming job used in Project Fortis with Azure Service Bus.

Jan 24, 2018

Azure Event Hub Ingestion at Scale with Python and Kubernetes

Tomer Rosenthal

We created a solution to ingest Azure Event Hubs events at scale using Python and Kubernetes.

Jun 29, 2017

IoT Sports Sensor Machine Learning Helps Amateurs Up Their Game

Patty Ryan

We use IoT sensors to collect positional and motion data from professional and amateur skiers to classify expertise and skill level through machine learning.

May 10, 2017

Project Fortis: Accelerating UN Humanitarian Aid Planning with GraphQL

Erik Schlegel

Using GraphQL and Azure to create a data processing pipeline for identifying trends and providing insights about global humanitarian crises.