August 26th, 2015

Real-time Maps with Event Hubs and SignalR Using Leaflet

Michael Lanzetta
ML Manufacturing Lead

Problem

Here on the Partner Catalyst team at Microsoft, we work with partners from all over the world on everything from long-term projects to week-long hackfests. Recently, we spent a week in Hamburg, Germany, working with Ströer to help them tackle some thorny data visualization and data-handling problems.

They had a large corpus of geographic data on the foot-traffic patterns of people within the city (entirely anonymized), and wanted to gain some insight as to where traffic might congregate and when, getting a rough sense of “page views” for their physical advertising emplacements (e.g. billboards). Naturally, as in any mature industry, they had groups and standards providing rough values for these numbers, but real data was harder to come by and until now had required large surveys and estimation.

If we could gather this data in real-time, we could not only more effectively target content to where it will be seen, but we could measure engagement. If we could measure engagement, we could work with the city of Hamburg to try and make the people happier, and measure results (this was my first inkling I wasn’t working with a US company).

Visualizing the Data

These were grandiose notions, to be sure, but the first step was to visualize the data. As with all geographic data, the only way to get a proper sense of what it looks like is to visualize it on a map, so we needed a way to get real-time data from mobile applications to our map visualization. The resulting website after all was said and done (remember, we had less than a week 🙂 ) looked something like this:

Image route animation

The Infrastructure

We built a Cordova-based mobile app driving geo data into Event Hubs (for future data), and a C#-based CLI for driving their existing data into Event Hubs as a spoof “real-time” feed. Now that we had our geo data, we built an EventProcessorHost to consume it from within our ASP.Net website, do any required pre-processing, and drive the resulting data through to listening clients via SignalR. Our main client was a single-page web application using Leaflet.js to render the most recent paths, and per-person and aggregated heatmaps.

Image geo data flow

The ASP.Net application was released on GitHub, and a more detailed walkthrough can be found on my blog.

Future work involves integrating our more precise and stable Geodesy library for location binning and accurate heatmaps, and follow-on work involves driving a Spark pipeline from the EventHubs data to do additional aggregation and processing. These works in progress will result in future case studies, but the Geodesy library is out now on Nuget.

Using the Website

To use this application yourself, you just need to set up an appropriate Event Hub and drive traffic through it. With the CLI I have checked in, it sends random point data through the pipeline and then renders it, so you don’t even need a data source to get started.

I’ve created a utility method that allows you to pull configuration data from either the cloud config, web/app.config or environment variables of the same name:

    private static ConcurrentDictionary<string, string> _ConfigurationEntries = new ConcurrentDictionary<string, string>();

    public static string FromConfiguration(string name)
    {
        return _ConfigurationEntries.GetOrAdd(name,
            x => CloudConfigurationManager.GetSetting(x) ?? Environment.GetEnvironmentVariable(name));
    }

This allows you to set environment variables with the configuration entries for your Event Hub, and then easily run the CLI and website without having to muck around in your app or web configs (or worry about checking in keys, in my case).

To run both, you can see from EventHubRoutePointSource that you just need to set the following variables:

    public class EventHubRoutePointSource
    {
        public async Task StartAsync()
        {
            var ehConnStr = AzureUtilities.ServiceBusConnectionString(
                AzureUtilities.FromConfiguration("MappyServiceBusNamespace"),
                AzureUtilities.FromConfiguration("MappyEventHubSASName"),
                AzureUtilities.FromConfiguration("MappyEventHubSASKey"));
            var storageConnStr = AzureUtilities.StorageConnectionString(
                AzureUtilities.FromConfiguration("MappyStorageName"),
                AzureUtilities.FromConfiguration("MappyStorageKey"));
            var eventHubName = AzureUtilities.FromConfiguration("MappyEventHubName");
            var consumerGroup = AzureUtilities.FromConfiguration("MappyConsumerGroupName") ?? "$Default";
    ...

Once those are set, build and run the CLI and then run the website, and you should see your little people start cruising around on the map.

Author

Michael Lanzetta
ML Manufacturing Lead

I've been a professional software developer and machine learning engineer for over 15 years, working with Python, R, Node.js, C#, Java, and C++ in a variety of companies from start-ups to large firms. My experience spans both Windows/.NET and *nix/Linux-based, with heavy DB and NoSQL experience and a focus on global distributed systems. Over the past few years, my focus has shifted to large-scale ML deployments using Spark, Scikit-Learn, Spacy, AzureML, TensorFlow, Keras, and PyTorch. I've worked with everyone from small startups to companies and organizations like Disney and the World Bank. While at my previous position in Amazon I worked heavily in the SOA space, writing HA/DR services in both Java and C++. I've also worked on heavy compute jobs with core libraries written in Matlab for demand forecasting at the scale of Amazon's product catalog. Now at Microsoft, I've worked on large-scale C# services, Petabyte-scale distributed map/reduce data processing jobs, and a stint working on Java/Spring for the Farecast (Bing Travel) team. I spent some time in MSR at FUSE Labs where I worked on So.cl and a Paxos-based distributed data-store, but left to work more closely with Microsoft partners in DX (now CSE). After running the ML Technical Working Group, I now lead a team in Asia and Europe focused on ML/DL solutions for our largest partners in the Manufacturing space, and advising the AI+R division on future directions with a focus on the developer.

0 comments

Discussion are closed.

Feedback