If I zoom in to the street level you can see that I used our Azure Cognitive Services and AI to identify the pools and correlate them with Bing Maps to get their addresses. With the addresses I could quickly do a lookup to my sales database to determine if the address was already a customer (the yellow square) or not a customer (the red square). Note: it’s not perfect, it clearly missed a pool…but this is something I created in just a few days.
There are lots of stories about hedge funds using drone data to determine retail store traffic or the number of container ships in Chinese ports. This can provide broad strokes on the health of a region or the economy in general, or we can laser focus to a particular address.
Think of all the apps you run on your smartphone. On the surface it would seem the business model for many of them is ad-driven. They monetize the ad impressions and are carefully targeting the CTR (clickthrough rate). But that’s only part of the story. Many are selling altdata to businesses to semantically enrich their existing data.
Thought experiment: out of the apps you use, which ones might be tracking users in a way that would provide lift to your business?
Here’s one example: Uber. With the user’s permission, Uber (likely Lyft too, but I’m not sure) can sell location data to food and retail industry players. Other companies can leverage this data to provide discounts and promotions personalized to the specific customer.
What is Digital Transformation? My definition is simple. A digitally-transformed company has learned how to monetize its data. That could mean leveraging its data to control costs are increase revenue, but at the extreme it means the company is selling its value data assets to others.
Thought experiment: If you had access to any one single company’s data assets, which one would it be and why? Now, go research if that company has monetized its data.
Here are some companies that have:
There is a general trend in the worldwide economy where intangible assets are a higher percentage of the balance sheet than ever. The biggest factor is likely monetized data, but someone should do some research to confirm this. What is really interesting is that tangible assets depreciate over time. Intangible assets, like data, don’t. How can you leverage this asset class?
The Financial Services industry loves seeking “alpha” (the industry’s equivalent term for “lift”) in altdata sources. Some interesting altadata ideas for the finance industry:
Named Entity Recognition
or simple sentiment analysis. We can also look for specific phrases and terms.Customers always ask me for interesting ideas for altdata in their industry. Fact is, I don’t know your industry as well as you do. Any ideas I may have, you’ve probably researched. My response is to think of use cases that your competition might not also be researching. A big area to focus on is Risk Management. Finding altdata that can mitigate risk should provide lift. Every industry has different risk management profiles, but let’s look at an example to get you thinking creatively.
Cambridge Mobile Telematics recently acquired TrueMotion. Both companies provide vehicle telematics data to auto insurers to reduce risk. Well, why couldn’t you leverage similar data? Traditional auto insurance risk rating factors such as age, gender, credit score, zip code data, moving violations, and type of vehicle are less predictive of accident risk than actually looking at driver behavior…via OBDII on-board vehicle telematics. Those traditional risk rating factors are just proxies for likely driver behavior. Younger drivers tend to be more risky, as are middle aged men driving red sports cars. Or, that’s the theory.
I will NEVER install a telematics device in my vehicle that will send data to my insurer. I can assure you that my risk profile using the traditional rating factors is much, Much, MUCH better than my actual driving behaviors. (I probably shouldn’t admit that).
CMT will likely create additional datasets to monetize for other industries than just auto insurance. You might be able to glean valuable insights about your customer if you knew their driving habits. How can knowing my customers’ risky behaviors provide me with competitive advantage? The bulk of CMT’s employees are data professionals, I’m sure they are dreaming up new data monetization avenues.
You can acquire telematic driving altdata from lots of vendors.
Thought experiment: Who better to provide auto insurance that the auto manufacturers that have access to all of your vehicle telematics, service history, credit, etc? General Motors has announced they are planning to offer their own auto insurance that they will bundle with OnStar. Brill-yunt! They are monetizing their data. That is Digital Transformation!
Banking and insurance are highly-regulated industries and tend to be slow-to-change based on necessity. This has allowed innovators from micro-lenders to payment processors to leverage data and invest heavily in digital services. One of the enablers of this trend is better risk management from altdata.
These companies are leveraging altdata like:
Yep, all of this data, in some de-identified fashion, is available to many industries. Surprising, isn’t it?
Consumption data is its own category of altdata. Right now this is huge in financial services but its potential is enormous. Quite simply, consumption data is business transaction-related information that can augment your predictive analytics.
Consumption Data Analytics
is the aggregating of online and offline (brick-and-mortar) consumer purchase activity, merged with consumer behavioral datasets, geolocation data (where was your smart phone when you made that online purchase), and other point-of-sale vendor data (also available for a fee).
Where can you get offline purchase activity? Well, the credit card companies (among many others) provide various levels of aggregated datasets for sale. This includes offline purchase activity.
Consumption Data Analytics in 2021 focuses on consumer consumption. I expect that to slowly shift to B2B consumption behaviors. An example: right now we have a global computer chip shortage. There are theories as to why that is, but if I am an automobile manufacturer that relies on certain chips for my vehicles, I want to know if my chip supplier is themselves experience supply chain issues so I can plan accordingly.
Data exhaust
is the trail of data that remains after a business activity has occurred on a computing system. Data exhaust provides valuable insights. Some examples:
* web server logs: this can tell you how long a consumer browsed your site before making a purchase, how long an item remained in their shopping cart before it was abandoned, etc.
Data exhaust is a great way to understand the behaviors of your customers…and your potential customers.
Treat your software like IoT data. It is throwing off a lot of interesting browsing events for your users. If you can ingest that data and react to it in real-time you should be able to provide a better experience for your users.
If you are a B2C company where your customer is a consumer then you need to know as much about them as possible.
The grocery industry has mastered consumer-profile data and it might be worthwhile to research how they do customer analytics. Grocers and CPG suppliers have been sharing data for years to learn about shopper habits and their shopping journey. Stores are analyzing broad buying trends to prevent shortages like we saw with toilet paper and Lysol during the early days of the pandemic. The CPG companies can leverage the POS data from the grocers to generate better consumer engagement and product offers and determine brand loyalty (which also suffered during the early pandemic).
Economic data that broadly shows the state of the economy and your industry is very valuable. Imagine you are a homebuilder…could you get a competitive advantage by knowing that lumber prices are forecast to rise substantially over the next few years because an invasive bug species is decimating Douglas Fir trees in the Pacific Northwest?
Jobs reports and inflation data are commonly used in many industries. If you are a QSR (Quick Serve Restaurant) it’s valuable to understand the prevailing wage in your area. How will this affect your margins?
Nielsen is a century-old research firm that measures TV viewership, among MANY other things. They are a monopoly for this data and they provide different datasets for lots of different use cases. Recently they created a new dataset that allows them to make comparisons of how many people are streaming entertainment vs watching traditional broadcast channels. This could be beneficial to your next marketing campaign.
Advertisers have been using altdata for years (sometimes called incidental data
), they just struggle to integrate it into their value-stream. Usually the integration is done on a one-time basis, usually in Excel. We can do better.
All data has structure, otherwise it’s worthless, but unstructured data
has come to mean data like images, pdfs, and video where you can extract value creatively. I mentioned above that many images have metadata that you can extract.
Every organization has a wealth of data that doesn’t sit in a traditional database. This means it’s difficult to do analytics on it. I call this latent data
. It has value, but it’s difficult to extract. If you can find this latent data in your organization you can leverage it with your structured data. Examples:
At the MTC we work with a lot of manufacturing companies. Each one has stressed that they have what I call a shifting demographics
problem. They have older workers nearing retirement and the younger generations are not interested in doing those dirty, manual labor jobs anymore. Recently, companies have been deploying IoT solutions to understand how they can automate some of these processes. Another approach is to look at all of the handwritten operator logs that these workers have maintained for decades and may not be digitized even today. Azure’s Cognitive Services can OCR even the worst handwriting, allowing you to use NLP to find the patterns in the notes.
competitive analysis
. These altdata-sets will show industry trends, growth rates, and demographics.Don’t value judge me. I think we are living in the most contentious, politically-charged environment ever. Probably everyone throughout history has said that.
Now, imagine you are targeting me as a potential high value customer lead. My CLV (customer lifetime value) is 2x your average customer lead. You’ve collected all of the common demographics about me using altdata and existing transactional data. Would you agree that you might want to tailor your advertising to me if you knew what my political views were? Well, you can’t know who I voted for in the last Presidential election (supposedly we have a secret ballot), but in most areas you CAN determine my party registration. And voter registration lists are free in most areas. There are aggregator firms that will sell you this data.
Voter registration data, I believe, will be the hottest altdata-set in the near future.
Are you convinced that your company is ready to leverage some of these altdata ideas?
I am a Microsoft Technology Center (MTC) Architect focused on data solutions. The MTCs are a service Microsoft provides to our customers. We strive to be the Trusted Advisors for our customers. Others have Know-How, we have Know-What. We want to understand your business problems and ideas for altdata analytics. Then, we’ll help you ingest and enrich the data using our cloud solutions. Technology alone cannot solve these problems without smart people and processes that work. We offer services ranging from human-centered Design Thinking Workshops – where we help you determine which use cases are the best for altdata – to hackathons where we quickly ingest some altdata, do the semantic enrichment with you, and quickly determine if the altdata provides lift.
Listen, we aren’t experts in your business, but we are great enablers. Within a few days we can build a rapid prototype and show you the Art of the Possible. We’ll show you what it takes to start a data sharing initiative and we’ll help you solve data problems in days that would’ve taken months in the past.
Does that sound compelling? Contact me on LinkedIn and we’ll get you started on your journey.
Are you convinced your data or cloud project will be a success?
Most companies aren’t. I have lots of experience with these projects. I speak at conferences, host hackathon events, and am a prolific open source contributor. I love helping companies with Data problems. If that sounds like someone you can trust, contact me.
Thanks for reading. If you found this interesting please subscribe to my blog.
Dave Wentzel CONTENT
data science Digital Transformation data architecture etl data lake