Digital transformation is so 2015! Companies have started thinking about artificial intelligence (AI) and how it can transform their way of running their businesses and they are already including it in their wider data analytics strategy! As a certified Google Analytics Cloud Platform partner, our team was present at the 2017 Google NEXT conference, where there were many productive discussions around the developments in the AI field. In this blog post, I am excited to introduce you to three of my favorite APIs that are empowered with AI and that can provide us as digital analysts with new opportunities for insight and improved customer experience.
As digital analysts, we are stewards of:
- clean data collection mechanisms
- efficient data governance practices
- demonstrating the impact of data on the business bottom line
We have come a long way through the power of storytelling, change management, and compassionate leadership. Yet I believe that there is still one group that we need to speak with more often: the Innovation Team.
While your innovation team may not be interested to hear about data warehouses, data lakes, data models, or data management platforms, they will still be interested to hear how they can create an innovative competitive edge for the organization with the combined power of data and artificial intelligence. It’s on you, the digital analyst, to show them the what, why, and how, and today I am going to help you with that.
Alors, on danse!
1. Cloud Video Intelligence API
This is a REST API which makes video content searchable. Thanks to this API, you can find the right entity in the video, learn what a video is about, find the best moment in the video and recognize inappropriate content in a video.
- Label detection
- Integrated with Google Cloud Platform in 7 languages
Not available yet. You can apply to access the private beta.
With this API, you can take the analysis of your content marketing to the next level. Picture yourself as the Social Media Manager of Banana Republic. You have just finished a successful campaign by asking your customers to send one-minute videos while they are wearing Banana Republic and pitch why they should win you most recent contest. By the end of the campaign, you have compiled a lot of videos from your most valuable asset: your customers! As a next step you’d like to start analyzing the content by gender. The video intelligence API allows you to search the video by gender, so you can segment and your video library content by gender, uncovering insights for merchandising and marketing.
To learn more, you can watch the following overview from the Google NEXT 2017:
2. Cloud Vision API
This powerful REST API provides image data, which allows your developers to build applications that can see and understand the content of images. Talk about Content Marketing 3.0!
This powerful tool enables you to classify images and analyze emotional facial attributes. The Cloud Vision API is accessible to the public.
- Label detection
- Face detection
- Explicit content detection
- Optical character recognition
- Logo detection
- Image attributes
- Landmark detection
Google pricing is based on blocks of 1000 units (that is, instances of API features applied to images) and the monthly usage. The fees for 1000 units per month is completely free, and there is a minimal fee as your usages increases to 1 million, 5 million and 20 million units per month.
Learn more about Cloud Vision API pricing.
I tested some of the features by analyzing the following photo which I took with 2 of my E-Nor colleagues recently.
I found the results quite impressive. The API identified three faces in the photo, face 1, face 2, and face 3. The result of the analysis for face 1 (Zara) has a confidence level of 77% for expressing joy.
As the Cloud Vision API analyzed the other two faces, the confidence level for expressing joy increased to 91% and eventually 100%. This indicates the power of the API to learn from the pattern of data.
Additionally, the API identified interesting labels for our photo with relevant confidence level. We were taking a photo on our way to an early supper in time to catch the sunset at the beach. This explains explains the confidence level of the analysis for the photo to be taken during the day. I was using a selfie stick which explains why the Cloud Vision APIs confidence level in this photo being a selfie is only 56%. The car in the background is not our car but we were passing by the cars in a parking lot on our way to the restaurant.
Let’s say that as manager for TSA (US airport security agency) at LAX (Los Angeles International), you have a mandate to improve travelers’ satisfaction with the security process. As part of this initiative, you ask users to take a photo before or after going through TSA and share with the TSA or other airport staff. The Vision API can help you search through the content and separate the happy and unhappy passengers. Next step is on you to design think a creative solution to decrease stress level while people go through the TSA line.
You can watch the following overview video to learn more:
3. Speech API
Thanks to the Speech API, your developers can build products to support your global customers by converting audio to text in over 80 languages and variants. The API can handle noisy environments and can support any device that sends either REST or gRPC requests.
- Automatic speech recognition
- Real-time pre-recorded audio support
- Global vocabulary (over 80 languages)
- Noise robustness
- Streaming recognition
- Inappropriate content filtering
- Word hints
This API is priced per 15 seconds of audio processed after a 60 minute free tier. Please note that the monthly usage is capped at one million minutes per month. The pricing is for applications on personal systems such as phones, tablets/desktops. You will have to contact Google for pricing and approval for embedded devices such as cars, TVs, appliances or speakers.
I did a quick test of the Speech API. You can see the results in the following video. I say the same sentence in English, French, and Persian. The API picked up each sentence correctly despite my accent. As described below, we as analysts can mine the voice information for business insights. Here’s my demo:
My only recommendation is to add Le Français (Canadian) in addition to Le Français (France) 🙂
This API could allow you to expand your reach. More customers, better bottom line 🙂 As we approaching the summertime (in the northern hemisphere), the number of visitors in most cities worldwide typically increases – especially in Canada, where many young immigrants invite family members from overseas. Not necessarily every visitor to Canada speaks English or French. That means the guests must wait for the host to get home after work to take them out. Imagine how much independence the retailers of the city could offer a new visitor if their app allowed them take voice orders in their native language and translate it into English or French. By the end of summer, a full analysis of all languages used could possibly open more doors to new markets or cultures.
You can watch the following overview video to learn more:
Broader Datasets, New Opportunities
I hope that this blog post has provided you with enough knowledge to be dangerous about the cloud APIs. You now have better opportunities to combine structured and unstructured data for even more interesting analysis. Should you start the discussions with your innovation team and implement these new products in your larger data analytics strategy, you will open up new and fresh sets of data streaming into your analytics tools. Have fun and drop me a line if you have any questions! I would love to hear your stories of using any of these APIs in your business.
Cloud Vision API
Cloud Video Intelligence
Cloud Speech API