Big Data for Small Business For Dummies®
Published by: John Wiley & Sons, Ltd., The Atrium, Southern Gate, Chichester, www.wiley.com
This edition first published 2016
© 2016 by John Wiley & Sons, Ltd., Chichester, West Sussex
Registered Office
John Wiley & Sons, Ltd., The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ, United Kingdom
For details of our global editorial offices, for customer services and for information about how to apply for permission to reuse the copyright material in this book, please see our website at www.wiley.com
.
All rights reserved. No part of this publication may be reproduced, stored in a retrieval system or transmitted in any form or by any means, electronic, mechanical, photocopying, recording, scanning or otherwise, except as permitted by the UK Copyright, Designs and Patents Act 1988, without the permission of this publisher.
Designations used by companies to distinguish their products are often claimed as trademarks. All brand names and product names used in this book are trade names, service marks, trademarks or registered trademarks of their respective owners. The publisher is not associated with any product or vendor mentioned in this book.
LIMIT OF LIABILITY/DISCLAIMER OF WARRANTY: WHILE THE PUBLISHER AND AUTHOR HAVE USED THEIR BEST EFFORTS IN PREPARING THIS BOOK, THEY MAKE NO REPRESENTATIONS OR WARRANTIES WITH RESPECT TO THE ACCURACY OR COMPLETENESS OF THE CONTENTS OF THIS BOOK AND SPECIFICALLY DISCLAIM ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. IT IS SOLD ON THE UNDERSTANDING THAT THE PUBLISHER IS NOT ENGAGED IN RENDERING PROFESSIONAL SERVICES AND NEITHER THE PUBLISHER NOR THE AUTHOR SHALL BE LIABLE FOR DAMAGES ARISING HEREFROM. IF PROFESSIONAL ADVICE OR OTHER EXPERT ASSISTANCE IS REQUIRED, THE SERVICES OF A COMPETENT PROFESSIONAL SHOULD BE SOUGHT.
For general information on our other products and services, please contact our Customer Care Department within the U.S. at 877-762-2974, outside the U.S. at 317-572-3993, or fax 317-572-4002. For technical support, please visit www.wiley.com/techsupport
.
Wiley publishes in a variety of print and electronic formats and by print-on-demand. Some material included with standard print versions of this book may not be included in e-books or in print-on-demand. If this book refers to media such as a CD or DVD that is not included in the version you purchased, you may download this material at http://booksupport.wiley.com
. For more information about Wiley products, visit www.wiley.com
.
A catalogue record for this book is available from the British Library.
Library of Congress Control Number: 2015955036
ISBN 978-1-119-02703-4 (pbk); ISBN 978-1-119-02704-1 (ebk); ISBN 978-1-119-02758-4 (ebk)
Almost everything we do now leaves a digital trace. If you bought this book online, you left a trail of digital crumbs in your wake, from browsing the online retailer’s website, to the transaction itself. Even if you walked into a physical bookstore and paid with cash, there’s still likely to be a digital trail of your activities, including CCTV (closed-circuit television) footage and location data from your own phone.
These digital traces can be summed up in one phrase: big data. Big data refers to the ability to collect and analyse the vast amounts of data now being generated in the world. This ability to harness the ever-expanding amounts of data is completely transforming our ability to understand the world and everything within it – from healthcare and science to how entire cities and countries are run. And, of course, it’s transforming the way we do business.
Some business owners and managers dismiss big data as being only for big-budget corporations. I think this is a huge mistake. Of course it’s true that some companies have eye-watering budgets for big data analytics, but most simply don’t. In fact, I work with plenty of small- and medium-sized businesses that successfully harness the power of data without spending a fortune.
The key is to start with a clear strategy. This allows you to focus solely on the data that’s right for you – the data that will help you achieve your long-term business goals. Having a clear strategy helps you cut through the hype and noise surrounding big data and get straight to how it can realistically help you improve the way you do business. That’s why I wrote Big Data for Small Business For Dummies: to help SMEs (small and medium enterprises) use big data in a practical and strategic way.
Whether you’re planning a one-off data project or want to incorporate data into your ongoing business operations, this book can help you understand what big data is, how you can apply it to your business, how to create your own big data strategy and get underway and how to build a culture that emphasises data-based decision making and continuous improvement.
Think of this book as a no-nonsense tour guide to help you on your big data journey. There are lots of inspirational examples of how other businesses are already using data, but the focus remains on practical tips to get you using data in your business. As well as examples and tips, the book is packed with step-by-step guidelines and lists designed to help you get the most out of big data. All the information is designed to be accessible and easy to understand. And where I have to resort to technical jargon, I give clear definitions. Sidebars (the grey boxes) contain nice-to-know but not essential information, so you can easily skip over them if you like.
The book is designed as a resource that you can dip in and out of and return to time and time again. As such, you don’t need to read it from cover to cover (although, if you want to, go ahead!). It’s designed to be read in whatever order works best for you.
Finally, if you decide to visit a website listed in the book then you just need to copy the URL (uniform resource locator) exactly as it appears in the book. This is true even if the address falls between two lines or two pages – no extra characters (such as hyphens) were inserted.
Every author has a target audience in mind when he writes. For this book, I assume that you’re the owner of a small/medium business or a manager in such a business. I assume that you’ve heard a little about big data already – perhaps what a powerful tool it can be for businesses – and you want to know more. I don’t assume you have any prior technical knowledge whatsoever. Crucially, whether you’re a business owner or manager, I assume that you want to improve the way you do business and you’re in a position to make strategic decisions … and then act upon them.
If you would like to supplement this book with more technical information, you might like to check out Big Data For Dummies by Judith Hurwitz, Alan Nugent, Fern Halper and Marcia Kaufman, published by Wiley.
When I see a huge wall of text, I start to switch off. So in this book I use a number of icons to break the text up, and to make it easier and more enjoyable to read. The icons also help you spot key information quickly.
In addition to the material in the print or e-book you’re reading right now, this product also comes with some access-anywhere goodies on the Web.
Check out the free Cheat Sheet at www.dummies.com/cheatsheet/bigdataforsmallbusiness
for some helpful key information and checklists. It’s designed as a quick-check reference for some crucial big data information, including a handy list of key terminology.
There are also some useful bonus articles and an additional Part of Tens chapter available on the website. Head to www.dummies.com/extras/bigdataforsmallbusiness
to access these.
You may also like to check out the website of the Advanced Performance Institute, which I founded and head up. There you’ll find many relevant case studies, white papers and reading material on big data: www.ap-institute.com
. I also write regularly for Forbes magazine on all things big data and you can find my articles at www.forbes.com/sites/bernardmarr
. My LinkedIn page also contains a wealth of articles and posts on big data: www.linkedin.com/in/bernardmarr
.
The short answer is: It’s up to you. You don’t have to start at Chapter 1 and work your way through the book in a linear way – but you can if you want to.
If you’re completely new to big data, I recommend you start with the Chapters in Part I for an explanation of what big data is and the main ways it can be used in business. Otherwise, simply use the table of contents to find what you’re most interested in and jump straight to that section. If you want to start by finding out exactly how to create a big data strategy, turn to Chapter 10. If you’re interested more in big data skills and competencies, start with Chapter 8. Wherever you go from here, you’ll find a wealth of information and tips to help you start using big data in your business.
Part I
In this part …
Understand what big data is and why you need to know about it.
Find out the key characteristics that define big data.
See why there’s so much hype around big data right now – and why all the fuss is justified.
Check out key big data uses for small businesses.
Chapter 1
In This Chapter
Understanding what big data is and why it’s so important
Taking a peek at the different types of data
Putting big data to work in your business
Big data has been making big headlines over the last couple of years, but it’s much more than just a buzz phrase or the latest business fad. The phenomenon is very real and is producing concrete benefits in so many different areas – from business to medical research to national security.
In this chapter, I look at how this phenomenon is transforming the way you do business. I also look at what sorts of data are available these days and introduce my step-by-step processes for using big data in business.
Personally, I don’t love the term big data because I think it places far too much emphasis on the sheer volume of data, when, as I talk about in this chapter, what you do with the data is much more important than how much of it you have. I have a feeling the term will gradually disappear and what’s now called big data will, in the future, just be known as data.
Given all the hype around big data, it’s no surprise that market researchers Gartner found in 2014 that 73 per cent of businesses have already invested in a big data plan or are planning to do so in the next few years.
The online behemoths that have come to dominate business in the Internet age – Google, Facebook, Amazon, you know the ones – all base their business models on big data. It’s by collecting and analysing huge amounts of information from us that they’re able to determine precisely what we want. The data also enables them to sell advertising services capable of precisely targeting their clients’ preferred demographics.
But big data isn’t just for giant corporations, it matters to every company – no matter how small or traditional. To cater for this huge demand, many companies have sprung up to offer services to other businesses, enabling them to launch big data initiatives of their own.
Of course, data collection itself isn’t new. But technological advances like chip and sensor technology, the Internet, cloud computing and the ability to store and analyse data have changed the quantity of data you can collect.
And the amount of data being generated every day is staggering. For example, users of Facebook upload around one billion pieces of content to the social network site every day. In industry, machinery and vehicles are fitted with sensors and trackers that record their every move, and whenever you call a call centre, an audio recording of your conversation is made and stored in a huge digital database.
Eventually, every aspect of your lives will be affected by big data. However, there are some areas where big data is already making a real difference today – in business and in other areas. Let’s look at the main areas where big data is most widely used right now.
Big data might seem like it’s something that only big business can make use of. When people first hear that massive volumes of information are being used to fight terrorism, cure cancer or predict the spread of Ebola, it sounds expensive, difficult and time-consuming. But that doesn’t have to be the case.
Huge datasets on everything from demographics to weather and consumer spending habits are freely available online for small businesses to use. Plus, the basic tools to make sense of the data are also free and becoming increasingly simple for anyone to use. For example, if you’re using Google’s AdWords to track what your customers are searching for online, you’re engaging in big data analysis, even if you don’t know it.
Plenty of small businesses are already using big data to better understand and target customers. Retailers can predict what products will sell, car insurance companies can understand how well their customers actually drive and detect potential fraud and takeaway companies can tailor their services to meet local customer preferences and demand. Social media has become a particularly valuable source of data for understanding customers, trends and markets.
Big data can also help improve business processes. Retailers are able to optimise their stock levels based on what’s trending on social media, what people are searching for on the web or even weather forecasts. Supply chains can be optimised so that delivery drivers use less gas and reach customers faster. And you can use data to understand and improve staff engagement or improve your hiring process.
There’s more detail on the many big data uses in Chapter 3 – and there are examples dotted throughout the book. Just look out for the Example icon.
The first thing to understand is that data in itself isn’t a new business phenomenon. Business data is as old as, well, business itself. Just think of sales and financial ledgers or, in more recent history, customer databases. It’s specifically big data that’s the new phenomenon. But, as I mention at the start of the chapter, big data isn’t just about how big it is. In fact, volume is just one of the key defining factors of big data.
To understand big data, and what separates it from normal data, you need to understand four main factors, which all handily start with a V. It’s these Vs that define what’s really special about big data, why it’s different to regular data and why it’s so transformative for businesses. You can find more information on the Vs in Chapter 2.
The four Vs are:
I think there are three main reasons why big data is in the news so much these days:
I look at each of these reasons in Chapter 2.
Another exciting aspect of big data is that it’s only going to get bigger and more widely used. As the tools to collect and analyse data become less and less expensive and more and more accessible, we’ll develop more and more uses for it – everything from smart yoga mats (no, really) to better healthcare tools and a more effective police force.
It may seem like big data has exploded onto the business scene out of nowhere. But in fact it’s been a more like a gradual evolution: from dusty archive rooms to the microfiche to databases and on to data centres. I think it’s part of human nature to want to continually gather information and make sense of what’s going on around us – we’ve just developed sleeker technology for it over the years.
There’s more detail on the technology changes that underpin big data in Chapter 6. But here’s a short overview of these three advances.
Distributed computing gives you greater storage capacity, but it also allows you to connect data faster than ever before. With data being spread across many different locations, you need to be able to bring that data back together quickly. This is where faster networks come into play.
These massive increases in storage and computing power make number crunching possible on a very large scale. Without faster networks that connect data sets together for analysis, big data just wouldn’t be possible for the average business. Now you can break up the analysis of data into manageable chunks, meaning that no one machine has to bear the whole load. This makes analysis faster and far more efficient – and cheaper.
It used to be that data would fit neatly into tables, spreadsheets and databases: think of data like sales figures, customer records, wholesale prices and so on. But now you can look at all sorts of data – including emails, Facebook posts, photos, blog comments and voice recordings – and extract meaning.
In this section I give an overview of the different types of data, but you can find more detail (and some great examples) in Chapters 4 and 5.
There are two main types of data: structured and unstructured.
Structured data has three main things going for it: it’s usually cheap to use, it’s easy to store, and it’s easy to mine for information. But, on the downside, it represents only a small proportion of all the data available these days – as the digital traces you leave behind get bigger and bigger, only a small amount of this data is structured in format.
Another downside is that structured data is simply less rich in insights than unstructured data, meaning it can be more difficult (maybe even impossible) to really understand what’s going on if you’re using only structured data. For best results, structured data often needs to be paired with other data to get a fuller picture. For example, structured data can tell you that hits on your website increased 20 per cent last month, but you need other forms of data to explore why that happened.
Unstructured and semi-structured data tends to be much more difficult to store (not least because so darn much of it is created every day). Now, thanks to massive increases in storage capacities and the ability to tag and categorise this data, as well as huge leaps in analytical technologies, you can finally make use of this data.
The advantages of unstructured and semi-structured data are that there’s absolutely loads of it (it accounts for around 80 per cent of all business-relevant data being generated today), and it providers a richer picture than structured data. However, it’s harder to store and more difficult to analyse, which makes it more expensive to work with.
The beauty of internal data is that it’s cheap (or maybe even free) and, as you own the data, there are no access issues to deal with. But, the downsides include having to maintain and secure the data (especially if it includes personal data). You may also find that internal data on its own doesn’t provide enough information to meet your strategic goals and you may need to supplement it with external data.
External data is powerful because it gives you access to information that’s often more up to date and richer than any information you could gather yourself. And, as it’s someone else’s data, you have the added bonus of not worrying about the security and data protection issues. But, the obvious downside is that you don’t own the data, and you usually have to pay for access (although not always – check out Chapter 15 for some great free data sources).
You leave more and more digital traces of your activities than ever before. If you think about what you’ve done today so far, most of those activities have left some digital trace (data) that can and is being collected and analysed. Some of the data you can now collect is new; some has been around a while but we’ve only just found ways to really analyse it.
Some of the exciting new types of data include:
There are three keys areas of decision making that relate to big data: one is pulling out insights from the data and using that information to guide your decision making, another is deciding how to build your big data skills and competencies, and the final aspect relates to infrastructure decisions. I look at each in turn in the next sections.
In today’s competitive business world, success often comes down to a company’s ability to learn faster than the competition and act on what they learn faster than the competition. The process of turning data into insights and actionable knowledge is the key to that success.
A key part of this process is making sure the right information is delivered to the right people at the right time. In order to aid decision making and ensure the necessary action is taken, insights need to be presented in a clear, concise and interesting way. People are less likely to act if they have to work hard to understand what the data is telling them.
Data and insights can also feed into the machines in your company, as well as your people. This applies to any machine or technology that’s a key part of how the business operates on a day-to-day basis, such as stock control systems or machinery on a production line. Connecting data and machines allows businesses to increase efficiency, improve product quality, cut costs and much more. Processes and systems can also be connected with data, so that you can improve how you do things based on what the data shows.
There’s more on focusing on insights and feeding data to your people, machines and systems in Chapter 7.
There’s currently a skill shortage in big data, meaning there’s more demand for big data experts than there are available experts. This can make it hard for smaller businesses to recruit good data staff. But there are alternatives to hiring in-house staff. You can try training up your existing people, working with external data providers (of which there are now many, big and small) and partnering with other organisations, such as universities.
Whether you want to hire new people or boost your existing skills, I think there are six key skills required to successfully use big data in business:
There’s more on these skills in Chapter 8.
If data is going to be a key part of your business, then it’s a good idea to consider hiring a data scientist to work in-house. The six skills I list are a good starting point when you’re searching for the right person, and I also list some helpful recruitment questions in Chapter 8. If you don’t have any tech experience at all then recruiting in the tech field can seem daunting – with these questions and by focusing on the core skills, you’ll be able to assess candidates with confidence.
Your ultimate goal is to gather insights which will lead to better decision making and improved business performance. In order to do this, you’ll need to invest in some tools or services.
I explore the main options for each element in Chapter 9, along with some of the most commonly used software packages.
The first step is to assess your existing infrastructure so, for each of these four elements, you need to consider what related technology or resources you already have in-house and how they might need to be improved or supplemented. For example, you may already be collecting useful customer data through your website or customer service department but don’t yet have the analytics in place to work with that data.