Open Source Cloud Authors: Vaibhaw Pandey, Stackify Blog, John Walsh, Pat Romanski, Liz McMillan

Related Topics: @DevOpsSummit, Linux Containers, Open Source Cloud, Containers Expo Blog, Agile Computing

@DevOpsSummit: Blog Post

Are you a Fox or Hedgehog? | @DevOpsSummit [#DevOps]

The second bullet is good news to anyone who thinks they have expertise such as DevOps developer

In the Log Management World: Are you a Fox or Hedgehog?

I’ve recently been reading Nate Silver’s book, “The Signal and the Noise.” In the book, Silver looks at a number of areas where predictions have been made and considers how successful they have been, as well as the reasons why they have been accurate (or not).

in the log management world are you a fox or hedgehod

Register For DevOps Summit "FREE" (before Friday) ▸ Here

I couldn’t help but draw the similarities how most companies use log management tools today.

Silver’s particular interests are political forecasting (see www.fivethirtyeight.com) and baseball, particularly predicting player performance. He doesn’t always get his predictions right, but he does explain the rationale behind his predictions and seeks to be unbiased.

In his book, he also considers other areas such as meteorology where the models are based on established scientific models, and prediction accuracy increases as more computing power is applied to the problem.

in the log management world are you a fox or hedgehod

Silver shows that predictions in areas such as economics have been less successful, e.g. he examines why many economists missed the recession, and why supposedly expert forecasters get election predictions wrong so often. For example, before the recession of 2008, the assumption was made that house prices would continue to rise, whereas history has shown they can decline in certain circumstances. Consequently there was false confidence about the associated risks in the event of a housing bubble or the risk that a fall in house prices could trigger a global crisis.

Other than giving us interesting problems around prediction where computing power can be applied to big data sets, why should his findings be interesting to me or anyone interested in logs and big data?

Firstly, I like the statement he makes that; “before we demand more of our data, we need to demand more of ourselves.”

In terms of the data being generated, “most of it is just noise, and the noise is increasing faster than the signal.”

Silver’s approach can be summarized as an attitude based on Bayes Theorem, and while that is a mathematical formula, he uses it as a basis for incorporating probability, uncertainty and testing into analysis, as well as questioning assumptions and beliefs.

The following are key lessons I took from the book:

Consider Risk vs Uncertainty
Appreciate the value of a domain expert (using computer/data support)
Be aware of your biases
Ask yourself if you are a fox or a hedgehog

Risk vs Uncertainty:

The relationship between Risk (a gamble with odds that you can put a price on) and Uncertainty (a risk that is hard to measure) is key, as we often tend to ignore or make incorrect assumption about uncertainty. This relationship is considered by Silver to be key to many of the issues seen in predictions in the finance industry.

In that case, there is plenty of computing power available, but predictions often turn out to be incorrect due to incorrect assumptions on the level of uncertainty, e.g. a mortgage backed security where a risk is calculated based on the individual mortgages being independent of each other and using models assuming a manageable downturn in house prices. Of course, these assumptions turned out not to be true in the event of a global housing price fall and so the risks had in fact been greater than calculated.

Appreciate the value of a domain expert

The second bullet is good news to anyone who thinks they have developed expertise based on experience, such as the expert sysadmin or DevOps developer.

Models are best when applied with human judgement to understand risk vs. uncertainty and the weighting to be applied to different factors. For example, in baseball prediction Silver cites the skills a scout may have in judging a prospect under different headings, such as work-ethic, focus, humility.

It is important for this judgement to be used to ensure that the greater computing power available is not just used to make seemingly more accurate predictions based on incorrect assumptions or a greater amount noise rather than the signal.

Be aware of your biases

The third bullet is a warning for us to be aware of our biases, because there is massive value in the experience of a domain expert (provided he is not biased!). Silver says that pursuing the objective truth is a goal for all those making predictions, but the forecaster must realize that they perceive it imperfectly.

He points out we often focus on signals that tell the story we want, and not the story wehave. Or we make assumptions that are not true. We may not think we have any biases, but ask yourself a few questions.

Have you got used to one tool for looking at system performance or failure analysis?
Are you prone to always blame a particular webserver instance, a certain application, database or a single vendor and begin by looking at those components before (or even to the exclusion of) other ones when a failure happens?
Would you be inclined to use the available data selectively to focus on one particular component or technology or consider it in an open way to select the component?
I hope in selecting the four bullets above as key points from the book, I hope I have not shown any bias of my own!

Are you a fox or a hedgehog?

The final bullet is important in terms of how much information you rely-on and gather to find errors, analyse for trends and maybe even make predictions.

Silver says hedgehogs believe “in governing principles about the world that behave as though they were physical laws.” and foxes “are scrappy creatures who believe in a plethora of little ideas and in taking a multitude of approaches toward a problem.”

Or put another way, “The fox knows many things, but the hedgehog knows one big thing.”

This is inspired by an Isiah Berlin essay “The Hedgehog and the Fox”, using a title borrowed from the Greek poet Archilochus, http://fivethirtyeight.com/features/what-the-fox-knows/.

A key point is that a hedgehog may only gather the information that confirms their existing views and/or ignore new information that conflicts with them. For example, Silver cites work on political pundits which shows that those “experts” who do the most interviews tend to be the most confident and most strident in their views, but make the worst predictions.

The key attributes of a fox are that he/she is multi-disciplinary, adaptable, self-critical, tolerant of complexity, cautious and empirical. On the other hand, a hedgehog tends to be specialized, a stalwart, stubborn, seek order, confident and ideological.

How do log management tools fit into this world of foxes and hedgehogs?

By allowing logs of all sorts from your data servers to be uploaded, searched and analyzed using a powerful UI to highlight areas of interest, we enable a pluralistic or foxlike approach that can contribute to your data analysis for a range of purposes.

You could of course limit yourself to being a hedgehog by only loading logs from those systems you have a bias against, but note the third bullet above and you don’t really have an excuse any more.

The point above on the value of applying the reasoning and experience of experts together with computing power to analyze the right sets of data should inspire those of us who believe in the experience of computer professionals and see the potential in all the data we are gathering and analyzing from logs and elsewhere.

So, just check your biases and try to be a fox.

More Stories By Trevor Parsons

Trevor Parsons is Chief Scientist and Co-founder of Logentries. Trevor has over 10 years experience in enterprise software and, in particular, has specialized in developing enterprise monitoring and performance tools for distributed systems. He is also a research fellow at the Performance Engineering Lab Research Group and was formerly a Scientist at the IBM Center for Advanced Studies. Trevor holds a PhD from University College Dublin, Ireland.

@ThingsExpo Stories
"Akvelon is a software development company and we also provide consultancy services to folks who are looking to scale or accelerate their engineering roadmaps," explained Jeremiah Mothersell, Marketing Manager at Akvelon, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
"MobiDev is a software development company and we do complex, custom software development for everybody from entrepreneurs to large enterprises," explained Alan Winters, U.S. Head of Business Development at MobiDev, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
SYS-CON Events announced today that CrowdReviews.com has been named “Media Sponsor” of SYS-CON's 22nd International Cloud Expo, which will take place on June 5–7, 2018, at the Javits Center in New York City, NY. CrowdReviews.com is a transparent online platform for determining which products and services are the best based on the opinion of the crowd. The crowd consists of Internet users that have experienced products and services first-hand and have an interest in letting other potential buye...
"IBM is really all in on blockchain. We take a look at sort of the history of blockchain ledger technologies. It started out with bitcoin, Ethereum, and IBM evaluated these particular blockchain technologies and found they were anonymous and permissionless and that many companies were looking for permissioned blockchain," stated René Bostic, Technical VP of the IBM Cloud Unit in North America, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Conventi...
SYS-CON Events announced today that Telecom Reseller has been named “Media Sponsor” of SYS-CON's 22nd International Cloud Expo, which will take place on June 5-7, 2018, at the Javits Center in New York, NY. Telecom Reseller reports on Unified Communications, UCaaS, BPaaS for enterprise and SMBs. They report extensively on both customer premises based solutions such as IP-PBX as well as cloud based and hosted platforms.
Large industrial manufacturing organizations are adopting the agile principles of cloud software companies. The industrial manufacturing development process has not scaled over time. Now that design CAD teams are geographically distributed, centralizing their work is key. With large multi-gigabyte projects, outdated tools have stifled industrial team agility, time-to-market milestones, and impacted P&L stakeholders.
"Space Monkey by Vivent Smart Home is a product that is a distributed cloud-based edge storage network. Vivent Smart Home, our parent company, is a smart home provider that places a lot of hard drives across homes in North America," explained JT Olds, Director of Engineering, and Brandon Crowfeather, Product Manager, at Vivint Smart Home, in this SYS-CON.tv interview at @ThingsExpo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
Coca-Cola’s Google powered digital signage system lays the groundwork for a more valuable connection between Coke and its customers. Digital signs pair software with high-resolution displays so that a message can be changed instantly based on what the operator wants to communicate or sell. In their Day 3 Keynote at 21st Cloud Expo, Greg Chambers, Global Group Director, Digital Innovation, Coca-Cola, and Vidya Nagarajan, a Senior Product Manager at Google, discussed how from store operations and ...
"Cloud Academy is an enterprise training platform for the cloud, specifically public clouds. We offer guided learning experiences on AWS, Azure, Google Cloud and all the surrounding methodologies and technologies that you need to know and your teams need to know in order to leverage the full benefits of the cloud," explained Alex Brower, VP of Marketing at Cloud Academy, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clar...
In his session at 21st Cloud Expo, Carl J. Levine, Senior Technical Evangelist for NS1, will objectively discuss how DNS is used to solve Digital Transformation challenges in large SaaS applications, CDNs, AdTech platforms, and other demanding use cases. Carl J. Levine is the Senior Technical Evangelist for NS1. A veteran of the Internet Infrastructure space, he has over a decade of experience with startups, networking protocols and Internet infrastructure, combined with the unique ability to it...
It is of utmost importance for the future success of WebRTC to ensure that interoperability is operational between web browsers and any WebRTC-compliant client. To be guaranteed as operational and effective, interoperability must be tested extensively by establishing WebRTC data and media connections between different web browsers running on different devices and operating systems. In his session at WebRTC Summit at @ThingsExpo, Dr. Alex Gouaillard, CEO and Founder of CoSMo Software, presented ...
"There's plenty of bandwidth out there but it's never in the right place. So what Cedexis does is uses data to work out the best pathways to get data from the origin to the person who wants to get it," explained Simon Jones, Evangelist and Head of Marketing at Cedexis, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
WebRTC is great technology to build your own communication tools. It will be even more exciting experience it with advanced devices, such as a 360 Camera, 360 microphone, and a depth sensor camera. In his session at @ThingsExpo, Masashi Ganeko, a manager at INFOCOM Corporation, introduced two experimental projects from his team and what they learned from them. "Shotoku Tamago" uses the robot audition software HARK to track speakers in 360 video of a remote party. "Virtual Teleport" uses a multip...
Gemini is Yahoo’s native and search advertising platform. To ensure the quality of a complex distributed system that spans multiple products and components and across various desktop websites and mobile app and web experiences – both Yahoo owned and operated and third-party syndication (supply), with complex interaction with more than a billion users and numerous advertisers globally (demand) – it becomes imperative to automate a set of end-to-end tests 24x7 to detect bugs and regression. In th...
A strange thing is happening along the way to the Internet of Things, namely far too many devices to work with and manage. It has become clear that we'll need much higher efficiency user experiences that can allow us to more easily and scalably work with the thousands of devices that will soon be in each of our lives. Enter the conversational interface revolution, combining bots we can literally talk with, gesture to, and even direct with our thoughts, with embedded artificial intelligence, whic...
SYS-CON Events announced today that Evatronix will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Evatronix SA offers comprehensive solutions in the design and implementation of electronic systems, in CAD / CAM deployment, and also is a designer and manufacturer of advanced 3D scanners for professional applications.
Leading companies, from the Global Fortune 500 to the smallest companies, are adopting hybrid cloud as the path to business advantage. Hybrid cloud depends on cloud services and on-premises infrastructure working in unison. Successful implementations require new levels of data mobility, enabled by an automated and seamless flow across on-premises and cloud resources. In his general session at 21st Cloud Expo, Greg Tevis, an IBM Storage Software Technical Strategist and Customer Solution Architec...
To get the most out of their data, successful companies are not focusing on queries and data lakes, they are actively integrating analytics into their operations with a data-first application development approach. Real-time adjustments to improve revenues, reduce costs, or mitigate risk rely on applications that minimize latency on a variety of data sources. In his session at @BigDataExpo, Jack Norris, Senior Vice President, Data and Applications at MapR Technologies, reviewed best practices to ...
An increasing number of companies are creating products that combine data with analytical capabilities. Running interactive queries on Big Data requires complex architectures to store and query data effectively, typically involving data streams, an choosing efficient file format/database and multiple independent systems that are tied together through custom-engineered pipelines. In his session at @BigDataExpo at @ThingsExpo, Tomer Levi, a senior software engineer at Intel’s Advanced Analytics gr...
When talking IoT we often focus on the devices, the sensors, the hardware itself. The new smart appliances, the new smart or self-driving cars (which are amalgamations of many ‘things’). When we are looking at the world of IoT, we should take a step back, look at the big picture. What value are these devices providing? IoT is not about the devices, it’s about the data consumed and generated. The devices are tools, mechanisms, conduits. In his session at Internet of Things at Cloud Expo | DXWor...