[Accede aquí a la versión en español]
From the definition of key terms around data analysis that are related today to the concept of Big Data, how could we have a clearer idea of their origin, when the relevance of each concept dates back to and how that relevance has evolved along the years? Modern web search tools allow us to visualize historical information that has been extracted automatically from different sources and present it in this interesting example of analysis on those terms related to data analysis itself and Big Data.
From the definition of key terms around data analysis that are related today to the concept of Big Data, how could we have a clearer idea of their origin, when the relevance of each concept dates back to and how that relevance has evolved along the years? Modern web search tools allow us to visualize historical information that has been extracted automatically from different sources and present it in this interesting example of analysis on those terms related to data analysis itself and Big Data.
If we want to examine the historical trend of the use of these terms, but we also want to go back to a period of time before the emergence of the Web and its massive use in our society, we can use Google Books and its search feature on the frequency of occurrence of terms and expressions along all books that have been digitally processed in this service. This is the chart comparing the frequency of occurrence for "big data", "business intelligence", "data mining", "data science" and "machine learning", between the years 1970 and 2008 (last available year):
If we focus on web searches and we compare how often these terms have been searched on Google since 2004 until today, we can identify a "second part" of this evolution showing very different trends from those seen before:
Google Trends chart comparing the volume of Google searches for “big data”, “business intelligence”, “data mining”, “data science” and “machine learning”.
[Click on the image to enlarge it]
During the first years of this interval of time it can be clearly noticed that the concept of Big Data had not obtained yet any popularity, something that changes drastically in 2011. We can easily relate this milestone with the publication that same year of the report by McKinsey Global Institute titled "Big Data: The next frontier for innovation, competition and productivity", which puts this concept of the radar of every analysis on global trends in technology. We can also see an increasing interest in the concept of Data Science since 2012, just when an article in Harvard Business Review paraphrases the words by Hal Varian in 2009 about the future of statisticians, introducing the profile of "data scientist" as "the sexiest job of the 21st century".
By contrast, the old preponderance of the concept of Data Mining drops significantly during this period and it currently has a level of relevance very similar to Business Intelligence, although both stay in the shadow of Big Data. This can also be related to the usual confusion of miscalling "big data" to every data analysis, regardless of the analytics approach or the volume or complexity on those data. The concept that clearly increments its popularity during these last years is Machine Learning, which fits perfectly with the "second youth" that machine learning techniques (which, as we have seen, date back to decades ago) have been experimenting since the rise of Big Data.
No hay comentarios:
Publicar un comentario