{"id":231,"date":"2017-10-26T19:57:12","date_gmt":"2017-10-26T19:57:12","guid":{"rendered":"http:\/\/www.ibertech.org\/?p=231"},"modified":"2018-01-15T12:22:30","modified_gmt":"2018-01-15T12:22:30","slug":"big-data-y-analytics-el-tratamiento-del-dato","status":"publish","type":"post","link":"https:\/\/www.ibertech.org\/en\/big-data-y-analytics-el-tratamiento-del-dato\/","title":{"rendered":"Big Data &#038; Analytics: Data Treatment"},"content":{"rendered":"<p><\/p>\n<h5>Data Treatment<\/h5>\n<p>The era of big data is already here. We\u2019re talking about a reality, not a prediction in a short-medium term. In today&#8217;s society the <strong>data is information<\/strong>. And we\u2019ve always been taught that<strong> information is power.<\/strong> The main question about the data management is: how do I manage the vast amount of data generated daily? And most importantly: how am I sure that my database is composed of 100% reliable data?<br \/>\nIt\u2019s needed to be clarified that the level of specificity and contextualization of data generates a clear differentiation in terms of quality. There are remarkable differences between the absolute record of visits on a website (raw data), and recording visits classified based on the time, the geographical area of ??the visitor, etc. (explicit data). From this point, the most important thing is to manage this information in an optimal way to ensure we have a high-quality database.<br \/>\nData quality is an issue that has increasingly worried to the IT departments of companies. This is achieved by developing a strict quality policy (Data Quality):<\/p>\n<ul>\n<li>Prior to the collection of information well defined functional strategy.<\/li>\n<li>Unification of criteria in data collection.<\/li>\n<li>Strict and methodical collecting information habits, developed based on the final research goal.<\/li>\n<li>Continuous subsequent feedback for the optimization of resources in future actions.<\/li>\n<\/ul>\n<p>The Data Quality industry is experiencing an exponential growth in 2016, and the reason is simple: the today\u2019s speed of data generation is far superior to the ability of human beings to accumulate and classify all of them in an optimized way. In fact, according to Martin Doyle, the following Experian qualitative data statistics are a clear example of this:<\/p>\n<ul>\n<li>63% of companies <strong>has not developed <\/strong>a clear strategy regarding the <strong>Data Quality.<\/strong><\/li>\n<li>78% of companies has numerous <strong>problems in e-mail sending.<\/strong><\/li>\n<li>81% of companies <strong>does not rely 100% <\/strong>the reports generated from<strong> their databases<\/strong> in because of its quality.<\/li>\n<li>83% of companies fights against <strong>data silos.\u00a0<\/strong><\/li>\n<\/ul>\n<p>When our BI strategy is well planned, when the entire <strong>team participating<\/strong> in the data collection and reporting h<strong>as unified the ultimate goal of the work<\/strong>, and, last but not least, when <strong>we have a quality database<\/strong>, with lots of filtered, cleaned and optimized data forming a good raw material on which to build our work, we will have established the culture of data in our company, and we will can benefit from all the descriptive and predictive information that the big data is contributing to the pioneers in the use of BI.<br \/>\nThey are already ahead, what are you waiting for?<\/p>\n<p><strong>Sources<\/strong><\/p>\n<ul>\n<li>Doyle,Martin. \u201cWill 2016 be the Year you Clean up your Dirty Data?. Datasciencecentral.com 12\/2015. 10 de Mayo de 2016. www.datasciencecentral.com\/profiles\/blogs\/will-2016-be-the-year-you-clean-up-your-dirty-data<\/li>\n<li>Guerrero, David, \u201cCalidad de datos: mucho m\u00e1s que una acci\u00f3n puntual\u201d blogs.deusto.es. 12\/2015. 10 de mayo de 2016. https:\/\/blogs.deusto.es\/bigdata\/calidad-de-datos-mucho-mas-que-una-accion-puntual\/<\/li>\n<\/ul>\n<p><\/p>","protected":false},"excerpt":{"rendered":"<p>The amount of data generated daily has increased in a way so that databases have to be treated adequately to grant their quality&#8230;<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[1],"tags":[],"wps_subtitle":"","acf":[],"_links":{"self":[{"href":"https:\/\/www.ibertech.org\/en\/wp-json\/wp\/v2\/posts\/231"}],"collection":[{"href":"https:\/\/www.ibertech.org\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.ibertech.org\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.ibertech.org\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.ibertech.org\/en\/wp-json\/wp\/v2\/comments?post=231"}],"version-history":[{"count":3,"href":"https:\/\/www.ibertech.org\/en\/wp-json\/wp\/v2\/posts\/231\/revisions"}],"predecessor-version":[{"id":470,"href":"https:\/\/www.ibertech.org\/en\/wp-json\/wp\/v2\/posts\/231\/revisions\/470"}],"wp:attachment":[{"href":"https:\/\/www.ibertech.org\/en\/wp-json\/wp\/v2\/media?parent=231"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.ibertech.org\/en\/wp-json\/wp\/v2\/categories?post=231"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.ibertech.org\/en\/wp-json\/wp\/v2\/tags?post=231"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}