{"id":1658,"date":"2023-09-09T10:12:31","date_gmt":"2023-09-09T10:12:31","guid":{"rendered":"https:\/\/staticalmo.com\/?p=1658"},"modified":"2024-10-24T08:04:20","modified_gmt":"2024-10-24T08:04:20","slug":"come-riconoscere-un-informatico-che-fa-statistica-e-perche-bisogna-fare-attenzione","status":"publish","type":"post","link":"https:\/\/staticalmo.com\/en\/come-riconoscere-un-informatico-che-fa-statistica-e-perche-bisogna-fare-attenzione\/","title":{"rendered":"How to recognize a computer scientist doing statistics and why you need to be careful"},"content":{"rendered":"<h2 class=\"wp-block-heading\">Introduction<\/h2>\r\n\r\n\r\n\r\n<p class=\"wp-block-paragraph\">One of the objectives of statistics is to explain the variability of a phenomenon of interest, through a mathematical expression, i.e. an equation called a model, made up of an objective (dependent) variable and explaining (explanatory, independent) variables. . If the target variable has no variability, then we do not have a variable but a constant. And we don't need statistics.\u00a0<\/p>\r\n\r\n\r\n\r\n<p class=\"wp-block-paragraph\">In statistics there are basic models, called inferential, and classification models. The former are used to do the latter, also to get an idea of the basic performance. The name of the basic models depend on the type of objective variable: linear regression for quantitative ones (e.g. sales), the logistic model for dichotomous ones (e.g. customer\/non-customer, recurring purchase\/occasional purchase, etc.) or categorical ( expense categories, etc.)<\/p>\r\n\r\n\r\n\r\n<p class=\"wp-block-paragraph\">Example of statistical model:<\/p>\r\n\r\n\r\n\r\n<pre>ice cream sales = coefficient*day temperature + coefficient*number of tourists + error\u00a0<\/pre>\r\n\r\n\r\n\r\n<p class=\"wp-block-paragraph\">Compared to the models you saw in high school, as in <a href=\"https:\/\/www.youtube.com\/watch?v=P8_TdG0HZhQ&amp;t=6s\">geometry<\/a> or in physics, statistical models have an error term and \u201cdirtier\u201d coefficients, i.e. non-round numbers.\u00a0<\/p>\r\n\r\n\r\n\r\n<h2 class=\"wp-block-heading\">Computer scientists vs statisticians<\/h2>\r\n\r\n\r\n\r\n<p class=\"wp-block-paragraph\">\u2013\u00a0 they tend to only do linear correlations, without <a href=\"https:\/\/staticalmo.com\/hai-ragione-o-torto-come-la-statistica-tramite-test-e-significativita-permette-di-valutare-opinioni-ipotesi\/\">significance<\/a>. Unless we have a dataset with many rows where the significance can be seen by eye through the size of the correlation, inserting non-significant variables into a model creates an unreliable and absolutely unstable model, therefore useless for explaining phenomena, making predictions or classifications .<\/p>\r\n\r\n\r\n\r\n<p class=\"wp-block-paragraph translation-block\">\u2013 they immediately dive into models that are used to make classification and do not start from basic models. This means immediately risking having rigid models (overfit), which will have apparently excellent performance during testing. But it also involves a dangerous and potentially costly automatism in terms of choice of variables, especially if you use solutions where the model is trained using resources from a third-party computer (cloud).<\/p>\r\n\r\n\r\n\r\n<p class=\"wp-block-paragraph\">\u2013 they use the importance of the variables rather than classical or Bayesian significance. The first is a purely descriptive metric, the last two are inferential. This means creating many more models, because in the absence of a method for selecting the variables we are left with a bit of experimentation, risking what was anticipated in the previous point. That purely descriptive metric can make sense in the problem of classifying certain images, where the variables are the columns of an image, for example if full HD we have 1080 rows and 1920 columns; or in the field of natural language processing (NLP). Computer vision has less to do with statistics than other areas of machine learning, and requires less<i> critical thinking<\/i> on what you are doing\/want to do, as it also has multiple automated steps.\u00a0<\/p>\r\n\r\n\r\n\r\n<p class=\"wp-block-paragraph\">In general you will notice that the<a href=\"https:\/\/www.youtube.com\/@staticalmo\/search?query=informatici\">computer scientist approach<\/a> compensates with computational brute force, a familiar expression in the password field, the theoretical <em>gaps<\/em> of statistics. In other words, <a href=\"https:\/\/www.youtube.com\/watch?v=_-rH3okYA4I\">they shoot a lot of bullets<\/a> but few hit the mark. Statisticians, on the other hand, have less IT skills, with the risk of making them less autonomous, but knowledge of theory makes them more effective in certain practical activities. On the other hand, computer scientists who writes statistics code is more useful to large companies than anything else.<\/p>\r\n<p>&nbsp;<\/p>\r\n<p><img fetchpriority=\"high\" decoding=\"async\" class=\"size-medium aligncenter\" src=\"https:\/\/media.istockphoto.com\/id\/1185389397\/vector\/many-arrows-missed-hitting-target-mark-shot-miss-multiple-failed-inaccurate-attempts-to-hit.jpg?s=612x612&amp;w=0&amp;k=20&amp;c=BMkp7Do_maAUtIQWbcJmL8tEVa3gDiN0D8U5oErp6zE=\" width=\"612\" height=\"408\" \/><\/p>","protected":false},"excerpt":{"rendered":"<p>Introduction One of the goals of statistics lies in explaining the variability of a phenomenon of interest by means of a mathematical expression, i.e., an equation called a model, made up of an objective (dependent) variable and explaining (explanatory, independent) variables. If the target variable has no variability, then we do not have a variable but a constant. E ...<\/p>\n<p class=\"read-more\"> <a class=\"\" href=\"https:\/\/staticalmo.com\/en\/come-riconoscere-un-informatico-che-fa-statistica-e-perche-bisogna-fare-attenzione\/\"> <span class=\"screen-reader-text\">How to recognize a computer scientist doing statistics and why you need to be careful<\/span> Read More &raquo;<\/a><\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_crdt_document":"","_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"site-sidebar-layout":"default","site-content-layout":"default","ast-global-header-display":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"","ast-breadcrumbs-content":"","ast-featured-img":"","footer-sml-layout":"","theme-transparent-header-meta":"default","adv-header-id-meta":"","stick-header-meta":"","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","_themeisle_gutenberg_block_has_review":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[1],"tags":[],"class_list":["post-1658","post","type-post","status-publish","format-standard","hentry","category-senza-categoria"],"aioseo_notices":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/staticalmo.com\/en\/wp-json\/wp\/v2\/posts\/1658","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/staticalmo.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/staticalmo.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/staticalmo.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/staticalmo.com\/en\/wp-json\/wp\/v2\/comments?post=1658"}],"version-history":[{"count":8,"href":"https:\/\/staticalmo.com\/en\/wp-json\/wp\/v2\/posts\/1658\/revisions"}],"predecessor-version":[{"id":1666,"href":"https:\/\/staticalmo.com\/en\/wp-json\/wp\/v2\/posts\/1658\/revisions\/1666"}],"wp:attachment":[{"href":"https:\/\/staticalmo.com\/en\/wp-json\/wp\/v2\/media?parent=1658"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/staticalmo.com\/en\/wp-json\/wp\/v2\/categories?post=1658"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/staticalmo.com\/en\/wp-json\/wp\/v2\/tags?post=1658"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}