{"id":78,"date":"2026-05-27T10:15:18","date_gmt":"2026-05-27T17:15:18","guid":{"rendered":"https:\/\/drstats.pipelinedatascience.org\/?p=78"},"modified":"2026-05-27T10:15:18","modified_gmt":"2026-05-27T17:15:18","slug":"a-data-science-lesson-inspired-by-ucsfs-health-atlas-part-2","status":"publish","type":"post","link":"https:\/\/drstats.pipelinedatascience.org\/?p=78","title":{"rendered":"A Data Science Lesson Inspired by UCSF&#8217;s Health Atlas &#8212; Part 2"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\"> <\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Teaching Moment: So, What Exactly Is the Lesson Here?<\/strong><\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Now if you\u2019re an instructor, and especially if you teach statistics, data science, public health, sociology, economics, geography, or honestly any subject where data show up wearing a fake mustache pretending to be \u201cobjective truth\u201d!<\/p>\n\n\n\n<p class=\"is-style-default wp-block-paragraph\">And the beautiful thing is that students do not need an advanced knowledge in machine learning to participate in meaningful data science.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">That point is  important.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Far too many students think data science begins when somebody starts throwing around terms like \u201cdeep neural networks,\u201d \u201ctransformers,\u201d or \u201cBayesian hierarchical spatiotemporal latent processes with adaptive priors.\u201d<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Listen.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Most students are still trying to remember where they saved the CSV file.<\/p>\n\n\n\n<p class=\"is-style-text-subtitle has-medium-font-size is-style-text-subtitle--1 wp-block-paragraph\"><strong>What students REALLY need first is not technical intimidation.<\/strong><\/p>\n\n\n\n<p class=\"is-style-text-subtitle has-medium-font-size is-style-text-subtitle--2 wp-block-paragraph\"><strong>They need guided curiosity.<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">They need to learn how to ask sensible questions, inspect patterns carefully, challenge assumptions, visualize relationships, and understand that data rarely speak clearly the first time you interrogate them.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">This Health Atlas example provides exactly that kind of environment.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Students can begin with a genuinely important public-health question:<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Why do some communities exhibit substantially higher disability prevalence than others?<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">And immediately, the investigation becomes interdisciplinary.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Now students are talking about:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>education<\/li>\n\n\n\n<li>poverty<\/li>\n\n\n\n<li>healthcare access<\/li>\n\n\n\n<li>geography<\/li>\n\n\n\n<li>regional history<\/li>\n\n\n\n<li>public policy<\/li>\n\n\n\n<li>and socioeconomic inequality<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">That right there is real data science.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Not because the models are complicated.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>But because the questions matter.<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">________________________________<\/p>\n\n\n\n<h4 class=\"wp-block-heading\"><strong>Sample Learning Objectives for this Lesson<\/strong><\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">By the end of this lesson, students should be able to:<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li>Import, clean, and organize publicly available data<\/li>\n\n\n\n<li>Use visualization as a scientific thinking tool<\/li>\n\n\n\n<li>Distinguish association from causation<\/li>\n\n\n\n<li>Interpret statistical models in plain language<\/li>\n\n\n\n<li>Understand that geography matters<\/li>\n\n\n\n<li>Appreciate the iterative nature of data science<\/li>\n<\/ol>\n\n\n\n<h3 class=\"wp-block-heading\">Import, clean, and organize publicly available data<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Students learn that real-world datasets are rarely neat and perfectly labeled. They must inspect variables, identify missingness, interpret documentation, and construct usable analytic tables.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Use visualization as a scientific thinking tool<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Students generate histograms, scatterplots, correlation maps, and geographic visualizations to identify patterns before formal modeling begins.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">This is enormously important pedagogically.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Visualization is not merely decoration.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Visualization is reasoning.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Distinguish association from causation<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Students learn that county-level observational relationships do not imply that one variable directly causes another. Instead, the analysis motivates deeper questions and competing explanations.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Frankly, society could use a LOT more people who understand this distinction.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Interpret statistical models in plain language<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Students move beyond simply \u201crunning regression\u201d and instead learn to explain what coefficients, predictions, uncertainty, and residuals actually mean in a substantive public-health context.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Understand that geography matters<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Spatial clustering teaches students that nearby regions often share historical, economic, environmental, and healthcare structures. Students begin recognizing that spatial dependence violates the unrealistic fantasy that all observations are completely independent.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Appreciate the iterative nature of data science<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Perhaps most importantly, students learn that a good analysis rarely concludes with:<br>\u201cWe solved it.\u201d<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Instead, good analyses end with:<br>\u201cNow we know what to ask next.\u201d<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">And honestly, that may be one of the healthiest intellectual habits we can teach anybody.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Because the true spirit of data science is not about worshipping algorithms.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">It is about learning how to think carefully in the presence of uncertainty.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">_________________________________________<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Postscript<\/strong><\/h2>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>What do you think? How would YOU modify this lesson?<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Would you change the public-health question? Add additional variables? Introduce different visualizations or modeling approaches? Expand the spatial component? Simplify the statistical modeling? Push students toward deeper policy discussions?<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>And how do the learning objectives sound to you?<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Are they realistic? Too ambitious? Not ambitious enough?<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">One of the beautiful things about teaching data science is that <strong>there is rarely a single \u201ccorrect\u201d pathway through the data<\/strong>. <strong>Every instructor brings a different perspective<\/strong>, a different intuition, and a different set of experiences into the classroom.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">So, let\u2019s continue the conversation.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Dr. Stats would truly love to hear your thoughts, suggestions, questions, critiques, and classroom ideas.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Teaching Moment: So, What Exactly Is the Lesson Here? Now if you\u2019re an instructor, and especially if you teach statistics, data science, public health, sociology, economics, geography, or honestly any subject where data show up wearing a fake mustache pretending to be \u201cobjective truth\u201d! And the beautiful thing is that students do not need an [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[6,5],"tags":[9,7],"class_list":["post-78","post","type-post","status-publish","format-standard","hentry","category-health-atlas-lesson","category-lesson-ideas","tag-data-science-lesson","tag-ucsf-health-atlas"],"_links":{"self":[{"href":"https:\/\/drstats.pipelinedatascience.org\/index.php?rest_route=\/wp\/v2\/posts\/78","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/drstats.pipelinedatascience.org\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/drstats.pipelinedatascience.org\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/drstats.pipelinedatascience.org\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/drstats.pipelinedatascience.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=78"}],"version-history":[{"count":3,"href":"https:\/\/drstats.pipelinedatascience.org\/index.php?rest_route=\/wp\/v2\/posts\/78\/revisions"}],"predecessor-version":[{"id":93,"href":"https:\/\/drstats.pipelinedatascience.org\/index.php?rest_route=\/wp\/v2\/posts\/78\/revisions\/93"}],"wp:attachment":[{"href":"https:\/\/drstats.pipelinedatascience.org\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=78"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/drstats.pipelinedatascience.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=78"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/drstats.pipelinedatascience.org\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=78"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}