• taiyang@lemmy.world
      link
      fedilink
      English
      arrow-up
      28
      arrow-down
      1
      ·
      11 months ago

      If you’re in a college statistics course and you’re doing graphs by hand and not generated entirely be statistics software, the skills you’re learning are useless anyway.

        • taiyang@lemmy.world
          link
          fedilink
          English
          arrow-up
          13
          ·
          11 months ago

          To be fair, I’m snarky because plenty of colleges (and way too many high schools) still do this shit because it’s not about the knowledge, it’s about the signalling to employers that the student will make a good cog in their machine.

          To anyone struggling in a stats course: real data science is programming, not math. If you’re on Lemmy there is a good chance you’re a better data scientist than your hack of a teacher.

          • Ecco the dolphin@lemmy.ml
            link
            fedilink
            English
            arrow-up
            6
            ·
            11 months ago

            …my stats professor is a programmer, though. Are you not talking about high level statistics courses? A lot has changed since R and Rstudio has been developed. (It’s FOSS!). All of my assignments are either proofs in LaTeX or questions that involve programming.

            ( If you’re in a stats course and using excel, you are learning stats for babies. Your class has business majors in it.)

        • callouscomic@lemm.ee
          link
          fedilink
          English
          arrow-up
          8
          ·
          11 months ago

          Memories of my professor in early 2010s teaching us to do it by hand in case the power at work ever goes out and we don’t wanna get fired … based on his 90s work experience.

          He was fun though.

    • reallykindasorta@slrpnk.net
      link
      fedilink
      English
      arrow-up
      18
      ·
      11 months ago

      Yeah that would be bad practice, industry standard is to run all the tests simultaneously and if something comes out statistically significant make up a narrative then try to split it into 4 papers.

      • NocturnalMorning@lemmy.world
        link
        fedilink
        English
        arrow-up
        7
        ·
        11 months ago

        Tell that to the reviewers who constantly ask my wife why she didn’t do linear regression in her analysis. She rages against linear regression constantly. But some people swear by it, which i think is weird.

    • TexasDrunk@lemmy.world
      link
      fedilink
      English
      arrow-up
      6
      ·
      11 months ago

      Folks in observation and analytics are gonna be real mad when they realize you’re giving away their secrets.

  • observantTrapezium@lemmy.ca
    link
    fedilink
    English
    arrow-up
    32
    ·
    11 months ago

    Just saw the scatter plot and line and my mind immediately screamed “bullshit” without knowing what this was about at all. Only then I read the text.

    • Rhaedas@fedia.io
      link
      fedilink
      arrow-up
      13
      ·
      11 months ago

      Could be valid. Now if it had been logarithmic the pro tip might still be true, since many don’t look at the axis either.

  • fossilesque@mander.xyzM
    link
    fedilink
    English
    arrow-up
    30
    arrow-down
    2
    ·
    11 months ago

    Label your x and y, you dirty heathen. Such offense, you’re lucky you’re not catching a b&.

    • rustydrd@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      8
      ·
      11 months ago

      My guess is lower. I’d put the correlation at about -.35 to -.45, so that’d correspond to an R² of .1225 to .2025. But eyeballing correlations is hard.

    • taiyang@lemmy.world
      link
      fedilink
      English
      arrow-up
      6
      arrow-down
      1
      ·
      11 months ago

      Assuming it’s a correction line, I don’t think you can tell from the slope of that line alone as the clustering will matter and correlations are finicky. Now, if it was a regression coefficient, that sexy line can be calculated just by looking at it (although we’d want to know if it was significant, lol).

      • sp3ctr4l@lemmy.zip
        link
        fedilink
        English
        arrow-up
        5
        ·
        11 months ago

        I was assuming its a simple linear regression fit, and attempting to eyeball the r², haha.

    • MeowZedong@lemmygrad.ml
      link
      fedilink
      English
      arrow-up
      2
      ·
      11 months ago

      Delete enough data points and it will be 1. You’ll only have two data points, but you’ll have bragging rights.

  • azi@mander.xyz
    link
    fedilink
    English
    arrow-up
    16
    arrow-down
    1
    ·
    11 months ago

    Actual graph used to inform government decisions

    Scatter plot correlating parked vehicles at supermarkets with the store's number of employees. There's only two data points and the trend line is drawn in the exact opposite direction of what they show (line says the store with more employees had more cars, the data shows otherwise). Page 602 of the Parking Generation Manual 5th Edition