• bss03@infosec.pub
    link
    fedilink
    English
    arrow-up
    3
    arrow-down
    4
    ·
    1 day ago

    Sounds like those statistics output would the heavily biased by whatever process you were using to turn names into genders. In short, a bad idea.

    • TangledHyphae@lemmy.world
      link
      fedilink
      arrow-up
      3
      arrow-down
      1
      ·
      17 hours ago

      “Since the dataset isn’t 100% perfectly annotated for analysis, we should give up the whole project entirely.”

      • Shanmugha@lemmy.world
        link
        fedilink
        arrow-up
        2
        ·
        edit-2
        9 hours ago

        No, since the dataset is bound to give nonsensical results, we search for sources that are more precise. Hint: “Andrea” already mentioned and Japanese names