• onlinepersona
    link
    fedilink
    English
    arrow-up
    47
    arrow-down
    5
    ·
    6 months ago

    5M to protect against scraping? That sounds… a bit much, no? 34 employees with that one task for 2 years doesn’t sound believable to me. Why is WorldCat worth anything anyway?

    Anti Commercial-AI license

    • lightnsfw@reddthat.com
      link
      fedilink
      English
      arrow-up
      30
      ·
      6 months ago

      Defendants, through the Anna’s Archive domains, have made, and continue to make, all 2.2 TB of WorldCat® data available for public download through its torrents,” OCLC wrote in the complaint it filed in an Ohio federal court.

      It was 2.2 TB that is nothing…

      • blindsight@beehaw.org
        link
        fedilink
        English
        arrow-up
        6
        ·
        edit-2
        6 months ago

        Seriously… I’ve downloaded 2TB in a week before.

        I get that it’s not about the bandwidth, though; it’s about needing to upgrade their security since they scraped the site without needing to log in, so obviously their site wasn’t secure. They’re claiming IT costs as damages.

        • lightnsfw@reddthat.com
          link
          fedilink
          English
          arrow-up
          5
          ·
          6 months ago

          They should have had security in place beforehand if they didn’t want people to scrape their site. If AA hadn’t done it someone else would have. Don’t make it public if you don’t want people to use it.

      • Bakkoda@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        2
        ·
        6 months ago

        Just set a torrent size of 10tb and hit go. I’ll move that to permanent storage when it’s done.

    • WaterSword@discuss.tchncs.de
      link
      fedilink
      English
      arrow-up
      15
      ·
      6 months ago

      Also calling “improving it security” damages is kind of misleading. No its not damages, you just actually got some IT security for once

    • cmnybo@discuss.tchncs.de
      link
      fedilink
      English
      arrow-up
      10
      ·
      6 months ago

      They should have been able to put a stop to the scraping very quickly. It’s not that hard to block or rate limit IPs that are causing excessive load.