Gives an idea of the amount of data YouTube is storing, if only this one channel is 250GB!
datahoarder
Who are we?
We are digital librarians. Among us are represented the various reasons to keep data -- legal requirements, competitive requirements, uncertainty of permanence of cloud services, distaste for transmitting your data externally (e.g. government or corporate espionage), cultural and familial archivists, internet collapse preppers, and people who do it themselves so they're sure it's done right. Everyone has their reasons for curating the data they have decided to keep (either forever or For A Damn Long Time). Along the way we have sought out like-minded individuals to exchange strategies, war stories, and cautionary tales of failures.
We are one. We are legion. And we're trying really hard not to forget.
-- 5-4-3-2-1-bang from this thread
And that 250GB is probably just the downloaded and HEVC-compressed files. YouTube actually promotes uploading in raw formats for best quality, just 3-4 full-length movies would be enough to fill 250GB for them
And mind you, they have a high number of videos but most are short clips and all of them are low res, 360p or 480p max. Any other channel uploading HD or 4k content will be orders of magnitudes larger for fewer videos.
Nice!
Could you fell us what tool you used to also get the description text and the comments? With dlp i only found the option of downloading the video itself.
yt-dlp does support fetching comments and description text - if you use the --write-info-json
and --write-comments
options, it will save them as a JSON file alongside other video metadata.
Nice! What're you gonna do with them? Are you gonna upload them somewhere, or just hold onto them?
They still happily exist on YouTube- for now. So no point in re-hosting, they'll get squirreled away into the Giant Hard Drive of Doom.
If something happens to the actual archive project in the near future, I'll likely section them up into 20gb pieces and post them out on a torrent someplace.
Just upload it to archive.org before your backup dies. No need to hoard it for yourself.
Nah. IA doesn't need to deal with this volume of shit and they already have enough of a hard time dealing with copyright trolls.
If this channel is impacted in the future, I'll probably put out a few torrents with the videos and post them here.
As if they're not having enough trouble with hosting "questionable" content! You obviously didn't read the torrentfreak article making the rounds.
Internet Archive != the Pirate Bay.
For now, DON'T contaminate the IA with the Classic Chicago Television channel.