So everyone keeps reporting that the DoJ has released 3 million files, with redactions.
But… I have all of the released data sets so far, and I’m at under 1.11 million files.
Note that this is using the fuckthissite3 release of DataSet09, and the total size and file count will be a bit short since I kept the lot of 2GB prison videos on another drive for space. But there are only a dozen or so, so that doesn’t change the file count very much at all.
Even with his release being 137GB of the 180GB total reported for Data Set 9… that leaves us with ~40GB and ~2 million files unaccounted for.
So… the news everywhere is saying the DoJ released 3 million files. Does anyone have a file count anywhere near that number? Even half that number?


I assume you also archived those files that were only accessible by changing the URL of pdfs to a video file type?
There aren’t any files like that. All of the files with different extensions are present in the actual full data set zips. If you download the full data sets you can see and browse them very easily. There’s never been any reason at all for people to sit there blindly guessing the right file extension.
Silly, i thought these were omitted but not access restricted. People making something out of nothing again on the internet.