It's trinary, and I understand why they instead say "1-bit," but it still bugs me that they call it "1-bit."
I'd love to see how low they can push this and still get spooky results. Something with ten million parameters could fit on a Macintosh Classic II - and if it ran at any speed worth calling interactive, it'd undercut a lot of loud complaints about energy use. Training takes a zillion watts. Using the model is like running a video game.