That’s cool! I’m really interested to know how many tokens per second you can get with a really good U.2. My gut is that it won’t actually be better than the 24VRAM+96RAM cache setup this user already tested with though.
- 0 Posts
- 16 Comments
How much do you need? Show your maths. I looked it up online for my post, and the website said 1747GB, which is completely in-line with other models.
Can you link that post?
Running R1 locally isn’t realistic. But you can rent a server and run it privately on someone else’s computer. It costs about 10 per hour to run. You can run it on CPU for a little less. You need about 2TB of RAM.
If you want to run it at home, even quantized in 4 bit, you need 20 4090s. And since you can only have 4 per computer for normal desktop mainboards, that’s 5 whole extra computers too, and you need to figure out networking between them. A more realistic setup is probably running it on CPU, with some layers offloaded to 4 GPUs. In that case you’ll need 4 4090s and 512GB of system RAM. Absolutely not cheap or what most people have, but technically still within the top top top end of what you might have on your home computer. And remember this is still the dumb 4 bit configuration.
Edit: I double-checked and 512GB of RAM is unrealistic. In fact anything higher than 192 is unrealistic. (High-end) AM5 mainboards support up to 256GB, but 64GB RAM sticks are much more expensive than 48GB ones. Most people will probably opt for 48GB or lower sticks. You need a Threadripper to be able to use 512GB. Very unlikely for your home computer, but maybe it makes sense with something else you do professionally. In which case you might also have 8 RAM slots. And such a person might then think it’s reasonable to spend 3000 Euro on RAM. If you spent 15K Euro on your home computer, you might be able to run a reduced version of R1 very slowly.
Aria@lemmygrad.mlto Linux@lemmy.ml•"I'll move to Linux if it runs every game I want" just say that you never will move3·4 months agoThe user’s library of apps are Windows apps. And Windows does support Linux programs. There are versions of Windows that don’t technically have it enabled by default, but it’s easy to install support. It has a built in command “wsl --install”, and a button in the store and start-menu. And for most users who get a pre-configured image from IT or their laptop manufacturer it’s pre-installed.
Aria@lemmygrad.mlto Linux@lemmy.ml•"I'll move to Linux if it runs every game I want" just say that you never will move25·4 months agoI really don’t get this community’s insistence on getting people to use Linux no matter how much destruction they bring. Steam games on Linux are not what anyone has in mind when they say Linux doesn’t have games. Because Linux isn’t binary compatibility, it’s libre software.
In my circles, if someone says “Linux such and such”, we assume they might be referring to their FreeBSD computer as well. Here it seems Linux is more likely to refer to Android. Emulating a sketchy Windows game doesn’t make Linux the better platform for games. The Windows games are always going to be best on Windows, and now your Linux computer has malware on it.
It’s completely open to Microsoft and the NSA for sure.
Aria@lemmygrad.mlto Linux@lemmy.ml•Phoronix: Linus Torvalds Comments On The Russian Linux Maintainers Being Delisted1·7 months agoA capitalist is someone who owns capital, not someone who supports capitalism. A liberal is someone who supports capitalism. I don’t think Linus is a liberal, given that he’s the Linux guy. But he’s obviously a capitalist, and that’s okay, that’s something you should strive towards if you live under capitalism, even if ideologically you oppose capitalism.
Aria@lemmygrad.mlto Linux@lemmy.ml•Phoronix: Linus Torvalds Comments On The Russian Linux Maintainers Being Delisted31·7 months agothought-terminating cliché
There’s no argument, it’s the definition of the word. Why do you assume there should be argument around the normal usage of a word?
Aria@lemmygrad.mlto Linux@lemmy.ml•KDE signs petition urging European Union to continue funding free software31·10 months agoGoogle Play get fined and the reward goes to Google Android.
Aria@lemmygrad.mlto Linux@lemmy.ml•Does anyone know why SteamOS is based on arch rather than Debian?2·1 year agoI used to use Gnome with a tiled window manager. It was a good combo. Don’t see why they have to be exclusive. No hate from my side, KDE and Gnome are both incredible. I can spare some hate for the Gnome-haters though.
Oh yeah I did mean cut/paste, my bad.
That’s actually not true. When you cut/paste a file on your computer (for most computers), it’s much faster than copying the file. Deleting the file is also not instant, so copy and delete should be the slowest of the three operations.
When you cut and paste a file, you’re just renaming the file or updating the file database. It’s different how that works depending on your file system, but it typically never involves rewriting much of the data of the file.
Edit: Fixed typo.
So what exactly is this? Open-source ChatGPT-alternatives have existed before and alongside ChatGPT the entire time, in the form of downloading oogabooga or a different interface and downloading an open source model from Huggingface. They aren’t competitive because users don’t have terabytes of VRAM or AI accelerators.
There’s no separate computer, it’s all in the monitor!