@theunknownmuncher

theunknownmuncher@lemmy.world · 8 days ago

Steam controller off ebay?

theunknownmuncher@lemmy.world · 4 months ago

You can overwrite the model by using the same name instead of creating one with a new name if it bothers you. Either way there is no duplication of the llm model file

theunknownmuncher@lemmy.world · 4 months ago

What I am talking about is when layers are split across GPUs. I guess this is loading the full model into each GPU to parallelize layers and do batching

theunknownmuncher@lemmy.world · edit-2 4 months ago

Can you try setting the num_ctx and num_predict using a Modelfile with ollama? https://github.com/ollama/ollama/blob/main/docs/modelfile.md#parameter

theunknownmuncher@lemmy.world · edit-2 4 months ago

Are you using a tiny model (1.5B-7B parameters)? ollama pulls 4bit quant by default. It looks like vllm does not used quantized models by default so this is likely the difference. Tiny models are impacted more by quantization

I have no problems with changing num_ctx or num_predict

theunknownmuncher@lemmy.world · 4 months ago

Models are computed sequentially (the output of each layer is the input into the next layer in the sequence) so more GPUs do not offer any kind of performance benefit

theunknownmuncher@lemmy.world · edit-2 4 months ago

Ummm… did you try /set parameter num_ctx # and /set parameter num_predict #? Are you using a model that actually supports the context length that you desire…?

theunknownmuncher@lemmy.world · 4 months ago

We don’t, I already have a steam deck. Touchpads.

theunknownmuncher@lemmy.world · 4 months ago

The device in the OP is not the steam deck

theunknownmuncher@lemmy.world · 4 months ago

Absurdly massive and yet no space for some touchpads? 🤡

theunknownmuncher@lemmy.world · 4 months ago

Feels good to have never owned a razer (or logitech) product

theunknownmuncher@lemmy.world · 4 months ago

My guess is an x86 32bit machine

theunknownmuncher@lemmy.world · 4 months ago

4690k was solid! Mine is retired, though. Now I selfhost on ARM

theunknownmuncher@lemmy.world · 6 months ago

Okie dokie then!

continues enjoying the Steam Deck

🙂

theunknownmuncher@lemmy.world · 6 months ago

Hey that’s great that Arma III mods also are realistic! Love to see it. Star Wars Battlefront mods also came close to photorealistic at moments. I’m sure these community made-by-volunteers-in-their-free-time mods were SO expensive to develop that it spurred a handheld gaming conspiracy, right???

Those dastardly indie teenager devs!!! 🤪

theunknownmuncher@lemmy.world · 6 months ago

Described as “the first photorealistic shooter” by reviewers, there is quite literally no other game that comes close, in terms of graphical realism.

https://www.youtube.com/watch?v=SzHfZYClTwo

https://www.gamesradar.com/games/fps/bodycam-brings-fps-action-to-an-ultra-realistic-world-that-has-to-be-seen-to-be-believed/

https://gameranx.com/features/id/505275/article/jaw-dropping-visuals-best-realistic-graphics-games-of-2024/

https://gamerant.com/most-ultra-realistic-games-like-bodycam/

theunknownmuncher@lemmy.world · 6 months ago

The game Bodycam was developed by 2 people, a 17 and 20 year old, and has the most realistic graphics I’ve ever seen.

theunknownmuncher@lemmy.world · 6 months ago

And they’ve already lost lmao