Hacker Newsnew | past | comments | ask | show | jobs | submit | mileycyrusXOXO's commentslogin

  Location: Fort Collins, CO
  Remote: preferred
  Willing to relocate: no
  Technologies: typescript / javascript / vue / react / graphql / java / rust / sql
  Résumé/CV: https://jordanmajd.com/cv.html
  Email: me [at] jordanmajd [dot] com
I’m a full stack dev with 13+ years experience building everything from enterprise applications to VR / AR apps (even firmware for margarita machines ). I’ve led teams to build scalable cloud apps by providing a solid architecture, strong design patterns, reusable abstractions and accessibility guidelines. Teaching helped me realize I'm energized by mentoring others and supporting my team’s development.

LMK if you think I'd be a good fit!


  Location: Fort Collins, CO
  Remote: preferred
  Willing to relocate: no
  Technologies: typescript / javascript / vue / react / graphql / java / rust / sql
  Résumé/CV: https://jordanmajd.com/cv.html
  Email: me [at] jordanmajd [dot] com
I’m a full stack dev with 13+ years experience building everything from enterprise applications to VR / AR apps (even firmware for margarita machines ). I’ve led teams to build scalable cloud apps by providing a solid architecture, strong design patterns, reusable abstractions and accessibility guidelines. Teaching helped me realize I'm energized by mentoring others and supporting my team’s development.

LMK if you think I'd be a good fit!


Is it just running on a 256gb server w/ CPU or do you have GPUs as well? I think I'm going to stand up a server tomorrow to do some testing myself


In my case just CPU (it's a Hetzner server, checked in /proc/cpuinfo and it said "AMD EPYC 9454P 48-Core Processor"). I apparently had still in terminal backlog some stats, so I pasted below.

It's not a speed demon but enough to mess around and test things out. Thinking can sometimes be pretty long so it can take a while to get responses, even if 6 tokens/sec is pretty good considering pure CPU setup.

---

prompt eval time = 133.55 ms / 1 tokens ( 133.55 ms per token, 7.49 tokens per second) eval time = 392205.46 ms / 2220 tokens ( 176.67 ms per token, 5.66 tokens per second) total time = 392339.02 ms / 2221 tokens

And my exact command was:

llama-server --model DeepSeek-R1-UD-Q2_K_XL-00001-of-00005.gguf --temp 0.6 -c 9000 --min-p 0.1 --top-k 0 --top-p 1 --timeout 3600 --slot-save-path ~/llama_kv_path --port 8117 -ctk q8_0

(IIRC slot save path argument does absolutely nothing unless and is superfluous, but I have been pasting a similar command around and been too lazy to remove it). -ctk q8_0 reduces memory use a bit for context.

I think my 256gb is right at the limit of spilling a bit into swap, so I'm pushing the limits :)

The --min-p 0.1 was a recommendation from Unsloth page; I think because the quant is going so low in bits, some things may start to misbehave and it is a mitigation. But I haven't messed around enough to say how true that is, or any nuance about it. I think I put --temp 0.6 for the same reason.

To explain to anyone not aware of llama-server: it exposes (a somewhat) OpenAI-compatible API and then you can use it with any software that speaks that. llama-server itself also has a UI, but I haven't used it.

I had some SSH tunnels set up to use the server interface with https://github.com/oobabooga/text-generation-webui where I hacked an "OpenAI" client to it (that UI doesn't have it natively). The only reason I use the oobabooga UI is out of habit so I don't recommend this setup to others.


This is super helpful! Appreciate you taking the time to reply


For years people have dogged on North Korea, Iran, China and Russia talking about how the government controls information by banning apps and by creating firewalls blocking access to parts of the internet. Now when the US introduces censorship people like you welcome it with open arms. Something of value is lost, our ability to access information freely


Think of it like Oreos:

- Text-based blogging platforms: regular Oreos

- Image-based blogging platforms: Double Stuff Oreos

- Short-form video blogging platforms: Mega Stuff Oreos

There are still plenty of high-quality and addictive ways to share information without Tik-tok.

Everyone will be better off without Mega Stuff Oreos, they were an abomination to begin with


"The internet is obviously a massive national security risk, and I find it funny people don't see that"

"Libraries are obviously a massive national security risk, and I find it funny people don't see that."


I'd rather be able to choose to consume propaganda than for than the government to be able to decide what I should and should not consume.


Yeah, the heroin addicts say the same thing about heroin though.


And yet the systems where addicts get actual heroin instead of even worse alternatives usually work better. See Switzerland


That's a poor analogy. It's more like they censor their citizens so we should censor ours!


No, it's a perfect analogy, you just don't like it. If you actually had a valid point, you'd bother to explain the issue, but you didn't. Telling.

The US isn't censoring it based on content anyway -- in fact, the US government's ability to censor much of anything based on content is severely constrained by the First Amendment -- the US doesn't like the fact that it's controlled by the PRC. But blocking businesses from a rival nation is a trade issue, not a speech issue.

China is a rival and opponent of the US on the geopolitical stage. It's entirely reasonable to respond to trade restrictions with trade restrictions.


Exactly, Instagram started as a way for me to interact with my social circle. Show people I personally know what is going on in my life and see what is going on in their life. Instagram later on has slowly tried evolving into something else, but mentally I still view it as a place to share with people in my life. On the other hand, Tiktok is a both a global community and a small niche of people who share the same interests as you where you can make memes, enjoy the same content together, converse and witness trends and ideas in real time


I’m gonna have to get Manic Miners, lots of fond memories playing Rock Raiders with my friends


They can! It takes a while for them to grow to adulthood though


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: