i dont think theres anything that makes it essentiall that llms are non-deterministic though
if you rewrote the math to be all fixed point precision on big ints, i think you would still get the useful LLM results?
if somebody really wanted to make a compiler in an LLM, i dont think that nondetermism is problem
id really imagine an llm compiler being a set of specs, dependency versions, and test definitions to use though, and you'd introduce essential nondetermism by changing a version number, even if the only change was the version name from "experimental" to "lts"
I'm working on a project in this realm. Basically I am building a "personal" spyware/data collection software suite. Kind of in the same realm as ms recall but more focused on security/privacy with sensible cryptographic defaults where needed.
It is basically a server/data store and client agents, currently the agent is only for linux end user devices. The agent records evdev events (keystrokes/mouse movement), currently active window, clipboard history, shell commands issued and browsing behaviour. It runs as its own user and different functionality is compartmentalized into their own processes. Data is encrypted at rest. I'm still looking into how to best handle sensitive data in-memory at runtime.
It stores these events in a persistent queue on the clients and one-way syncs it to the server. If a client is offline for a bit it syncs it when it comes back online. As such, I am also trying to minimize storage used.
The idea is that rather than overwhelmingly linking stuff manually, e.g. with obsidian, locality of reference seems more useful as a baseline. In this data set, links by time are valued the most. In the future I'd like to add also the screenshot/video feature using hashes and perceptual hashes or an RDP like way to store as little data as possible.
For now I'm mostly in the architecting phase but I do have an early working version. Really looking for suggestions architecture wise too. So far I came up with my own binary format to save events on the clients but I'm unsure if it's the right way to go. There are many challenges to be thought about, such as changing hardware configuration (display plugged in), protecting against statistical analysis (e.g. keystroke bursts), deletion of data across sources if required, how to make sure the system can run smoothly for a decade, etc.
This is a super interesting (and refreshingly candid) direction. You’re basically building a local-first “life event ledger” with delayed sync.
Actually, I'm not an expert in this area, but I feel the challenge may not lie in data collection itself, but rather in ensuring the data remains secure, usable, and easy to maintain over many years.
A custom binary format can work, but it could be a long-term maintenance commitment (schema evolution, tooling, corruption recovery).
Agreed. The shell is great for chaining together atomic operations on plaintext. That is to say, it is great for one liners doing that. The main reason probably isn't how it all operates on plain text but how easy it makes it to start processes, do process substitution, redirections, etc.
As soon as you have state accumulating somewhere, branching or loops it becomes chaotic too quickly.
How do they define "VPN" in this? If I make a little wireguard mesh and use an aws vm in another country as the exit node for my traffic, would that go under VPN?
My solution: auto-export to a folder then sync using your preferred method. Use the betterbibtex plugin to rename and move all necessary files. Fiddly to set up, but reliable once it's working.
I think 18 is extreme. Having been born after the inception of the internet, I couldn't imagine not having had access to it. The amount of books, blog posts, knowledge and I suppose also entertainment/fun I would have missed.. safe to say I would be a completely different person. Never had predatory social media and still don't.
When you go that extreme with it, you hurt every kid that is actually curious and using it in a non destructive way. I largely got into computer science due to the early iOS jailbreak scene that existed from around 2009 to ~2014. I got into Linux because the debian package management tools had been ported over to jailbroken ios. I think if you had stripped this stuff from me as a child I would have been on a path to mischief instead. I don't know when you got your entry into your hobbies/things you're passionate about, though remember that the story can look different for every generation and if you cut off access to information/knowledge from curious kids you may just set them on a completely wrong path.
Heck I think the MOST sensible way to go about things is to let the parents decide but no, apparently parents (in the US anyways) need to make the entire internet safe for you little snot nosed brats. I think your dad needed to take you out to the woodshed when you jailbroke your phone, I know I would have but I probably wouldn't have bought you one anyways, lest you turn into some sort of freakshow adolescent gooner with a room full of tissue paper sculptures of your animes or whatever you guys are into nowadays.
reply