More

m1el · 2026-02-04T18:29:05 1770229745

I've been using nemotron ASR with my own ported inference, and happy about it:

https://huggingface.co/nvidia/nemotron-speech-streaming-en-0...

https://github.com/m1el/nemotron-asr.cpp https://huggingface.co/m1el/nemotron-speech-streaming-0.6B-g...

Multicomp · 2026-02-04T19:19:59 1770232799

I'm so amazed to find out just how close we are to the start trek voice computer.

I used to use Dragon Dictation to draft my first novel, had to learn a 'language' to tell the rudimentary engine how to recognize my speech.

And then I discovered [1] and have been using it for some basic speech recognition, amazed at what a local model can do.

But it can't transcribe any text until I finish recording a file, and then it starts work, so very slow batches in terms of feedback latency cycles.

And now you've posted this cool solution which streams audio chunks to a model in infinite small pieces, amazing, just amazing.

Now if only I can figure out how to contribute to Handy or similar to do that Speech To Text in a streaming mode, STT locally will be a solved problem for me.

[1] https://github.com/cjpais/Handy

m1el · 2026-02-04T21:32:49 1770240769

you should check out

https://github.com/pipecat-ai/nemotron-january-2026/

discovered through this twitter post:

https://x.com/kwindla/status/2008601717987045382

kwindla · 2026-02-04T22:00:06 1770242406

Happy to answer questions about this (or work with people on further optimizing the open source inference code here). NVIDIA has more inference tooling coming, but it's also fun to hack on the PyTorch/etc stuff they've released so far.

pstroqaty · 2026-02-05T07:35:27 1770276927

Thank you for sharing! Does your implementation allow running the Nemotron model on Vulkan? Like whisper.cpp? I'm curious to try other models, but I don't have Nvidia, so my choices are limited.

m1el · 2025-09-04T20:58:22 1757019502

Exactly. I've implemented a xorshift-based rng inverter previously, and here's the implementation for the algorithm in the article:

https://github.com/m1el/samaras/blob/master/src/xorshift128....

Retr0id · 2025-09-04T21:14:24 1757020464

Seeing your comment with the precomputed matrix reminded me of this visualisation I made: https://bsky.app/profile/retr0.id/post/3kc2rz7i7fm2y

ScottContini · 2025-09-06T06:18:41 1757139521

The blog is not about going from one known internal state to the previous internal state.

It is about not knowing the internal state but figuring it out after having 2 (or 3) consecutive outputs of the random number generator.

m1el · 2025-05-31T18:07:47 1748714867

It's an artifact of the camera. The camera shutter is long enough that it averages the images over 33ms. At some point in the video you can see that a high speed camera can see the correct display.

hinkley · 2025-05-31T18:20:10 1748715610

In the high speed you can see that they are distinct but you can see the rollover. Maybe mark V could tighten that up.

signal-intel · 2025-06-01T20:51:57 1748811117

At the 7 minute mark the industrial 14k FPS camera shows essentially zero rollover. The earlier rollover does appear to be an artifact of the cheap consumer grade high speed camera used.

m1el · 2025-03-13T16:58:44 1741885124

somehow, I suspect openai didn't "buy" all of the articles, books, websites they crawled and torrented.

m1el · 2025-03-13T16:56:50 1741885010

when it comes to real people, they get sued into oblivion for downloading copyrighted content, even for the purpose of learning. but when facebook & openai do it, at a much larger scale, suddenly the laws must be changed.

ryoshu · 2025-03-13T17:09:09 1741885749

Case in point - https://en.wikipedia.org/wiki/Aaron_Swartz

JumpCrisscross · 2025-03-13T17:53:12 1741888392

Swartz wasn’t “downloading copyrighted content…for the purpose of learning,” he was downloading with the intent to distribute. That doesn’t justify how he was treated. But it’s not analogous to the limited argument for LLMs that don’t regurgitate the copyrighted content.

conjectures · 2025-03-13T17:18:46 1741886326

It does apply to people? When you read a copy of a book, you can't be sued for making a copy of the book in the synapses of your brain.

Now, if you have eidetic memory and write out large chunks of the book from memory and publish them, that's what you could be sued for.

tsimionescu · 2025-03-13T17:47:24 1741888044

This is not about memory or training. The LLM training process is not being run on books streamed directly off the internet or from real-time footage of a book.

What these companies are doing is:

1. Obtain a free copy of a work in some way.

2. Store this copy in a format that's amenable to training.

3. Train their models on the stored copy, months or years after step 1 happened.

The illegal part happens in steps 1 and/or 2. Step 3 is perhaps debatable - maybe it's fair to argue that the model is learning in the same sense as a human reading a book, so the model is perhaps not illegally created.

But the training set that the company is storing is full of illegally obtained or at least illegally copied works.

What they're doing before the training step is exactly like building a library by going with a portable copier into bookshops and creating copies of every book in that bookshop.

visarga · 2025-03-13T18:31:21 1741890681

But making copies for yourself, without distributing them, is different than making copies for others. Google is downloading copyrighted content from everywhere online, but they don't redistribute their scraped content.

Even web browsing implies making copies of copyrighted pages, we can't tell the copyright status of a page without loading it, at which point a copy has been made in memory.

tsimionescu · 2025-03-13T19:32:36 1741894356

Making copies of an original you don't own/didn't obtain legally is not fair use. Also, this type of personal copying doesn't apply to corporations making copies to be distributed among their employees (it might apply to a company making a copy for archival, though).

codedokode · 2025-03-14T01:48:02 1741916882

> But making copies for yourself, without distributing them,

If this was legal, nobody would be paying for software.

triceratops · 2025-03-13T17:39:50 1741887590

> When you read a copy of a book

They're not talking about reading a book FFS. You absolutely can be sued for illegally obtaining a copy of the book.

Terretta · 2025-03-13T17:12:35 1741885955

> when it comes to real people, they get sued into oblivion for downloading copyrighted content, even for the purpose of learning.

Really? Or do they get sued for sharing as in republishing without transformation? Arguably a URL providing copyrighted content, is you offering a xerox machine.

It seems most "sued into oblivion" are the reshare problem, not the get one for myself problem.

m1el · on July 8, 2024

From my observations: cold start, ease of patching. If you're running a lot of different JS code or restarting the code frequently, it's faster than node. Where it's useful: fuzzing. If you have a library/codebase you want to fuzz, you need to restart the code from a snapshot, and other engines seem to do it slower. It's also really easy to patch the code, because of the codebase size. If you need to trace/observe some behavior, just do it.

m1el · on July 7, 2024

Salesforce sandboxing is too easy to escape. Last time I needed to implement some feature for Salesforce, I've encountered 4 different escapes. It was also horrible dev experience.

spankalee · on July 8, 2024

I would love to hear more about that. I'm looking into their approach for a plug-in system myself.

m1el · on June 23, 2024

It's not about being poor. First, the climate didn't require AC in most of the Europe, until ~10 years ago. You had a few hot days, and that's it. Second, thermal isolation in the US is extremely bad quality. I think people could cut their AC usage by half if they had proper thermal isolation in their houses. Third, northern Europe countries still don't have a climate to justify buying an AC.

brnt · on June 23, 2024

Specifically, American houses lack thermal mass due to being constructed mainly from wood. Concrete and brick will buffer over a week or so of heat before it warms up too much.

firesteelrain · on June 23, 2024

In Florida, most of the homes are built from concrete brick with wood trusses. There are apartments made from wood and concrete.

It’s not the heat completely - it is also the humidity. You can bear up to 80 F before it starts to feel uncomfortable. Humidity will make even 75F uncomfortable.

brnt · on June 23, 2024

Ofc, as far south as Florida the absence of cold makes the buffer function of limited use. And indeed, humidity is even worse than heat.

difosfor · on June 23, 2024

We have high humidity in Europe as well. For example the average here in the Netherlands is 77% where in Florida it's 75% based on a quick Google.

bitshiftfaced · on June 24, 2024

Relative humidity isn't a great indicator of comfort. It's better to look at dew point. The Netherlands is not only cooler on average but also has a lower dew point. This shouldn't be surprising given each country's latitudes.

firesteelrain · on June 23, 2024

Both regions have high humidity, but Florida tends to have higher average humidity levels, particularly in the summer months. Florida has a subtropical to tropical climate, characterized by high temperatures and humidity, especially in the summer. Florida experiences high humidity levels throughout the year, often ranging from 70% to 90%. Summer months are particularly humid, with frequent afternoon thunderstorms.

The Netherlands has a temperate maritime climate, influenced by the North Sea.

Florida and the Netherlands are not close in comparison.

xyzzyz · on June 23, 2024

I’m sorry, but it is just mind boggling to suggest that Netherlands and Florida have comparable weather in any sense. You wouldn’t suggest that the weather in Netherlands is as hot as in, say, Italy or Greece, and Florida is even hotter than these two.

difosfor · on June 23, 2024

I'm not saying it's as hot here as it is in Florida. But we've been breaking records left and right up to the point where I've purchased an AC (a crappy mobile one for lack of better options here for rented apartments) because we go through months every summer now where I can barely sleep without one anymore.

My point was that people often don't realize how humid it is here. You apparently also can't believe it. And how our buildings are not made to keep heat out, but rather in. So I expect many more ACs to be sold here as well in the coming years.

It might just be a month or two each year. And it might be worse for you. But it's also getting pretty bad here already thanks to climate change. And that's not going to improve anytime soon thanks to all of us.

quantified · on June 24, 2024

Is it possible to build so that hear is kept in but not out? I sort of thought that heat flow was bidirectional.

difosfor · on July 7, 2024

Yes, mostly by using insulating (double) glass to let warmth in in the form of light that then warms up the interior. Think greenhouses. Surround that with poorly insulated walls and limited ventilation and in cold weather they'll leak out heat while in warm weather they'll also heat up in the sun and radiate that in.

rayiner · on June 23, 2024

Thermal mass doesn't matter much because the air in a typical home is replaced every hour.

hn_go_brrrrr · on June 23, 2024

Any home with an ACH nat of 1 that's attempting to condition the air (heating or cooling) is wasting a mind boggling percentage of the energy. Surely that's not the natural ventilation rate of the _typical_ home? That would imply that 50% of homes are worse.

rayiner · on June 23, 2024

This reference suggests that the mean air change rate in southern europe is 1.1 +/- 0.8 ac/h. https://www.researchgate.net/publication/265511654_Proportio....

orf · on June 23, 2024

Do you live in a wind-tunnel?

01HNNWZ0MV43FF · on June 23, 2024

But if it's hot air touching cool brick, won't the air cool down quicker than the brick warms up?

tomjakubowski · on July 1, 2024

The air inside is only a tiny fraction of a building's mass.

m1el · on May 31, 2024

Artificial neural networks work the following way: you have a bunch of “neurons” which have inputs and an output. Neuron’s inputs have weights associated with them, the larger the weight, the more influence the input has on the neuron. These weights need to be represented in our computers somehow, usually people use IEEE754 floating point numbers. But these numbers take a lot of space (32 or 16 bits). So one approach people have invented is to use more compact representation of these weights (10, 8, down to 2 bits). This process is called quantisation. Having a smaller representation makes running the model faster because models are currently limited by memory bandwidth (how long it takes to read weights from memory), going from 32 bits to 2 bits potentially leads to 16x speed up. The surprising part is that the models still produce decent results, even when a lot of information from the weights was “thrown away”.

endofreach · on June 1, 2024

Oh, nice. Thank you for this explanation. Now i think i get quantisation. Very well explained for someone like me. Thank you a lot!

m1el · on Feb 26, 2024

Not a browser, but a PWA. It's a web page, which you can "install" as an "app". Features like storage, background tasks and notifications are important for many applications, for example a messenger. These were available, and there is a market for those, but Apple has decided to kill that market.

throwawa14223 · on Feb 26, 2024

That makes a ton more sense thank you.