Having been a member of the robot learning community both in grad school and now...

yellow_lead · on Sept 21, 2023

To be fair, they do credit Professor Song and the paper you linked. TRI is also listed as a collaborator on the paper.

> Diffusion Policy: TRI and our collaborators in Professor Song’s group at Columbia University developed a new, powerful generative-AI approach to behavior learning. This approach, called Diffusion Policy, enables easy and rapid behavior teaching from demonstration.

prox · on Sept 21, 2023

Interesting that we have a genius robotics Dr. Song for real vs Star Treks Dr. Soong :)

ben_w · on Sept 21, 2023

My dad, who worked on military IFF before he retired, met someone in the UK intelligence community whose actual name was James Bond.

Or so he said…

michalu · on Sept 23, 2023

There's actually 1000+ people named James Bond so it's likely. Although fathers are the biggest liars - mine told me he swam across the ocean.

dotancohen · on Sept 21, 2023

I'm glad that she skipped Dr. Soong's ambition that preceded his work on robotics!

snor · on Sept 21, 2023

It should be noted that Diffusion Policy (not to mention IRP) was also apparently joint work with TRI.

tomp · on Sept 21, 2023

Can anyone ELI5 (well, or, "explain like I'm someone who understands how autoencoders, transformers & convolutional networks work") diffusion?

What makes it work so much better than alternatives mentioned above?

Paluth · on Sept 21, 2023

I haven't read the paper on Policy Diffusion yet so I don't know what they do differently. But I can ELI5 image diffusion models, like stable diffusion. Essentially you add random noise to an image, and the ask the model to predict the noise, such that if you remove that noise detected by the model, you obtain the original image. After the model has been trained enough, int the noise removal task, you can pass just random noise, ask the model to remove noise from the noise only image, then remove a little bit of the noise the model suggested, and do it again. And again, for multiple steps, eventually all the noise is removed and you end up with an image "dreamed" by the model from random noise. You can also condition the noise removal with things like text or other images to guide the noise removal process toward a certain target image.

3abiton · on Sept 21, 2023

It seems some researchers in her lab were also involved with Toyota.

mdonahoe · on Sept 21, 2023

> our lab

Which lab are you referring to?

momofuku · on Sept 21, 2023

I meant the academic lab that I was a part of while in grad school (would like to keep that anonymous for now, it's a smallish community).

Btw, I was at R:SS 2022 and meeting the Skydio autonomy team was one of the highlights of my career as a robotics engineer!

srsqsonyl · on Sept 21, 2023

You may as well credit the information theorists, mathematicians, and physicists who laid out the fundamentals that brought us here.

They died before hardware achieved their decades old visions. Not much of this work is net new description, moreso normalizing old descriptions with observation now that we can actually build the old ideas.

3seashells · on Sept 21, 2023

Prompt: ChatGpt write a generic anti academia rant against robotics research