Typical CNNs can learn a linear transformation of the input at no cost. Since YU...

bob1029 · on July 13, 2023

How is there not a cost associated with forcing the machine to learn how to do something that we already have a simple, deterministic algorithm for? Won't some engineer need to double check a few things with regard to the AI's idea of color space transform?

CodesInChaos · on July 13, 2023

You could probably derive some smart initialization for the first layer of a NN based on domain knowledge (color spaces, sobel filters, etc.). But since this is such a small part of what the NN has to learn, I expect this to result in a small improvement in training time and have no effect on final performance and accuracy, so it's unlikely to be worth the complexity of developing such a feature.

whimsicalism · on July 13, 2023

Absolutely this.

Seems like on HN people are still learning 'the bitter lesson'.

CSSer · on July 14, 2023

Amdahl’s law?

nl · on July 14, 2023

Rick Sutton.

http://www.incompleteideas.net/IncIdeas/BitterLesson.html

CSSer · on July 14, 2023

Thank you!

whimsicalism · on July 14, 2023

Sorry - should have included a cite.

That said, Amdahl's law is also probably related in some degree - I would view YUV conversion as an unnecessary optimization.

uoaei · on July 13, 2023

Your instincts are correct. Training is faster, more stable, and more efficient that way. In certain cases it "pretty much is irrelevant" but the advantages of the strategy of modelling the knowns and training only on the unknowns becomes starkly apparent when doing e.g. sensor fusion or other ML tasks on physical systems.