Discussion about this post

User's avatar
Neural Foundry's avatar

Strong take on the limitations of scaling-first approaches. The cognitive vs statistical framing cuts through alot of the hype. I've been skeptical of RLHF solving alignment issues precisely because it's bolted on rather than fundamental to the architecture. The energy comparison (20 watts vs gigawatts) really hammers home how far we are from biological efficieny. Neuro-symbolic architectures make sense but dunno if the commercial incentives will shift anytime soon.

Richard Self's avatar

As you say, all the approaches are by advocates flailing around in desperation, trying all sorts of kludges that haven't and will never work. We can confidently say this based on the fundamentals of the transformer and diffusion model.

4 more comments...

No posts

Ready for more?