More

armanified · 2026-04-06T12:47:33 1775479653

True, but most would ignore LM if it weren't LLM.

armanified · 2026-04-06T12:45:37 1775479537

This title might have triggered something in those bots; most of them have sneaky AI SaaS links in their bio.

Honestly, I never expected this post to become so popular. It was just the outcome of a weekend practice session.

armanified · 2026-04-06T11:59:50 1775476790

OMG! Why didn't I thought fo this first :P

armanified · 2026-04-06T11:57:38 1775476658

OMG! You just gave me the next idea..

armanified · 2026-04-06T11:53:03 1775476383

Pretty neat! I'll definitely take a deeper look into this.

armanified · 2026-04-06T11:51:48 1775476308

Uppercase letters were intentionally ignored.

armanified · 2026-04-06T11:50:20 1775476220

My initial idea was to train a navigation decision model with 25M parameters for a Raspberry Pi, which, in testing, was getting about 60% of tool calls correct. IMO, it seems like around 20M parameters would be a good size for following some narrow & basic language instructions.

amelius · 2026-04-06T12:39:19 1775479159

Ok. This makes me wonder about a broader question. Is there a scientific approach showing a pyramid of cognitive functions, and how many parameters are (minimally) required for each layer in this pyramid?

armanified · 2026-04-06T11:41:16 1775475676

It mostly doesn't, at 9M it has very limited capacity. The whole idea of this project is to demonstrate how Language Models work.

armanified · 2026-04-06T11:37:48 1775475468

I haven't compared it with anything yet. Thanks for the suggestion; I'll look into these.

armanified · 2026-04-06T11:36:37 1775475397

I intentionally removed all optimizations to keep it vanilla.