More

atlex2 · 2026-04-01T03:32:25 1775014345

In January and February we flew two Air Tours with our AI copilot Shirley, here's what we learned about ASR, connectivity and models in-flight!

atlex2 · 2026-04-01T03:44:57 1775015097

Here's our video debrief as well! https://youtu.be/BZAwlSKnEus

atlex2 · 2025-11-11T16:10:07 1762877407

I built some of the first apps on the App Store. Top twenty navigation app. Won an ADA.

Still, my pocket is my iPhone pocket.

Until they release something the size of the X or smaller, I’m sticking with my iPhone 13 Mini or eventually going for a Razr style Android.

Every year they release something, I go check it out. My love for Apple dies a bit more.

atlex2 · 2025-11-11T02:43:33 1762829013

I think spatial tokens could help, but they're not really necessary. Lots of physics/physical tasks can be solved with pencil and paper.

On the other hand, it's amazing that a 512x512 image can be represented by 85 tokens (as in OAI's API), or 263 tokens per second for video (with Gemini). It's as if the memory vs compute tradeoff has morphed into a memory vs embedding question.

This dichotomy reminds me of the "Apple Rotators - can you rotate the Apple in your head" question. The spatial embeddings will likely solve dynamics questions a lot more intuitively (ie, without extended thinking).

We're also working on this space at FlyShirley - training pilots to fly then training Shirley to fly - where we benefit from established simulation tools. Looking forward to trying Fei Fei's models!

atlex2 · 2025-11-11T00:11:28 1762819888

On the other hand, test-time weight updates would make model interpretability much harder.

atlex2 · 2025-11-10T02:23:44 1762741424

They're building on a false premise that human equivalent performance using cameras is acceptable. That's the whole point of AI - when you can think really fast, the world is really slow. You simulate things. Even with lifetimes of data, the cars still will fail in visual scenarios where error bars on ground truth shoot through the roof. Elon seems to believe his cars will fail in similar ways to humans because they use cameras. False premise. As Waymo scales, human just isn't good enough, except for humans.

irjustin · 2025-11-10T04:18:46 1762748326

So, I agree with what you're saying, but that doesn't matter.

The legal standing doesn't care what tech it is behind it. 1000 monkeys for all it matters. The point is level 3 is the most dangerous level because neither the public nor the manuf properly operates in this space.

atlex2 · 2025-11-11T00:16:10 1762820170

Yeah - Tesla is in a weird level 2-4 space. They've managed to shunt liability onto their customers until now..

atlex2 · 2025-06-13T01:01:41 1749776501

Definitely curious what circuits light-up from a Neuralese perspective. We want reasoning traces that are both faithful to the thought process and also interpretable. If the other language segments are lighting up meanings much different than their translations, that would raise questions for me.

atlex2 · on Feb 25, 2025

Yes absolutely this! We're working on these problems at FlyShirley for our pilot training tool. My go-to is: I'm facing 160 degrees and want to face north. What's the quickest way to turn and by how much?

For small models and when attention is "taken up", these sorts of questions really send a model for a loop. Agreed - especially noticeable with small reasoning models.

KiwiJohnno · on Feb 28, 2025

I just tried this with a smaller "thinking" model (deepseek distill, running locally) and boy are you right. It keeps flipping between which direction it should turn, second guessing its thought process and then getting sidetracked with a different approach.

atlex2 · on Nov 21, 2024

Hello! On this topic: Could you please look into making the path ‘next/image’ uses for caching images user definable? Currently I can’t use Google app engine standard because the directory it uses is write protected. The only real solution seems to be custom image providers, which is a drag, so I’m on App Engine Flex spending way more than I probably should :-)

leerob · on Nov 21, 2024

The way I would recommend handling this is using the `loaderFile`: https://nextjs.org/docs/app/api-reference/components/image#l...

atlex2 · on Nov 9, 2024

Seriously nobody thought to use SVD on these weight matrices before?

liuliu · on Nov 9, 2024

I did try, but in a wrong way (try to SVD quantization error to recover quality (I.e. SVD(W - Q(W)))). The lightbulb moment in this paper is to do SVD on W and then quantize the remaining.

atlex2 · on Oct 27, 2024

FlyShirley.com - AI Copilot for Pilots. Starting with flight sim users, working our way up to in-flight and ultimately context aware AI for robotics.