Thought

Notes about Gemini 3

Rarely has an AI model arrived with such unanimous anticipation across the industry. In many respects, Gemini 3 feels like Google’s “GPT-4 …

Anthropic recently published a blog post on Best practices for prompt engineering. After reading it, I believe it offers an excellent …

I’ve been waiting for Gemini 3 for a long time, and this week I finally got to test it on the Gemini mobile app with Canvas enabled. While …

I read the blog from AnthropicAI and take some notes: This article explains the core components of Claude’s agentic architecture, designed …

I read the paper Puzzled by Puzzles: When Vision-Language Models Can’t Take a Hint from UC Berkely. The authors built a hand-crafted …

Qwen3-Max Thinking was quietly released on Sunday. Earlier in the week, the team had promised it would arrive in the week. After putting it …

I found a new benchmark paper from Meituan:AMO-Bench: Large Language Models StillStruggle in High School Math Competitions. This paper …

Anthropic just released a new post on emergent introspective awareness in LLMs. Here are my notes: The key experiment: the team injected …

I saw the paper LLMs can get “Brain Rot” is very popular on my X timeline and there are many discussions about it. I just read this …

Moonshot AI has open-sourced its own coding agent, kimi-cli. Built in Python, the codebase is approachable for anyone who wants to learn how …