o1 "think through" problems before providing solutions. o1 is not GPT. It excels in tasks requiring planning and iteration. o1 doesn't surpass GPT-4o in writing creativity. o1 successfully solved the "river crossing" riddle and a crossword puzzle. This unlocks a new paradigm of model pre-training. o1 is great in many ways but it isn't superior in all areas.
Backend Software Engineer#
👋 I’m Cedric Chee. I’ve been a software engineer, writer, and entrepreneur.
I code and write about it sometimes. I create system softwares and apps in Go/JS.
I do product engineering and web development at startups/consulting. I enjoy backend development.
I’m currenly focusing on Large Language Models (LLMs). I tinker with LLMs and AI systems at night.
Recent Posts
Llama-3.1-Minitron 4B is a Smaller and Accurate LLM
NVIDIA developed a method to create a smaller and accurate LLM known as Llama-3.1-Minitron 4B using structured weight pruning and knowledge distillation.
Prompt Caching with Anthropic Claude
This is huge! Prompt caching enables you to load vast amounts of data into the context window. This will unlock a wide range of new use cases. I'm so pumped.
🐐 Llama 3.1 405B Matches Or Beats The Top Foundation Models
Meta flagship model, Llama 3.1 405B is the first open weights model that is competitive with the top foundation models across a range of tasks, including GPT-4, GPT-4o, and Claude 3.5 Sonnet. The smaller models, 8B and 70B are competitive with Gemma 2 9B, Mistral 7B, Mixtral 8x22B, and GPT 3.5. The upgraded versions are more capable, have a longer context length of 128k, multilingual, and advanced tool use.
Llama 3.1 Leaks: SoTA Open Model 405B & What We Know So Far
8B gets a big bump across the board, 70B instruct shows minor improvements, and 405B is the SoTA open model. But 405B still lags behind flagship models.
Co-Intelligence: Living and Working with AI - A Book Review
📚 I recently finished "Co-Intelligence" by Ethan Mollick. Is it the Gen AI guidebook you've been waiting for? 🚀 It's all about the sociological perspective of living and working with AI. A solid primer to AI for most readers, but not suited for those with advanced AI knowledge.
Vibe Checking Claude 3.5, DeepSeek-Coder-V2, and GPT-4o for "Alien" Coding Skills
This evaluation provides insights into the current capabilities of leading AI models in solving complex coding problems. While Claude 3.5 Sonnet showed superior performance in this specific task, all models demonstrated the ability to produce correct solutions with varying degrees of assistance. These findings underscore the importance of conducting independent evaluations to verify public benchmarks and understand the nuanced strengths and limitations of different AI models.
Claude 3.5 Sonnet
Anthropic Claude 3.5 Sonnet takes the top spot on the leaderboards. It surpasses GPT-4o.
Book Therapy
For this round of book therapy, I will be reading two books, "Experts vs. Imitators" and "Just Enough Software Architecture".
Cutting through to what matters
The importance of focusing on foundational principles and high-impact work in technology.
Why You Should Learn C: Uncovering the Hidden Benefits
I argue for the importance of learning the C programming language despite its lack of trendiness in modern software development.
How Git Works
It sucks to be afraid of the tools that you use in your work every day.
Designing Machine Learning (ML) Systems Book Summary
A chapter by chapter detailed summary of the book.
The Bitter Lesson by Rich Sutton
AI research shows that leveraging computation through general methods like search and learning is far more effective than incorporating human knowledge.
Google Gemini 1.5 Vibe Check
Vibe checking the latest Google Gemini models by asking it about fine-tuning Transformer tools.
The Assembly Language Period of LLMs and Generative AI
Signs that we're living in the assembly language period of LLMs and Generative AI.
Deconstructing GPT-4o
Tech dive into GPT-4o covering techniques and data, system, and tricks.
GPT-4o: HER Went From Science Fiction to Reality
The new advanced voice and video mode is groundbreaking! Audio AI models are having their Stable Diffusion moment too.
Voice AI Research
Make computers talk like human. My attempts and some notes about them, along the way.
AI Agent UX
Augmentation is composed of smaller automations. AI helps generate levels of abstraction.