@nikitabier @business bloomberg being suddenly interested in your take on developer experience and ai coding tools is the new "sexy singles in your area" https://t.co/PBnXTxam0V
Alignment research often has to focus on averting concerning behaviors, but I think the positive vision for this kind of training is one where we can give models and honest and positive vision for what AI models can be and why. I'm excited about the future of this work. https://t.co/4BnLVNjnEY https://t.co/AYIilWyJLB
HTML is the new markdown.
I've stopped writing markdown files for almost everything and switched to using Claude Code to generate HTML for me. This is why. https://t.co/T97m0lIDx1
An early Claude Mythos Preview snapshot we provided METR has a time horizon of more than 2x the next best model on their 80% success rate benchmark https://t.co/xOrewjvFIF https://t.co/DVUWVaubCg
A common trend emerging in larger enterprises is token budgeting as a major topic. As agents can do more and more long running tasks, and thus take vastly more compute, allocation of tokens across teams becomes a very real thing in the enterprise.
Companies spend a meaningful amount of time deciding how much to spend on talent, marketing campai...
Downloading now... 1M token context window with supposedly usable coding agent capability all on a 128GB Macbook Pro is 🤯 https://t.co/otTL8NZMvV https://t.co/YfiNYM06Zu
The more I think about AI agents, the less obvious it is that pricing goes purely consumption-based
Token costs matter... but enterprise agents may need identities, roles, auth, budgets, audit logs etc
That sounds oddly seat-like? just not human-seat-like
Built a "YouTube realtime copilot" browser extension using OpenAI's realtime 2 API:
The agent watches the video alongside you, and can answer any question you have about what was just said via realtime voice chat.
The crazy part to me is: It can differentiate the YouTube's audio stream and your voice, so it doesn't confuse the video as comman...
All 32 Beautiful HTML Slide Templates are now available on AnyGen, it's plug-and-play even for those without a coding agent
Use them now: https://t.co/HDcFWYzJXa https://t.co/BYSpiI1Bts https://t.co/Ugi2IUbxRN
Seeing the clear dichotomy in my responses and that’s making me feel this even stronger..
Every large company manager: you know nothing, we need to be invested, this is how it’s always been done.
Every startup manager: thanks for saying this, yes, feel this in my bones.
🤷♂️
Generational opportunity for anyone in AI to play the markets given this time lag.
Guarantee everyone is psyched about Codex in a few months. Invest accordingly