stp2y

31137 Posts
AI Is a Black Box. Anthropic Figured Out a Way to Look Inside

AI Is a Black Box. Anthropic Figured Out a Way to Look Inside

Last year, the team began experimenting with a tiny model that uses only a single layer of neurons. (Sophisticated LLMs have dozens of layers.) The hope was that in the simplest possible setting they could discover patterns that designate features. They ran countless experiments with no success. “We tried a whole bunch of stuff, and nothing was working. It looked like a bunch of random garbage,” says Tom Henighan, a member of Anthropic’s technical staff. Then a run dubbed “Johnny”—each experiment was assigned a random name—began associating neural patterns with concepts that appeared in its outputs.“Chris looked at it, and…
Read More
ChatGPT is now better than ever at faking human emotion and behaviour

ChatGPT is now better than ever at faking human emotion and behaviour

Earlier this week OpenAI launched GPT-4o (“o” for “omni”), a new version of the artificial intelligence (AI) system powering the popular ChatGPT chatbot. GPT-4o is promoted as a step towards more natural engagement with AI. According to the demonstration video, it can have voice conversations with users in near real-time, exhibiting human-like personality and behaviour. This emphasis on personality is likely to be a point of contention. In OpenAI’s demos, GPT-4o sounds friendly, empathetic and engaging. It tells “spontaneous” jokes, giggles, flirts and even sings. The AI system also shows it can respond to users’ body language and emotional tone.…
Read More
Predictive Human Preference: From Model Ranking to Model Routing

Predictive Human Preference: From Model Ranking to Model Routing

A challenge of building AI applications is choosing which model to use. What if we don’t have to? What if we can predict the best model for any prompt? Predictive human preference aims to predict which model users might prefer for a specific query. Table of contents Ranking Models Using Human Preference…. How Preferential Ranking Works…. Correctness of Chatbot Arena Ranking…….. Eval data…….. ResultsPredicting Human Preference For Each Prompt…. Experiment setup…. Experiment results…….. Domain-specific and query-specific leaderboardsConclusion Human preference has emerged to be both the Northstar and a powerful tool for AI model development. Human preference guides post-training techniques including…
Read More
GenAI Just Got A Little Less Opaque

GenAI Just Got A Little Less Opaque

Yesterday, the AI startup Anthropic published a paper detailing the successful interpretation of the inner workings of a large language model (LLM). LLMs are notoriously opaque — their size, complexity, and numeric representation of human language have hitherto defied explanation — so it’s impossible to understand why inputs lead to outputs. Anthropic used a technique called dictionary learning, leveraging a sparse encoder to isolate specific concepts within its Claude 3 Sonnet model. The technique allowed them to extract millions of features, including specific entities like the Golden Gate Bridge as well as more abstract ideas such as gender bias. They…
Read More
Generating fashion product descriptions by fine-tuning a vision-language model with SageMaker and Amazon Bedrock | Amazon Web Services

Generating fashion product descriptions by fine-tuning a vision-language model with SageMaker and Amazon Bedrock | Amazon Web Services

In the world of online retail, creating high-quality product descriptions for millions of products is a crucial, but time-consuming task. Using machine learning (ML) and natural language processing (NLP) to automate product description generation has the potential to save manual effort and transform the way ecommerce platforms operate. One of the main advantages of high-quality product descriptions is the improvement in searchability. Customers can more easily locate products that have correct descriptions, because it allows the search engine to identify products that match not just the general category but also the specific attributes mentioned in the product description. For example,…
Read More
Tech policy is only frustrating 90% of the time

Tech policy is only frustrating 90% of the time

Many technologists stay far away from public policy. That’s understandable. In our experience, most of the time when we engage with policymakers there is no discernible impact. But when we do make a difference to public policy, the impact is much bigger than what we can accomplish through academic work. So we find it fruitful to engage even if it feels frustrating on a day-to-day basis.In this post, we summarize some common reasons why many people are cynical about tech policy and explain why we’re cautiously optimistic. We also announce some recent writings on tech policy as well as an…
Read More
Inside the Pod: A Guide to Launching a Successful One-person Company

Inside the Pod: A Guide to Launching a Successful One-person Company

Thank you to everyone who is watching or listening to my podcast, How Do You Use ChatGPT? If you want to see a collection of all of the prompts and responses in one place, Every contributor Rhea Purohit is breaking them down for you to replicate. Let us know in the comments if you find these guides useful. —Dan ShipperWas this newsletter forwarded to you? Sign up to get it in your inbox.In the first week of April, I came face-to-face with the part of being a freelancer I hate the most: tax season.The idea of wading through invoices, account…
Read More
Scale AI

Scale AI

I’m excited to share Generational’s inaugural growth & late-stage company briefing with a deep dive on Scale AI, blending analytical rigor with feature writing. Disclaimer: I have a financial interest in Scale. Don’t take this as investment advice.In this deep dive, you’ll learn insights from conversations with Scale’s customers, ex-employees, and competitors. I could do this thanks to Tegus, which centralizes expert calls into a single platform. Nothing beats primary research when it comes to understanding a company. If you’re curious about Tegus, try them out with this link.Scale AI accelerates the development of AI applications through services and software.…
Read More
Poking parts of Sonnet’s brain to make it less annoying

Poking parts of Sonnet’s brain to make it less annoying

In a groundbreaking new paper (actually groundbreaking, IMO), researchers at Anthropic have scaled up an interpretability technique called "dictionary learning" to one of their deployed models, Claude 3 Sonnet. The results provide an unprecedented look inside the mind of a large language model, revealing millions of interpretable features that correspond to specific concepts and behaviors (like sycophancy) and shedding light on the model's inner workings. In this post, we'll explore the key findings of this research, including the discovery of interpretable features, the role of scaling laws, the abstractness and versatility of these features, and their implications for model steering…
Read More
No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.