Research Papers in February 2024: A LoRA Successor, Small Finetuned LLMs Vs Generalist LLMs, and Transparent LLM research

Research Papers in February 2024: A LoRA Successor, Small Finetuned LLMs Vs Generalist LLMs, and Transparent LLM research

Once again, this has been an exciting month in AI research. This month, I'm covering two new openly available LLMs, insights into small finetuned LLMs, and a new parameter-efficient LLM finetuning technique.The two LLMs mentioned above stand out for several reasons. One LLM (OLMo) is completely open source, meaning that everything from the training code to the dataset to the log files is openly shared.The other LLM (Gemma) also comes with openly available weights but achieves state-of-the-art performance on several benchmarks and outperforms popular LLMs of similar size, such as Llama 2 7B and Mistral 7B, by a large margin.However,…
Read More

Seoul summit showcases UK’s progress on trying to make advanced AI safe

The UK is leading an international effort to test the most advanced AI models for safety risks before they hit the public, as regulators race to create a workable safety regime before the Paris summit in six months.Britain’s AI Safety Institute, the first of its kind, is now matched by counterparts from around the world, including South Korea, the US, Singapore, Japan and France.Regulators at the Seoul AI Summit hope the bodies can collaborate to create the 21st-century version of the Montreal Protocol, the groundbreaking agreement to control CFCs and close the hole in the ozone layer.But before they do,…
Read More

On NeurIPS’ High School Paper Track • David Stutz

The decision to have a separate High School Project Track at NeurIPS 2024 has sparked quite some controversy, with many prominent AI researchers debating pros and cons and personal opinions, primarily on X/Twitter. Initially, I ignored this discussion, but eventually started thinking about it myself. Here are some of my thoughts. A short disclaimer is necessary before diving in: the below is a rather personal opinion on the subject — driven by my personal experiences in AI research. As such, it is not meant to blame, contradict or discredit anyone or anything. Instead it is an attempt to add color.…
Read More
The Goldilocks Scenario Every VC Is Hoping For 

The Goldilocks Scenario Every VC Is Hoping For 

Over the past few years, we’ve witnessed the meteoric top of the venture and startup markets where valuations were through the roof, investors were competing with each other on speed (instead of due diligence), founders were exclusively focused on raising the next round, and startups had an almost unlimited source of capital to pursue growth at all costs. Those days ended with a series of significant blows to the ecosystem including the Silicon Valley Bank collapse, global wars and rising interest rates. Now the industry has settled into a new, healthy normal where valuations have returned to reasonable levels, only…
Read More
Optimizing Databricks LLM Pipelines with DSPy

Optimizing Databricks LLM Pipelines with DSPy

If you’ve been following the world of industry-grade LLM technology for the last year, you’ve likely observed a plethora of frameworks and tools in production. Startups are building everything from Retrieval-Augmented Generation (RAG) automation to custom fine-tuning services. Langchain is perhaps the most famous of all these new frameworks, enabling easy prototypes for chained language model components since Spring 2023. However, a recent, significant development has come not from a startup, but from the world of academia. In October 2023, researchers working in Databricks co-founder Matei Zaharia’s Stanford research lab released DSPy, a library for compiling declarative language model calls into…
Read More
Mozilla says it will add Tab Groups, Vertical Tabs, Profile Management to Firefox – gHacks Tech News

Mozilla says it will add Tab Groups, Vertical Tabs, Profile Management to Firefox – gHacks Tech News

Mozilla has officially announced a roadmap that outlines some important features which will be added to Firefox. Notable additions will include support for Tab Groups, Vertical Tabs, and a better Profile Management system. Vertical Tabs are coming to Firefox Mozilla's Tweet poked fun at itself, saying that it heard users who had been asking for a vertical tab bar in the browser. The feature is already available in the Larch channel of Firefox, which Martin tested last month. Vertical tabs aren't new, Brave browser has the feature, as do Vivaldi, and Microsoft Edge. Firefox users have relied on extensions like…
Read More
AI Is a Black Box. Anthropic Figured Out a Way to Look Inside

AI Is a Black Box. Anthropic Figured Out a Way to Look Inside

Last year, the team began experimenting with a tiny model that uses only a single layer of neurons. (Sophisticated LLMs have dozens of layers.) The hope was that in the simplest possible setting they could discover patterns that designate features. They ran countless experiments with no success. “We tried a whole bunch of stuff, and nothing was working. It looked like a bunch of random garbage,” says Tom Henighan, a member of Anthropic’s technical staff. Then a run dubbed “Johnny”—each experiment was assigned a random name—began associating neural patterns with concepts that appeared in its outputs.“Chris looked at it, and…
Read More
ChatGPT is now better than ever at faking human emotion and behaviour

ChatGPT is now better than ever at faking human emotion and behaviour

Earlier this week OpenAI launched GPT-4o (“o” for “omni”), a new version of the artificial intelligence (AI) system powering the popular ChatGPT chatbot. GPT-4o is promoted as a step towards more natural engagement with AI. According to the demonstration video, it can have voice conversations with users in near real-time, exhibiting human-like personality and behaviour. This emphasis on personality is likely to be a point of contention. In OpenAI’s demos, GPT-4o sounds friendly, empathetic and engaging. It tells “spontaneous” jokes, giggles, flirts and even sings. The AI system also shows it can respond to users’ body language and emotional tone.…
Read More
Predictive Human Preference: From Model Ranking to Model Routing

Predictive Human Preference: From Model Ranking to Model Routing

A challenge of building AI applications is choosing which model to use. What if we don’t have to? What if we can predict the best model for any prompt? Predictive human preference aims to predict which model users might prefer for a specific query. Table of contents Ranking Models Using Human Preference…. How Preferential Ranking Works…. Correctness of Chatbot Arena Ranking…….. Eval data…….. ResultsPredicting Human Preference For Each Prompt…. Experiment setup…. Experiment results…….. Domain-specific and query-specific leaderboardsConclusion Human preference has emerged to be both the Northstar and a powerful tool for AI model development. Human preference guides post-training techniques including…
Read More
No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.