stp2y

30273 Posts
DETAIL: Task DEmonsTration Attribution for Interpretable In-context Learning

DETAIL: Task DEmonsTration Attribution for Interpretable In-context Learning

arXiv:2405.14899v1 Announce Type: new Abstract: In-context learning (ICL) allows transformer-based language models that are pre-trained on general text to quickly learn a specific task with a few "task demonstrations" without updating their parameters, significantly boosting their flexibility and generality. ICL possesses many distinct characteristics from conventional machine learning, thereby requiring new approaches to interpret this learning paradigm. Taking the viewpoint of recent works showing that transformers learn in context by formulating an internal optimizer, we propose an influence function-based attribution technique, DETAIL, that addresses the specific characteristics of ICL. We empirically verify the effectiveness of our approach for demonstration attribution…
Read More
Winamp: new music platform and source code release – gHacks Tech News

Winamp: new music platform and source code release – gHacks Tech News

Winamp used to be a popular audio player for Windows. Things went quiet after its heyday and rights changed hands several times in that period. Veteran users of the player had big hopes when a comeback of the player was announced in 2021. After a, mostly, bug fix release in 2022, a new Winamp Player was announced for release in 2023. The release turned things around significantly, as it was launched as as web-based application and not a traditional desktop app. Even worse, the web-based player did not support playing local files. Everything seemed to be designed to push the…
Read More
Research Highlights Jul-Aug 2023: Llama 2, Flash-Attention 2, and More

Research Highlights Jul-Aug 2023: Llama 2, Flash-Attention 2, and More

Every month is a busy month for LLM research. However, this month has been particularly interesting due to the release of new state-of-the-art base models, such as Meta's Llama 2 model suite. Double kudos: this new iteration of Llama models comes without any major restrictions and a very detailed 77-page report on arXiv!I am still compiling all my notes and thoughts after reading through this 77-pager for the next main issue of this newsletter -- coming soon! However, in the meantime, I wanted to share some of the main takeaways in the monthly Research Highlights (next to many other interesting…
Read More
Eat a rock a day, put glue on your pizza: how Google’s AI is losing touch with reality

Eat a rock a day, put glue on your pizza: how Google’s AI is losing touch with reality

Google has rolled out its latest experimental search feature on Chrome, Firefox and the Google app browser to hundreds of millions of users. “AI Overviews” saves you clicking on links by using generative AI — the same technology that powers rival product ChatGPT — to provide summaries of the search results. Ask “how to keep bananas fresh for longer” and it uses AI to generate a useful summary of tips such as storing them in a cool, dark place and away from other fruits like apples. But ask it a left-field question and the results can be disastrous, or even…
Read More
Data Machina #238

Data Machina #238

Non-stop AI Innovation Every Single Week. Well yeah, thats’s right: There is no single week without something new, exciting, or amazing happening in AI. This is a selection of interesting, cool stuff that happened in the last 7 days or so:OpenAI introduced new, faster, and more efficient embedding models. Buried in the blog announcement, it says: “the new embedding models were trained with a technique that allows developers to shorten embeddings without the embedding losing its concept-representing properties.” Well - for some reason- it seems the blog fails to mention that the technique is called Matryoshka Representation Learning (paper, repo),…
Read More
It’s Time To Be More Strategic About AI In Customer Service

It’s Time To Be More Strategic About AI In Customer Service

Generative AI has topped the list of customer inquiries and conversations that I have been having this year — no surprises there! Interestingly, at least half of them have been about AI in customer service and chatbots, virtual assistants, and knowledge bots for customers, which seem top of mind for most customer service leaders. This makes us wonder if this is the limit of what AI can do for customer service. I think not. These are the most obvious and simplest use cases that need the least amount of disruption in existing customer service operations, seen as easily done and…
Read More
One year update: book submitted; TIME 100; Sep 21 online workshop

One year update: book submitted; TIME 100; Sep 21 online workshop

It’s almost exactly a year since we launched this newsletter and began writing our book. Earlier this week, we turned in our manuscript to our publisher! It’s now in the hands of peer reviewers.In the book, we dig into the ideas behind generative AI, tackle fears around artificial general intelligence, explain the scientific and ethical limitations of predictive AI, categorize the many types of AI harms, explore why AI has failed to fix social media, analyze how AI hype and misinformation are created and amplified, and argue that many AI problems in fact reflect problems with capitalism requiring deeper reforms.…
Read More
Improving Text2SQL Performance with Ease on Databricks

Improving Text2SQL Performance with Ease on Databricks

Want to raise your LLM into the top 10 of Spider, a widely used benchmark for text-to-SQL tasks? Spider evaluates how well LLMs can convert text queries into SQL code.For those unfamiliar with text-to-SQL, its significance lies in transforming how businesses interact with their data. Instead of relying on SQL experts to write queries, people can simply ask questions of their data in plain English and receive precise answers. This democratizes access to data, enhancing business intelligence and enabling more informed decision-making.The Spider benchmark is a widely recognized standard for evaluating the performance of text-to-SQL systems. It challenges LLMs to…
Read More
How to manage a team of AI agents

How to manage a team of AI agents

Hey all, its been a awhile. I took time off for honeymoon and the resulting backlog of work. To those who joined the enterprise-ready AI event, thank you for making it awesome! The interest was staggering. Ashish and I will share the key takeaways soon. If you’d like to be the first to know of future events, consider subscribing! The user experience of ChatGPT & similar products is that it requires a human to pilot. Its works as a collaborator that needs live instructions. A different way to experience AI is that of fully autonomous AI agents, large language model…
Read More
No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.