UNLEARN Efficient Removal of Knowledge in Large Language Models

UNLEARN Efficient Removal of Knowledge in Large Language Models

arXiv:2408.04140v1 Announce Type: new Abstract: Given the prevalence of large language models (LLMs) and the prohibitive cost of training these models from scratch, dynamically forgetting specific knowledge e.g., private or proprietary, without retraining the model has become an important capability. This paper proposes a novel method to achieve this objective called UNLEARN. The approach builds upon subspace methods to identify and specifically target the removal of knowledge without adversely affecting other knowledge in the LLM. Results demonstrate 96% of targeted knowledge can be forgotten while maintaining performance on other knowledge within 2.5% of the original model, significantly outperforming the discriminatory…
Read More
Cellnex Sells Unit For €803 Million, Signals Share Buyback

Cellnex Sells Unit For €803 Million, Signals Share Buyback

Cellnex Telecom SA has agreed to sell its Austrian business to an investor consortium comprised of Vauban Infrastructure Partners, EDF Invest and MEAG for an enterprise value of €803 million ($877 million), as the Spanish tower operator works to cut debt and exit smaller markets.The deal includes an unconditional deferred payment of €272 million in December 2028 and is subject to customary regulatory approvals, Cellnex said in a statementBloomberg Terminal on Friday. Once the transaction closes, the company “will reassess its capital allocation priorities,” it said, in reference to a possible share buyback. Source link lol
Read More
The internet relies on vulnerable undersea cables — but the biggest risks aren’t the ones you think

The internet relies on vulnerable undersea cables — but the biggest risks aren’t the ones you think

The biggest threat to the undersea cables that power the internet —and, by extension, the world — may not be what you'd expect.While espionage and sabotage are growing threats, less dramatic but more frequent factors are causing the biggest problems.As The Guardian reported, the network of undersea cables is more at risk from anchors and fishing than from Russian spies.The Guardian reported on the vulnerability of the network of undersea cables that power internet connectivity worldwide, citing Tonga as an example.In 2022, an underwater eruption 1,000 times the power of the Hiroshima atomic bomb severed the country's internet connection, triggering…
Read More
WeRide Is Said to Seek Up to $400 Million in US IPO, Placement

WeRide Is Said to Seek Up to $400 Million in US IPO, Placement

China’s WeRide Inc. is seeking as much as $400 million in a US initial public offering and concurrent private placement, people familiar with the matter said. The Guangzhou-based autonomous vehicle company, which was granted approval last year by the Chinese securities regulator for a US listing, is seeking about $100 million in the IPO and around $200 million to $300 million in the placement, the people said. Source link lol
Read More
Zero-Delay QKV Compression for Mitigating KV Cache and Network Bottlenecks in LLM Inference

Zero-Delay QKV Compression for Mitigating KV Cache and Network Bottlenecks in LLM Inference

arXiv:2408.04107v1 Announce Type: new Abstract: In large-language models, memory constraints in the key-value cache (KVC) pose a challenge during inference, especially with long prompts. In this work, we observed that compressing KV values is more effective than compressing the model regarding accuracy and job completion time (JCT). However, quantizing KV values and dropping less-important tokens incur significant runtime computational time overhead, delaying JCT. These methods also cannot reduce computation time or high network communication time overhead in sequence-parallelism (SP) frameworks for long prompts. To tackle these issues, based on our insightful observations from experimental analysis, we propose ZeroC, a Zero-delay…
Read More
Intel is bringing GPUs to cars

Intel is bringing GPUs to cars

Intel has unveiled a discrete GPU for cars, the Arc A760A, designed to bring the "triple-A gaming experience" from home over to your car, the company announced. No automotive partners were revealed, but vehicles with the new chips will go on sale as soon as 2025.With car buyers increasingly focused on in-vehicle entertainment above all else, the chips are designed to "unlock a new era of AI-powered cockpit experiences," according to Intel's press release.The GPUs will allow voice, camera and gesture recognition to make it easy to control up to "seven high-definition screens rendering 3D graphics and six-in vehicle cameras…
Read More
Decoding Visual Sentiment of Political Imagery

Decoding Visual Sentiment of Political Imagery

arXiv:2408.04103v1 Announce Type: new Abstract: How can we define visual sentiment when viewers systematically disagree on their perspectives? This study introduces a novel approach to visual sentiment analysis by integrating attitudinal differences into visual sentiment classification. Recognizing that societal divides, such as partisan differences, heavily influence sentiment labeling, we developed a dataset that reflects these divides. We then trained a deep learning multi-task multi-class model to predict visual sentiment from different ideological viewpoints. Applied to immigration-related images, our approach captures perspectives from both Democrats and Republicans. By incorporating diverse perspectives into the labeling and model training process, our strategy addresses…
Read More
4 Starter Kits to Jumpstart Your ASP.NET Core SaaS Project

4 Starter Kits to Jumpstart Your ASP.NET Core SaaS Project

Launching a SaaS product can be an exciting yet daunting task. Choosing the right foundation for your project is crucial for building a scalable, secure, and maintainable application. This article explores four powerful starter kits specifically designed for ASP.NET Core SaaS development, along with a bonus option for Blazor-based SaaS projects. 1. ASP.NET Boilerplate: Lightweight and Open-Source ASP.NET Boilerplate offers a free and open-source foundation for your ASP.NET Core SaaS project. It provides a solid starting point with essential features like: Modular Design: Choose the components you need, such as user management, authentication, and authorization, without unnecessary bloat. Multi-Tenancy: Easily…
Read More

TGH Transforms Operating Room Safety and Efficiency with Apella

Tampa General Hospital (TGH) today announced the launch of Apella, a technology platform that leverages artificial intelligence (AI) to provide data and insights that enhance safety, increase efficiency and elevate the patient experience in the operating room. “We have some of the most talented surgeons and clinical teams in the world right here at Tampa General. The question is — how do you make the best better?” said John Couris, president and CEO of Tampa General Hospital. “Through innovation and technology such as Apella, we’re giving our teams the tools and information to enhance quality, strengthen safety and improve patient outcomes. We’re also…
Read More
LLM Observability: Fundamentals, Practices, and Tools

LLM Observability: Fundamentals, Practices, and Tools

LLM observability is the practice of gathering data about an LLM-based system in production to understand, evaluate, and optimize it. Developers and operators gain insight by recording prompts and user feedback, tracing user requests through the components, monitoring latency and API usage, performing LLM evaluations, and assessing retrieval performance. A range of frameworks and platforms supports the implementation of LLM observability. As new types of models are released and best practices emerge, these tools will continue to adapt and evolve. Large Language Models (LLMs) have become the driving force behind AI-powered applications, ranging from translation services to chatbots and RAG…
Read More
No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.